1 article found
Mistral's 24B parameter reasoning model runs on a single RTX 4090, delivers GPT-4 level performance, and costs exactly zero dollars per token.