Chinese AI startup DeepSeek launches DeepSeek-V3, a large 671-billion parameter mannequin, shattering benchmarks and rivaling top proprietary techniques.
This smaller mannequin approached the mathematical reasoning capabilities of GPT-4 and outperformed another Chinese model, Qwen-72B. https://quicknote.io/97f78d70-df47-11ef-a9bd-a57b99780c19