Chinese AI startup DeepSeek launches DeepSeek-V3, a massive 671-billion parameter model, shattering benchmarks and rivaling high proprietary methods.
This smaller mannequin approached the mathematical reasoning capabilities of GPT-four and outperformed another Chinese model, Qwen-72B. https://s.id/deepseek1