Chinese AI startup DeepSeek launches DeepSeek-V3, a large 671-billion parameter model, shattering benchmarks and rivaling prime proprietary methods.
This smaller model approached the mathematical reasoning capabilities of GPT-four and outperformed one other Chinese model, Qwen-72B. https://files.fm/deepseek1