These models are designed to “think and reflect” before giving responses, aiming to narrow the gap between China and the United States in AI technology.
Alibaba Cloud stated that QwQ matched or outperformed o1 in certain benchmark tests. The tech firm said it ranked higher in two maths evaluations and performed equally in problem-solving and coding. These allow them to handle complex tasks better than earlier generative AI models, according to OpenAI.
Other companies have also joined the race. Hangzhou-based DeepSeek introduced its r1 model, outperforming OpenAI’s o1 in three out of six benchmarks evaluating maths, programming, and scientific tasks.