model reasoning

5 days ago · ai

Qwen3-Max Thinking beats Gemini 3 Pro and GPT-5.2 on Humanity's Last Exam (with search)

Chinese AI and tech firms continue to impress with their development of cutting-edge, state-of-the-art AI language models. Today, the one drawing eyeballs is Al...

#Qwen3-Max-Thinking #Alibaba Cloud #large language model #AI benchmarking #model reasoning
3 weeks ago · ai

لماذا نعتقد: كيف يمكننا تحسين قدرة النماذج على التفكير

!Cover image for لماذا نعتقد: كيف يمكننا تحسين قدرة النماذج على التفكيرhttps://media2.dev.to/dynamic/image/width=1000,height=420,fit=cover,gravity=auto,format=a...

#artificial intelligence #machine learning #model reasoning #deep learning #large language models #AI research
3 weeks ago · ai

Why Your AI Is Suddenly Making You Wait. ⏳

The Shift: From “Guessing” to “Reasoning” Old AI models were “Fast Guessers.” They predicted the next word based on patterns, delivering instant responses but...

#AI latency #model reasoning #slow inference #OpenAI o1 #DeepSeek R1 #prompt engineering #AI hallucinations #model performance
3 weeks ago · ai

TII’s Falcon H1R 7B can out-reason models up to 7x its size — and it’s (mostly) open

For the last two years, the prevailing logic in generative AI has been one of brute force: if you want better reasoning, you need a bigger model. While 'small'...

#generative AI #large language models #Falcon H1R #TII #model reasoning #open-source AI
1 month ago · ai

Evaluating chain-of-thought monitorability

OpenAI introduces a new framework and evaluation suite for chain-of-thought monitorability, covering 13 evaluations across 24 environments. Our findings show th...

#chain-of-thought #monitorability #model reasoning #evaluation suite #OpenAI #AI interpretability #LLM safety
1 month ago · ai

New Gemini API updates for Gemini 3

Nov. 25, 2025 What’s new in the Gemini API for Gemini 3 - Simplified parameters for thinking control – A new thinking_level parameter lets you set the depth of...

#Gemini 3 #Google AI #Gemini API #thinking_level #multimodal #autonomous coding #agentic capabilities #model reasoning