AI reasoning

1 month ago · ai

Evaluating AI’s ability to perform scientific research tasks

OpenAI introduces FrontierScience, a benchmark testing AI reasoning in physics, chemistry, and biology to measure progress toward real scientific research....

#OpenAI #FrontierScience #AI benchmark #scientific research #physics #chemistry #biology #AI reasoning
1 month ago · ai

New Gemini API updates for Gemini 3

Gemini 3, our most intelligent model, is now available for developers via the Gemini API. To support its state‑of‑the‑art reasoning, autonomous coding, multimod...

#Gemini 3 #Gemini API #thinking_level #multimodal AI #autonomous coding #agentic capabilities #Google AI #large language model #AI reasoning
1 month ago · ai

GPT-5.2 first impressions: a powerful update, especially for business tasks and workflows

OpenAI has officially released GPT-5.2, and the reactions from early testers — among whom OpenAI seeded the model several days prior to public release, in some...

#GPT-5.2 #OpenAI #large language model #AI reasoning #AI coding #business workflows #LLM updates
1 month ago · ai

Introducing GPT-5.2

GPT-5.2 is our most advanced frontier model for everyday professional work, with state-of-the-art reasoning, long-context understanding, coding, and vision. Use...

#GPT-5.2 #OpenAI #large language model #LLM #AI reasoning #coding assistance #vision model #API #agentic workflows
1 month ago · ai

Unlocking AI Reasoning: The Power of Modular Cognition

Introduction Tired of AI that's a black box? Frustrated by complex systems that are difficult to debug and adapt? What if you could build intelligent systems w...

#modular AI #cognitive architecture #AI reasoning #explainability #AI development
1 month ago · ai

A smarter way for large language models to think about hard problems

This new technique enables LLMs to dynamically adjust the amount of computation they use for reasoning, based on the difficulty of the question....

#large language models #dynamic computation #adaptive inference #AI reasoning #MIT research
1 month ago · ai

AI Model Nearly Aces the Putnam: Why the Real Disruption Is How We Reason

AI Model Nears a Perfect Score on the Putnam An AI math model recently scored 118/120 on one of the hardest human exams. Beyond solving problems, it learned to...

#Putnam exam #AI reasoning #self‑verification #proof generation #math AI #model self‑correction #large language models
1 month ago · ai

Think Like HATEOAS: How Agentic RAG Dynamically Navigates Knowledge

!Cover image for Think Like HATEOAS: How Agentic RAG Dynamically Navigates Knowledgehttps://media2.dev.to/dynamic/image/width=1000,height=420,fit=cover,gravity=...

#RAG #Agentic RAG #Retrieval Augmented Generation #LLM #HATEOAS #AI reasoning #knowledge navigation #prompt engineering
1 month ago · ai

Reading About o4-mini & o4-mini-high Made Me Rethink “Small” AI Models

I went into the Makiai article about OpenAI’s o4-mini and o4-mini-high expecting just another technical breakdown full of benchmarks I’d skim and forget. Instea...

#openai #small language models #o4-mini #AI reasoning #model comparison
1 month ago · ai

[Paper] Agentic Learner with Grow-and-Refine Multimodal Semantic Memory

MLLMs exhibit strong reasoning on isolated queries, yet they operate de novo -- solving each problem independently and often repeating the same mistakes. Existi...

#multimodal memory #lifelong learning #large multimodal models #semantic memory #AI reasoning