AI research — Page 3

Sort:

3 weeks ago · ai · - · -

[Paper] Halluverse-M^3: A multitask multilingual benchmark for hallucination in LLMs

Hallucinations in large language models remain a persistent challenge, particularly in multilingual and generative settings where factual consistency is difficu...

#LLM hallucination #multilingual benchmark #NLP evaluation #dataset release #AI research
0 month ago · ai · - · -

GPT-5.3-Codex

Article URL: https://openai.com/index/introducing-gpt-5-3-codex/ Comments URL: https://news.ycombinator.com/item?id=46902638 Points: 181 Comments: 41...

#GPT-5.3 #Codex #OpenAI #large language model #code generation #AI research
0 month ago · ai · - · -

Fundamental emerges from stealth with first major foundation model trained for tabular data

The deep learning revolution has a curious blind spot: the spreadsheet. While Large Language Models LLMs have mastered the nuances of human prose and image gene...

#foundation model #tabular data #deep learning #structured data #LLM #machine learning #AI research #VentureBeat
0 month ago · ai · - · -

OpenAI Frontier

Article URL: https://openai.com/index/introducing-openai-frontier/ Comments URL: https://news.ycombinator.com/item?id=46899770 Points: 8 Comments: 0...

#OpenAI #Frontier #generative AI #large language model #AI research
0 month ago · ai · - · -

메타 “차세대 LLM 아보카도, 가장 유능한 사전학습 모델”

메타가 차세대 대규모 언어 모델LLM ‘아보카도’의 사전 학습을 완료했는데 “메타 역사상 가장......

#Meta #Avocado #LLM #large language model #pretraining #generative AI #AI research
0 month ago · ai · - · -

Plan–Code–Execute: Designing Agents That Create Their Own Tools

The case against pre-built tools in Agentic Architectures The post Plan–Code–Execute: Designing Agents That Create Their Own Tools appeared first on Towards Dat...

#autonomous agents #tool creation #plan-code-execute #LLM #AI research #agentic architectures
0 month ago · ai · - · -

'time to GPT-2', down to 2.91 hours

Article URL: https://twitter.com/karpathy/status/2018804068874064198 Comments URL: https://news.ycombinator.com/item?id=46883528 Points: 29 Comments: 4...

#GPT-2 #model training #compute efficiency #deep learning #AI research
1 month ago · ai · - · -

[Paper] AOrchestra: Automating Sub-Agent Creation for Agentic Orchestration

Language agents have shown strong promise for task automation. Realizing this promise for increasingly complex, long-horizon tasks has driven the rise of a sub-...

#agentic orchestration #sub-agent automation #LLM multi-agent systems #AI research #dynamic tool selection
1 month ago · ai · - · -

New Apple study shows how grouping similar sounds can speed up AI speech generation

A group of Apple and Tel-Aviv University researchers figured out a way to speed up AI-based text-to-speech generation without sacrificing intelligibility. Here’...

#Apple #text-to-speech #speech synthesis #AI research #speed optimization #Tel Aviv University
1 month ago · ai · - · -

Silicon Darwinism: Why Scarcity Is the Source of True Intelligence

We are confusing “size” with “smart.” The next leap in artificial intelligence will not come from a larger data center, but from a more constrained environment....

#artificial intelligence #resource scarcity #efficiency #AI research #intelligence emergence
1 month ago · ai · - · -

Towards a science of scaling agent systems: When and why agent systems work

Article URL: https://research.google/blog/towards-a-science-of-scaling-agent-systems-when-and-why-agent-systems-work/ Comments URL: https://news.ycombinator.com...

#agent systems #scaling #AI research #multi-agent #Google Research
1 month ago · ai · - · -

[Paper] Design Techniques for LLM-Powered Interactive Storytelling: A Case Study of the Dramamancer System

The rise of Large Language Models (LLMs) has enabled a new paradigm for bridging authorial intent and player agency in interactive narrative. We consider this p...

#large language models #interactive storytelling #natural language processing #AI research

Newer posts

Older posts