AI research

Sort:

1 day ago · ai · - · -

NanoGPT Slowrun: Language Modeling with Limited Data, Infinite Compute

Compute grows much faster than data. Our current scaling laws require proportional increases in both to scale, but the asymmetry in their growth means intellige...

#language modeling #scaling laws #compute vs data #data efficiency #NanoGPT #Q Labs #generalization #AI research
1 day ago · ai · - · -

Something is afoot in the land of Qwen

Recent developments at Alibaba’s Qwen team I’m behind on writing about Qwen 3.5, a remarkable family of open‑weight models released by Alibaba’s Qwen team over...

#Qwen #Alibaba #open-weight models #large language models #AI research #team departures
2 days ago · ai · - · -

March 11 - Strategies for Validating World Models and Action-Conditioned Video

Join us on March 11 for “Debugging the Future: Strategies for Validating World Models and Action‑Conditioned Video” workshop with Nick Lotz from Voxel51 – regis...

#world models #action‑conditioned video #video generation #model validation #temporal consistency #large‑scale video datasets #Voxel51 #AI research
3 days ago · ai · - · -

Language Model Contains Personality Subnetworks

Abstract Humans shift between different personas depending on social context. Large Language Models LLMs demonstrate a similar flexibility in adopting differen...

#large language models #persona subnetworks #model interpretability #parameter masking #LLM behavior #AI research
6 days ago · ai · - · -

Featured video: Coding for underwater robotics

Internship Overview During a summer internship at MIT Lincoln Laboratory, Ivy Mahncke, an undergraduate student of robotics engineering at Olin College of Engi...

#ai #ai-research #academia
1 week ago · it · - · -

OpenAI Announces Major Expansion of London Office

The San Francisco‑based AI lab is growing its research team in London. The move puts it in direct competition with Google DeepMind for top research talent in th...

#OpenAI #London office #company expansion #AI research #DeepMind #talent competition #UK tech
1 week ago · ai · - · -

Riley Walz, the Jester of Silicon Valley, Is Joining OpenAI

The software engineer is famous for his online stunts. Now he’s joining the company behind ChatGPT to work on new ways for humans to use AI systems....

#OpenAI #Riley Walz #ChatGPT #AI hiring #Silicon Valley #AI research
1 week ago · ai · - · -

Enhancing maritime cybersecurity with technology and policy

Originally from the small Balkan country of Montenegro, Strahinja Strajo Janjusevic says his life has unfolded in unexpected ways, for which he is deeply gratef...

#ai #ai-research #academia
1 week ago · ai · - · -

Guide Labs debuts a new kind of interpretable LLM

The challenge of wrangling a deep learning model is often understanding why it does what it does: whether it’s xAI’s repeated struggle sessions to fine‑tune Gro...

#interpretable LLM #Guide Labs #Steerling-8B #explainable AI #open-source model #LLM tracing #AI research
1 week ago · ai · - · -

Researchers baked 3x inference speedups directly into LLM weights — without speculative decoding

Multi‑Token Prediction MTP — Boosting Throughput for Agentic AI Workflows As agentic AI workflows multiply the cost and latency of long reasoning chains, a tea...

#LLM #inference speedup #multi-token prediction #model weights #speculative decoding #AI research
1 week ago · ai · - · -

Our First Proof submissions

markdown February 20, 2026 - Research/news/research/ - Conclusion/research/index/conclusion/ Our First Proof Submissions We’re sharing our proof attempts for Fi...

#AI research #mathematical proofs #First Proof challenge #OpenAI #machine learning #formal verification
2 weeks ago · ai · - · -

Google’s new Gemini Pro model has record benchmark scores—again

!Google Geminihttps://techcrunch.com/wp-content/uploads/2026/01/google-gemini-jagmeet-singh-techcrunch.jpg?w=1024 Image Credits: Jagmeet Singh / TechCrunch Goog...

#Google #Gemini #LLM #large language model #benchmark #AI research #machine learning

Newer posts

Older posts