large language models

Sort:

1 day ago · ai · - · -

New KV cache compaction technique cuts LLM memory 50x without accuracy loss

Enterprise‑Scale Memory Bottleneck in Large Language Models Large‑document or long‑horizon AI applications quickly run into a memory bottleneck. As the context...

#LLM #KV cache #memory compression #Attention Matching #MIT research #inference optimization #large language models
3 days ago · ai · - · -

The paradox of AI memory: remembering everything is easy. Remembering wisely is hard.

The Problem with Naive Memory But here's what nobody talks about: naive memory is expensive. And not just in dollars. Give an agent a massive context window an...

#AI memory #context window #large language models #agent architecture #forgetting #structured extraction #summarization
3 days ago · ai · - · -

Stripe has been doing amazing work with their llms.txt, and this guide covers it well. This might be worth adding to your own llms.txt file too!

!https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprof...

#stripe #llms.txt #large-language-models #ai #guide #developer-tools
3 days ago · ai · - · -

Microsoft built Phi-4-reasoning-vision-15B to know when to think — and when thinking is a waste of time

Microsoft Releases Phi‑4‑reasoning‑vision‑15B Microsoft announced on Tuesday the launch of Phi‑4‑reasoning‑vision‑15B, a compact open‑weight multimodal AI mode...

#Microsoft #Phi-4-reasoning-vision-15B #multimodal AI #open-weight model #large language models #computer vision #AI reasoning #open-source AI #HuggingFace #GitHub
3 days ago · ai · - · -

Something is afoot in the land of Qwen

Recent developments at Alibaba’s Qwen team I’m behind on writing about Qwen 3.5, a remarkable family of open‑weight models released by Alibaba’s Qwen team over...

#Qwen #Alibaba #open-weight models #large language models #AI research #team departures
3 days ago · ai · - · -

Rethinking AI Assistants: A Privacy-First Approach with Google Gemini

Are you sure you want to hide this comment? It will become hidden in your post, but will still be visible via the comment's permalink. Hide child comments as we...

#google-gemini #privacy #ai-assistants #large-language-models #data-protection
4 days ago · ai · - · -

Alibaba’s Qwen tech lead steps down after major AI push

Background Junyang Lin, a central technical leader on Alibaba’s Qwen team, announced on X that he was “stepping down” from the project — without providing furt...

#Alibaba #Qwen #large language models #AI leadership #open-weight models #AI competition
5 days ago · ai · - · -

Language Model Contains Personality Subnetworks

Abstract Humans shift between different personas depending on social context. Large Language Models LLMs demonstrate a similar flexibility in adopting differen...

#large language models #persona subnetworks #model interpretability #parameter masking #LLM behavior #AI research
6 days ago · ai · - · -

Your AI is a Confident Liar: How to Actually Fix Factual Hallucinations

Let’s be honest: we’ve all been there. You’re deep into a sprint, building a shiny new feature powered by a Large Language Model LLM. You feed it a complex prom...

#AI hallucination #large language models #LLM reliability #prompt engineering #factual accuracy #AI safety #generative AI
6 days ago · software · - · -

🚀 Création d'une application PHP MCP pour publier des articles Darkwood

markdown Applications MCP : simplifier le flux éditorial Les grands modèles de langage sont déjà performants pour la génération de texte. Ce qui manque encore,...

#PHP #MCP #large-language-models #content-generation #editorial-workflow #blog-publishing #AI-integration
1 week ago · ai · - · -

Perplexity Announces 'Computer,' an AI Agent That Assigns Work To Other AI Agent

Overview joshuarkhttps://slashdot.org/~joshuark shares a report from Ars Technica: Perplexity has introducedhttps://www.perplexity.ai/hub/blog/introducing-perp...

#Perplexity #AI agents #multi‑agent systems #workflow automation #large language models #task orchestration
1 week ago · ai · - · -

Anthropic ditches its core safety promise

Anthropic’s Shift Away From Its Core Safety Promise Anthropic, founded with the mission to build AI systems aligned with human values, has long positioned itse...

#Anthropic #AI safety #AI alignment #large language models #AI governance #trustworthy AI

Newer posts

Older posts