large-language-models — Page 6

Sort:

3 weeks ago · ai · - · -

Nvidia’s new technique cuts LLM reasoning costs by 8x without losing accuracy

Dynamic Memory Sparsification DMS Researchers at NVIDIA have introduced Dynamic Memory Sparsification DMS, a technique that can cut the memory cost of large‑la...

#Nvidia #large language models #dynamic memory sparsification #KV cache compression #LLM reasoning efficiency #memory optimization #AI research
3 weeks ago · ai · - · -

How caching helps in LLM Application?

What is caching? Caching is the technique of storing frequently accessed data in a temporary, high‑speed storage e.g., Redis. It reduces the compute load on th...

#caching #large-language-models #token-cost #performance-optimization #retrieval-augmented-generation
3 weeks ago · ai · - · -

Anthropic raises $30B in Series G funding at $380B post-money valuation

Series G Funding Overview We have raised $30 billion in Series G funding led by GIC and Coatue, valuing Anthropic at $380 billion post‑money. The round was co‑...

#Anthropic #AI funding #enterprise AI #Claude #large language models
3 weeks ago · ai · - · -

[Paper] CM2: Reinforcement Learning with Checklist Rewards for Multi-Turn and Multi-Step Agentic Tool Use

AI agents are increasingly used to solve real-world tasks by reasoning over multi-turn user interactions and invoking external tools. However, applying reinforc...

#reinforcement learning #large language models #tool-use agents #checklist rewards #RLHF
3 weeks ago · software · - · -

[Paper] Automated Test Suite Enhancement Using Large Language Models with Few-shot Prompting

Unit testing is essential for verifying the functional correctness of code modules (e.g., classes, methods), but manually writing unit tests is often labor-inte...

#unit testing #large language models #few-shot prompting #test generation #software quality
3 weeks ago · ai · - · -

Can you clone Gemini by asking it enough questions? Google says attackers tried

!https://www.androidauthority.com/wp-content/uploads/2024/02/Google-Gemini-logo-on-smartphone-stock-photo-7.jpg TL;DR - Google report claims one campaign sent o...

#model extraction #AI security #Google Gemini #adversarial attacks #large language models
3 weeks ago · ai · - · -

ai;dr

ai; didn't read For me, writing is the most direct window into how someone thinks, perceives, and groks the world. Once you outsource that to an LLM, I'm not su...

#large-language-models #AI-generated-content #code-assistance #Claude #productivity
3 weeks ago · ai · - · -

Gemini 3 Deep Think: Advancing science, research and engineering

Our most specialized reasoning mode is now updated to solve modern science, research and engineering challenges....

#Gemini 3 #DeepMind #AI reasoning #Scientific AI #Large language models
3 weeks ago · ai · - · -

Are We Over-Engineering LLM Stacks Too Early?

Introduction I’ve been building with LLMs for a while now, and I keep noticing the same pattern. A project starts simple: python response = client.responses.cr...

#large-language-models #prompt-engineering #token-optimization #retrieval-augmented-generation #LLM-architecture
3 weeks ago · ai · - · -

GPT-5 outperforms federal judges in legal reasoning experiment

Article URL: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6155012 Comments URL: https://news.ycombinator.com/item?id=46982792 Points: 166 Comments: 127...

#GPT-5 #legal reasoning #large language models #AI benchmarks #judicial decision-making
3 weeks ago · ai · - · -

Apple’s Siri revamp reportedly delayed… again

Background Apple has been promising a new‑and‑improved, AI‑powered Siri since it first unveiledhttps://techcrunch.com/2024/06/10/apple-intelligence-is-the-comp...

#Apple Siri #generative AI #large language models #iOS updates #Google Gemini
3 weeks ago · ai · - · -

[Paper] Data Repetition Beats Data Scaling in Long-CoT Supervised Fine-Tuning

Supervised fine-tuning (SFT) on chain-of-thought data is an essential post-training step for reasoning language models. Standard machine learning intuition sugg...

#chain-of-thought #fine-tuning #large language models #data efficiency

Newer posts

Older posts