large-language-models — Page 11

Sort:

1 month ago · ai · - · -

LLM-as-a-Courtroom

Article URL: https://falconer.com/notes/llm-as-a-courtroom/ Comments URL: https://news.ycombinator.com/item?id=46784210 Points: 30 Comments: 6...

#large-language-models #legal-tech #AI-ethics #courtroom-automation
1 month ago · ai · - · -

Going Beyond the Context Window: Recursive Language Models in Action

Explore a practical approach to analysing massive datasets with LLMs The post Going Beyond the Context Window: Recursive Language Models in Action appeared firs...

#large-language-models #context-window #recursive-models #data-analysis #machine-learning
1 month ago · ai · - · -

Alyah ⭐️: Toward Robust Evaluation of Emirati Dialect Capabilities in Arabic LLMs

Back to Articles !https://huggingface.co/avatars/598fc631df6c75ae458eed55f883820e.svghttps://huggingface.co/Omar-Alkaabi !https://huggingface.co/avatars/874c3ea...

#Arabic LLM #dialect benchmark #Emirati dialect #NLP evaluation #large language models
1 month ago · ai · - · -

[Paper] ctELM: Decoding and Manipulating Embeddings of Clinical Trials with Embedding Language Models

Text embeddings have become an essential part of a variety of language applications. However, methods for interpreting, exploring and reversing embedding spaces...

#embedding language models #clinical trials #biomedical NLP #synthetic data #large language models
1 month ago · ai · - · -

[Paper] Reuse your FLOPs: Scaling RL on Hard Problems by Conditioning on Very Off-Policy Prefixes

Typical reinforcement learning (RL) methods for LLM reasoning waste compute on hard problems, where correct on-policy traces are rare, policy gradients vanish, ...

#reinforcement learning #large language models #off-policy learning #sample efficiency #prefix conditioning
1 month ago · ai · - · -

[Paper] MortalMATH: Evaluating the Conflict Between Reasoning Objectives and Emergency Contexts

Large Language Models are increasingly optimized for deep reasoning, prioritizing the correct execution of complex tasks over general conversation. We investiga...

#large language models #AI safety #reasoning benchmarks #emergency response #natural language processing
1 month ago · ai · - · -

[Paper] Design Techniques for LLM-Powered Interactive Storytelling: A Case Study of the Dramamancer System

The rise of Large Language Models (LLMs) has enabled a new paradigm for bridging authorial intent and player agency in interactive narrative. We consider this p...

#large language models #interactive storytelling #natural language processing #AI research
1 month ago · ai · - · -

[Paper] POPE: Learning to Reason on Hard Problems via Privileged On-Policy Exploration

Reinforcement learning (RL) has improved the reasoning abilities of large language models (LLMs), yet state-of-the-art methods still fail to learn on many train...

#reinforcement learning #large language models #reasoning #privileged exploration #machine learning
1 month ago · ai · - · -

[Paper] Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability

Can a model learn to escape its own learning plateau? Reinforcement learning methods for finetuning large reasoning models stall on datasets with low initial su...

#self-improvement #meta-reinforcement-learning #large-language-models #curriculum-generation #machine-learning-research
1 month ago · ai · - · -

[Paper] Dep-Search: Learning Dependency-Aware Reasoning Traces with Persistent Memory

Large Language Models (LLMs) have demonstrated remarkable capabilities in complex reasoning tasks, particularly when augmented with search mechanisms that enabl...

#dependency-aware reasoning #large language models #retrieval-augmented generation #multi-hop question answering #persistent memory
1 month ago · ai · - · -

DeepSeek-R1: The AI That Learned to Think (and Had an 'Aha Moment')

Imagine an AI that stops mid‑sentence, realizes it made a mistake, and says: “Wait, wait. That’s an aha moment I can flag here.” This isn’t science fiction—it h...

#DeepSeek-R1 #large language models #reinforcement learning #metacognition #chain-of-thought #AI reasoning
1 month ago · ai · - · -

The assistant axis: situating and stabilizing the character of LLMs

Article URL: https://www.anthropic.com/research/assistant-axis Comments URL: https://news.ycombinator.com/item?id=46684708 Points: 4 Comments: 0...

#LLM #large language models #AI alignment #assistant behavior #Anthropic research

Newer posts

Older posts