ai — Page 18 | EUNO.NEWS

Sort:

2 weeks ago · ai · - · -

[Paper] Find, Fix, Reason: Context Repair for Video Reasoning

Reinforcement learning has advanced video reasoning in large multi-modal models, yet dominant pipelines either rely on on-policy self-exploration, which plateau...

#research #paper #ai #computer-vision
2 weeks ago · ai · - · -

[Paper] Detecting and Suppressing Reward Hacking with Gradient Fingerprints

Reinforcement learning with verifiable rewards (RLVR) typically optimizes for outcome rewards without imposing constraints on intermediate reasoning. This leave...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] BAGEL: Benchmarking Animal Knowledge Expertise in Language Models

Large language models have shown strong performance on broad-domain knowledge and reasoning benchmarks, but it remains unclear how well language models handle s...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] CollideNet: Hierarchical Multi-scale Video Representation Learning with Disentanglement for Time-To-Collision Forecasting

Time-to-Collision (TTC) forecasting is a critical task in collision prevention, requiring precise temporal prediction and comprehending both local and global pa...

#research #paper #ai #computer-vision
2 weeks ago · ai · - · -

[Paper] Adaptive multi-fidelity optimization with fast learning rates

In multi-fidelity optimization, biased approximations of varying costs of the target function are available. This paper studies the problem of optimizing a loca...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Enhancing AI and Dynamical Subseasonal Forecasts with Probabilistic Bias Correction

Decision-makers rely on weather forecasts to plant crops, manage wildfires, allocate water and energy, and prepare for weather extremes. Today, such forecasts e...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Optimizing Korean-Centric LLMs via Token Pruning

This paper presents a systematic benchmark of state-of-the-art multilingual large language models (LLMs) adapted via token pruning - a compression technique tha...

#research #paper #ai #nlp
2 weeks ago · ai · - · -

[Paper] A Two-Stage, Object-Centric Deep Learning Framework for Robust Exam Cheating Detection

Academic integrity continues to face the persistent challenge of examination cheating. Traditional invigilation relies on human observation, which is inefficien...

#research #paper #ai #machine-learning #computer-vision
2 weeks ago · ai · - · -

Beyond Prompting: Using Agent Skills in Data Science

In my last articlehttps://towardsdatascience.com/beyond-code-generation-ai-for-the-full-data-science-workflow/, I shared how to use MCP to integrate LLMs into y...

#ai #data-science #tutorial
2 weeks ago · ai · - · -

[Paper] Beyond Surface Statistics: Robust Conformal Prediction for LLMs via Internal Representations

Large language models are increasingly deployed in settings where reliability matters, yet output-level uncertainty signals such as token probabilities, entropy...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

Building a Fast Multilingual OCR Model with Synthetic Data

Back to Articleshttps://huggingface.co/blog !https://huggingface.co/avatars/a514f0d2b2f9937dd6fd97560f8319a8.svghttps://huggingface.co/emelryan Training a high...

#ai #ai-models #opensource
2 weeks ago · ai · - · -

[Paper] JumpLoRA: Sparse Adapters for Continual Learning in Large Language Models

Adapter-based methods have become a cost-effective approach to continual learning (CL) for Large Language Models (LLMs), by sequentially learning a low-rank upd...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

[Paper] AtManRL: Towards Faithful Reasoning via Differentiable Attention Saliency

Large language models (LLMs) increasingly rely on chain-of-thought (CoT) reasoning to solve complex tasks. Yet ensuring that the reasoning trace both contribute...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

Anthropic launches Claude Design following Opus 4.7 model upgrade

!https://9to5mac.com/wp-content/uploads/sites/6/2026/04/claude-design.webp?w=1600 Claude Design is Anthropic’s latest research preview Powered by Opus 4.7, Clau...

#Anthropic #Claude Design #Opus 4.7 #generative AI #design automation #AI design assistant #Mac tools
2 weeks ago · ai · - · -

[Paper] On the Rejection Criterion for Proxy-based Test-time Alignment

Recent works proposed test-time alignment methods that rely on a small aligned model as a proxy that guides the generation of a larger base (unaligned) model. T...

#research #paper #ai #nlp
2 weeks ago · ai · - · -

[Paper] Training Time Prediction for Mixed Precision-based Distributed Training

Accurate prediction of training time in distributed deep learning is crucial for resource allocation, cost estimation, and job scheduling. We observe that the f...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Sentiment Analysis of German Sign Language Fairy Tales

We present a dataset and a model for sentiment analysis of German sign language (DGS) fairy tales. First, we perform sentiment analysis for three levels of vale...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

You Don’t Need Many Labels to Learn

Introduction usually comes with an implicit assumption: you need a lot of labeled data. At the same time, many models are capable of discovering structure in d...

#ai #data-science #tutorial
2 weeks ago · ai · - · -

[Paper] Robust Synchronisation for Federated Learning in The Face of Correlated Device Failure

Probabilistic Synchronous Parallel (PSP) is a technique in distributed learning systems to reduce synchronization bottlenecks by sampling a subset of participat...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Prototype-Grounded Concept Models for Verifiable Concept Alignment

Concept Bottleneck Models (CBMs) aim to improve interpretability in Deep Learning by structuring predictions through human-understandable concepts, but they pro...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

Jacob Andreas and Brett McGuire named Edgerton Award winners

MIT Associate Professor Jacob Andreashttps://www.eecs.mit.edu/people/jacob-andreas/ of the Department of Electrical Engineering and Computer Science EECS and MI...

#ai #ai-research #academia
2 weeks ago · ai · - · -

[Paper] LLMSniffer: Detecting LLM-Generated Code via GraphCodeBERT and Supervised Contrastive Learning

The rapid proliferation of Large Language Models (LLMs) in software development has made distinguishing AI-generated code from human-written code a critical cha...

#research #paper #ai #nlp
2 weeks ago · ai · - · -

[Paper] Neurosymbolic Repo-level Code Localization

Code localization is a cornerstone of autonomous software engineering. Recent advancements have achieved impressive performance on real-world issue benchmarks. ...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Combining Convolution and Delay Learning in Recurrent Spiking Neural Networks

Spiking neural networks (SNNs) are rapidly gaining momentum as an alternative to conventional artificial neural networks in resource constrained edge systems. I...

#research #paper #ai
2 weeks ago · ai · - · -

[Paper] ECG-Lens: Benchmarking ML & DL Models on PTB-XL Dataset

Automated classification of electrocardiogram (ECG) signals is a useful tool for diagnosing and monitoring cardiovascular diseases. This study compares three tr...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

[Paper] Breaking the Training Barrier of Billion-Parameter Universal Machine Learning Interatomic Potentials

Universal Machine Learning Interatomic Potentials (uMLIPs), pre-trained on massively diverse datasets encompassing inorganic materials and organic molecules acr...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

The 270-Second Rule: How to Cut Claude Code API Costs by 90% with Smart

Key Takeaways - Anthropic's prompt cache has a 5‑minute TTL. - Orchestrator loops running faster than 270 seconds pay ~10 % of full input token costs. What Cha...

#Anthropic #Claude #prompt cache #API cost optimization #LLM #token pricing #orchestrator loops #developer tips
2 weeks ago · ai · - · -

Designing ChatGPT Prompts & Workflows Like a Developer

!Cover image for Designing ChatGPT Prompts & Workflows Like a Developerhttps://media2.dev.to/dynamic/image/width=1000,height=420,fit=cover,gravity=auto,format=a...

#prompt engineering #ChatGPT #LLM #AI tools #developer workflow #prompt design
2 weeks ago · ai · - · -

Profling Claude Converstaions

!Cover image for Profling Claude Converstaionshttps://media2.dev.to/dynamic/image/width=1000,height=420,fit=cover,gravity=auto,format=auto/https%3A%2F%2Fdev-to-...

#Claude #LLM #token limits #AI profiling #cost management #developer tools
2 weeks ago · ai · - · -

[Paper] Frenetic Cat-inspired Particle Optimization: a Markov state-switching hybrid swarm optimizer with application to cardiac digital twinning

Designing optimizers that remain effective under tight evaluation budgets is critical in expensive black-box settings such as cardiac digital twinning. We propo...

#research #paper #ai
2 weeks ago · ai · - · -

[Paper] Enhancing Discrete Particle Swarm Optimization for Hypergraph-Modeled Influence Maximization

Influence maximization (IM) is a fundamental problem in complex network analysis, with a wide range of real-world applications. To date, existing approaches to ...

#research #paper #ai
2 weeks ago · ai · - · -

[Paper] Neuromorphic Parameter Estimation for Power Converter Health Monitoring Using Spiking Neural Networks

Always-on converter health monitoring demands sub-mW edge inference, a regime inaccessible to GPU-based physics-informed neural networks. This work separates sp...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

Bringing AI-driven protein-design tools to biologists everywhere

Artificial intelligence is already proving it can accelerate drug development and improve our understanding of disease. But to turn AI into novel treatments we...

#ai #ai-research #academia
2 weeks ago · ai · - · -

[Paper] CodeMMR: Bridging Natural Language, Code, and Image for Unified Retrieval

Code search, framed as information retrieval (IR), underpins modern software engineering and increasingly powers retrieval-augmented generation (RAG), improving...

#research #paper #ai #machine-learning
2 weeks ago · ai · - · -

My Manus AI Credit Usage After 30 Days — The Data

I tracked every Manus AI task for 30 days. Here’s what I found about credit usage and optimization. Task Categorization | Category | % of Tasks | Avg Credits |...

#Manus AI #credit usage #cost optimization #prompt engineering #AI task modes #A/B testing #quality vs cost
2 weeks ago · ai · - · -

Why Your AI Agent Has Root Access to Everything (And How to Fix It in 3 Lines of Python)

I’ve been building AI agents at work and kept running into the same problem: every framework lets agents call any registered tool with zero safety checks. An ag...

#AI agents #security #prompt injection #Python #tool sandboxing
2 weeks ago · ai · - · -

George Orwell Predicted the Rise of 'AI Slop' in Nineteen Eighty-Four

Categories: Literature, Technology | Date: April 16 th, 2026 | 3 Commentshttps://www.openculture.com/2026/04/how-george-orwell-predicted-the-rise-of-ai-slop.htm...

#artificial intelligence #AI-generated content #George Orwell #1984 #cultural commentary #AI in literature
2 weeks ago · ai · - · -

[Paper] Why Fine-Tuning Encourages Hallucinations and How to Fix It

Large language models are prone to hallucinating factually incorrect statements. A key source of these errors is exposure to new factual information through sup...

#research #paper #ai #machine-learning #nlp
2 weeks ago · ai · - · -

OpenAI starts offering a biology-tuned LLM

Model Tuning and Skepticism To address LLMs’ tendencies toward sycophancy and over‑enthusiasm, OpenAI says it has tuned the model to be more skeptical, making...

#OpenAI #biology-tuned LLM #GPT #model skepticism #hallucination mitigation #drug target prediction #AI in biotech
2 weeks ago · ai · - · -

Understanding Transformers Part 8: Shared Weights in Self-Attention

!Cover image for Understanding Transformers Part 8: Shared Weights in Self-Attentionhttps://media2.dev.to/dynamic/image/width=1000,height=420,fit=cover,gravity=...

#transformers #self-attention #shared-weights #deep-learning #neural-networks #nlp
2 weeks ago · ai · - · -

Why Generative AI Isn’t Enough, Enter Agentic Systems

Generative AI vs Agentic AI From Content Creation to Autonomous Action As we move beyond AWS DeepRacer and the “AWS AI League,” the shift from model‑ML design...

#generative AI #agentic AI #AI engineering #autonomous AI #large language models #AWS AI League
2 weeks ago · ai · - · -

Kiwi-chan Progress Report: Steady Mining!

Devlog: Kiwi-chan's Great Oak Adventure – Or, How My LLM Became a Lumberjack Again! Hey tech enthusiasts and fellow pixel pioneers! It's another glorious day i...

#LLM #Minecraft bot #autonomous AI #game AI #Kiwi-chan #AI mining
2 weeks ago · ai · - · -

OpenAI takes aim at Anthropic with beefed-up Codex that gives it more power over your desktop

OpenAI’s Codex Revamp Targets Anthropic’s Claude Code There is currently a low‑grade war between OpenAI and Anthropic over who can release the most convenient...

#OpenAI #Codex #AI coding assistant #desktop automation #AI tools #Anthropic #Claude Code #productivity
2 weeks ago · ai · - · -

OpenAI debuts GPT-Rosalind, a new limited access model for life sciences, and broader Codex plugin on Github

The Journey from Lab Hypothesis to Pharmacy Shelf The journey from a laboratory hypothesis to a pharmacy shelf is one of the most grueling marathons in modern...

#OpenAI #GPT-Rosalind #domain-specific LLM #life sciences AI #biotech research #specialized models #AI for drug discovery
2 weeks ago · ai · - · -

The only way to fight deepfakes is by making deepfakes

I was unsure if my parents would notice that the voice on the other end wasn't mine—or that it was mine, sort of, but it wasn't me. The voice said hello, asked...

#deepfakes #synthetic media #voice cloning #AI security #misinformation
2 weeks ago · ai · - · -

OpenAI drastically updates Codex desktop app to use all other apps on your computer, generate images, preview webpages

OpenAI Codex Desktop Update – 3 Million Weekly Developers OpenAI announced a massive update to its Codex developer environment Mac & Windows desktop apps as it...

#OpenAI #Codex #desktop app #AI assistant #code generation #app integration #productivity tools
2 weeks ago · ai · - · -

Anthropic releases Claude Opus 4.7: How to try it, benchmarks, safety

Claude Opus 4.7 is Anthropic's most intelligent model available to the general public. In a press release, Anthropic noted that Opus 4.7 is not as powerful as C...

#Anthropic #Claude Opus 4.7 #large language model #hybrid reasoning #AI safety #benchmarking #coding assistance
2 weeks ago · ai · - · -

Google Maps taps Gemini to crack down on political vandalism and spammy reviews

!https://www.androidauthority.com/wp-content/uploads/2025/09/google-maps-my-maps-custom-map-example-3.jpg TL;DR - Google Maps users have a history of submitting...

#Google Maps #Gemini #AI moderation #spam detection #political vandalism #review filtering

Newer posts

Older posts