ai — Page 26 | EUNO.NEWS

Sort:

3 weeks ago · ai · - · -

[Paper] When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models

Text-to-video diffusion models have enabled open-ended video synthesis, but often struggle with generating the correct number of objects specified in a prompt. ...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] E-3DPSM: A State Machine for Event-Based Egocentric 3D Human Pose Estimation

Event cameras offer multiple advantages in monocular egocentric 3D human pose estimation from head-mounted devices, such as millisecond temporal resolution, hig...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] SIM1: Physics-Aligned Simulator as Zero-Shot Data Scaler in Deformable Worlds

Robotic manipulation with deformable objects represents a data-intensive regime in embodied learning, where shape, contact, and topology co-evolve in ways that ...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai · - · -

[Paper] Scal3R: Scalable Test-Time Training for Large-Scale 3D Reconstruction

This paper addresses the task of large-scale 3D scene reconstruction from long video sequences. Recent feed-forward reconstruction models have shown promising r...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] Seeing but Not Thinking: Routing Distraction in Multimodal Mixture-of-Experts

Multimodal Mixture-of-Experts (MoE) models have achieved remarkable performance on vision-language tasks. However, we identify a puzzling phenomenon termed Seei...

#research #paper #ai #machine-learning #nlp #computer-vision
3 weeks ago · ai · - · -

[Paper] AVGen-Bench: A Task-Driven Benchmark for Multi-Granular Evaluation of Text-to-Audio-Video Generation

Text-to-Audio-Video (T2AV) generation is rapidly becoming a core interface for media creation, yet its evaluation remains fragmented. Existing benchmarks largel...

#research #paper #ai #machine-learning #nlp #computer-vision
3 weeks ago · ai · - · -

[Paper] OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks

Group Relative Policy Optimization (GRPO) has emerged as the de facto Reinforcement Learning (RL) objective driving recent advancements in Multimodal Large Lang...

#research #paper #ai #machine-learning #nlp #computer-vision
3 weeks ago · ai · - · -

[Paper] Meta-learning In-Context Enables Training-Free Cross Subject Brain Decoding

Visual decoding from brain signals is a key challenge at the intersection of computer vision and neuroscience, requiring methods that bridge neural representati...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] RewardFlow: Generate Images by Optimizing What You Reward

We introduce RewardFlow, an inversion-free framework that steers pretrained diffusion and flow-matching models at inference time through multi-reward Langevin d...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai · - · -

[Paper] PSI: Shared State as the Missing Layer for Coherent AI-Generated Instruments in Personal AI Agents

Personal AI tools can now be generated from natural-language requests, but they often remain isolated after creation. We present PSI, a shared-state architectur...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Demystifying OPD: Length Inflation and Stabilization Strategies for Large Language Models

On-policy distillation (OPD) trains student models under their own induced distribution while leveraging supervision from stronger teachers. We identify a failu...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

Google’s Gemini AI can answer your questions with 3D models and simulations

Google's latest upgrade for Gemini will allow the chatbot to generate interactive 3D models and simulations in response to your questions. With the new feature,...

#Google #Gemini #AI chatbot #3D models #interactive simulations #generative AI #LLM #visualization
3 weeks ago · ai · - · -

[Paper] Ads in AI Chatbots? An Analysis of How Large Language Models Navigate Conflicts of Interest

Today's large language models (LLMs) are trained to align with user preferences through methods such as reinforcement learning. Yet models are beginning to be d...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] What Drives Representation Steering? A Mechanistic Case Study on Steering Refusal

Applying steering vectors to large language models (LLMs) is an efficient and effective model alignment technique, but we lack an interpretable explanation for ...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] ClawBench: Can AI Agents Complete Everyday Online Tasks?

AI agents may be able to automate your inbox, but can they automate other routine aspects of your life? Everyday online tasks offer a realistic yet unsolved tes...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Cram Less to Fit More: Training Data Pruning Improves Memorization of Facts

Large language models (LLMs) can struggle to memorize factual knowledge in their parameters, often leading to hallucinations and poor performance on knowledge-i...

#research #paper #ai #nlp
3 weeks ago · ai · - · -

[Paper] What do Language Models Learn and When? The Implicit Curriculum Hypothesis

Large language models (LLMs) can perform remarkably complex tasks, yet the fine-grained details of how these capabilities emerge during pretraining remain poorl...

#research #paper #ai #nlp
3 weeks ago · ai · - · -

[Paper] Differentially Private Language Generation and Identification in the Limit

We initiate the study of language generation in the limit, a model recently introduced by Kleinberg and Mullainathan [KM24], under the constraint of differentia...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] sciwrite-lint: Verification Infrastructure for the Age of Science Vibe-Writing

Science currently offers two options for quality assurance, both inadequate. Journal gatekeeping claims to verify both integrity and contribution, but actually ...

#research #paper #ai #nlp
3 weeks ago · ai · - · -

[Paper] PIArena: A Platform for Prompt Injection Evaluation

Prompt injection attacks pose serious security risks across a wide range of real-world applications. While receiving increasing attention, the community faces a...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] The Impact of Dimensionality on the Stability of Node Embeddings

Previous work has established that neural network-based node embeddings return different outcomes when trained with identical parameters on the same dataset, ju...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] SUPERNOVA: Eliciting General Reasoning in LLMs with Reinforcement Learning on Natural Instructions

Reinforcement Learning with Verifiable Rewards (RLVR) has significantly improved large language model (LLM) reasoning in formal domains such as mathematics and ...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Quantization Impact on the Accuracy and Communication Efficiency Trade-off in Federated Learning for Aerospace Predictive Maintenance

Federated learning (FL) enables privacy-preserving predictive maintenance across distributed aerospace fleets, but gradient communication overhead constrains de...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Persistence-Augmented Neural Networks

Topological Data Analysis (TDA) provides tools to describe the shape of data, but integrating topological features into deep learning pipelines remains challeng...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] TTVS: Boosting Self-Exploring Reinforcement Learning via Test-time Variational Synthesis

Despite significant advances in Large Reasoning Models (LRMs) driven by reinforcement learning with verifiable rewards (RLVR), this paradigm is fundamentally li...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

Google Gemma 4 in your pocket: How to run the latest AI fully offline

!https://www.androidauthority.com/wp-content/uploads/2024/02/gemma-header.jpg - Google’s AI Edge Gallery app is now officially available on the Google Play Stor...

#Gemma 4 #offline AI #Google AI Edge Gallery #on-device inference #mobile AI
3 weeks ago · ai · - · -

[Paper] Multi-Modal Learning meets Genetic Programming: Analyzing Alignment in Latent Space Optimization

Symbolic regression (SR) aims to discover mathematical expressions from data, a task traditionally tackled using Genetic Programming (GP) through combinatorial ...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Robust Multi-Objective Optimization for Bicycle Rebalancing in Shared Mobility Systems

Dock-based bike-sharing systems exhibit spatial imbalances between bicycle supply and user demand, often addressed through overnight truck-based rebalancing. Th...

#research #paper #ai
3 weeks ago · ai · - · -

[Paper] CIAO - Code In Architecture Out - Automated Software Architecture Documentation with Large Language Models

Software architecture documentation is essential for system comprehension, yet it is often unavailable or incomplete. While recent LLM-based techniques can gene...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Introducing Echo Networks for Computational Neuroevolution

For applications on the extreme edge, minimal networks of only a few dozen artificial neurons for event detection and classification in discrete time signals wo...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

Strength and Destiny Collide: ‘Samson: A Tyndalston Story’ Arrives in the Cloud

A timeless story of grit, faith and rebellion takes center stage as Samson: A Tyndalston Story joins the GeForce NOWhttps://www.nvidia.com/en-us/geforce-now/ li...

#ai #gpu #nvidia
3 weeks ago · ai · - · -

New technique makes AI models leaner and faster while they’re still learning

Compressing State‑Space Models During Training Training a large artificial‑intelligence model is expensive—not only in dollars, but also in time, energy, and c...

#model compression #training efficiency #state-space models #control theory #CompreSSM #MIT CSAIL #AI hardware optimization #large language models
3 weeks ago · ai · - · -

[Paper] Exploration of Pareto-preserving Search Space Transformations in Multi-objective Test Functions

Benchmark problems are an important tool for gaining understanding of optimization algorithms. Since algorithms often aim to perform well on benchmarks, biases ...

#research #paper #ai
3 weeks ago · ai · - · -

[Paper] LegoDiffusion: Micro-Serving Text-to-Image Diffusion Workflows

Text-to-image generation executes a diffusion workflow comprising multiple models centered on a base diffusion model. Existing serving systems treat each workfl...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Internal noise in deep neural networks: interplay of depth, neuron number, and noise injection step

This paper examines the influence of internal Gaussian noise on the performance of deep feedforward neural networks, focusing on the role of the noise injection...

#research #paper #ai
3 weeks ago · ai · - · -

[Paper] Analysis of Search Heuristics in the Multi-Armed Bandit Setting

We consider the classic Multi-Armed Bandit setting to understand the exploration/exploitation tradeoffs made by different search heuristics. Since many search h...

#research #paper #ai
3 weeks ago · ai · - · -

The Tool Harness Meta Didnt Tell You About

Meta just dropped Muse Spark, their first major model release in a year. The benchmarks show it competitive with Claude Opus 4.6 and GPT 5.4, but that isn’t the...

#Meta #Muse Spark #LLM #AI tools #browser tool #content search #code interpreter #AI model benchmarks #large language model
3 weeks ago · ai · - · -

Google makes it easy to deepfake yourself

Overview YouTube Shorts is rolling out a new AI‑powered feature that gives creators an easy way to realistically clone themselves on camera. The launch, hinted...

#YouTube Shorts #deepfake #AI avatar #generative AI #synthetic media #video creation
3 weeks ago · ai · - · -

I stopped writing prompts and started writing Python

The Prompt Chaos For a year I treated LLMs like a command line: type instructions, pray for output, tweak wording, add “IMPORTANT:”, move sentences around like...

#LLM #prompt engineering #DSPy #Python #Stanford NLP #AI tooling
3 weeks ago · ai · - · -

Claude mixes up who said what and that's not OK

The bug Claude sometimes sends messages to itself and then thinks those messages came from the user. This is the worst bug I’ve seen from an LLM provider, but...

#Claude #LLM #AI bug #hallucination #AI safety #prompt injection
3 weeks ago · ai · - · -

[Paper] LogAct: Enabling Agentic Reliability via Shared Logs

Agents are LLM-driven components that can mutate environments in powerful, arbitrary ways. Extracting guarantees for the execution of agents in production envir...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[그게 뭔가요] 뮤즈 스파크, AI 경쟁 탈락했던 메타의 반전 카드

메타가 새로운 AI 모델 ‘뮤즈 스파크Muse Spark’를 공개했다. 지난해 수조 원 규모의 AI 조직 개편과 인재 영입 이후 처음 선보이는 결과물이다. 뮤즈 스파크는 무엇이며, 왜 주목받고, 어떤 평가를 받고 있는지 살펴본다. 뮤즈 스파크, 어떤 모델인가 뮤즈 스파크는 메타의 새 A...

#Meta #Muse Spark #multimodal AI #vision-language model #native multimodal inference #large language model #AI research
3 weeks ago · ai · - · -

[Paper] Kuramoto Oscillatory Phase Encoding: Neuro-inspired Synchronization for Improved Learning Efficiency

Spatiotemporal neural dynamics and oscillatory synchronization are widely implicated in biological information processing and have been hypothesized to support ...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai · - · -

[Paper] PyVRP$^+$: LLM-Driven Metacognitive Heuristic Evolution for Hybrid Genetic Search in Vehicle Routing Problems

Designing high-performing metaheuristics for NP-hard combinatorial optimization problems, such as the Vehicle Routing Problem (VRP), remains a significant chall...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

Gemini meets NotebookLM is Google’s latest powerful integration

!https://www.androidauthority.com/wp-content/uploads/2024/02/Google-Gemini-logo-on-smartphone-stock-photo-7.jpg TL;DR - Google is adding Notebooks to Gemini to...

#Google Gemini #NotebookLM #AI integration #AI productivity tools #Google AI #Notebooks feature #chat AI
3 weeks ago · ai · - · -

Meta's New Model Has 16 Tools. Here's What They Do.

Overview Meta has just released Muse Spark, its first new model since Llama 4, arriving about a year after the previous release. Benchmarks place it alongside...

#Meta #Muse Spark #AI model #tool integration #Python sandbox #pandas #numpy #matplotlib #scikit-learn #OpenCV #image generation #Segment Anything #AI tools
3 weeks ago · ai · - · -

Boost Training Goodput: How Continuous Checkpointing Optimizes Reliability in Orbax and MaxText

'MARCH 31, 2026

#continuous checkpointing #Orbax #MaxText #model training reliability #training performance #fault tolerance #Google AI tools
3 weeks ago · ai · - · -

Closing the knowledge gap with agent skills

Large language models LLMs have fixed knowledge, being trained at a specific point in time. Software engineering practices are fast‑paced and change often, with...

#large language models #agent skills #knowledge gap #software engineering #Gemini API #DeepMind #SDK updates

Newer posts

Older posts