paper — Page 55 | EUNO.NEWS

Sort:

1 month ago · ai · - · -

[Paper] One-step Latent-free Image Generation with Pixel Mean Flows

Modern diffusion/flow-based models for image generation typically exhibit two core characteristics: (i) using multi-step sampling, and (ii) operating in a laten...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] Discovering Hidden Gems in Model Repositories

Public repositories host millions of fine-tuned models, yet community usage remains disproportionately concentrated on a small number of foundation checkpoints....

#research #paper #ai #machine-learning #nlp
1 month ago · ai · - · -

[Paper] Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts

Hybrid Transformer architectures, which combine softmax attention blocks and recurrent neural networks (RNNs), have shown a desirable performance-throughput tra...

#research #paper #ai #machine-learning #nlp
1 month ago · ai · - · -

[Paper] Exploring Reasoning Reward Model for Agents

Agentic Reinforcement Learning (Agentic RL) has achieved notable success in enabling agents to perform complex reasoning and tool use. However, most methods sti...

#research #paper #ai #machine-learning #nlp
1 month ago · ai · - · -

[Paper] UEval: A Benchmark for Unified Multimodal Generation

We introduce UEval, a benchmark to evaluate unified models, i.e., models capable of generating both images and text. UEval comprises 1,000 expert-curated questi...

#research #paper #ai #nlp #computer-vision
1 month ago · ai · - · -

[Paper] DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation

Manipulating dynamic objects remains an open challenge for Vision-Language-Action (VLA) models, which, despite strong generalization in static manipulation, str...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] Late Breaking Results: Conversion of Neural Networks into Logic Flows for Edge Computing

Neural networks have been successfully applied in various resource-constrained edge devices, where usually central processing units (CPUs) instead of graphics p...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] Do VLMs Perceive or Recall? Probing Visual Perception vs. Memory with Classic Visual Illusions

Large Vision-Language Models (VLMs) often answer classic visual illusions 'correctly' on original images, yet persist with the same responses when illusion fact...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] DynaWeb: Model-Based Reinforcement Learning of Web Agents

The development of autonomous web agents, powered by Large Language Models (LLMs) and reinforcement learning (RL), represents a significant step towards general...

#research #paper #ai #machine-learning #nlp
1 month ago · ai · - · -

[Paper] FineInstructions: Scaling Synthetic Instructions to Pre-Training Scale

Due to limited supervised training data, large language models (LLMs) are typically pre-trained via a self-supervised 'predict the next word' objective on a vas...

#research #paper #ai #machine-learning #nlp
1 month ago · ai · - · -

[Paper] JUST-DUB-IT: Video Dubbing via Joint Audio-Visual Diffusion

Audio-Visual Foundation Models, which are pretrained to jointly generate sound and visual content, have recently shown an unprecedented ability to model multi-m...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] Routing the Lottery: Adaptive Subnetworks for Heterogeneous Data

In pruning, the Lottery Ticket Hypothesis posits that large networks contain sparse subnetworks, or winning tickets, that can be trained in isolation to match t...

#research #paper #ai #machine-learning #computer-vision
1 month ago · ai · - · -

[Paper] Reasoning While Asking: Transforming Reasoning Large Language Models from Passive Solvers to Proactive Inquirers

Reasoning-oriented Large Language Models (LLMs) have achieved remarkable progress with Chain-of-Thought (CoT) prompting, yet they remain fundamentally limited b...

#research #paper #ai #machine-learning #nlp
1 month ago · ai · - · -

[Paper] PRISM: Distribution-free Adaptive Computation of Matrix Functions for Accelerating Neural Network Training

Matrix functions such as square root, inverse roots, and orthogonalization play a central role in preconditioned gradient methods for neural network training. T...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] StepShield: When, Not Whether to Intervene on Rogue Agents

Existing agent safety benchmarks report binary accuracy, conflating early intervention with post-mortem analysis. A detector that flags a violation at step 8 en...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] PI-Light: Physics-Inspired Diffusion for Full-Image Relighting

Full-image relighting remains a challenging problem due to the difficulty of collecting large-scale structured paired data, the difficulty of maintaining physic...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] Early and Prediagnostic Detection of Pancreatic Cancer from Computed Tomography

Pancreatic ductal adenocarcinoma (PDAC), one of the deadliest solid malignancies, is often detected at a late and inoperable stage. Retrospective reviews of pre...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] Pay for Hints, Not Answers: LLM Shepherding for Cost-Efficient Inference

Large Language Models (LLMs) deliver state-of-the-art performance on complex reasoning tasks, but their inference costs limit deployment at scale. Small Languag...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] SMOG: Scalable Meta-Learning for Multi-Objective Bayesian Optimization

Multi-objective optimization aims to solve problems with competing objectives, often with only black-box access to a problem and a limited budget of measurement...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] World of Workflows: a Benchmark for Bringing World Models to Enterprise Systems

Frontier large language models (LLMs) excel as autonomous agents in many domains, yet they remain untested in complex enterprise systems where hidden workflows ...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] SWE-Replay: Efficient Test-Time Scaling for Software Engineering Agents

Test-time scaling has been widely adopted to enhance the capabilities of Large Language Model (LLM) agents in software engineering (SWE) tasks. However, the sta...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] EditYourself: Audio-Driven Generation and Manipulation of Talking Head Videos with Diffusion Transformers

Current generative video models excel at producing novel content from text and image prompts, but leave a critical gap in editing existing pre-recorded videos, ...

#research #paper #ai #machine-learning #computer-vision
1 month ago · ai · - · -

[Paper] Creative Image Generation with Diffusion Model

Creative image generation has emerged as a compelling area of research, driven by the need to produce novel and high-quality images that expand the boundaries o...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] A Federated and Parameter-Efficient Framework for Large Language Model Training in Medicine

Large language models (LLMs) have demonstrated strong performance on medical benchmarks, including question answering and diagnosis. To enable their use in clin...

#research #paper #ai #nlp

Newer posts

Older posts