research — Page 101

Sort:

2 months ago · ai · - · -

[Paper] Exploration v.s. Exploitation: Rethinking RLVR through Clipping, Entropy, and Spurious Reward

This paper examines the exploration-exploitation trade-off in reinforcement learning with verifiable rewards (RLVR), a framework for improving the reasoning of ...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] Posterior Behavioral Cloning: Pretraining BC Policies for Efficient RL Finetuning

Standard practice across domains from robotics to language is to first pretrain a policy on a large-scale demonstration dataset, and then finetune this policy, ...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] SFTok: Bridging the Performance Gap in Discrete Tokenizers

Recent advances in multimodal models highlight the pivotal role of image tokenization in high-resolution image generation. By compressing images into compact la...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] Flowing from Reasoning to Motion: Learning 3D Hand Trajectory Prediction from Egocentric Human Interaction Videos

Prior works on 3D hand trajectory prediction are constrained by datasets that decouple motion from semantic supervision and by models that weakly link reasoning...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] How Good is Post-Hoc Watermarking With Language Model Rephrasing?

Generation-time text watermarking embeds statistical signals into text for traceability of AI-generated content. We explore *post-hoc watermarking* where an LLM...

#research #paper #ai #nlp
2 months ago · ai · - · -

[Paper] In-Context Algebra

We investigate the mechanisms that arise when transformers are trained to solve arithmetic on sequences where tokens are variables whose meaning is determined o...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] Impacts of Racial Bias in Historical Training Data for News AI

AI technologies have rapidly moved into business and research applications that involve large text corpora, including computational journalism research and news...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] Multimodal RewardBench 2: Evaluating Omni Reward Models for Interleaved Text and Image

Reward models (RMs) are essential for training large language models (LLMs), but remain underexplored for omni models that handle interleaved image and text seq...

#research #paper #ai #nlp #computer-vision
2 months ago · software · - · -

[Paper] Checking the HAL Interface Specification Continuously, Right from the Start

The correct use of a Hardware Abstraction Layer (HAL) interface in embedded applications is crucial to prevent malfunctions, crashes, or even hardware damage. S...

#research #paper #software
2 months ago · ai · - · -

[Paper] LinkedOut: Linking World Knowledge Representation Out of Video LLM for Next-Generation Video Recommendation

Video Large Language Models (VLLMs) unlock world-knowledge-aware video understanding through pretraining on internet-scale data and have already shown promise o...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] AdaSearch: Balancing Parametric Knowledge and Search in Large Language Models via Reinforcement Learning

Equipping large language models (LLMs) with search engines via reinforcement learning (RL) has emerged as an effective approach for building search agents. Howe...

#research #paper #ai #nlp
2 months ago · ai · - · -

[Paper] Cartesian-nj: Extending e3nn to Irreducible Cartesian Tensor Product and Contracion

Equivariant atomistic machine learning models have brought substantial gains in both extrapolation capability and predictive accuracy. Depending on the basis of...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] PolaRiS: Scalable Real-to-Sim Evaluations for Generalist Robot Policies

A significant challenge for robot learning research is our ability to accurately measure and compare the performance of robot policies. Benchmarking in robotics...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Training Together, Diagnosing Better: Federated Learning for Collagen VI-Related Dystrophies

The application of Machine Learning (ML) to the diagnosis of rare diseases, such as collagen VI-related dystrophies (COL6-RD), is fundamentally limited by the s...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] On the Universal Representation Property of Spiking Neural Networks

Inspired by biology, spiking neural networks (SNNs) process information via discrete spikes over time, offering an energy-efficient alternative to the classical...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] LLMCache: Layer-Wise Caching Strategies for Accelerated Reuse in Transformer Inference

Transformer-based language models have achieved remarkable performance across a wide range of tasks, yet their high inference latency poses a significant challe...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] What Do Prosody and Text Convey? Characterizing How Meaningful Information is Distributed Across Multiple Channels

Prosody -- the melody of speech -- conveys critical information often not captured by the words or text of a message. In this paper, we propose an information-t...

#research #paper #ai #nlp
2 months ago · software · - · -

[Paper] Toward Systematic Counterfactual Fairness Evaluation of Large Language Models: The CAFFE Framework

Nowadays, Large Language Models (LLMs) are foundational components of modern software systems. As their influence grows, concerns about fairness have become inc...

#research #paper #software
2 months ago · ai · - · -

[Paper] Coordinated Anti-Jamming Resilience in Swarm Networks via Multi-Agent Reinforcement Learning

Reactive jammers pose a severe security threat to robotic-swarm networks by selectively disrupting inter-agent communications and undermining formation integrit...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Delay-Aware Multi-Stage Edge Server Upgrade with Budget Constraint

In this paper, the Multi-stage Edge Server Upgrade (M-ESU) is proposed as a new network planning problem, involving the upgrading of an existing multi-access ed...

#research #paper #ai #machine-learning
2 months ago · software · - · -

[Paper] Inside Out: Uncovering How Comment Internalization Steers LLMs for Better or Worse

While comments are non-functional elements of source code, Large Language Models (LLM) frequently rely on them to perform Software Engineering (SE) tasks. Yet, ...

#research #paper #software
2 months ago · software · - · -

[Paper] An Empirical Study of the Realism of Mutants in Deep Learning

Mutation analysis is a well-established technique for assessing test quality in the traditional software development paradigm by injecting artificial faults int...

#research #paper #software
2 months ago · devops · - · -

[Paper] Efficient Bitcoin Meta-Protocol Transaction and Data Discovery Through nLockTime Field Repurposing

We describe the Lockchain Protocol, a lightweight Bitcoin meta-protocol that enables highly efficient transaction discovery at zero marginal block space cost, a...

#research #paper #devops
2 months ago · ai · - · -

[Paper] ParamExplorer: A framework for exploring parameters in generative art

Generative art systems often involve high-dimensional and complex parameter spaces in which aesthetically compelling outputs occupy only small, fragmented regio...

#research #paper #ai #machine-learning

Newer posts

Older posts