Source

arXiv

5752 posts from this source

Sort:

2 months ago · ai · - · -

[Paper] LAER-MoE: Load-Adaptive Expert Re-layout for Efficient Mixture-of-Experts Training

Expert parallelism is vital for effectively training Mixture-of-Experts (MoE) models, enabling different devices to host distinct experts, with each device proc...

#mixture-of-experts #distributed training #load balancing #model parallelism #GPU optimization
2 months ago · ai · - · -

[Paper] LoRA-based Parameter-Efficient LLMs for Continuous Learning in Edge-based Malware Detection

The proliferation of edge devices has created an urgent need for security solutions capable of detecting malware in real time while operating under strict compu...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] CL API: Real-Time Closed-Loop Interactions with Biological Neural Networks

Biological neural networks (BNNs) are increasingly explored for their rich dynamics, parallelism, and adaptive behavior. Beyond understanding their function as ...

#closed-loop neurotech #real-time API #Python DSL #bio‑neural interfaces #sub‑millisecond latency
2 months ago · ai · - · -

[Paper] Evolution With Purpose: Hierarchy-Informed Optimization of Whole-Brain Models

Evolutionary search is well suited for large-scale biophysical brain modeling, where many parameters with nonlinear interactions and no tractable gradients need...

#evolutionary algorithms #brain modeling #curriculum learning #dynamic mean field #neuroscience AI
2 months ago · ai · - · -

[Paper] Predictive Associative Memory: Retrieval Beyond Similarity Through Temporal Co-occurrence

Current approaches to memory in neural systems rely on similarity-based retrieval: given a query, find the most representationally similar stored state. This as...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] SurfPhase: 3D Interfacial Dynamics in Two-Phase Flows from Sparse Videos

Interfacial dynamics in two-phase flows govern momentum, heat, and mass transfer, yet remain difficult to measure experimentally. Classical techniques face intr...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Diffusion-Pretrained Dense and Contextual Embeddings

In this report, we introduce pplx-embed, a family of multilingual embedding models that employ multi-stage contrastive learning on a diffusion-pretrained langua...

#dense retrieval #multilingual embeddings #diffusion pretraining #contrastive learning #retrieval models
2 months ago · ai · - · -

[Paper] YOR: Your Own Mobile Manipulator for Generalizable Robotics

Recent advances in robot learning have generated significant interest in capable platforms that may eventually approach human-level competence. This interest, c...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Data Repetition Beats Data Scaling in Long-CoT Supervised Fine-Tuning

Supervised fine-tuning (SFT) on chain-of-thought data is an essential post-training step for reasoning language models. Standard machine learning intuition sugg...

#chain-of-thought #fine-tuning #large language models #data efficiency
2 months ago · ai · - · -

[Paper] Beyond VLM-Based Rewards: Diffusion-Native Latent Reward Modeling

Preference optimization for diffusion and flow-matching models relies on reward functions that are both discriminatively robust and computationally efficient. V...

#diffusion-models #latent-reward-modeling #generative-AI #preference-learning #computer-vision
2 months ago · ai · - · -

[Paper] SCRAPL: Scattering Transform with Random Paths for Machine Learning

The Euclidean distance between wavelet scattering transform coefficients (known as paths) provides informative gradients for perceptual quality assessment of de...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] GENIUS: Generative Fluid Intelligence Evaluation Suite

Unified Multimodal Models (UMMs) have shown remarkable progress in visual generation. Yet, existing benchmarks predominantly assess Crystallized Intelligence, w...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] Data-Efficient Hierarchical Goal-Conditioned Reinforcement Learning via Normalizing Flows

Hierarchical goal-conditioned reinforcement learning (H-GCRL) provides a powerful framework for tackling complex, long-horizon tasks by decomposing them into st...

#reinforcement learning #hierarchical RL #normalizing flows #data-efficient learning #offline RL
2 months ago · ai · - · -

[Paper] LCIP: Loss-Controlled Inverse Projection of High-Dimensional Image Data

Projections (or dimensionality reduction) methods P aim to map high-dimensional data to typically 2D scatterplots for visual exploration. Inverse projection met...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] TabICLv2: A better, faster, scalable, and open tabular foundation model

Tabular foundation models, such as TabPFNv2 and TabICL, have recently dethroned gradient-boosted trees at the top of predictive benchmarks, demonstrating the va...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Weight Decay Improves Language Model Plasticity

The prevailing paradigm in large language model (LLM) development is to pretrain a base model, then perform further training to improve performance and model be...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] FormalJudge: A Neuro-Symbolic Paradigm for Agentic Oversight

As LLM-based agents increasingly operate in high-stakes domains with real-world consequences, ensuring their behavioral safety becomes paramount. The dominant o...

#neuro-symbolic #formal verification #LLM oversight #Dafny #Z3
2 months ago · ai · - · -

[Paper] Just on Time: Token-Level Early Stopping for Diffusion Language Models

Diffusion language models generate text through iterative refinement, a process that is often computationally inefficient because many tokens reach stability lo...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] From Circuits to Dynamics: Understanding and Stabilizing Failure in 3D Diffusion Transformers

Reliable surface completion from sparse point clouds underpins many applications spanning content creation and robotics. While 3D diffusion transformers attain ...

#diffusion models #3d generation #transformer stability #mechanistic interpretability #spectral entropy
2 months ago · devops · - · -

[Paper] Min-Sum Uniform Coverage Problem by Autonomous Mobile Robots

We study the min-sum uniform coverage problem for a swarm of n mobile robots on a given finite line segment and on a circle having finite positive radius, where...

#research #paper #devops
2 months ago · ai · - · -

[Paper] PhyCritic: Multimodal Critic Models for Physical AI

With the rapid development of large multimodal models, reliable judge and critic models have become essential for open-ended evaluation and preference alignment...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] HairWeaver: Few-Shot Photorealistic Hair Motion Synthesis with Sim-to-Real Guided Video Diffusion

We present HairWeaver, a diffusion-based pipeline that animates a single human image with realistic and expressive hair dynamics. While existing methods success...

#diffusion models #few-shot synthesis #hair animation #computer vision
2 months ago · ai · - · -

[Paper] Learning to Compose for Cross-domain Agentic Workflow Generation

Automatically generating agentic workflows -- executable operator graphs or codes that orchestrate reasoning, verification, and repair -- has become a practical...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] TEGRA: Text Encoding With Graph and Retrieval Augmentation for Misinformation Detection

Misinformation detection is a critical task that can benefit significantly from the integration of external knowledge, much like manual fact-checking. In this w...

#research #paper #ai #nlp
2 months ago · ai · - · -

[Paper] FastFlow: Accelerating The Generative Flow Matching Models with Bandit Inference

Flow-matching models deliver state-of-the-art fidelity in image and video generation, but the inherent sequential denoising process renders them slower. Existin...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] GameDevBench: Evaluating Agentic Capabilities Through Game Development

Despite rapid progress on coding agents, progress on their multimodal counterparts has lagged behind. A key challenge is the scarcity of evaluation testbeds tha...

#benchmark #game-development #multimodal-ai #coding-agents #research
2 months ago · ai · - · -

[Paper] Safety Recovery in Reasoning Models Is Only a Few Early Steering Steps Away

Reinforcement learning (RL) based post-training for explicit chain-of-thought (e.g., GRPO) improves the reasoning ability of multimodal large-scale reasoning mo...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] Can Large Language Models Make Everyone Happy?

Misalignment in Large Language Models (LLMs) refers to the failure to simultaneously satisfy safety, value, and cultural dimensions, leading to behaviors that d...

#large language models #misalignment #benchmark #AI safety #NLP
2 months ago · ai · - · -

[Paper] Direct Learning of Calibration-Aware Uncertainty for Neural PDE Surrogates

Neural PDE surrogates are often deployed in data-limited or partially observed regimes where downstream decisions depend on calibrated uncertainty in addition t...

#uncertainty quantification #neural operators #PDE surrogates #model calibration #machine learning research
2 months ago · ai · - · -

[Paper] DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning

In the current landscape of Large Language Models (LLMs), the curation of large-scale, high-quality training data is a primary driver of model performance. A ke...

#reinforcement-learning #LLM-fine-tuning #data-pipelines #research-paper
2 months ago · ai · - · -

[Paper] First International StepUP Competition for Biometric Footstep Recognition: Methods, Results and Remaining Challenges

Biometric footstep recognition, based on a person's unique pressure patterns under their feet during walking, is an emerging field with growing applications in ...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] SteuerLLM: Local specialized large language model for German tax law analysis

Large language models (LLMs) demonstrate strong general reasoning and language understanding, yet their performance degrades in domains governed by strict forma...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] Chatting with Images for Introspective Visual Thinking

Current large vision-language models (LVLMs) typically rely on text-only reasoning based on a single-pass visual encoding, which often leads to loss of fine-gra...

#vision-language models #dynamic vision encoder #multimodal AI #reinforcement learning #research paper
2 months ago · ai · - · -

[Paper] PuriLight: A Lightweight Shuffle and Purification Framework for Monocular Depth Estimation

We propose PuriLight, a lightweight and efficient framework for self-supervised monocular depth estimation, to address the dual challenges of computational effi...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Fine-Tuning GPT-5 for GPU Kernel Generation

Developing efficient GPU kernels is essential for scaling modern AI systems, yet it remains a complex task due to intricate hardware architectures and the need ...

#GPT-5 #reinforcement learning #GPU kernel generation #Triton #code synthesis
2 months ago · ai · - · -

[Paper] FeatureBench: Benchmarking Agentic Coding for Complex Feature Development

Agents powered by large language models (LLMs) are increasingly adopted in the software industry, contributing code as collaborators or even autonomous develope...

#LLM coding agents #benchmark #software feature development #evaluation #AI research
2 months ago · software · - · -

[Paper] Deriving and Validating Requirements Engineering Principles for Large-Scale Agile Development: An Industrial Longitudinal Study

In large scale agile systems development, the lack of a unified requirements engineering (RE) process is a major challenge, exacerbated by the absence of high l...

#requirements engineering #agile development #software process #empirical study #industry research
2 months ago · ai · - · -

[Paper] Interactive LLM-assisted Curriculum Learning for Multi-Task Evolutionary Policy Search

Multi-task policy search is a challenging problem because policies are required to generalize beyond training cases. Curriculum learning has proven to be effect...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] PELLI: Framework to effectively integrate LLMs for quality software generation

Recent studies have revealed that when LLMs are appropriately prompted and configured, they demonstrate mixed results. Such results often meet or exceed the bas...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] VulReaD: Knowledge-Graph-guided Software Vulnerability Reasoning and Detection

Software vulnerability detection (SVD) is a critical challenge in modern systems. Large language models (LLMs) offer natural-language explanations alongside pre...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Amortized Inference of Neuron Parameters on Analog Neuromorphic Hardware

Our work utilized a non-sequential simulation-based inference algorithm to provide an amortized neural density estimator, which approximates the posterior distr...

#amortized inference #neuromorphic hardware #simulation-based inference #neural density estimation #AdEx neuron model
2 months ago · ai · - · -

[Paper] Amortized Inference of Neuron Parameters on Analog Neuromorphic Hardware

Our work utilized a non-sequential simulation-based inference algorithm to provide an amortized neural density estimator, which approximates the posterior distr...

#neuromorphic computing #simulation-based inference #amortized inference #neural density estimation #AdEx neuron model
2 months ago · software · - · -

[Paper] Hidden Licensing Risks in the LLMware Ecosystem

Large Language Models (LLMs) are increasingly integrated into software systems, giving rise to a new class of systems referred to as LLMware. Beyond traditional...

#research #paper #software
2 months ago · devops · - · -

[Paper] BOute: Cost-Efficient LLM Serving with Heterogeneous LLMs and GPUs via Multi-Objective Bayesian Optimization

The rapid growth of large language model (LLM) deployments has made cost-efficient serving systems essential. Recent efforts to enhance system cost-efficiency a...

#LLM serving #bayesian optimization #cost efficiency #heterogeneous GPUs #query routing
2 months ago · ai · - · -

[Paper] A Unified Experimental Architecture for Informative Path Planning: from Simulation to Deployment with GuadalPlanner

The evaluation of informative path planning algorithms for autonomous vehicles is often hindered by fragmented execution pipelines and limited transferability b...

#informative path planning #ROS 2 #robotics simulation #open-source framework #autonomous vehicles
2 months ago · ai · - · -

[Paper] Assessing Vision-Language Models for Perception in Autonomous Underwater Robotic Software

Autonomous Underwater Robots (AURs) operate in challenging underwater environments, including low visibility and harsh water conditions. Such conditions present...

#vision-language models #underwater robotics #model uncertainty #perception benchmark #multimodal AI
2 months ago · ai · - · -

[Paper] ISD-Agent-Bench: A Comprehensive Benchmark for Evaluating LLM-based Instructional Design Agents

Large Language Model (LLM) agents have shown promising potential in automating Instructional Systems Design (ISD), a systematic approach to developing education...

#research #paper #ai #nlp
2 months ago · ai · - · -

[Paper] MindPilot: Closed-loop Visual Stimulation Optimization for Brain Modulation with EEG-guided Diffusion

Whereas most brain-computer interface research has focused on decoding neural signals into behavior or intent, the reverse challenge-using controlled stimuli to...

#research #paper #ai

Newer posts

Older posts