Source

arXiv

5752 posts from this source

Sort:

3 months ago · ai · - · -

[Paper] Protein Autoregressive Modeling via Multiscale Structure Generation

We present protein autoregressive modeling (PAR), the first multi-scale autoregressive framework for protein backbone generation via coarse-to-fine next-scale p...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Contrastive Continual Learning for Model Adaptability in Internet of Things

Internet of Things (IoT) deployments operate in nonstationary, dynamic environments where factors such as sensor drift, evolving user behavior, and heterogeneou...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Rethinking the Trust Region in LLM Reinforcement Learning

Reinforcement learning (RL) has become a cornerstone for fine-tuning Large Language Models (LLMs), with Proximal Policy Optimization (PPO) serving as the de fac...

#research #paper #ai #machine-learning #nlp
3 months ago · ai · - · -

[Paper] CoWTracker: Tracking by Warping instead of Correlation

Dense point tracking is a fundamental problem in computer vision, with applications ranging from video analysis to robotic manipulation. State-of-the-art tracke...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] PerpetualWonder: Long-Horizon Action-Conditioned 4D Scene Generation

We introduce PerpetualWonder, a hybrid generative simulator that enables long-horizon, action-conditioned 4D scene generation from a single image. Current works...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] Laminating Representation Autoencoders for Efficient Diffusion

Recent work has shown that diffusion models can generate high-quality images by operating directly on SSL patch features rather than pixel-space latents. Howeve...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] Multi-layer Cross-Attention is Provably Optimal for Multi-modal In-context Learning

Recent progress has rapidly advanced our understanding of the mechanisms underlying in-context learning in modern attention-based neural networks. However, exis...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Multi-Head LatentMoE and Head Parallel: Communication-Efficient and Deterministic MoE Parallelism

Large language models have transformed many applications but remain expensive to train. Sparse Mixture of Experts (MoE) addresses this through conditional compu...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] CRoSS: A Continual Robotic Simulation Suite for Scalable Reinforcement Learning with High Task Diversity and Realistic Physics Simulation

Continual reinforcement learning (CRL) requires agents to learn from a sequence of tasks without forgetting previously acquired policies. In this work, we intro...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Subliminal Effects in Your Data: A General Mechanism via Log-Linearity

Training modern large language models (LLMs) has become a veritable smorgasbord of algorithms and datasets designed to elicit particular behaviors, making it cr...

#research #paper #ai #machine-learning #nlp
3 months ago · ai · - · -

[Paper] When LLaVA Meets Objects: Token Composition for Vision-Language-Models

Current autoregressive Vision Language Models (VLMs) usually rely on a large number of visual tokens to represent images, resulting in a need for more compute e...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] From Evaluation to Design: Using Potential Energy Surface Smoothness Metrics to Guide Machine Learning Interatomic Potential Architectures

Machine Learning Interatomic Potentials (MLIPs) sometimes fail to reproduce the physical smoothness of the quantum potential energy surface (PES), leading to er...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] CoT is Not the Chain of Truth: An Empirical Internal Analysis of Reasoning LLMs for Fake News Generation

From generating headlines to fabricating news, the Large Language Models (LLMs) are typically assessed by their final outputs, under the safety assumption that ...

#research #paper #ai #nlp
3 months ago · ai · - · -

[Paper] Decomposed Prompting Does Not Fix Knowledge Gaps, But Helps Models Say 'I Don't Know'

Large language models often struggle to recognize their knowledge limits in closed-book question answering, leading to confident hallucinations. While decompose...

#research #paper #ai #nlp
3 months ago · ai · - · -

[Paper] The Key to State Reduction in Linear Attention: A Rank-based Perspective

Linear attention offers a computationally efficient yet expressive alternative to softmax attention. However, recent empirical results indicate that the state o...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] PDF-HR: Pose Distance Fields for Humanoid Robots

Pose and motion priors play a crucial role in humanoid robotics. Although such priors have been widely studied in human motion recovery (HMR) domain with a rang...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] El Agente Quntur: A research collaborator agent for quantum chemistry

Quantum chemistry is a foundational enabling tool for the fields of chemistry, materials science, computational biology and others. Despite of its power, the pr...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] El Agente Estructural: An Artificially Intelligent Molecular Editor

We present El Agente Estructural, a multimodal, natural-language-driven geometry-generation and manipulation agent for autonomous chemistry and molecular modell...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Fluid Representations in Reasoning Models

Reasoning language models, which generate long chains of thought, dramatically outperform non-reasoning language models on abstract problems. However, the inter...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] LitS: A novel Neighborhood Descriptor for Point Clouds

With the advancement of 3D scanning technologies, point clouds have become fundamental for representing 3D spatial data, with applications that span across vari...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] It's not a Lottery, it's a Race: Understanding How Gradient Descent Adapts the Network's Capacity to the Task

Our theoretical understanding of neural networks is lagging behind their empirical success. One of the important unexplained phenomena is why and how, during th...

#research #paper #ai #machine-learning #computer-vision
3 months ago · software · - · -

[Paper] When Code Becomes Abundant: Redefining Software Engineering Around Orchestration and Verification

Software Engineering (SE) faces simultaneous pressure from AI automation (reducing code production costs) and hardware-energy constraints (amplifying failure co...

#research #paper #software
3 months ago · software · - · -

[Paper] Do Developers Read Type Information? An Eye-Tracking Study on TypeScript

Statically-annotated types have been shown to aid developers in a number of programming tasks, and this benefit holds true even when static type checking is not...

#research #paper #software
3 months ago · ai · - · -

[Paper] Toward Reliable and Explainable Nail Disease Classification: Leveraging Adversarial Training and Grad-CAM Visualization

Human nail diseases are gradually observed over all age groups, especially among older individuals, often going ignored until they become severe. Early detectio...

#research #paper #ai #machine-learning #computer-vision
3 months ago · ai · - · -

[Paper] XtraLight-MedMamba for Classification of Neoplastic Tubular Adenomas

Accurate risk stratification of precancerous polyps during routine colonoscopy screenings is essential for lowering the risk of developing colorectal cancer (CR...

#research #paper #ai #machine-learning #computer-vision
3 months ago · ai · - · -

[Paper] Horizon-LM: A RAM-Centric Architecture for LLM Training

The rapid growth of large language models (LLMs) has outpaced the evolution of single-GPU hardware, making model scale increasingly constrained by memory capaci...

#research #paper #ai #nlp
3 months ago · ai · - · -

[Paper] SE-Bench: Benchmarking Self-Evolution with Knowledge Internalization

True self-evolution requires agents to act as lifelong learners that internalize novel experiences to solve future problems. However, rigorously measuring this ...

#research #paper #ai #machine-learning #nlp
3 months ago · ai · - · -

[Paper] OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models

Omni-modal Large Language Models (Omni-LLMs) have demonstrated strong capabilities in audio-video understanding tasks. However, their reliance on long multimoda...

#research #paper #ai #nlp
3 months ago · software · - · -

[Paper] Beyond the Control Equations: An Artifact Study of Implementation Quality in Robot Control Software

A controller -- a software module managing hardware behavior -- is a key component of a typical robot system. While control theory gives safety guarantees for s...

#research #paper #software
3 months ago · software · - · -

[Paper] Demonstrating ARG-V's Generation of Realistic Java Benchmarks for SV-COMP

The SV-COMP competition provides a state-of-the-art platform for evaluating software verification tools on a standardized set of verification tasks. Consequentl...

#research #paper #software
3 months ago · ai · - · -

[Paper] Speaker-Aware Simulation Improves Conversational Speech Recognition

Automatic speech recognition (ASR) for conversational speech remains challenging due to the limited availability of large-scale, well-annotated multi-speaker di...

#research #paper #ai #nlp
3 months ago · ai · - · -

[Paper] Beyond Many-Shot Translation: Scaling In-Context Demonstrations For Low-Resource Machine Translation

Building machine translation (MT) systems for low-resource languages is notably difficult due to the scarcity of high-quality data. Although Large Language Mode...

#machine translation #in-context learning #low-resource languages #large language models #NLP research
3 months ago · ai · - · -

[Paper] Impact of diversity on bounded archives for multi-objective local search

This work tackles two critical challenges related to the development of metaheuristics for Multi-Objective Optimization Problems (MOOPs): the exponential growth...

#research #paper #ai
3 months ago · ai · - · -

[Paper] Supporting software engineering tasks with agentic AI: Demonstration on document retrieval and test scenario generation

The introduction of large language models ignited great retooling and rethinking of the software development models. The ensuing response of software engineerin...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Evolutionary Mapping of Neural Networks to Spatial Accelerators

Spatial accelerators, composed of arrays of compute-memory integrated units, offer an attractive platform for deploying inference workloads with low latency and...

#research #paper #ai
3 months ago · it · - · -

[Paper] A TEE-based Approach for Preserving Data Secrecy in Process Mining with Decentralized Sources

Process mining techniques enable organizations to gain insights into their business processes through the analysis of execution records (event logs) stored by i...

#trusted execution environment #process mining #data privacy #secure multi‑party computation #Intel SGX
3 months ago · devops · - · -

[Paper] Six Times to Spare: LDPC Acceleration on DGX Spark for AI-Native Open RAN

Low-density parity-check (LDPC) decoding is one of the most computationally intensive kernels in the 5G New Radio (NR) physical layer and must complete within a...

#research #paper #devops
3 months ago · ai · - · -

[Paper] Towards Structured, State-Aware, and Execution-Grounded Reasoning for Software Engineering Agents

Software Engineering (SE) agents have shown promising abilities in supporting various SE tasks. Current SE agents remain fundamentally reactive, making decision...

#research #paper #ai #machine-learning
3 months ago · devops · - · -

[Paper] Entanglement improves coordination in distributed systems

Coordination in distributed systems is often hampered by communication latency, which degrades performance. Quantum entanglement offers fundamentally stronger c...

#research #paper #devops
3 months ago · ai · - · -

[Paper] Real-time processing of analog signals on accelerated neuromorphic hardware

Sensory processing with neuromorphic systems is typically done by using either event-based sensors or translating input signals to spikes before presenting them...

#research #paper #ai
3 months ago · ai · - · -

[Paper] Trust The Typical

Current approaches to LLM safety fundamentally rely on a brittle cat-and-mouse game of identifying and blocking known threats via guardrails. We argue for a fre...

#research #paper #ai #machine-learning #nlp
3 months ago · ai · - · -

[Paper] Landscape-aware Automated Algorithm Design: An Efficient Framework for Real-world Optimization

The advent of Large Language Models (LLMs) has opened new frontiers in automated algorithm design, giving rise to numerous powerful methods. However, these appr...

#research #paper #ai
3 months ago · software · - · -

[Paper] A Framework of Critical Success Factors for Agile Software Development

Despite the popularity of Agile software development, achieving consistent project success remains challenging. This systematic literature review identifies cri...

#agile #critical success factors #software development methodology #systematic literature review
3 months ago · software · - · -

[Paper] What's in a Benchmark? The Case of SWE-Bench in Automated Program Repair

The rapid progress in Automated Program Repair (APR) has been fueled by advances in AI, particularly large language models (LLMs) and agent-based systems. SWE-B...

#research #paper #software
3 months ago · software · - · -

[Paper] AgenticAKM : Enroute to Agentic Architecture Knowledge Management

Architecture Knowledge Management (AKM) is crucial for maintaining current and comprehensive software Architecture Knowledge (AK) in a software project. However...

#research #paper #software
3 months ago · ai · - · -

[Paper] SPEAR: An Engineering Case Study of Multi-Agent Coordination for Smart Contract Auditing

We present SPEAR, a multi-agent coordination framework for smart contract auditing that applies established MAS patterns in a realistic security analysis workfl...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Scalable Explainability-as-a-Service (XaaS) for Edge AI Systems

Though Explainable AI (XAI) has made significant advancements, its inclusion in edge and IoT systems is typically ad-hoc and inefficient. Most current methods a...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] A logical re-conception of neural networks: Hamiltonian bitwise part-whole architecture

We introduce a simple initial working system in which relations (such as part-whole) are directly represented via an architecture with operating and learning ru...

#research #paper #ai #machine-learning

Newer posts

Older posts