Source

arXiv

5804 posts from this source

Sort:

3 months ago · ai · - · -

[Paper] ObjectForesight: Predicting Future 3D Object Trajectories from Human Videos

Humans can effortlessly anticipate how objects might move or change through interaction--imagining a cup being lifted, a knife slicing, or a lid being closed. W...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] Measuring and Fostering Peace through Machine Learning and Artificial Intelligence

We used machine learning and artificial intelligence: 1) to measure levels of peace in countries from news and social media and 2) to develop on-line tools that...

#research #paper #ai #machine-learning #nlp
3 months ago · ai · - · -

[Paper] Learning Latent Action World Models In The Wild

Agents capable of reasoning and planning in the real world require the ability of predicting the consequences of their actions. While world models possess this ...

#research #paper #ai #machine-learning #computer-vision
3 months ago · ai · - · -

[Paper] Stochastic Deep Learning: A Probabilistic Framework for Modeling Uncertainty in Structured Temporal Data

I propose a novel framework that integrates stochastic differential equations (SDEs) with deep generative models to improve uncertainty quantification in machin...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] CAOS: Conformal Aggregation of One-Shot Predictors

One-shot prediction enables rapid adaptation of pretrained foundation models to new tasks using only one labeled example, but lacks principled uncertainty quant...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] MineNPC-Task: Task Suite for Memory-Aware Minecraft Agents

We present textsc{MineNPC-Task}, a user-authored benchmark and evaluation harness for testing memory-aware, mixed-initiative LLM agents in open-world Minecraft....

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Internal Representations as Indicators of Hallucinations in Agent Tool Selection

Large Language Models (LLMs) have shown remarkable capabilities in tool calling and tool usage, but suffer from hallucinations where they choose incorrect tools...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] FlowLet: Conditional 3D Brain MRI Synthesis using Wavelet Flow Matching

Brain Magnetic Resonance Imaging (MRI) plays a central role in studying neurological development, aging, and diseases. One key application is Brain Age Predicti...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] MoE3D: A Mixture-of-Experts Module for 3D Reconstruction

MoE3D is a mixture-of-experts module designed to sharpen depth boundaries and mitigate flying-point artifacts (highlighted in red) of existing feed-forward 3D r...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] EARL: Energy-Aware Optimization of Liquid State Machines for Pervasive AI

Pervasive AI increasingly depends on on-device learning systems that deliver low-latency and energy-efficient computation under strict resource constraints. Liq...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Stock Market Price Prediction using Neural Prophet with Deep Neural Network

Stock market price prediction is a significant interdisciplinary research domain that depends at the intersection of finance, statistics, and economics. Forecas...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Mechanisms of Prompt-Induced Hallucination in Vision-Language Models

Large vision-language models (VLMs) are highly capable, yet often hallucinate by favoring textual prompts over visual evidence. We study this failure mode in a ...

#research #paper #ai #machine-learning #nlp #computer-vision
3 months ago · ai · - · -

[Paper] An interpretable data-driven approach to optimizing clinical fall risk assessment

In this study, we aim to better align fall risk prediction from the Johns Hopkins Fall Risk Assessment Tool (JHFRAT) with additional clinically meaningful measu...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] LELA: an LLM-based Entity Linking Approach with Zero-Shot Domain Adaptation

Entity linking (mapping ambiguous mentions in text to entities in a knowledge base) is a foundational step in tasks such as knowledge graph construction, questi...

#research #paper #ai #nlp
3 months ago · ai · - · -

[Paper] Cutting AI Research Costs: How Task-Aware Compression Makes Large Language Model Agents Affordable

When researchers deploy large language models for autonomous tasks like reviewing literature or generating hypotheses, the computational bills add up quickly. A...

#research #paper #ai #machine-learning #computer-vision
3 months ago · ai · - · -

[Paper] SimuAgent: An LLM-Based Simulink Modeling Assistant Enhanced with Reinforcement Learning

Large language models (LLMs) have revolutionized text-based code automation, but their potential in graph-oriented engineering workflows remains under-explored....

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Observations and Remedies for Large Language Model Bias in Self-Consuming Performative Loop

The rapid advancement of large language models (LLMs) has led to growing interest in using synthetic data to train future models. However, this creates a self-c...

#research #paper #ai #machine-learning #nlp
3 months ago · ai · - · -

[Paper] VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice

Chain-of-thought (CoT) reasoning has emerged as a powerful tool for multimodal large language models on video understanding tasks. However, its necessity and ad...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] CoV: Chain-of-View Prompting for Spatial Reasoning

Embodied question answering (EQA) in 3D environments often requires collecting context that is distributed across multiple viewpoints and partially occluded. Ho...

#research #paper #ai #machine-learning #computer-vision
3 months ago · ai · - · -

[Paper] Inside Out: Evolving User-Centric Core Memory Trees for Long-Term Personalized Dialogue Systems

Existing long-term personalized dialogue systems struggle to reconcile unbounded interaction streams with finite context constraints, often succumbing to memory...

#research #paper #ai #nlp
3 months ago · ai · - · -

[Paper] Reverse-engineering NLI: A study of the meta-inferential properties of Natural Language Inference

Natural Language Inference (NLI) has been an important task for evaluating language models for Natural Language Understanding, but the logical properties of the...

#research #paper #ai #nlp
3 months ago · ai · - · -

[Paper] RelayLLM: Efficient Reasoning via Collaborative Decoding

Large Language Models (LLMs) for complex reasoning is often hindered by high computational costs and latency, while resource-efficient Small Language Models (SL...

#research #paper #ai #machine-learning #nlp
3 months ago · ai · - · -

[Paper] DocDancer: Towards Agentic Document-Grounded Information Seeking

Document Question Answering (DocQA) focuses on answering questions grounded in given documents, yet existing DocQA agents lack effective tool utilization and la...

#research #paper #ai #nlp
3 months ago · ai · - · -

[Paper] A Lightweight and Explainable Vision-Language Framework for Crop Disease Visual Question Answering

Visual question answering for crop disease analysis requires accurate visual understanding and reliable language generation. This work presents a lightweight vi...

#research #paper #ai #nlp #computer-vision
3 months ago · devops · - · -

[Paper] Nalar: An agent serving framework

LLM-driven agentic applications increasingly automate complex, multi-step tasks, but serving them efficiently remains challenging due to heterogeneous component...

#research #paper #devops
3 months ago · ai · - · -

[Paper] ECLIPSE: An Evolutionary Computation Library for Instrumentation Prototyping in Scientific Engineering

Designing scientific instrumentation often requires exploring large, highly constrained design spaces using computationally expensive physics simulations. These...

#research #paper #ai
3 months ago · ai · - · -

[Paper] Advanced Multimodal Learning for Seizure Detection and Prediction: Concept, Challenges, and Future Directions

Epilepsy is a chronic neurological disorder characterized by recurrent unprovoked seizures, affects over 50 million people worldwide, and poses significant risk...

#research #paper #ai
3 months ago · ai · - · -

[Paper] Advanced Multimodal Learning for Seizure Detection and Prediction: Concept, Challenges, and Future Directions

Epilepsy is a chronic neurological disorder characterized by recurrent unprovoked seizures, affects over 50 million people worldwide, and poses significant risk...

#research #paper #ai
3 months ago · devops · - · -

[Paper] Asynchronous Secure Federated Learning with Byzantine aggregators

Privacy-preserving federated averaging is a central approach for protecting client privacy in federated learning. In this paper, we study this problem in an asy...

#research #paper #devops
3 months ago · software · - · -

[Paper] AVX / NEON Intrinsic Functions: When Should They Be Used?

A cross-configuration benchmark is proposed to explore the capacities and limitations of AVX / NEON intrinsic functions in a generic context of development proj...

#research #paper #software
3 months ago · devops · - · -

[Paper] Parallel Quadratic Selected Inversion in Quantum Transport Simulation

Driven by Moore's Law, the dimensions of transistors have been pushed down to the nanometer scale. Advanced quantum transport (QT) solvers are required to accur...

#research #paper #devops
3 months ago · ai · - · -

[Paper] Analyzing Message-Code Inconsistency in AI Coding Agent-Authored Pull Requests

Pull request (PR) descriptions generated by AI coding agents are the primary channel for communicating code changes to human reviewers. However, the alignment b...

#research #paper #ai #machine-learning
3 months ago · software · - · -

[Paper] A Longitudinal Analysis of Gamification in Untappd: Ethical Reflections on a Social Drinking Application

This paper presents a longitudinal ethical analysis of Untappd, a social drinking application that gamifies beer consumption through badges, streaks, and social...

#research #paper #software
3 months ago · devops · - · -

[Paper] Proof of Commitment: A Human-Centric Resource for Permissionless Consensus

Permissionless consensus protocols require a scarce resource to regulate leader election and provide Sybil resistance. Existing paradigms such as Proof of Work ...

#research #paper #devops
3 months ago · ai · - · -

[Paper] Neural-Symbolic Integration with Evolvable Policies

Neural-Symbolic (NeSy) Artificial Intelligence has emerged as a promising approach for combining the learning capabilities of neural networks with the interpret...

#research #paper #ai #machine-learning
3 months ago · devops · - · -

[Paper] Cognitive Infrastructure: A Unified DCIM Framework for AI Data Centers

This work presents DCIM 3.0, a unified framework integrating semantic reasoning, predictive analytics, autonomous orchestration, and unified connectivity for ne...

#research #paper #devops
3 months ago · ai · - · -

[Paper] Training a Custom CNN on Five Heterogeneous Image Datasets

Deep learning has transformed visual data analysis, with Convolutional Neural Networks (CNNs) becoming highly effective in learning meaningful feature represent...

#research #paper #ai #computer-vision
3 months ago · ai · - · -

[Paper] MoEBlaze: Breaking the Memory Wall for Efficient MoE Training on Modern GPUs

The pervasive 'memory wall' bottleneck is significantly amplified in modern large-scale Mixture-of-Experts (MoE) architectures. MoE's inherent architectural spa...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] MQ-GNN: A Multi-Queue Pipelined Architecture for Scalable and Efficient GNN Training

Graph Neural Networks (GNNs) are powerful tools for learning graph-structured data, but their scalability is hindered by inefficient mini-batch generation, data...

#research #paper #ai #machine-learning
3 months ago · software · - · -

[Paper] Extending Delta Debugging Minimization for Spectrum-Based Fault Localization

This paper introduces DDMIN-LOC, a technique that combines Delta Debugging Minimization (DDMIN) with Spectrum-Based Fault Localization (SBFL). It can be applied...

#research #paper #software
3 months ago · devops · - · -

[Paper] Quantifying Autoscaler Vulnerabilities: An Empirical Study of Resource Misallocation Induced by Cloud Infrastructure Faults

Resource autoscaling mechanisms in cloud environments depend on accurate performance metrics to make optimal provisioning decisions. When infrastructure faults ...

#research #paper #devops
3 months ago · ai · - · -

[Paper] Mechanism Design for Federated Learning with Non-Monotonic Network Effects

Mechanism design is pivotal to federated learning (FL) for maximizing social welfare by coordinating self-interested clients. Existing mechanisms, however, ofte...

#research #paper #ai #machine-learning
3 months ago · software · - · -

[Paper] 4D-ARE: Bridging the Attribution Gap in LLM Agent Requirements Engineering

We deployed an LLM agent with ReAct reasoning and full data access. It executed flawlessly, yet when asked 'Why is completion rate 80%?', it returned metrics in...

#research #paper #software
3 months ago · ai · - · -

[Paper] Timeliness-Oriented Scheduling and Resource Allocation in Multi-Region Collaborative Perception

Collaborative perception (CP) is a critical technology in applications like autonomous driving and smart cities. It involves the sharing and fusion of informati...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] AdaptEval: A Benchmark for Evaluating Large Language Models on Code Snippet Adaptation

Recent advancements in large language models (LLMs) have automated various software engineering tasks, with benchmarks emerging to evaluate their capabilities. ...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Paradoxical noise preference in RNNs

In recurrent neural networks (RNNs) used to model biological neural networks, noise is typically introduced during training to emulate biological variability an...

#research #paper #ai #machine-learning
3 months ago · ai · - · -

[Paper] Advancing Language Models for Code-related Tasks

Recent advances in language models (LMs) have driven significant progress in various software engineering tasks. However, existing LMs still struggle with compl...

#research #paper #ai #machine-learning #nlp
3 months ago · devops · - · -

[Paper] Sharded Elimination and Combining for Highly-Efficient Concurrent Stacks

We present a new blocking linearizable stack implementation which utilizes sharding and fetch&increment to achieve significantly better performance than all...

#research #paper #devops

Newer posts

Older posts