Source

arXiv

1659 posts from this source

Sort:

1 week ago · software · - · -

[Paper] Econstellar: An Open-Source AI-Augmented Research Engine for Computational Financial Econometrics

Turning a promising economic idea into a credible empirical finding is, in practice, an expensive undertaking: it demands a great deal of specialised computatio...

#research #paper #software
1 week ago · ai · - · -

[Paper] Synthetic Benchmarks Overstate Forward-Forward Scaling: Real-Data Limits of Layer-Local Training

Forward-Forward (FF) learning [Hinton, 2022] replaces backpropagation with strictly layer-local goodness updates. Recent FF-CNN work has narrowed the gap to BP ...

#research #paper #ai #machine-learning #computer-vision
1 week ago · ai · - · -

[Paper] Synthetic Benchmarks Overstate Forward-Forward Scaling: Real-Data Limits of Layer-Local Training

Forward-Forward (FF) learning [Hinton, 2022] replaces backpropagation with strictly layer-local goodness updates. Recent FF-CNN work has narrowed the gap to BP ...

#research #paper #ai #machine-learning #computer-vision
1 week ago · ai · - · -

[Paper] When Surface Form Changes Moderation Decisions: A Paired Study of Code-Mixed Workflow Instability

Hate moderation is often evaluated as classification on clean English inputs, but deployed systems must route content to actions such as ALLOW, FLAG, or REVIEW....

#research #paper #ai #machine-learning
1 week ago · software · - · -

[Paper] Development of a Structured Approach for Establishing Mission Engineering Requirements

This paper addresses the question: How can mission effectiveness be systematically defined or approximated in the absence of customer requirements? Legacy requi...

#research #paper #software
1 week ago · ai · - · -

[Paper] Enhancing Software Engineering Through Closed-Loop Memory Optimization

Large language models (LLMs) have enabled powerful software engineering (SE) agents capable of navigating complex codebases and resolving real-world issues. How...

#research #paper #ai #machine-learning
1 week ago · devops · - · -

[Paper] PoCQ: Proof of Contribution Quality as a Lightweight Blockchain Consensus for Secure Federated Learning

Decentralized Federated Learning (FL) removes reliance on centralized coordinators but remains vulnerable to model poisoning, unreliable validation, and high va...

#research #paper #devops
1 week ago · ai · - · -

[Paper] The End of Software Engineering: How AI Agents Are Fundamentally Restructuring the Software Paradigm

For over half a century, software engineering has operated on a foundational premise: human engineers decompose problems, encode decision logic into static code...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] From Prediction to Self: Developmental Conditions for Agency in Minimal Neural Systems

How does a system that merely predicts the world come to distinguish its own causal influence from everything else? We trace this transition in a minimal 192-di...

#research #paper #ai #machine-learning
1 week ago · software · - · -

[Paper] SmellBench: Towards Fine-Grained Evaluation of Code Agents on Refactoring Tasks

Code Agents have achieved remarkable advances in recent years, exhibiting strong capabilities across a wide range of software engineering tasks. However, their ...

#research #paper #software
1 week ago · ai · - · -

[Paper] ADK Arena: Evaluating Agent Development Kits via LLM-as-a-Developer

The rapid proliferation of Agent Development Kits (ADKs), SDK-level frameworks for building LLM-powered autonomous agents, has outpaced any empirical understand...

#research #paper #ai #machine-learning
1 week ago · devops · - · -

[Paper] Latent Reasoning Guidance for Parallel Code Translation

Tackling complex coding tasks often requires autonomous agents and iterative repair pipelines. These increasingly rely on large amounts of test-time computation...

#research #paper #devops
1 week ago · devops · - · -

[Paper] Bitcoin After Block Rewards

Bitcoin's block reward is scheduled to decline to zero, raising concerns about whether the network can remain secure once miners rely solely on transaction fees...

#research #paper #devops
1 week ago · devops · - · -

[Paper] SET: Stream-Event-Triggered Scheduling for Efficient CUDA Graph Pipelines

Achieving peak GPU performance remains a significant challenge as the system throughput is constrained by host-device synchronization delays and kernel scheduli...

#research #paper #devops
1 week ago · ai · - · -

[Paper] Mutation Without Variation: Convergence Dynamics in LLM-Driven Program Evolution

When an LLM repeatedly mutates a program, does it explore new forms or circle back to the same ones? We study this question by analyzing LLM-driven mutation cha...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] STRIDE: Training Data Attribution via Sparse Recovery from Subset Perturbations

Training Data Attribution (TDA) seeks to trace a model's predictions back to its training data. The gold standard for TDA relies on causal interventions, observ...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Controllable Dynamic 3D Shape Generation via 3D Trajectories and Text

We introduce T2Mo, a feed-forward framework for controllable dynamic 3D shape generation conditioned on 3D trajectories and text. Due to the inherent ambiguity ...

#research #paper #ai #computer-vision
1 week ago · ai · - · -

[Paper] Beyond Text Following: Repairable Arbitration Reversals in Audio-Language Models

Audio-language models (ALMs) often follow text that conflicts with audio, even when the audio evidence is clear. This raises a basic question: is the audio-supp...

#research #paper #ai #nlp
1 week ago · ai · - · -

[Paper] Streaming Communication in Multi-Agent Reasoning

Multi-agent reasoning systems adopt a 'generate-then-transfer' paradigm that forces end-to-end latency to scale linearly with pipeline depth. We introduce Strea...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Reinforcement Learning from Rich Feedback with Distributional DAgger

Reasoning models have advanced rapidly, but the dominant reinforcement learning from verifiable rewards (RLVR) recipe remains surprisingly narrow: sample many r...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Multi-Column RBF Neural Network Using Adaptive and Non-Adaptive Particle Swarm Optimization

The radial basis function neural network (RBFN) trained with a gradient descending algorithm provides an effective fully connected structure in both shallow and...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] An Open-Source Two-Stage Computer Vision Pipeline for Fine-Grained Vehicle Classification using Vision Transformers

Vehicle body type is a significant determinant of cyclist injury severity in overtaking crashes, yet automated tools for classifying vehicles into injury-risk-r...

#research #paper #ai #machine-learning #computer-vision
1 week ago · ai · - · -

[Paper] Failed Reasoning Traces Tell You What Is Fixable (But Not by Reading Them)

When post-trained language models fail on reasoning problems, the common test-time-scaling response is to spend more compute on additional attempts, and the fai...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] GeM-NR: Geometry-Aware Multi-View Editing for Nonrigid Scene Changes

Recent developments in multi-view image editing with generative models have brought us a step closer toward general 3D content generation and customization. Mos...

#research #paper #ai #machine-learning #computer-vision
1 week ago · ai · - · -

[Paper] BBOmix: A Tabular Benchmark for Hyperparameter Optimization of Unsupervised Biological Representation Learning

The rapid advancement of high-throughput sequencing has led to large, high-dimensional omics datasets. Deep unsupervised learning architectures, particularly Au...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Generating Financial Time Series by Matching Random Convolutional Features

Generating realistic financial time series is challenging as training data is often limited to a single historical path. With such scarce data, overfitting is h...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Activation-Based Active Learning for In-Context Learning: Challenges and Insights

Deep active learning has previously been explored for LLM in-context sample selection, but not with methods that utilise recent advances in understanding of tra...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Deep Embedded Multiplicative DMD for Algebra-Preserving Koopman Learning

Koopman theory turns nonlinear dynamics into a linear spectral problem. In computation, however, everything depends on a hard finite-dimensional choice: the obs...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Towards Efficient and Evidence-grounded Mobility Prediction with LLM-Driven Agent

Individual-level mobility prediction is central to urban simulation, transportation planning, and policy analysis. Supervised sequence models achieve strong acc...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Preserving Data Privacy in Learning Causal Structure with Fully Homomorphic Encryption

Preserving data privacy is an important topic in structural data management and data mining. However, the issue of privacy leakage in distributed causal structu...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Geometry Gaussians: Decoupling Appearance and Geometry in Gaussian Splatting

After the success of 3D Gaussian Splatting (3DGS) for novel view synthesis, many works have explored how to also use it for geometric surface representation. Ho...

#research #paper #ai #machine-learning #computer-vision
1 week ago · ai · - · -

[Paper] Self-Evaluation Is Already There: Eliciting Latent Judge Calibration in Base LLMs with Minimal Data

Large language models are increasingly evaluated by other models, raising a natural question: can a model predict how a judge will score its own output? We find...

#research #paper #ai #nlp
1 week ago · ai · - · -

[Paper] Audio Interaction Model

Audio is an inherently interactive modality, yet today's Large Audio Language Models (LALMs) are offline, and streaming audio models each handle only a single t...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Continual Visual and Verbal Learning Through a Child's Egocentric Input

Children learn the meanings of words from a continuous, temporally structured stream of egocentric experience. Recent work shows that neural networks can also l...

#research #paper #ai #machine-learning #nlp #computer-vision
1 week ago · software · - · -

[Paper] How Software Engineering Students Use LLMs to Write Research Papers: An Experience Report

Large language models are increasingly becoming part of software engineering education, including activities involving empirical software engineering and eviden...

#research #paper #software
1 week ago · ai · - · -

[Paper] Evaluating Large Language Models in Dynamic Clinical Decision-Making with Standardized Patient Cases

Large language models (LLMs) are increasingly proposed as clinical agents, yet static, single-turn benchmarks cannot capture how a model dynamically delivers ca...

#research #paper #ai #nlp
1 week ago · ai · - · -

[Paper] Who Needs Labels? Adapting Vision Foundation Models With the Metadata You Already Have

We propose a label-free approach to adapt powerful but generic vision foundation models to specialized scientific domains. Standard supervised fine-tuning is of...

#research #paper #ai #machine-learning #computer-vision
1 week ago · ai · - · -

[Paper] Arithmetic Pedagogy for Language Models

We investigate whether methods of human mathematics pedagogy can guide the training of language models toward arithmetic reasoning. Building on the GASING metho...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Identifying Gems from Roman RAPIDly

The Nancy Grace Roman Space Telescope (Roman), set for launch as early as September 2026, will conduct wide-field infrared imaging surveys with unprecedented sp...

#research #paper #ai #machine-learning #computer-vision
1 week ago · ai · - · -

[Paper] ZipSplat: Fewer Gaussians, Better Splats

Feed-forward 3D Gaussian Splatting methods reconstruct a scene from posed or pose-free images in a single forward pass, yet current approaches predict one Gauss...

#research #paper #ai #computer-vision
2 weeks ago · devops · - · -

[Paper] Graph Traversal on Tensor Cores: A BFS Framework for Modern GPUs

Modern GPUs have Tensor Cores (TCs) capable of extremely high-throughput matrix operations, yet graph algorithms remain difficult to accelerate because of their...

#research #paper #devops
2 weeks ago · ai · - · -

[Paper] InstantRetouch: Efficient and High-Fidelity Instruction-Guided Image Retouching with Bilateral Space

Language-guided photo retouching aims to adjust color and tone while preserving geometry and texture. Recently, diffusion-based retouching shows a superior visu...

#research #paper #ai #computer-vision
2 weeks ago · ai · - · -

[Paper] MaCo-GAN: Manifold-Contrastive Adversarial Learning for Single Image Super-Resolution

Conventional Generative Adversarial Networks (GANs) for Single Image Super-Resolution (SISR) often struggle with hallucinated artifacts, largely because standar...

#research #paper #ai #computer-vision
2 weeks ago · ai · - · -

[Paper] Self-Reflective APIs: Structure Beats Verbosity for AI Agent Recovery

When an AI agent calls an API and hits a validation error, it needs more than what went wrong -- it needs what to do next. A self-reflective API returns, on val...

#research #paper #ai #machine-learning
2 weeks ago · software · - · -

[Paper] TeleSWEBench: A Commit-Driven Benchmark for Evaluating LLM-Powered Software Engineering in Telecommunications

With the telecommunications field embracing zero touch management alongside novel O-RAN and AI-RAN frameworks, contemporary telecom networks now function as imm...

#research #paper #software
2 weeks ago · software · - · -

[Paper] Code Lifespan Survival Analysis (CLSA): Predicting the Survival of Source Code Lines Using AST-Aware Mining

Context: Predicting which source lines will be deleted - and when - matters for maintenance, technical debt, and review prioritization. Existing MSR approaches ...

#research #paper #software
2 weeks ago · ai · - · -

[Paper] From Prompt to Process: a Process Taxonomy and Comparative Assessment of Frameworks Supporting AI Software Development Agents

AI tools for programming are no longer just autocomplete or chat assistants: they organize themselves as development frameworks, with process, roles, artifacts ...

#research #paper #ai #machine-learning
2 weeks ago · devops · - · -

[Paper] The local complexity of certifying parity

In this paper, we consider the problem of locally certifying that the size of a network is even, or more generally, congruent to some fixed number. The parity p...

#research #paper #devops

Newer posts

Older posts