Source

arXiv

5856 posts from this source

Sort:

4 months ago · ai · - · -

[Paper] DiffusionBrowser: Interactive Diffusion Previews via Multi-Branch Decoders

Video diffusion models have revolutionized generative video synthesis, but they are imprecise, slow, and can be opaque during generation -- keeping users in the...

#research #paper #ai #machine-learning #computer-vision
4 months ago · ai · - · -

[Paper] LitePT: Lighter Yet Stronger Point Transformer

Modern neural architectures for 3D point cloud processing contain both convolutional layers and attention blocks, but the best way to assemble them remains uncl...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] Towards Scalable Pre-training of Visual Tokenizers for Generation

The quality of the latent space in visual tokenizers (e.g., VAEs) is crucial for modern generative models. However, the standard reconstruction-based training p...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] Beyond surface form: A pipeline for semantic analysis in Alzheimer's Disease detection from spontaneous speech

Alzheimer's Disease (AD) is a progressive neurodegenerative condition that adversely affects cognitive abilities. Language-related changes can be automatically ...

#research #paper #ai #nlp
4 months ago · ai · - · -

[Paper] Recurrent Video Masked Autoencoders

We present Recurrent Video Masked-Autoencoders (RVM): a novel video representation learning approach that uses a transformer-based recurrent neural network to a...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] I-Scene: 3D Instance Models are Implicit Generalizable Spatial Learners

Generalization remains the central challenge for interactive 3D scene generation. Existing learning-based approaches ground spatial understanding in limited sce...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] LASER: Layer-wise Scale Alignment for Training-Free Streaming 4D Reconstruction

Recent feed-forward reconstruction models like VGGT and π^3 achieve impressive reconstruction quality but cannot process streaming videos due to quadratic memor...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] Feedforward 3D Editing via Text-Steerable Image-to-3D

Recent progress in image-to-3D has opened up immense possibilities for design, AR/VR, and robotics. However, to use AI-generated 3D assets in real applications,...

#research #paper #ai #machine-learning #computer-vision
4 months ago · ai · - · -

[Paper] JoVA: Unified Multimodal Learning for Joint Video-Audio Generation

In this paper, we present JoVA, a unified framework for joint video-audio generation. Despite recent encouraging advances, existing methods face two critical li...

#research #paper #ai #computer-vision
4 months ago · ai · - · -

[Paper] Towards Effective Model Editing for LLM Personalization

Personalization is becoming indispensable for LLMs to align with individual user preferences and needs. Yet current approaches are often computationally expensi...

#research #paper #ai #nlp
4 months ago · ai · - · -

[Paper] Towards Interactive Intelligence for Digital Humans

We introduce Interactive Intelligence, a novel paradigm of digital human that is capable of personality-aligned expression, adaptive interaction, and self-evolu...

#research #paper #ai #nlp #computer-vision
4 months ago · ai · - · -

[Paper] Directional Textual Inversion for Personalized Text-to-Image Generation

Textual Inversion (TI) is an efficient approach to text-to-image personalization but often fails on complex prompts. We trace these failures to embedding norm i...

#research #paper #ai #machine-learning #computer-vision
4 months ago · ai · - · -

[Paper] A Scientific Reasoning Model for Organic Synthesis Procedure Generation

Solving computer-aided synthesis planning is essential for enabling fully automated, robot-assisted synthesis workflows and improving the efficiency of drug dis...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] A stylometric analysis of speaker attribution from speech transcripts

Forensic scientists often need to identify an unknown speaker or writer in cases such as ransom calls, covert recordings, alleged suicide notes, or anonymous on...

#research #paper #ai #nlp
4 months ago · ai · - · -

[Paper] SEDULity: A Proof-of-Learning Framework for Distributed and Secure Blockchains with Efficient Useful Work

The security and decentralization of Proof-of-Work (PoW) have been well-tested in existing blockchain systems. However, its tremendous energy waste has raised c...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] Embedding-Based Rankings of Educational Resources based on Learning Outcome Alignment: Benchmarking, Expert Validation, and Learner Performance

As the online learning landscape evolves, the need for personalization is increasingly evident. Although educational resources are burgeoning, educators face ch...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] Comparative Analysis of LLM Abliteration Methods: A Cross-Architecture Evaluation

Safety alignment mechanisms in large language models prevent responses to harmful queries through learned refusal behavior, yet these same mechanisms impede leg...

#research #paper #ai #nlp
4 months ago · ai · - · -

[Paper] Large-Language Memorization During the Classification of United States Supreme Court Cases

Large-language models (LLMs) have been shown to respond in a variety of ways for classification tasks outside of question-answering. LLM responses are sometimes...

#research #paper #ai #machine-learning #nlp
4 months ago · ai · - · -

[Paper] World Models Can Leverage Human Videos for Dexterous Manipulation

Dexterous manipulation is challenging because it requires understanding how subtle hand motion influences the environment through contact with objects. We intro...

#research #paper #ai #machine-learning #computer-vision
4 months ago · ai · - · -

[Paper] From Code to Field: Evaluating the Robustness of Convolutional Neural Networks for Disease Diagnosis in Mango Leaves

The validation and verification of artificial intelligence (AI) models through robustness assessment are essential to guarantee the reliable performance of inte...

#research #paper #ai #machine-learning #computer-vision
4 months ago · devops · - · -

[Paper] Design in Tiles: Automating GEMM Deployment on Tile-Based Many-PE Accelerators

Tile-based many-Processing Element (PE) accelerators can achieve competitive performance on General Matrix Multiplication (GEMM), but they are extremely hard to...

#research #paper #devops
4 months ago · ai · - · -

[Paper] Universality of high-dimensional scaling limits of stochastic gradient descent

We consider statistical tasks in high dimensions whose loss depends on the data only through its projection into a fixed-dimensional subspace spanned by the par...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] StutterFuse: Mitigating Modality Collapse in Stuttering Detection with Jaccard-Weighted Metric Learning and Gated Fusion

Stuttering detection breaks down when disfluencies overlap. Existing parametric models struggle to distinguish complex, simultaneous disfluencies (e.g., a 'bloc...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] Temporal Tokenization Strategies for Event Sequence Modeling with Large Language Models

Representing continuous time is a critical and under-explored challenge in modeling temporal event sequences with large language models (LLMs). Various strategi...

#research #paper #ai #machine-learning #nlp
4 months ago · ai · - · -

[Paper] LightTopoGAT: Enhancing Graph Attention Networks with Topological Features for Efficient Graph Classification

Graph Neural Networks have demonstrated significant success in graph classification tasks, yet they often require substantial computational resources and strugg...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] Do-Undo: Generating and Reversing Physical Actions in Vision-Language Models

We introduce the Do-Undo task and benchmark to address a critical gap in vision-language models: understanding and generating physically plausible scene transfo...

#research #paper #ai #machine-learning #computer-vision
4 months ago · ai · - · -

[Paper] Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models

Building general-purpose reasoning models with reinforcement learning (RL) entails substantial cross-domain heterogeneity, including large variation in inferenc...

#research #paper #ai #machine-learning #nlp
4 months ago · ai · - · -

[Paper] DA-SSL: self-supervised domain adaptor to leverage foundational models in turbt histopathology slides

Recent deep learning frameworks in histopathology, particularly multiple instance learning (MIL) combined with pathology foundational models (PFMs), have shown ...

#research #paper #ai #machine-learning #computer-vision
4 months ago · ai · - · -

[Paper] Textual Gradients are a Flawed Metaphor for Automatic Prompt Optimization

A well-engineered prompt can increase the performance of large language models; automatic prompt optimization techniques aim to increase performance without req...

#research #paper #ai #machine-learning #nlp
4 months ago · devops · - · -

[Paper] astroCAMP: A Community Benchmark and Co-Design Framework for Sustainable SKA-Scale Radio Imaging

The Square Kilometre Array (SKA) project will operate one of the world's largest continuous scientific data systems, sustaining petascale imaging under strict p...

#research #paper #devops
4 months ago · ai · - · -

[Paper] ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

Autoregressive models (ARMs) are hindered by slow sequential inference. While masked diffusion models (MDMs) offer a parallel alternative, they suffer from crit...

#research #paper #ai #machine-learning #nlp
4 months ago · ai · - · -

[Paper] DP-CSGP: Differentially Private Stochastic Gradient Push with Compressed Communication

In this paper, we propose a Differentially Private Stochastic Gradient Push with Compressed communication (termed DP-CSGP) for decentralized learning over direc...

#research #paper #ai #machine-learning
4 months ago · ai · - · -

[Paper] Reproducing and Dissecting Denoising Language Models for Speech Recognition

Denoising language models (DLMs) have been proposed as a powerful alternative to traditional language models (LMs) for automatic speech recognition (ASR), motiv...

#research #paper #ai
4 months ago · devops · - · -

[Paper] Janus: Disaggregating Attention and Experts for Scalable MoE Inference

Large Mixture-of-Experts (MoE) model inference is challenging due to high resource demands and dynamic workloads. Existing solutions often deploy the entire mod...

#research #paper #devops
4 months ago · software · - · -

[Paper] How Low Can You Go? The Data-Light SE Challenge

Much of software engineering (SE) research assumes that progress depends on massive datasets and CPU-intensive optimizers. Yet has this assumption been rigorous...

#research #paper #software
4 months ago · ai · - · -

[Paper] Fine-tuned LLM-based Code Migration Framework

The study presents the outcomes of research and experimental validation in the domain of automated codebase migration, with a focus on addressing challenges in ...

#research #paper #ai #nlp
4 months ago · ai · - · -

[Paper] SIGMA: An AI-Empowered Training Stack on Early-Life Hardware

An increasing variety of AI accelerators is being considered for large-scale training. However, enabling large-scale training on early-life AI accelerators face...

#research #paper #ai #nlp
4 months ago · software · - · -

[Paper] Mapping of the system of software-related emissions and shared responsibilities

The global climate is experiencing a rapid and unprecedented warming trend. The ICT sector is a notable contributor to global greenhouse gas emissions, with its...

#research #paper #software
4 months ago · software · - · -

[Paper] A Data Annotation Requirements Representation and Specification (DARS)

With the rise of AI-enabled cyber-physical systems, data annotation has become a critical yet often overlooked process in the development of these intelligent i...

#research #paper #software
4 months ago · ai · - · -

[Paper] From User Interface to Agent Interface: Efficiency Optimization of UI Representations for LLM Agents

While Large Language Model (LLM) agents show great potential for automated UI navigation such as automated UI testing and AI assistants, their efficiency has be...

#research #paper #ai #machine-learning
4 months ago · software · - · -

[Paper] QMon: Monitoring the Execution of Quantum Circuits with Mid-Circuit Measurement and Reset

Unlike classical software, where logging and runtime tracing can effectively reveal internal execution status, quantum circuits possess unique properties, such ...

#research #paper #software
4 months ago · software · - · -

[Paper] PSALM: applying Proportional SAmpLing strategy in Metamorphic testing

Metamorphic testing (MT) alleviates the oracle problem by checking metamorphic relations (MRs) across multiple test executions. The fault detection effectivenes...

#research #paper #software
4 months ago · software · - · -

[Paper] UCRBench: Benchmarking LLMs on Use Case Recovery

Use cases are widely employed to specify functional requirements, yet existing benchmarks are scarce and face the risk of being misaligned with actual system be...

#research #paper #software
4 months ago · devops · - · -

[Paper] Temporal parallelisation of continuous-time maximum-a-posteriori trajectory estimation

This paper proposes a parallel-in-time method for computing continuous-time maximum-a-posteriori (MAP) trajectory estimates of the states of partially observed ...

#research #paper #devops
4 months ago · devops · - · -

[Paper] SPARS: A Reinforcement Learning-Enabled Simulator for Power Management in HPC Job Scheduling

High-performance computing (HPC) clusters consume enormous amounts of energy, with idle nodes as a major source of waste. Powering down unused nodes can mitigat...

#research #paper #devops
4 months ago · software · - · -

[Paper] A Decision Support Framework for Blockchain Pattern Selection Based on Soft Goals

Blockchain technology is gaining momentum across many sectors. Whereas blockchain solutions have important positive effects on the business domain, they also in...

#research #paper #software
4 months ago · devops · - · -

[Paper] Towards Secure Decentralized Applications and Consensus Protocols in Blockchains (on Selfish Mining, Undercutting Attacks, DAG-Based Blockchains, E-Voting, Cryptocurrency Wallets, Secure-Logging, and CBDC)

With the rise of cryptocurrencies, many new applications built on decentralized blockchains have emerged. Blockchains are full-stack distributed systems where m...

#research #paper #devops
4 months ago · devops · - · -

[Paper] Toward Self-Healing Networks-on-Chip: RL-Driven Routing in 2D Torus Architectures

We investigate adaptive minimal routing in 2D torus networks on chip NoCs under node fault conditions comparing a reinforcement learning RL based strategy to an...

#research #paper #devops

Newer posts

Older posts