machine-learning — Page 55

1 month ago · ai

[Paper] Kascade: A Practical Sparse Attention Method for Long-Context LLM Inference

Attention is the dominant source of latency during long-context LLM inference, an increasingly popular workload with reasoning models and RAG. We propose Kascad...

#research #paper #ai #machine-learning
1 month ago · ai

[Paper] Beyond Blind Spots: Analytic Hints for Mitigating LLM-Based Evaluation Pitfalls

Large Language Models are increasingly deployed as judges (LaaJ) in code generation pipelines. While attractive for scalability, LaaJs tend to overlook domain s...

#research #paper #ai #machine-learning
1 month ago · ai

[Paper] Explicit and Non-asymptotic Query Complexities of Rank-Based Zeroth-order Algorithms on Smooth Functions

Rank-based zeroth-order (ZO) optimization -- which relies only on the ordering of function evaluations -- offers strong robustness to noise and monotone transfo...

#research #paper #ai #machine-learning
1 month ago · ai

TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

TensorFlow is a tool that helps people make apps that can learn from data. It runs on tiny phones and on huge servers, so the same idea can be used at home or i...

#TensorFlow #machine learning #distributed systems #deep learning #AI frameworks #heterogeneous computing
1 month ago · ai

[Paper] Staggered Batch Scheduling: Co-optimizing Time-to-First-Token and Throughput for High-Efficiency LLM Inference

The evolution of Large Language Model (LLM) serving towards complex, distributed architectures--specifically the P/D-separated, large-scale DP+EP paradigm--intr...

#research #paper #ai #machine-learning
1 month ago · ai

[Paper] Embedding Software Intent: Lightweight Java Module Recovery

As an increasing number of software systems reach unprecedented scale, relying solely on code-level abstractions is becoming impractical. While architectural ab...

#research #paper #ai #machine-learning
1 month ago · ai

[Paper] Introduction to Symbolic Regression in the Physical Sciences

Symbolic regression (SR) has emerged as a powerful method for uncovering interpretable mathematical relationships from data, offering a novel route to both scie...

#research #paper #ai #machine-learning
1 month ago · ai

[Paper] Spatia: Video Generation with Updatable Spatial Memory

Existing video generation models struggle to maintain long-term spatial and temporal consistency due to the dense, high-dimensional nature of video signals. To ...

#research #paper #ai #machine-learning #computer-vision
1 month ago · ai

[Paper] Predictive Concept Decoders: Training Scalable End-to-End Interpretability Assistants

Interpreting the internal activations of neural networks can produce more faithful explanations of their behavior, but is difficult due to the complex structure...

#research #paper #ai #machine-learning #nlp
1 month ago · ai

[Paper] Artism: AI-Driven Dual-Engine System for Art Generation and Critique

This paper proposes a dual-engine AI architectural method designed to address the complex problem of exploring potential trajectories in the evolution of art. W...

#research #paper #ai #machine-learning
1 month ago · ai

[Paper] Learning Model Parameter Dynamics in a Combination Therapy for Bladder Cancer from Sparse Biological Data

In a mathematical model of interacting biological organisms, where external interventions may alter behavior over time, traditional models that assume fixed par...

#research #paper #ai #machine-learning
1 month ago · ai

[Paper] Dynamic Rebatching for Efficient Early-Exit Inference with DREX

Early-Exit (EE) is a Large Language Model (LLM) architecture that accelerates inference by allowing easier tokens to be generated using only a subset of the mod...

#research #paper #ai #machine-learning

Newer posts

Older posts