EUNO.NEWS EUNO.NEWS
  • All (21181) +146
  • AI (3169) +10
  • DevOps (940) +5
  • Software (11185) +102
  • IT (5838) +28
  • Education (48)
  • Notice
  • All (21181) +146
    • AI (3169) +10
    • DevOps (940) +5
    • Software (11185) +102
    • IT (5838) +28
    • Education (48)
  • Notice
  • All (21181) +146
  • AI (3169) +10
  • DevOps (940) +5
  • Software (11185) +102
  • IT (5838) +28
  • Education (48)
  • Notice
Sources Tags Search
한국어 English 中文
  • 1 month ago · ai

    [Paper] Kascade: A Practical Sparse Attention Method for Long-Context LLM Inference

    Attention is the dominant source of latency during long-context LLM inference, an increasingly popular workload with reasoning models and RAG. We propose Kascad...

    #research #paper #ai #machine-learning
  • 1 month ago · ai

    [Paper] Beyond Blind Spots: Analytic Hints for Mitigating LLM-Based Evaluation Pitfalls

    Large Language Models are increasingly deployed as judges (LaaJ) in code generation pipelines. While attractive for scalability, LaaJs tend to overlook domain s...

    #research #paper #ai #machine-learning
  • 1 month ago · ai

    [Paper] Explicit and Non-asymptotic Query Complexities of Rank-Based Zeroth-order Algorithms on Smooth Functions

    Rank-based zeroth-order (ZO) optimization -- which relies only on the ordering of function evaluations -- offers strong robustness to noise and monotone transfo...

    #research #paper #ai #machine-learning
  • 1 month ago · ai

    TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

    TensorFlow is a tool that helps people make apps that can learn from data. It runs on tiny phones and on huge servers, so the same idea can be used at home or i...

    #TensorFlow #machine learning #distributed systems #deep learning #AI frameworks #heterogeneous computing
  • 1 month ago · ai

    [Paper] Staggered Batch Scheduling: Co-optimizing Time-to-First-Token and Throughput for High-Efficiency LLM Inference

    The evolution of Large Language Model (LLM) serving towards complex, distributed architectures--specifically the P/D-separated, large-scale DP+EP paradigm--intr...

    #research #paper #ai #machine-learning
  • 1 month ago · ai

    [Paper] Embedding Software Intent: Lightweight Java Module Recovery

    As an increasing number of software systems reach unprecedented scale, relying solely on code-level abstractions is becoming impractical. While architectural ab...

    #research #paper #ai #machine-learning
  • 1 month ago · ai

    [Paper] Introduction to Symbolic Regression in the Physical Sciences

    Symbolic regression (SR) has emerged as a powerful method for uncovering interpretable mathematical relationships from data, offering a novel route to both scie...

    #research #paper #ai #machine-learning
  • 1 month ago · ai

    [Paper] Spatia: Video Generation with Updatable Spatial Memory

    Existing video generation models struggle to maintain long-term spatial and temporal consistency due to the dense, high-dimensional nature of video signals. To ...

    #research #paper #ai #machine-learning #computer-vision
  • 1 month ago · ai

    [Paper] Predictive Concept Decoders: Training Scalable End-to-End Interpretability Assistants

    Interpreting the internal activations of neural networks can produce more faithful explanations of their behavior, but is difficult due to the complex structure...

    #research #paper #ai #machine-learning #nlp
  • 1 month ago · ai

    [Paper] Artism: AI-Driven Dual-Engine System for Art Generation and Critique

    This paper proposes a dual-engine AI architectural method designed to address the complex problem of exploring potential trajectories in the evolution of art. W...

    #research #paper #ai #machine-learning
  • 1 month ago · ai

    [Paper] Learning Model Parameter Dynamics in a Combination Therapy for Bladder Cancer from Sparse Biological Data

    In a mathematical model of interacting biological organisms, where external interventions may alter behavior over time, traditional models that assume fixed par...

    #research #paper #ai #machine-learning
  • 1 month ago · ai

    [Paper] Dynamic Rebatching for Efficient Early-Exit Inference with DREX

    Early-Exit (EE) is a Large Language Model (LLM) architecture that accelerates inference by allowing easier tokens to be generated using only a subset of the mod...

    #research #paper #ai #machine-learning

Newer posts

Older posts
EUNO.NEWS
RSS GitHub © 2026