research — Page 102

Sort:

2 months ago · devops · - · -

[Paper] Efficient CPU-GPU Collaborative Inference for MoE-based LLMs on Memory-Limited Systems

Large Language Models (LLMs) have achieved impressive results across various tasks, yet their high computational demands pose deployment challenges, especially ...

#research #paper #devops
2 months ago · ai · - · -

[Paper] AI4EOSC: a Federated Cloud Platform for Artificial Intelligence in Scientific Research

In this paper, we describe a federated compute platform dedicated to support Artificial Intelligence in scientific workloads. Putting the effort into reproducib...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Topic Modelling Black Box Optimization

Choosing the number of topics T in Latent Dirichlet Allocation (LDA) is a key design decision that strongly affects both the statistical fit and interpretabilit...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] Hypernetworks That Evolve Themselves

How can neural networks evolve themselves without relying on external optimizers? We propose Self-Referential Graph HyperNetworks, systems where the very machin...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Kascade: A Practical Sparse Attention Method for Long-Context LLM Inference

Attention is the dominant source of latency during long-context LLM inference, an increasingly popular workload with reasoning models and RAG. We propose Kascad...

#research #paper #ai #machine-learning
2 months ago · software · - · -

[Paper] Using a Sledgehammer to Crack a Nut? Revisiting Automated Compiler Fault Isolation

Background: Compilers are fundamental to software development, translating high-level source code into executable software systems. Faults in compilers can have...

#research #paper #software
2 months ago · ai · - · -

[Paper] Beyond Blind Spots: Analytic Hints for Mitigating LLM-Based Evaluation Pitfalls

Large Language Models are increasingly deployed as judges (LaaJ) in code generation pipelines. While attractive for scalability, LaaJs tend to overlook domain s...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Improving Low-Latency Learning Performance in Spiking Neural Networks via a Change-Perceptive Dendrite-Soma-Axon Neuron

Spiking neurons, the fundamental information processing units of Spiking Neural Networks (SNNs), have the all-or-zero information output form that allows SNNs t...

#research #paper #ai
2 months ago · ai · - · -

[Paper] Explicit and Non-asymptotic Query Complexities of Rank-Based Zeroth-order Algorithms on Smooth Functions

Rank-based zeroth-order (ZO) optimization -- which relies only on the ordering of function evaluations -- offers strong robustness to noise and monotone transfo...

#research #paper #ai #machine-learning
2 months ago · devops · - · -

[Paper] FlexKV: Flexible Index Offloading for Memory-Disaggregated Key-Value Store

Disaggregated memory (DM) is a promising data center architecture that decouples CPU and memory into independent resource pools to improve resource utilization....

#research #paper #devops
2 months ago · software · - · -

[Paper] Analysis of Design Patterns and Benchmark Practices in Apache Kafka Event-Streaming Systems

Apache Kafka has become a foundational platform for high throughput event streaming, enabling real time analytics, financial transaction processing, industrial ...

#research #paper #software
2 months ago · devops · - · -

[Paper] Lotus: Optimizing Disaggregated Transactions with Disaggregated Locks

Disaggregated memory (DM) separates compute and memory resources, allowing flexible scaling to achieve high resource utilization. To ensure atomic and consisten...

#research #paper #devops
2 months ago · ai · - · -

[Paper] Staggered Batch Scheduling: Co-optimizing Time-to-First-Token and Throughput for High-Efficiency LLM Inference

The evolution of Large Language Model (LLM) serving towards complex, distributed architectures--specifically the P/D-separated, large-scale DP+EP paradigm--intr...

#research #paper #ai #machine-learning
2 months ago · software · - · -

[Paper] LLM4Perf: Large Language Models Are Effective Samplers for Multi-Objective Performance Modeling (Copy)

The performance of modern software systems is critically dependent on their complex configuration options. Building accurate performance models to navigate this...

#research #paper #software
2 months ago · software · - · -

Seven Core Activities of Great Digital Teams (RAADDDR)

Overview Most organisations don’t fail at “digital” because they can’t build software; they fail because they under‑invest in the work that delivers better dig...

#digital teams #RAADDDR #software development process #research #analysis #architecture #design #development #delivery #operations
2 months ago · ai · - · -

[Paper] Embedding Software Intent: Lightweight Java Module Recovery

As an increasing number of software systems reach unprecedented scale, relying solely on code-level abstractions is becoming impractical. While architectural ab...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Introduction to Symbolic Regression in the Physical Sciences

Symbolic regression (SR) has emerged as a powerful method for uncovering interpretable mathematical relationships from data, offering a novel route to both scie...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Spatia: Video Generation with Updatable Spatial Memory

Existing video generation models struggle to maintain long-term spatial and temporal consistency due to the dense, high-dimensional nature of video signals. To ...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] In Pursuit of Pixel Supervision for Visual Pre-training

At the most basic level, pixels are the source of the visual information through which we perceive the world. Pixels contain information at all levels, ranging ...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models

In recent multimodal research, the diffusion paradigm has emerged as a promising alternative to the autoregressive paradigm (AR), owing to its unique decoding a...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Predictive Concept Decoders: Training Scalable End-to-End Interpretability Assistants

Interpreting the internal activations of neural networks can produce more faithful explanations of their behavior, but is difficult due to the complex structure...

#research #paper #ai #machine-learning #nlp
2 months ago · ai · - · -

[Paper] Gaussian Pixel Codec Avatars: A Hybrid Representation for Efficient Rendering

We present Gaussian Pixel Codec Avatars (GPiCA), photorealistic head avatars that can be generated from multi-view images and efficiently rendered on mobile dev...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Artism: AI-Driven Dual-Engine System for Art Generation and Critique

This paper proposes a dual-engine AI architectural method designed to address the complex problem of exploring potential trajectories in the evolution of art. W...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Multi-View Foundation Models

Foundation models are vital tools in various Computer Vision applications. They take as input a single RGB image and output a deep feature representation that i...

#research #paper #ai #computer-vision

Newer posts

Older posts