ai — Page 16 | EUNO.NEWS

Sort:

3 weeks ago · ai · - · -

[Paper] Dense vs Sparse Pretraining at Tiny Scale: Active-Parameter vs Total-Parameter Matching

We study dense and mixture-of-experts (MoE) transformers in a tiny-scale pretraining regime under a shared LLaMA-style decoder training recipe. The sparse model...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

Exploring Patterns of Survival from the Titanic Dataset

Introduction Titanic shipwreck was a major historical incident that shaped how we view human survival during disasters. Even a century later, this tragic incid...

#ai #data-science #tutorial
3 weeks ago · ai · - · -

[Paper] Senses Wide Shut: A Representation-Action Gap in Omnimodal LLMs

When an omnimodal large language model accepts a question whose textual premise contradicts what it actually sees or hears, does the failure lie in perception o...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] KVServe: Service-Aware KV Cache Compression for Communication-Efficient Disaggregated LLM Serving

LLMs are widely adopted in production, pushing inference systems to their limits. Disaggregated LLM serving (e.g., PD separation and KV state disaggregation) im...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Mechanistic Interpretability of EEG Foundation Models via Sparse Autoencoders

EEG foundation models achieve state-of-the-art clinical performance, yet the internal computations driving their predictions remain opaque: a barrier to clinica...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Children's English Reading Story Generation via Supervised Fine-Tuning of Compact LLMs with Controllable Difficulty and Safety

Large Language Models (LLMs) are widely applied in educational practices, such as for generating children's stories. However, the generated stories are often to...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] DisAgg: Distributed Aggregators for Efficient Secure Aggregation in Federated Learning

Federated learning enables collaborative model training across distributed clients, yet vanilla FL exposes client updates to the central server. Secure-aggregat...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] RTLC -- Research, Teach-to-Learn, Critique: A three-stage prompting paradigm inspired by the Feynman Learning Technique that lifts LLM-as-judge accuracy on JudgeBench with no fine-tuning

LLM-as-a-judge is now the default measurement instrument for open-ended generation, but on the public JudgeBench benchmark even strong instruction-tuned judges ...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Dual-axis attribution of zebrafish tectal microcircuits for energy-efficient and robust neurocomputing

Biological neural circuits contain specialized substructures that support distinct computational functions, yet many bio-inspired neural networks borrow biologi...

#research #paper #ai
3 weeks ago · ai · - · -

[Paper] Texture Regenerating and Grafting Using Genome-Driven Neural Cellular Automata

This study significantly advances multi-texture synthesis using Neural Cellular Automata (NCAs) by introducing a novel training methodology that enables robust ...

#research #paper #ai
3 weeks ago · ai · - · -

[Paper] MARLIN: Multi-Agent Game-Theoretic Reinforcement Learning for Sustainable LLM Inference in Cloud Datacenters

Large Language Models (LLMs) have become increasingly prevalent in cloud-based platforms, propelled by the introduction of AI-based consumer and enterprise serv...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

Hermes Unlocks Self-Improving AI Agents, Powered by NVIDIA RTX PCs and DGX Spark

Agentic AI is changing the way users get work done. Following the success of OpenClawhttps://www.nvidia.com/en-us/ai/build-a-claw/, the community is embracing n...

#ai #gpu #nvidia
3 weeks ago · ai · - · -

[Paper] Nonsmooth Set-Gradient Ascent to the Pareto Front via Layered Hypervolume and Magnitude Indicators

A nonsmooth set-gradient ascent method is developed for moving finite approximation sets toward the Pareto front in multiobjective optimization. The method opti...

#research #paper #ai
3 weeks ago · ai · - · -

[Paper] Rescaled Asynchronous SGD: Optimal Distributed Optimization under Data and System Heterogeneity

Asynchronous stochastic gradient descent (ASGD) is a standard way to exploit heterogeneous compute resources in distributed learning: instead of forcing fast wo...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] TurboGR: An Accelerated Training System for Large-Scale Generative Recommendation

Generative recommendation (GR) has emerged as a promising paradigm that replaces fragmented, scenario-specific architectures with unified Transformer-based mode...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] The Geno-Synthetic Algorithm: Type-Factored Coevolutionary Optimization for Heterogeneous Genotypes and Assembled Phenotypes

Many real-world optimization problems are not naturally homogeneous vectors but composite design objects with heterogeneous parameters: integers, real values, B...

#research #paper #ai
3 weeks ago · ai · - · -

[Paper] Constitutional Governance in Metric Spaces

Computational social choice and algorithmic decision theory offer rich aggregation theory but no end-to-end, polynomial-time process for egalitarian self-govern...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] AI Harness Engineering: A Runtime Substrate for Foundation-Model Software Agents

Foundation models have transformed automated code generation, yet autonomous software-engineering agents remain unreliable in realistic development settings. Th...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Hierarchical Transformer Preconditioning for Interactive Physics Simulation

Neural preconditioners for real-time physics simulation offer promising data-driven priors, but they often fail to capture long-range couplings efficiently beca...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

Building a safe, effective sandbox to enable Codex on Windows

When I joined the Codex engineering team in September 2025, Codex for Windows didn’t have a sandbox implementation meaning that Windows users were forced to cho...

#ai #ai-models #llm
3 weeks ago · ai · - · -

[Paper] TRUST-TAEA: A trustworthiness-guided two-archive evolutionary algorithm with variable-grouping sparse search for large-scale multi-objective optimization

Large-scale multi-objective optimization remains challenging because high-dimensional decision spaces, complex variable interactions, and limited function evalu...

#research #paper #ai
3 weeks ago · ai · - · -

[Paper] Embodied Neurocomputation: A Framework for Interfacing Biological Neural Cultures with Scaled Task-Driven Validation

Biological neural networks (BNNs) have been established as a powerful and adaptive substrate that offer the potential for incredibly energy and data efficient i...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] The Readability Spectrum: Patterns, Issues, and Prompt Effects in LLM-Generated Code

As Large Language Models (LLMs) are transforming software development, the functional quality of generated code has become a central focus, leaving readability,...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] FiTS: Interpretable Spiking Neurons via Frequency Selectivity and Temporal Shaping

Spiking Neural Networks (SNNs) are a promising framework for event-driven temporal processing. Prior work has improved temporal modeling through richer neuron d...

#research #paper #ai
3 weeks ago · it · - · -

디캠프, JR동일본과 손잡고 국내 스타트업 7개사 도쿄 스마트시티 실증 기회 연결

!https://cdn.platum.kr/wp-content/uploads/2026/05/dd-1024x576.jpg 프로그램 개요 스타트업 성장 파트너 디캠프가 일본 최대 철도·도시개발 기업 동일본여객철도JR동일본와 협력해 국내 스타트업에게 일본 스마트시티 프로젝트 실증 기회를 제공한...

#smart city #startup partnership #JR East #Korean startups #AI #digital health #energy tech #open innovation
3 weeks ago · ai · - · -

[Paper] ToolMol: Evolutionary Agentic Framework for Multi-objective Drug Discovery

Advances in large language models (LLMs) have recently opened new and promising avenues for small-molecule drug discovery. Yet existing LLM-based approaches for...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Covering Human Action Space for Computer Use: Data Synthesis and Benchmark

Computer-use agents (CUAs) automate on-screen work, as illustrated by GPT-5.4 and Claude. Yet their reliability on complex, low-frequency interactions is still ...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Recent large vision-language models (VLMs) remain fundamentally constrained by a persistent dichotomy: understanding and generation are treated as distinct prob...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] EgoForce: Forearm-Guided Camera-Space 3D Hand Pose from a Monocular Egocentric Camera

Reconstructing the absolute 3D pose and shape of the hands from the user's viewpoint using a single head-mounted camera is crucial for practical egocentric inte...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] CausalCine: Real-Time Autoregressive Generation for Multi-Shot Video Narratives

Autoregressive video generation aims at real-time, open-ended synthesis. Yet, cinematic storytelling is not merely the endless extension of a single scene; it r...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] From Web to Pixels: Bringing Agentic Search into Visual Perception

Visual perception connects high-level semantic understanding to pixel-level perception, but most existing settings assume that the decisive evidence for identif...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] AlphaGRPO: Unlocking Self-Reflective Multimodal Generation in UMMs via Decompositional Verifiable Reward

In this paper, we propose AlphaGRPO, a novel framework that applies Group Relative Policy Optimization (GRPO) to AR-Diffusion Unified Multimodal Models (UMMs) t...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai · - · -

[Paper] Revisiting Photometric Ambiguity for Accurate Gaussian-Splatting Surface Reconstruction

Surface reconstruction with differentiable rendering has achieved impressive performance in recent years, yet the pervasive photometric ambiguities have strictl...

#research #paper #ai #computer-vision
3 weeks ago · ai · - · -

[Paper] LongMemEval-V2: Evaluating Long-Term Agent Memory Toward Experienced Colleagues

Long-term memory is crucial for agents in specialized web environments, where success depends on recalling interface affordances, state dynamics, workflows, and...

#research #paper #ai #nlp
3 weeks ago · ai · - · -

[Paper] Pion: A Spectrum-Preserving Optimizer via Orthogonal Equivalence Transformation

We introduce Pion, a spectrum-preserving optimizer for large language model (LLM) training based on orthogonal equivalence transformation. Unlike additive optim...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Elastic Attention Cores for Scalable Vision Transformers

Vision Transformers (ViTs) achieve strong data-driven scaling by leveraging all-to-all self-attention. However, this flexibility incurs a computational cost tha...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai · - · -

[Paper] Task-Adaptive Embedding Refinement via Test-time LLM Guidance

We explore the effectiveness of an LLM-guided query refinement paradigm for extending the usability of embedding models to challenging zero-shot search and clas...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Learning, Fast and Slow: Towards LLMs That Adapt Continually

Large language models (LLMs) are trained for downstream tasks by updating their parameters (e.g., via RL). However, updating parameters forces them to absorb ta...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Beyond GRPO and On-Policy Distillation: An Empirical Sparse-to-Dense Reward Principle for Language-Model Post-Training

In settings where labeled verifiable training data is the binding constraint, each checked example should be allocated carefully. The standard practice is to us...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents

Computer Use Agents (CUAs) can act through both atomic GUI actions, such as click and type, and high-level tool calls, such as API-based file operations, but th...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] OmniNFT: Modality-wise Omni Diffusion Reinforcement for Joint Audio-Video Generation

Recent advances in joint audio-video generation have been remarkable, yet real-world applications demand strong per-modality fidelity, cross-modal alignment, an...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai · - · -

[Paper] MEME: Multi-entity & Evolving Memory Evaluation

LLM-based agents increasingly operate in persistent environments where they must store, update, and reason over information across many sessions. While prior be...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Routers Learn the Geometry of Their Experts: Geometric Coupling in Sparse Mixture-of-Experts

Sparse Mixture-of-Experts (SMoE) models enable scaling language models efficiently, but training them remains challenging, as routing can collapse onto few expe...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Reward Hacking in Rubric-Based Reinforcement Learning

Reinforcement learning with verifiable rewards has enabled strong post-training gains in domains such as math and coding, though many open-ended settings rely o...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] KV-Fold: One-Step KV-Cache Recurrence for Long-Context Inference

We introduce KV-Fold, a simple, training-free long-context inference protocol that treats the key-value (KV) cache as the accumulator in a left fold over sequen...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Solve the Loop: Attractor Models for Language and Reasoning

Looped Transformers offer a promising alternative to purely feed-forward computation by iteratively refining latent representations, improving language modeling...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai · - · -

[Paper] Towards Affordable Energy: A Gymnasium Environment for Electric Utility Demand-Response Programs

Extreme weather and volatile wholesale electricity markets expose residential consumers to catastrophic financial risks, yet demand response at the distribution...

#research #paper #ai #machine-learning
3 weeks ago · ai · - · -

[Paper] Multi-Stream LLMs: Unblocking Language Models with Parallel Streams of Thoughts, Inputs and Outputs

The continued improvements in language model capability have unlocked their widespread use as drivers of autonomous agents, for example in coding or computer us...

#research #paper #ai #machine-learning #nlp

Newer posts

Older posts