[논문] AI 기반 스팟 플릿을 활용한 다지역 클라우드 서비스 프로비저닝
Cloud service platforms increasingly rely on elastic infrastructures to support dynamic workloads. Spot instances provide discounted computing resources but int...
1354 posts from this source
Cloud service platforms increasingly rely on elastic infrastructures to support dynamic workloads. Spot instances provide discounted computing resources but int...
Representation Autoencoders (RAEs) leverage frozen vision foundation models (VFMs) as tokenizer encoders, providing robust high-level representations that facil...
Survival analysis aims to estimate a time-to-event distribution from data with censored observations. Many existing methods either impose structural assumptions...
Real-time cognitive load assessment from eye-tracking signals could potentially enable adaptive human-centered-AI such as safety-critical applications such as d...
Real-time cognitive load assessment is essential for adaptive human-computer interaction but remains challenging due to limited labeled data and poor cross-subj...
Large language models (LLMs) exhibit systematic political bias across a variety of sensitive contexts. We find that LLMs handle counterpart topics from opposing...
대규모 언어 모델(LLMs)은 일반적으로 섞인 코퍼스(shuffled corpora)로 훈련되어, 훈련 시점에 지식이 고정되고 시간적 기반(temporal grounding)이 …
Children with rare genetic diseases often exhibit distinctive facial phenotypes, yet developing computer vision systems for early diagnosis remains challenging ...
As generative image models evolve rapidly, the perceptual gap between generated and real images continues to narrow, making AI-generated image detection increas...
Biomedical knowledge graphs (KGs) treat disease associations as static facts, but temporal information is crucial for clinical reasoning, e.g., a symptom diagno...
Every Python function deployed as an LLM tool must today exist in two forms: an HTTP endpoint for human-facing clients and CI pipelines, and an MCP tool registr...
We investigate whether acoustic emotion recognition models can serve as proxies for the Pathos dimension in political speech analysis, as operationalised by the...
Autoregressive video diffusion models have enabled real-time, action-conditioned world generation. However, sustaining a persistent world, where revisiting a pr...
As wearable and mobile devices become increasingly embedded in daily life, they offer a practical way to continuously sense human motion in the wild. But inerti...
Large language models are routinely used as automated evaluators: to review code, moderate content, or score outputs, often with many items passing through one ...
We introduce Tokenization with Split Trees (ToaST), a subword tokenization method that directly optimizes compression under a new recursive inference procedure....
Trauma resuscitation is a clinical process for treating life-threatening physiological disorders in safety-critical environments, driven by the experience of he...
Skills are increasingly used to package agent instructions, workflows, scripts, and reference materials. In enterprise settings, however, skills often need to e...
The advent of cardless artificial intelligence (AI) banking heralds a paradigm shift in the financial landscape, offering users unprecedented security and conve...
오늘날, tool-calling agents는 입력 명령, agent responses 및 관련된 execution traces와 같은 static datasets에 대해 일반적으로 평가되거나 테스트됩니다.
AI 코딩 에이전트가 오픈 소스 저장소에 풀 리퀘스트(Agentic-PR)를 점점 더 많이 제출하고 있지만, 그들의 성능은 일반적으로 머지와 거절을 기준으로 평가됩니다…
Recent advances in coding agents have shown remarkable progress in software issue resolution. In practice, real-world issues are typically bug fixes or feature ...
Opportunistic Networks (OppNets)에서는 정보 전파가 모바일 장치(피어) 간의 일시적인 쌍별 라디오 접촉에만 의존할 수 있다. Designi...
Reducing collective communication latency is a critical goal for large model training and inference in both academia and industry. Many-to-many communications, ...
Erasure codes are a critical component in reliable storage systems today, and many blockchain systems use consensus protocols that involve erasure codes to redu...
We observe that existing model interpretation methods generally ignore the baseline, and such neglect often results in imprecise or even incorrect interpretatio...
Hybrid language models like Jamba mix attention layers with State Space Models (SSMs), creating two memory cache types with opposite profiles: Key-Value (KV) ca...
Does the relationship between learning rules and brain alignment generalize across species? We extend our prior finding that untrained CNNs match backpropagatio...
Scientific workflows are pipelines of interdependent tasks. They are increasingly executed on shared Kubernetes clusters via workflow engines such as Nextflow. ...
Symbolic regression with genetic programming (GPSR) may suffer from overfitting and structural bloat, especially when noise is present. In this paper we evaluat...
As large language models (LLMs) are increasingly deployed for software engineering, constructing high-quality benchmarks is crucial for evaluating not just the ...
Generative Artificial Intelligence (GenAI) is rapidly reshaping software development, with growing emphasis on accelerating productivity and optimizing performa...
Despite strong predictive results in the clinical machine learning literature, the translation of these models into bedside use remains limited by systems-level...
천 뇌 이론(TBT)과 오픈소스 Monty 프레임워크는 감각‑운동 추론을 통해 객체 인식을 모델링합니다 — 객체를 능동적으로 …
The advent of edge computing has enabled resource-constrained clients to delegate intensive computational tasks to distributed edge servers, especially within I...
To reduce user costs and maximize cluster utilization, large model training increasingly leverages volatile but inexpensive GPU capacity, such as spot instances...
We study fixed-cardinality maximization of the inverse-matrix Solow--Polasky diversity, equivalently finite metric magnitude for the exponential kernel, on one-...
The integration of machine learning with domain-specific physics is transforming the design, monitoring, and control of electricity systems, where data scarcity...
We develop a mean-field theory of dropout as a perturbation of critical signal propagation at the edge of chaos. Dropout shifts the perfect-alignment fixed poin...
Pretrained diffusion models는 frozen teachers 역할을 하여 텍스트-투-3D, single-step distillation, data attribution과 같은 downstream pipelines에 공급됩니다. The teache...
Scaling test-time compute by iteratively updating a latent state has emerged as a powerful paradigm for reasoning. Yet the internal mechanisms that enable these...
현재, Unified Multimodal Models (UMMs)에 이미지 이해, 생성 및 편집 기능을 강화하는 것은 주로 mixed multi-task training에 의존하고 있다....
하이퍼파라미터 전이는 작은 규모에서 최적의 최적화 하이퍼파라미터를 대규모로 외삽할 수 있게 해주어, 대규모 언어 모델을 학습하는 데 필수적입니다.
Equivariant graph neural network (GNN) 방법은 항체 보체결합부위(CDR) 설계에서 가장 높은 서열 복구율을 달성하지만 …
Discrete diffusion models는 시각 합성에서 뛰어나지만 느리고 반복적인 디코딩에 의존합니다. 기존의 single-step distillation 방법은 이 병목을 우회하려고 시도합니다.
동역학적 Sunyaev‑Zel'dovich (kSZ) 효과의 정밀 측정 – 대규모 바리온 물질 분포를 탐구하는 도구이며, 우주론에서 핵심적인 관측량이다.
Recent advances in artificial intelligence (AI) have accelerated the growth of both human-authored and AI-generated research outputs, placing increasing strain ...
Deep research, in which an agent searches the open web, collects evidence, and derives an answer through extended reasoning, is a prominent use case for frontie...