[논문] 표면 형태 변화가 검열 판단에 미치는 영향: 코드 혼합 워크플로 불안정성에 대한 쌍별 연구
Hate moderation is often evaluated as classification on clean English inputs, but deployed systems must route content to actions such as ALLOW, FLAG, or REVIEW....
1025 posts from this source
Hate moderation is often evaluated as classification on clean English inputs, but deployed systems must route content to actions such as ALLOW, FLAG, or REVIEW....
This paper addresses the question: How can mission effectiveness be systematically defined or approximated in the absence of customer requirements? Legacy requi...
Large language models (LLMs) have enabled powerful software engineering (SE) agents capable of navigating complex codebases and resolving real-world issues. How...
Decentralized Federated Learning (FL) removes reliance on centralized coordinators but remains vulnerable to model poisoning, unreliable validation, and high va...
For over half a century, software engineering has operated on a foundational premise: human engineers decompose problems, encode decision logic into static code...
How does a system that merely predicts the world come to distinguish its own causal influence from everything else? We trace this transition in a minimal 192-di...
Code Agents have achieved remarkable advances in recent years, exhibiting strong capabilities across a wide range of software engineering tasks. However, their ...
The rapid proliferation of Agent Development Kits (ADKs), SDK-level frameworks for building LLM-powered autonomous agents, has outpaced any empirical understand...
Tackling complex coding tasks often requires autonomous agents and iterative repair pipelines. These increasingly rely on large amounts of test-time computation...
Bitcoin's block reward is scheduled to decline to zero, raising concerns about whether the network can remain secure once miners rely solely on transaction fees...
Achieving peak GPU performance remains a significant challenge as the system throughput is constrained by host-device synchronization delays and kernel scheduli...
When an LLM repeatedly mutates a program, does it explore new forms or circle back to the same ones? We study this question by analyzing LLM-driven mutation cha...
Training Data Attribution (TDA)는 모델의 예측을 해당 훈련 데이터로 추적하려고 합니다. TDA의 금본위 표준은 인과적 개입(causal interventions)과 관찰…
We introduce T2Mo, a feed-forward framework for controllable dynamic 3D shape generation conditioned on 3D trajectories and text. Due to the inherent ambiguity ...
Audio-language models (ALMs) often follow text that conflicts with audio, even when the audio evidence is clear. This raises a basic question: is the audio-supp...
멀티 에이전트 추론 시스템은 ‘generate-then-transfer’ 패러다임을 채택하여 엔드‑투‑엔드 지연 시간이 파이프라인 깊이에 따라 선형적으로 증가하도록 강제합니다. 우리는 Strea…를 소개합니다.
추론 모델은 빠르게 발전했지만, 검증 가능한 보상으로부터의 강화 학습(RLVR)이라는 지배적인 레시피는 놀라울 정도로 좁은 범위에 머물고 있다: 많은 r을 샘플링한다.
The radial basis function neural network (RBFN) trained with a gradient descending algorithm provides an effective fully connected structure in both shallow and...
Vehicle body type is a significant determinant of cyclist injury severity in overtaking crashes, yet automated tools for classifying vehicles into injury-risk-r...
When post-trained language models fail on reasoning problems, the common test-time-scaling response is to spend more compute on additional attempts, and the fai...
Recent developments in multi-view image editing with generative models have brought us a step closer toward general 3D content generation and customization. Mos...
The rapid advancement of high-throughput sequencing has led to large, high-dimensional omics datasets. Deep unsupervised learning architectures, particularly Au...
Generating realistic financial time series is challenging as training data is often limited to a single historical path. With such scarce data, overfitting is h...
Deep active learning has previously been explored for LLM in-context sample selection, but not with methods that utilise recent advances in understanding of tra...
Koopman theory turns nonlinear dynamics into a linear spectral problem. In computation, however, everything depends on a hard finite-dimensional choice: the obs...
Individual-level mobility prediction은 urban simulation, transportation planning 및 policy analysis의 핵심이다. Supervised sequence models는 강력한 정확도를 달성한다.
Preserving data privacy is an important topic in structural data management and data mining. However, the issue of privacy leakage in distributed causal structu...
After the success of 3D Gaussian Splatting (3DGS) for novel view synthesis, many works have explored how to also use it for geometric surface representation. Ho...
Large language models는 점점 다른 모델에 의해 평가되고 있으며, 이는 자연스러운 질문을 제기한다: 모델이 판사가 자신의 출력에 대해 어떻게 점수를 매길지 예측할 수 있을까? 우리는 …
Audio is an inherently interactive modality, yet today's Large Audio Language Models (LALMs) are offline, and streaming audio models each handle only a single t...
Children learn the meanings of words from a continuous, temporally structured stream of egocentric experience. Recent work shows that neural networks can also l...
Large language models는 점점 소프트웨어 공학 교육의 일부가 되고 있으며, 여기에는 경험적 소프트웨어 공학 및 eviden…
Large language models (LLMs) are increasingly proposed as clinical agents, yet static, single-turn benchmarks cannot capture how a model dynamically delivers ca...
우리는 강력하지만 일반적인 비전 파운데이션 모델을 특수 과학 분야에 적용하기 위해 라벨이 없는 접근 방식을 제안한다. 표준 감독식 파인튜닝은 …
We investigate whether methods of human mathematics pedagogy can guide the training of language models toward arithmetic reasoning. Building on the GASING metho...
The Nancy Grace Roman Space Telescope (Roman), set for launch as early as September 2026, will conduct wide-field infrared imaging surveys with unprecedented sp...
Feed-forward 3D Gaussian Splatting 방법은 포즈가 지정된 이미지든 포즈가 없는 이미지든 단일 전방 패스로 장면을 재구성하지만, 현재 접근 방식은 하나의 Gauss...
Modern GPUs have Tensor Cores (TCs) capable of extremely high-throughput matrix operations, yet graph algorithms remain difficult to accelerate because of their...
Language-guided photo retouching aims to adjust color and tone while preserving geometry and texture. Recently, diffusion-based retouching shows a superior visu...
Conventional Generative Adversarial Networks (GANs) for Single Image Super-Resolution (SISR) often struggle with hallucinated artifacts, largely because standar...
AI 에이전트가 API를 호출하고 validation error가 발생하면, 무엇이 잘못됐는지에 대한 정보만으로는 충분하지 않습니다 — 다음에 무엇을 해야 하는지가 필요합니다. self‑reflective API는 validation error 시, …
통신 분야가 제로 터치 관리와 새로운 O-RAN 및 AI-RAN 프레임워크를 수용함에 따라, 현대 통신 네트워크는 이제 …
맥락: 어떤 소스 라인이 언제 삭제될지를 예측하는 것은 유지보수, 기술 부채, 그리고 리뷰 우선순위 지정에 중요합니다. 기존 MSR 접근법…
프로그래밍용 AI 도구는 이제 단순히 autocomplete나 chat assistants에 그치지 않는다: 이들은 process, roles, artifacts와 함께 development frameworks로 스스로 조직된다.
본 논문에서는 네트워크의 크기가 짝수인지, 혹은 보다 일반적으로 어떤 고정된 수와 동등한지(동치인지를) 지역적으로 인증하는 문제를 고려한다. 패리티 p...
Pearl은 고위 프로필 AI 산업의 지지를 받은 Layer-1 블록체인으로, 네트워크를 동시에 보호하는 Proof-of-Useful-Work (PoUW) 프로토콜을 마케팅합니다 ...
Directed Acyclic Graph (DAG) based BFT protocols have demonstrated the capability to achieve significantly high throughput in practice. Recent advancements focu...
기후 적응을 위한 도시 레이아웃 최적화는 건물 밀도와 차가운 공기 환기의 균형을 맞추는 것을 필요로 합니다. 물리 기반 기후 시뮬레이션은 com...