Source

arXiv

1077 posts from this source

정렬:

1주 전 · ai · - · -

[논문] 아첨적 찬사: 언어 모델의 과도한 칭찬 평가

Sycophancy in language models is typically studied as excessive agreement or validation, while explicit praise and flattery have received comparatively little a...

#research #paper #ai #nlp
1주 전 · ai · - · -

[논문] 자율주행 시대 ISO 26262 재구상: 전이성·예측성으로 제어성 강화

The ISO 26262 standard defines functional safety for road vehicles through risk assessments based on Severity, Exposure, and Controllability, grounded in a huma...

#research #paper #ai #machine-learning
1주 전 · ai · - · -

[논문] Skill-3D: 에이전트형 3D 공간 추론을 위한 장면 인식 스킬 진화

This paper explores agentic 3D spatial understanding, i.e., MLLM agents performing 3D reasoning through tool use. Existing methods often misuse tools and exhibi...

#research #paper #ai #computer-vision
1주 전 · ai · - · -

[논문] 입술읽기 격차: VSR 모델이 인간처럼 시각적 말을 인식할까?

Visual speech recognition (VSR) models now surpass human lipreaders on benchmarks, but do such gains establish human-like visual speech perception? To explore t...

#research #paper #ai #nlp #computer-vision
1주 전 · ai · - · -

[논문] 시청·기억·추론: 인간 시각 비디오 이해와 MLLM

Video understanding is being rapidly transformed by multimodal large language models (MLLMs), as research moves from short clips to long, multimodal, and knowle...

#research #paper #ai #machine-learning #computer-vision
1주 전 · ai · - · -

[논문] OpenGlass: 온디바이스 이벤트 기반 제스처 인식을 위한 오픈소스 스마트 안경

Smart eyewear enables unobtrusive, context-aware interaction through multimodal sensors and on-device intelligence, but is severely limited by power, memory, an...

#research #paper #ai #computer-vision
1주 전 · ai · - · -

[논문] 신경망 기반 람다 계산으로 복합 시스템의 다중 스케일 심층 공식 발견

A fundamental problem in science is identifying underlying patterns of complex systems in the form of concise mathematical formulas. Current Artificial Intellig...

#research #paper #ai #machine-learning
1주 전 · ai · - · -

[논문] 마스크된 이점: LLM에서 지역 언어를 통한 문화 지식 접근 탐구

Large language models are increasingly used to answer culturally grounded questions across languages, yet it remains unclear whether local cultural knowledge is...

#research #paper #ai #machine-learning #nlp
1주 전 · ai · - · -

[논문] DisPOSE: 투영 다중확률 확산을 이용한 자기지도 다중뷰 3D 인간 자세 추정

Recovering 3D human poses for multiple individuals from different camera views is a fundamental bottleneck for analyzing interacting behaviors. Existing self-su...

#research #paper #ai #computer-vision
1주 전 · ai · - · -

[논문] 영상 기반 대기 플라즈마 스프레이 비행 입자 특성 예측

Atmospheric plasma spraying (APS) is a widely used coating process in which in-flight particle temperature and velocity strongly influence coating quality. Howe...

#research #paper #ai #machine-learning
1주 전 · ai · - · -

[Paper] 희소하게 게이트된 소형 선형 전문가

Sparsity allows scaling model parameters without proportionally increasing computational cost. While mixture of experts (MoE) models are made increasingly spars...

#research #paper #ai #machine-learning
1주 전 · ai · - · -

[논문] Socratic‑SWE: 트레이스 기반 스킬로 스스로 진화하는 코딩 에이전트

LLM-driven software engineering agents have become a central testbed for real-world language-model capability, yet their training remains limited by the availab...

#research #paper #ai #machine-learning
1주 전 · ai · - · -

[논문] 인간과 DeepSeek‑R1 LLM 수학 추론 종합 분석

The emergence of 'Aha moments' in large language models, particularly DeepSeek-R1-0120, has raised the question of whether these systems genuinely reason or mer...

#research #paper #ai #machine-learning
1주 전 · ai · - · -

[논문] 가역적 기반: 상태 보존 스케일링을 통한 1200억 규모 희소 MoE 학습

This paper reports on training a hundred-billion-parameter sparse mixture of experts on a single eight-GPU node, end to end. LightningLM 0.1V is a recurrence-ba...

#research #paper #ai #machine-learning
1주 전 · ai · - · -

[논문] 프록시 벤더스 분해

Benders decomposition is a fundamental framework for solving large-scale mixed-integer optimization problems with complicating variables that, when fixed, yield...

#research #paper #ai #machine-learning
1주 전 · ai · - · -

[논문] M³Exam: 현실적인 사용자‑에이전트 상호작용을 위한 다중모달 메모리 벤치마크

Language agents are increasingly deployed over accumulating multimodal information, yet existing benchmarks assume a human-human form with sparse visuals and st...

#research #paper #ai #nlp
1주 전 · ai · - · -

[논문] RealDocBench: 실제 규제 문서의 필드‑레벨 QA와 레이아웃 이해를 위한 벤치마크

Document parsing systems are increasingly deployed in high-stakes, regulated workflows such as mortgage underwriting, financial reporting, supply-chain logistic...

#research #paper #ai #computer-vision
1주 전 · ai · - · -

[논문] 동적 정책 그라디언트를 통한 이산 잠재 구조 생성 모델링

Many scientific problems require inferring unobserved mechanistic latent states from indirect observations. While classical approaches, including expectation ma...

#research #paper #ai #machine-learning
1주 전 · ai · - · -

[논문] 갭에 주목: 비디오 인스턴스 세그멘테이션 성능 병목 해소

In Video Instance Segmentation (VIS), classification, segmentation, and tracking objectives are jointly evaluated, but their individual contributions to perform...

#research #paper #ai #computer-vision
1주 전 · software · - · -

[논문] 미국 방위 획득, AI 기반 역량 도입 준비됐나? 시나리오 기반 정책 분석으로 국방부 소프트웨어 획득 경로 평가

As AI systems transition from experimental prototypes to mission-critical tools, their dependence on dynamic data, evolving models, and governance raises questi...

#research #paper #software
1주 전 · ai · - · -

[논문] 컨텍스트 LLM 캐스케이딩을 위한 온라인 판도라의 상자

Motivated by Large Language Model (LLM) cascading, we propose an online contextual Pandora's Box model for adaptively querying and selecting LLM APIs. In each p...

#research #paper #ai #machine-learning
1주 전 · ai · - · -

[논문] 데이터 부족 상황에서 합성 병변 MR 이미지가 자동 국소 피질 이형성증 탐지에 미치는 영향

Background and Purpose: Automated detection of focal cortical dysplasia (FCD) requires large volumes of voxelwise lesion-delineated MRI data, which are difficul...

#research #paper #ai #machine-learning #computer-vision
1주 전 · ai · - · -

[논문] 코딩 에이전트가 우리를 속이나? 무작위 테스트와 제한된 평가로 부정 행위 탐지·방지

A growing failure mode in agent evaluation and training is that models can achieve high evaluation scores by exploiting shortcuts instead of solving the intende...

#research #paper #ai #machine-learning #nlp
1주 전 · ai · - · -

[논문] 백스캐터를 넘어: 탐지된 SAR 이미지에서의 InSAR 일관성

In this work, we propose a deep learning framework for coherence regression directly from detected SAR images, without the need for accurate coregistration. A R...

#research #paper #ai #computer-vision
1주 전 · software · - · -

[논문] 거인의 어깨 위에서: GiAnt 코퍼스로 자동 스마트 계약 감사를 강화

High-quality smart contract auditing datasets are crucial for evaluating security tools and advancing smart contract security research. Two major limitations of...

#research #paper #software
1주 전 · ai · - · -

[논문] 지배 집합과 정점 색칠을 위한 조합적 풍경 분석

We analyze the two combinatorial problems of Dominating Set and Vertex Coloring regarding what kind of local optima are present for various instances. For a var...

#research #paper #ai
1주 전 · ai · - · -

[논문] DirectAudioEdit: 확산 예측 대비를 통한 텍스트 기반 역전 없는 오디오 편집

Text-guided audio editing aims to modify the language-specified acoustic content while preserving edit-irrelevant source components. Existing training-free meth...

#research #paper #ai #nlp
1주 전 · ai · - · -

[논문] LLM이 이끄는 의료 의사결정 파이프라인 진화

Adapting large language models (LLMs) to clinical workflows often requires costly fine-tuning or manual prompt and pipeline engineering. We study LLM-guided MAP...

#research #paper #ai #nlp
1주 전 · ai · - · -

[논문] 비잔틴 저항형 LLM 에이전트 협업을 위한 계층적 인증 의미 약속

Byzantine collaboration among large-language-model agents requires a finality-control primitive: given delivered stochastic, structured natural-language proposa...

#research #paper #ai #machine-learning
1주 전 · software · - · -

[논문] QBugLM: LLM 기반 양자 소프트웨어 디버깅을 위한 에이전트형 벤치마크 프레임워크

Quantum software bugs often yield silent, incorrect outputs rather than explicit errors, making them particularly difficult to detect and repair with convention...

#research #paper #software
1주 전 · ai · - · -

[논문] SV-Detect: 스티어링 벡터를 활용한 AI 생성 텍스트 탐지

Detecting machine-generated text is especially difficult under distribution shift, such as transfer across domains, source models, and editing attacks. We propo...

#research #paper #ai #machine-learning #nlp
1주 전 · ai · - · -

[논문] 음성 감정 인식을 위한 오디오 언어 모델의 음향 단서 정렬

Instruction-following audio language models (ALMs) can be augmented with explicit acoustic cues, yet it remains unclear whether such cues are used in a grounded...

#research #paper #ai #machine-learning #nlp
1주 전 · ai · - · -

[논문] Phun‑Bench: LLM의 중국어 음운 이해 평가

Language is a vehicle for thought, intricately tied to sounds, symbols, and meaning. However, most large language model (LLM) research focuses on meaning (seman...

#research #paper #ai #nlp
1주 전 · ai · - · -

[논문] SWE-Explore: 코딩 에이전트의 레포지토리 탐색 벤치마크

Repository-level coding benchmarks such as SWE-bench have driven a rapid surge in the capabilities of coding agents. Yet they usually treat coding tasks as a ho...

#research #paper #ai #nlp
1주 전 · ai · - · -

[논문] 진화를 앞선 고래: 군집 지능이 연결체 레저버의 기억을 극대화한다

Reservoir computing exploits the fixed dynamics of a recurrent network for temporal processing, requiring only a trained linear readout. Biological neural conne...

#research #paper #ai #machine-learning
1주 전 · devops · - · -

[논문] 클레어보이언트: 직렬 LLM 백엔드의 헤드오브라인 차단을 완화하는 예측형 SJF 스케줄링

Serial LLM inference backends -- such as Ollama -- process requests one at a time under FCFS admission, causing Head-of-Line Blocking (HOLB) under mixed workloa...

#research #paper #devops
1주 전 · ai · - · -

[논문] KIT, IWSLT 2026 교차언어 음성 클로닝에 제출

Cross-lingual voice cloning aims to generate speech in a target language while preserving speaker identity from a source-language reference. This task is centra...

#research #paper #ai #nlp
1주 전 · ai · - · -

[논문] 대형 언어 모델이 의료 분야에서 실패할 때: 프롬프트 변형에 대한 민감도 평가

Large Language Models (LLMs) are increasingly used in healthcare for tasks such as clinical question answering, diagnosis support, and report summarization. Des...

#research #paper #ai #machine-learning #nlp
1주 전 · ai · - · -

[Paper] MMAE: 대규모 다중 작업 오디오 편집 벤치마크

We introduce MMAE, a Massive Multitask Audio Editing benchmark, serving as the first comprehensive evaluation testbed designed for general-purpose instruction-b...

#research #paper #ai #nlp
1주 전 · software · - · -

[논문] 자율주행 인지 기반 폐쇄‑루프 시뮬레이션을 위한 인과 확률 프레임워크

Software-in-the-loop (SIL) simulation is a cornerstone for the validation of modern automotive safety functions. However, many current frameworks utilize ideal ...

#research #paper #software
1주 전 · ai · - · -

[논문] 데이터 없이 방정식을 푸는 기호 회귀 접근법

Many equations arising in science currently cannot be solved by available analytical techniques and are therefore solved numerically, without yielding explicit ...

#research #paper #ai
1주 전 · software · - · -

[논문] MalSkillBench: 악성 에이전트 기술의 런타임 검증 벤치마크

AI coding agents such as Claude Code and Gemini CLI increasingly extend themselves with third-party skills: markdown packages bundling natural-language instruct...

#research #paper #software
1주 전 · ai · - · -

[논문] MetaConfigurator: JSON 데이터 기반 AI 지원 RDF 작성

Scientific workflows increasingly generate structured JSON data that is easy to exchange but difficult to interpret consistently across systems due to lacking s...

#research #paper #ai #machine-learning
1주 전 · software · - · -

[논문] 선언형 UI를 HarmonyOS에 이식: 휴리스틱 기반 LLM 접근법

As an emerging operating system, HarmonyOS has a significant demand for software migration from platforms such as Android and iOS, where the User Interface (UI)...

#research #paper #software
1주 전 · devops · - · -

[논문] 클라우드 네이티브 및 연합 클라우드‑엣지 환경에서의 예측 자동 확장: 분류 체계와 향후 과제

Autoscaling is a key capability in cloud-native systems, where dynamic workloads, heterogeneous environments, and latency-sensitive applications require efficie...

#research #paper #devops
1주 전 · devops · - · -

[논문] PCCL: 프로세스 그룹 인식형 확장·범용 집합 알고리즘 합성기

Distributed machine learning has become increasingly important due to the massive scale of large-scale generative models. Both model parameters and data are dis...

#research #paper #devops
1주 전 · devops · - · -

[논문] 자율주행을 위한 미션 수준 런타임 보증 프레임워크

This paper studies runtime safety for autonomous driving when high-level driving commands become faulty or unreliable. Unlike conventional runtime-safety approa...

#research #paper #devops
1주 전 · ai · - · -

[논문] 지식 기반 도구 사용 워크플로우를 위한 선언형 스킬

We study orchestration mechanisms for tool-using AI agents in realistic customer-service workflows over an unstructured knowledge base. We argue that declarativ...

#research #paper #ai #machine-learning

Newer posts

Older posts