Source

arXiv

1025 posts from this source

정렬:

1주 전 · devops · - · -

[Paper] Low-Bandwidth 모델에서의 직사각형 행렬 곱셈

우리는 분산 컴퓨팅의 저대역폭 모델에서 직사각형 행렬 곱셈을 연구한다. n개의 컴퓨터가 있으며, 초기 입력 행렬은 분산되어 있다.

#research #paper #devops
1주 전 · ai · - · -

[논문] Ekka: LLM 추론 시 무음 오류 자동 진단

LLM serving frameworks are quickly evolving with a complex software stack and a vast number of optimizations. The rapid development process can introduce silent...

#research #paper #ai #machine-learning
1주 전 · ai · - · -

[논문] Multi‑SPIN: 엣지에서 협력 토큰 생성을 위한 다중 접근 사전 추론

Speculative inference (SPIN) was originally developed as an efficient architecture to accelerate Large Language Models (LLMs). In this work, we propose its dist...

#research #paper #ai #machine-learning
1주 전 · ai · - · -

[논문] 딥 강화학습을 활용한 암호화폐 시장의 동적 다중 페어 트레이딩 전략

This study aims to determine whether the application of Deep Reinforcement Learning (DRL) as a specialized execution overlay can enhance pair trading in highly ...

#research #paper #ai #machine-learning
1주 전 · ai · - · -

[Paper] ParetoPilot: Zero‑Surrogate 오프라인 다목표 최적화 via Infer‑Perturb‑Guide Diffusion

Offline multi-objective optimization (Offline MOO)은 비용이 많이 드는 환경 상호작용 없이 static datasets를 기반으로 새로운 Pareto-optimal 설계를 발견하는 것을 목표로 합니다.

#research #paper #ai #machine-learning
1주 전 · ai · - · -

[논문] D^2SD: 이중 확산 초안 모델로 추측 디코딩 가속화

Speculative decoding accelerates autoregressive large language model inference by drafting multiple tokens and verifying them in a single target-model forward p...

#research #paper #ai #machine-learning
1주 전 · devops · - · -

[논문] FlexNPU: 동적 LLM 프리필·디코드 공동 배치를 위한 투명 NPU 가상화

Modern AI serving increasingly relies on NPUs for conventional inference and large language model serving. However, current NPU deployments commonly expose phys...

#research #paper #devops
1주 전 · devops · - · -

[논문] ACEAPEX: 인코딩 시 절대 오프셋 해결로 병렬 LZ77 디코딩

LZ77-based codecs exhibit a fundamental sequential bottleneck in decoding: each back-reference depends on previously decompressed data, preventing multi-core sc...

#research #paper #devops
1주 전 · ai · - · -

[논문] 라이다 의미 장면 완성을 위한 간단한 향상 방안 탐구

This paper investigates 'free lunch' strategies to boost the performance of lidar semantic scene completion (SSC) without requiring complex architectural redesi...

#research #paper #ai #computer-vision
1주 전 · ai · - · -

[논문] SimuScene: 단일 이미지에서 시뮬레이션용 3D 장면을 구성·재구성

Reconstructing interactive, simulation-ready 3D scenes from a single image is a critical bottleneck for robotic manipulation. While recent single-image lifters ...

#research #paper #ai #computer-vision
1주 전 · ai · - · -

[논문] 뉴런 집단, 규모에 따라 선택성 차이 나타남

We investigate whether neuron populations within neural networks evolve predictably with scale, extending scaling laws beyond macroscopic observables such as lo...

#research #paper #ai #machine-learning #nlp #computer-vision
1주 전 · ai · - · -

[논문] PixVOD: 픽셀 분산 직접 시각 오도메트리 및 깊이 추정

Images composed of 2D pixel arrays are the standard input to computer vision algorithms, yet many underlying computations can be distributed across pixels. Tran...

#research #paper #ai #computer-vision
1주 전 · ai · - · -

[논문] NewtPhys: 기초 모델이 뉴턴 물리학을 이해할까?

Previous work has evaluated physics reasoning in foundation models using synthetic or semi-synthetic scenes and visual question-answering tasks. However, these ...

#research #paper #ai #computer-vision
1주 전 · ai · - · -

[Paper] Quadratic integrate-and-fire neurons은 덜 파편화된 loss landscapes를 보이며 spike-based gradient descent에서 leaky integrate-and-fire neurons보다 우수한 성능을 보인다

spiking neural networks을 훈련시키는 능력은 biological neural networks를 모델링하고 neuromorphic computing을 수행하는 데 필수적입니다. 그러나, 확장성을 위해…

#research #paper #ai #machine-learning
1주 전 · ai · - · -

[논문] 희소 도로 관측을 활용한 유전 최적화 기반 도시 교통 시뮬레이션 보정

Urban traffic simulation is a critical tool for infrastructure planning, including the placement of electric vehicle charging stations. However, realistic traff...

#research #paper #ai #machine-learning
1주 전 · ai · - · -

[Paper] Orthogonal-Easy-Axis Magnetic Tunnel Junction에 의해 가능해진 부호 스파이킹 뉴런

부호 스파이킹 뉴런은 표준 스파이킹 뉴런보다 더 풍부한 정보를 전달합니다. 이 연구는 부호를 위한 컴팩트한 자기 터널 접합(MTJ) 기반 뉴런을 제안합니다.

#research #paper #ai #machine-learning
1주 전 · ai · - · -

[Paper] Equilibrium Propagation을 이용한 ImageNet에서 Predictive Coding Network 훈련

Equilibrium Propagation (EP)은 물리 기반 훈련 프레임워크로, 주로 연속 Hopfield 네트워크를 포함한 에너지 기반 모델에 사용되어 왔습니다.

#research #paper #ai #machine-learning
1주 전 · ai · - · -

[Paper] 단기 시냅스 가소성이 목표 조건부 역학을 안정화시킨다: 다단계 목표 지향 행동 계획을 위한 PFC 영감형 Reservoir Model

전전두엽 피질(PFC)은 행동 계획을 위해 목표 정보를 유지하지만, recurrent circuits가 행동 시간에 걸쳐 이를 행동에 사용할 수 있는 형태로 어떻게 보존하는지는…

#research #paper #ai
1주 전 · ai · - · -

[논문] PrimeSVT: 스파이킹 비전 트랜스포머를 위한 메모리 인식 자동 프루닝 프레임워크와 우선순위 압축 정책

The large sizes of Spiking Vision Transformers (SViTs) still hinder their embedded implementation, highlighting the need for model compression. State-of-the-art...

#research #paper #ai #machine-learning
1주 전 · ai · - · -

[논문] 명시적 단위 거리 하한 증명 최적화

The 2026 disproof of Erdős's unit-distance conjecture and Sawin's subsequent explicit quantitative refinement show that the maximum number u(n) of unit distance...

#research #paper #ai #machine-learning
1주 전 · ai · - · -

[논문] PSViT: 스파이킹 비전 트랜스포머의 구조적 가지치기 방법론

Spiking Vision Transformer (SViT) models are promising low-power ViT models for solving vision-based tasks with state-of-the-art performance. However, their lar...

#research #paper #ai #machine-learning
1주 전 · ai · - · -

[논문] 정적 사전 가정 탈피: 대규모 개미 군집 최적화를 위한 동적 신경 가이드

Neural-guided Ant Colony Optimization (ACO) suffers from a fundamental training-inference misalignment: policies are typically trained to generate static priors...

#research #paper #ai #machine-learning
1주 전 · ai · - · -

[논문] 블렌더 사고: 비전·언어 모델 기반 단계적 실행 역그래픽스

Inverse graphics is a longstanding and highly underconstrained problem that seeks to reconstruct images as editable 3D scenes which can be rendered, relit, and ...

#research #paper #ai #computer-vision
1주 전 · ai · - · -

[논문] 다중모달 LLM‑판사에서 인지 판단 편향을 인지 교란과 보상 모델링으로 완화

Recent multimodal large language models have demonstrated strong reasoning ability, yet their reliability as automated evaluators remains limited by a critical ...

#research #paper #ai #machine-learning #computer-vision
1주 전 · ai · - · -

[논문] RoboDream: 확장 가능한 로봇 데이터 합성을 위한 구성 세계 모델

Scaling robot learning requires large-scale, diverse demonstrations, yet real-world data collection via teleoperation remains prohibitively expensive and time-c...

#research #paper #ai #computer-vision
1주 전 · ai · - · -

[논문] ProtoAda: 프로토타입 기반 적응 어댑터 확장·기하학적 통합으로 다중모달 지속 인스트럭션 튜닝

Multimodal Large Language Models (MLLMs) achieve strong performance through instruction tuning, but real-world deployment requires them to continually acquire n...

#research #paper #ai #machine-learning #computer-vision
1주 전 · ai · - · -

[논문] 무에서 영웅까지: 훈련 없이 세계 모델에서 맞춤 개념 생성

Autoregressive world models have emerged as a powerful paradigm for interactive video generation, allowing users to navigate dynamically generated environments ...

#research #paper #ai #computer-vision
1주 전 · ai · - · -

[논문] AdaCodec: 비디오 MLLM을 위한 예측 시각 코드

Video is temporally redundant: adjacent frames usually share most objects, background, and layout. Yet existing video multimodal large language models (video ML...

#research #paper #ai #machine-learning #nlp #computer-vision
1주 전 · ai · - · -

[논문] ClinEnv: 에이전트를 위한 인터랙티브 다단계 장기 전자건강기록 환경

Clinical practice is not the selection of an answer from enumerated options: a physician gathers heterogeneous information incrementally and commits to sequenti...

#research #paper #ai #machine-learning #nlp
1주 전 · ai · - · -

[Paper] IntraShuffler: A Privacy Preserving Framework for Heterogeneous DP Federated Learning

Heterogeneous Differential Privacy (HDP) in Federated Learning (FL) allows clients to select individual privacy budgets (varepsilon_i) according to institutiona...

#research #paper #ai #machine-learning
1주 전 · ai · - · -

[논문] 신뢰 추론을 통한 허용적 안전: 검증 가능한 신념공간 신경 안전 필터로 보장된 인터랙티브 로봇학

Autonomous robots that interact with people must make safe and efficient decisions under human-induced uncertainty, such as their preferences, goals, competency...

#research #paper #ai #machine-learning
1주 전 · ai · - · -

[논문] 레이어에서 서브모듈까지: 교체 기반 LLM 압축의 세분화 재고

Post-training compression of Large Language Models (LLMs) removes entire architectural components, either deleting them or replacing them with fitted modules. E...

#research #paper #ai #machine-learning #nlp
1주 전 · ai · - · -

[논문] 깊이 모호성 모델링: 플라잉 포인트 없는 깊이 추정을 위한 혼합 밀도 표현

Despite advances in depth estimation, flying points remain a persistent failure mode: near object boundaries, depth estimators often predict spurious 3D points ...

#research #paper #ai #machine-learning #computer-vision
1주 전 · ai · - · -

[논문] SimSD: 확산 언어 모델을 위한 간단한 추측 디코딩

Diffusion large language models (dLLMs) have recently emerged as a promising alternative to autoregressive (AR) LLMs, offering faster inference through parallel...

#research #paper #ai #machine-learning #nlp
1주 전 · ai · - · -

[논문] 적응형 에이전트의 행동 궤적 추적

Text files such as skill files, memory files, and behavioral configuration files play a central role in defining how modern agents act. Through edits by humans ...

#research #paper #ai #machine-learning
1주 전 · ai · - · -

[논문] SafeSteer: 효율적인 안전 정렬을 위한 지역화 온‑정책 증류

Aligning Large Language Models (LLMs) with human values often degrades their general capabilities, termed the alignment tax. Existing methods mitigate this by b...

#research #paper #ai #machine-learning #nlp
1주 전 · ai · - · -

[논문] 하이퍼파라미터 친화적 최적화는 왜 안 될까? 긴 꼬리 인식을 위한 단조 적응형 노름 재스케일링 접근법

Long-tailed recognition poses a significant challenge for deep learning. The two-stage decoupling paradigm, which separates representation learning from classif...

#research #paper #ai #machine-learning #computer-vision
1주 전 · ai · - · -

[논문] 신뢰성 확보 전 에이전시 시스템 모니터링

Agentic systems entering production typically operate as partially integrated assemblies where structural defects, not task-level errors, dominate the failure l...

#research #paper #ai #machine-learning
1주 전 · software · - · -

[논문] 어둠 속을 탐색하며: 구성 요소에 대한 공유된 이해가 중요한 이유

By listing the components included in an application, Software Bills of Materials (SBOMs) are intended to support the timely identification of vulnerable compon...

#research #paper #software
1주 전 · ai · - · -

[논문] 오류는 모두 같지 않다: 대형 언어 모델 추론에서 오류 전파에 대한 체계적 연구

Large language models (LLMs) are increasingly integrated into high-performance computing (HPC) workflows, accelerating scientific discovery through diverse pers...

#research #paper #ai #machine-learning
1주 전 · devops · - · -

[논문] 하이브리드 시스템을 활용한 분자동역학 전략: LAMMPS 활용 사례

The complexity of biomolecular simulations has substantially increased the demand for High-Performance Computing (HPC) infrastructures, particularly in molecula...

#research #paper #devops
1주 전 · devops · - · -

[논문] EES‑CND: 드리프트 인식 결함 허용 엣지·클라우드 서비스 배치를 위한 협업 신경망 의사결정

The edge-cloud paradigm improves service delivery by orchestrating resources across edge nodes and cloud data centres. These environments consist of heterogeneo...

#research #paper #devops
1주 전 · devops · - · -

[논문] TAPAAL SMC: 확률적 타임드 아크 페트리넷의 통계적 모델 검증

Timed-Arc Petri net (TAPN) is a timed extension of the classical Petri net model where tokens have their age and input arcs are associated with time intervals r...

#research #paper #devops
1주 전 · software · - · -

[논문] 복제 패키지 품질 평가를 위한 주체적 접근

Reproducibility in empirical software engineering relies on complete, accessible, and reusable research artifacts, yet artifact evaluation remains largely manua...

#research #paper #software
1주 전 · ai · - · -

[논문] LLM을 활용한 알고리즘 개발: 텐서 네트워크 수축 순서 최적화 사례 연구

We consider LLM-based algorithm development through a case study on contractionorder optimisation for tensor networks with OpenEvolve. We pay particular attenti...

#research #paper #ai #machine-learning
1주 전 · software · - · -

[논문] 신뢰 기반 코드 리뷰: LLM이 만든 다파일 변경에 대한 리뷰 워크플로우 참여 설계 연구

Background: Developers increasingly review multi-file code changes generated by LLM-based agents, yet no validated end-to-end workflow or IDE tooling design exi...

#research #paper #software
1주 전 · devops · - · -

[논문] 비확장 오버헤드 제거로 Amdahl 한계 초월 LLM 추론 스케일링

Deployers of online LLM services usually seek to maximize cluster-wide performance given a fixed number of GPUs. Tensor parallelism (TP) is necessary to fit mod...

#research #paper #devops
1주 전 · software · - · -

[논문] 프로젝트 특성별 ML 전용 및 일반 파이썬 코드 냄새 비교

Machine learning systems consist of general-purpose code as well as machine-learning-specific code. While ML-specific code smells have been identified, their co...

#research #paper #software

Newer posts

Older posts