research — Page 71

3 weeks ago · ai

[Paper] Fail Fast, Win Big: Rethinking the Drafting Strategy in Speculative Decoding via Diffusion LLMs

Diffusion Large Language Models (dLLMs) offer fast, parallel token generation, but their standalone use is plagued by an inherent efficiency-quality tradeoff. W...

#research #paper #ai #machine-learning
3 weeks ago · ai

[Paper] Distilling to Hybrid Attention Models via KL-Guided Layer Selection

Distilling pretrained softmax attention Transformers into more efficient hybrid architectures that interleave softmax and linear attention layers is a promising...

#research #paper #ai #machine-learning #nlp
3 weeks ago · ai

[Paper] LEAD: Minimizing Learner-Expert Asymmetry in End-to-End Driving

Simulators can generate virtually unlimited driving data, yet imitation learning policies in simulation still struggle to achieve robust closed-loop performance...

#research #paper #ai #machine-learning #computer-vision
3 weeks ago · ai

[Paper] Shallow Neural Networks Learn Low-Degree Spherical Polynomials with Learnable Channel Attention

We study the problem of learning a low-degree spherical polynomial of degree ell_0 = Θ(1) ge 1 defined on the unit sphere in RR^d by training an over-parameteri...

#research #paper #ai #machine-learning
3 weeks ago · ai

[Paper] FlashVLM: Text-Guided Visual Token Selection for Large Multimodal Models

Large vision-language models (VLMs) typically process hundreds or thousands of visual tokens per image or video frame, incurring quadratic attention cost and su...

#research #paper #ai #computer-vision
3 weeks ago · ai

[Paper] Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

Vision-language models (VLM) excel at general understanding yet remain weak at dynamic spatial reasoning (DSR), i.e., reasoning about the evolvement of object g...

#research #paper #ai #computer-vision
3 weeks ago · ai

[Paper] Advancing Multimodal Teacher Sentiment Analysis:The Large-Scale T-MED Dataset & The Effective AAM-TSA Model

Teachers' emotional states are critical in educational scenarios, profoundly impacting teaching efficacy, student engagement, and learning achievements. However...

#research #paper #ai #machine-learning
3 weeks ago · ai

[Paper] Step-DeepResearch Technical Report

As LLMs shift toward autonomous agents, Deep Research has emerged as a pivotal metric. However, existing academic benchmarks like BrowseComp often fail to meet ...

#research #paper #ai #nlp
3 weeks ago · devops

[Paper] WOC: Dual-Path Weighted Object Consensus Made Efficient

Modern distributed systems face a critical challenge: existing consensus protocols optimize for either node heterogeneity or workload independence, but not both...

#research #paper #devops
3 weeks ago · ai

[Paper] SweRank+: Multilingual, Multi-Turn Code Ranking for Software Issue Localization

Maintaining large-scale, multilingual codebases hinges on accurately localizing issues, which requires mapping natural-language error descriptions to the releva...

#research #paper #ai #machine-learning
3 weeks ago · ai

[Paper] Coherence in the brain unfolds across separable temporal regimes

Coherence in language requires the brain to satisfy two competing temporal demands: gradual accumulation of meaning across extended context and rapid reconfigur...

#research #paper #ai #nlp
3 weeks ago · ai

[Paper] Snapshot 3D image projection using a diffractive decoder

3D image display is essential for next-generation volumetric imaging; however, dense depth multiplexing for 3D image projection remains challenging because diff...

#research #paper #ai #computer-vision

Newer posts

Older posts