research — Page 63

Sort:

1 month ago · ai · - · -

[Paper] Value-Aware Numerical Representations for Transformer Language Models

Transformer-based language models often achieve strong results on mathematical reasoning benchmarks while remaining fragile on basic numerical understanding and...

#research #paper #ai #machine-learning #nlp
1 month ago · ai · - · -

[Paper] ShortCoder: Knowledge-Augmented Syntax Optimization for Token-Efficient Code Generation

Code generation tasks aim to automate the conversion of user requirements into executable code, significantly reducing manual development efforts and enhancing ...

#research #paper #ai #machine-learning #nlp
1 month ago · ai · - · -

[Paper] SAM3-DMS: Decoupled Memory Selection for Multi-target Video Segmentation of SAM3

Segment Anything 3 (SAM3) has established a powerful foundation that robustly detects, segments, and tracks specified targets in videos. However, in its origina...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] COMPOSE: Hypergraph Cover Optimization for Multi-view 3D Human Pose Estimation

3D pose estimation from sparse multi-views is a critical task for numerous applications, including action recognition, sports analysis, and human-robot interact...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] Efficient Camera-Controlled Video Generation of Static Scenes via Sparse Diffusion and 3D Rendering

Modern video generative models based on diffusion models can produce very realistic clips, but they are computationally inefficient, often requiring minutes of ...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] Empathy Applicability Modeling for General Health Queries

LLMs are increasingly being integrated into clinical workflows, yet they often lack clinical empathy, an essential aspect of effective doctor-patient communicat...

#research #paper #ai #nlp
1 month ago · software · - · -

[Paper] How well LLM-based test generation techniques perform with newer LLM versions?

The rapid evolution of Large Language Models (LLMs) has strongly impacted software engineering, leading to a growing number of studies on automated unit test ge...

#research #paper #software
1 month ago · ai · - · -

[Paper] LLMs can Compress LLMs: Adaptive Pruning by Agents

As Large Language Models (LLMs) continue to scale, post-training pruning has emerged as a promising approach to reduce computational costs while preserving perf...

#research #paper #ai #machine-learning #nlp #computer-vision
1 month ago · ai · - · -

[Paper] Contrastive Geometric Learning Unlocks Unified Structure- and Ligand-Based Drug Design

Structure-based and ligand-based computational drug design have traditionally relied on disjoint data sources and modeling assumptions, limiting their joint use...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] Routing with Generated Data: Annotation-Free LLM Skill Estimation and Expert Selection

Large Language Model (LLM) routers dynamically select optimal models for given inputs. Existing approaches typically assume access to ground-truth labeled data,...

#research #paper #ai #machine-learning #nlp
1 month ago · ai · - · -

[Paper] DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

Deep research systems are widely used for multi-step web research, analysis, and cross-source synthesis, yet their evaluation remains challenging. Existing benc...

#research #paper #ai #nlp
1 month ago · ai · - · -

[Paper] Disentangling Task Conflicts in Multi-Task LoRA via Orthogonal Gradient Projection

Multi-Task Learning (MTL) combined with Low-Rank Adaptation (LoRA) has emerged as a promising direction for parameter-efficient deployment of Large Language Mod...

#research #paper #ai #machine-learning #nlp
1 month ago · ai · - · -

[Paper] Automating Supply Chain Disruption Monitoring via an Agentic AI Approach

Modern supply chains are increasingly exposed to disruptions from geopolitical events, demand shocks, trade restrictions, to natural disasters. While many of th...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] STEP3-VL-10B Technical Report

We present STEP3-VL-10B, a lightweight open-source foundation model designed to redefine the trade-off between compact efficiency and frontier-level multimodal ...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Multi-agent systems have evolved into practical LLM-driven collaborators for many applications, gaining robustness from diversity and cross-checking. However, m...

#research #paper #ai #machine-learning #nlp
1 month ago · ai · - · -

[Paper] SCE-SLAM: Scale-Consistent Monocular SLAM via Scene Coordinate Embeddings

Monocular visual SLAM enables 3D reconstruction from internet video and autonomous navigation on resource-constrained platforms, yet suffers from scale drift, i...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] Self-Supervised Animal Identification for Long Videos

Identifying individual animals in long-duration videos is essential for behavioral ecology, wildlife monitoring, and livestock management. Traditional methods r...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] LiteEmbed: Adapting CLIP to Rare Classes

Large-scale vision-language models such as CLIP achieve strong zero-shot recognition but struggle with classes that are rarely seen during pretraining, includin...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] Image2Garment: Simulation-ready Garment Generation from a Single Image

Estimating physically accurate, simulation-ready garments from a single image is challenging due to the absence of image-to-physics datasets and the ill-posed n...

#research #paper #ai #computer-vision
1 month ago · ai · - · -

[Paper] Exploring Fine-Tuning for Tabular Foundation Models

Tabular Foundation Models (TFMs) have recently shown strong in-context learning capabilities on structured data, achieving zero-shot performance comparable to t...

#research #paper #ai #machine-learning
1 month ago · ai · - · -

[Paper] Creating a Hybrid Rule and Neural Network Based Semantic Tagger using Silver Standard Data: the PyMUSAS framework for Multilingual Semantic Annotation

Word Sense Disambiguation (WSD) has been widely evaluated using the semantic frameworks of WordNet, BabelNet, and the Oxford Dictionary of English. However, for...

#research #paper #ai #nlp
1 month ago · ai · - · -

[Paper] Identifying Models Behind Text-to-Image Leaderboards

Text-to-image (T2I) models are increasingly popular, producing a large share of AI-generated images online. To compare model quality, voting-based leaderboards ...

#research #paper #ai #machine-learning #computer-vision
1 month ago · ai · - · -

[Paper] PersonalAlign: Hierarchical Implicit Intent Alignment for Personalized GUI Agent with Long-Term User-Centric Records

While GUI agents have shown strong performance under explicit and completion instructions, real-world deployment requires aligning with users' more complex impl...

#research #paper #ai #machine-learning #computer-vision
1 month ago · ai · - · -

[Paper] LLM for Large-Scale Optimization Model Auto-Formulation: A Lightweight Few-Shot Learning Approach

Large-scale optimization is a key backbone of modern business decision-making. However, building these models is often labor-intensive and time-consuming. We ad...

#research #paper #ai #machine-learning

Newer posts

Older posts