research — Page 21

Sort:

1 week ago · ai · - · -

[Paper] MT-PingEval: Evaluating Multi-Turn Collaboration with Private Information Games

We present a scalable methodology for evaluating language models in multi-turn interactions, using a suite of collaborative games that require effective communi...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] Task-Centric Acceleration of Small-Language Models

Small language models (SLMs) have emerged as efficient alternatives to large language models for task-specific applications. However, they are often employed in...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] ArgLLM-App: An Interactive System for Argumentative Reasoning with Large Language Models

Argumentative LLMs (ArgLLMs) are an existing approach leveraging Large Language Models (LLMs) and computational argumentation for decision-making, with the aim ...

#research #paper #ai #machine-learning #nlp
1 week ago · devops · - · -

[Paper] Advanced Scheduling Strategies for Distributed Quantum Computing Jobs

Scaling the number of qubits available across multiple quantum devices is an active area of research within distributed quantum computing (DQC). This includes q...

#research #paper #devops
1 week ago · ai · - · -

[Paper] CoME: Empowering Channel-of-Mobile-Experts with Informative Hybrid-Capabilities Reasoning

Mobile Agents can autonomously execute user instructions, which requires hybrid-capabilities reasoning, including screen summary, subtask planning, action decis...

#research #paper #ai #machine-learning #nlp
1 week ago · ai · - · -

[Paper] AgenticOCR: Parsing Only What You Need for Efficient Retrieval-Augmented Generation

The expansion of retrieval-augmented generation (RAG) into multimodal domains has intensified the challenge for processing complex visual documents, such as fin...

#research #paper #ai #nlp #computer-vision
1 week ago · software · - · -

[Paper] Context-Aware Functional Test Generation via Business Logic Extraction and Adaptation

Functional testing is essential for verifying that the business logic of mobile applications aligns with user requirements, serving as the primary methodology f...

#research #paper #software
1 week ago · ai · - · -

[Paper] CIRCLE: A Framework for Evaluating AI from a Real-World Lens

This paper proposes CIRCLE, a six-stage, lifecycle-based framework to bridge the reality gap between model-centric performance metrics and AI's materialized out...

#research #paper #ai #machine-learning
1 week ago · ai · - · -

[Paper] Data Driven Optimization of GPU efficiency for Distributed LLM Adapter Serving

Large Language Model (LLM) adapters enable low-cost model specialization, but introduce complex caching and scheduling challenges in distributed serving systems...

#research #paper #ai #machine-learning #nlp
1 week ago · software · - · -

[Paper] LeGend: A Data-Driven Framework for Lemma Generation in Hardware Model Checking

Property checking of RTL designs is a central task in formal verification. Among available engines, IC3/PDR is a widely used backbone whose performance critical...

#research #paper #software
1 week ago · software · - · -

[Paper] The Vocabulary of Flaky Tests in the Context of SAP HANA

Background. Automated test execution is an important activity to gather information about the quality of a software project. So-called flaky tests, however, neg...

#research #paper #software
1 week ago · ai · - · -

[Paper] Green or Fast? Learning to Balance Cold Starts and Idle Carbon in Serverless Computing

Serverless computing simplifies cloud deployment but introduces new challenges in managing service latency and carbon emissions. Reducing cold-start latency req...

#research #paper #ai #machine-learning

Newer posts

Older posts