research — Page 145

1 month ago · ai

[Paper] Martingale Score: An Unsupervised Metric for Bayesian Rationality in LLM Reasoning

Recent advances in reasoning techniques have substantially improved the performance of large language models (LLMs), raising expectations for their ability to p...

#research #paper #ai #machine-learning #nlp
1 month ago · ai

[Paper] Model-Based Diagnosis with Multiple Observations: A Unified Approach for C Software and Boolean Circuits

Debugging is one of the most time-consuming and expensive tasks in software development and circuit design. Several formula-based fault localisation (FBFL) meth...

#research #paper #ai #machine-learning
1 month ago · ai

[Paper] Fast-Decoding Diffusion Language Models via Progress-Aware Confidence Schedules

Diffusion large language models (dLLMs) offer a promising alternative to autoregressive models, but their practical utility is severely hampered by slow, iterat...

#research #paper #ai #nlp
1 month ago · ai

[Paper] OptPO: Optimal Rollout Allocation for Test-time Policy Optimization

Test-time policy optimization enables large language models (LLMs) to adapt to distribution shifts by leveraging feedback from self-generated rollouts. However,...

#research #paper #ai #machine-learning #nlp
1 month ago · ai

[Paper] Think in Parallel, Answer as One: Logit Averaging for Open-Ended Reasoning

Majority voting has proven effective for close-ended question answering by aggregating parallel reasoning traces. However, it is not directly applicable to open...

#research #paper #ai #nlp
1 month ago · ai

[Paper] Bangla Hate Speech Classification with Fine-tuned Transformer Models

Hate speech recognition in low-resource languages remains a difficult problem due to insufficient datasets, orthographic heterogeneity, and linguistic variety. ...

#research #paper #ai #nlp
1 month ago · devops

[Paper] Designing FAIR Workflows at OLCF: Building Scalable and Reusable Ecosystems for HPC Science

High Performance Computing (HPC) centers provide advanced infrastructure that enables scientific research at extreme scale. These centers operate with hardware ...

#research #paper #devops
1 month ago · software

[Paper] Towards Observation Lakehouses: Living, Interactive Archives of Software Behavior

Code-generating LLMs are trained largely on static artifacts (source, comments, specifications) and rarely on materializations of run-time behavior. As a result...

#research #paper #software
1 month ago · ai

[Paper] Exploring Definitions of Quality and Diversity in Sonic Measurement Spaces

Digital sound synthesis presents the opportunity to explore vast parameter spaces containing millions of configurations. Quality diversity (QD) evolutionary alg...

#research #paper #ai
1 month ago · software

[Paper] 'Can you feel the vibes?': An exploration of novice programmer engagement with vibe coding

Emerging alongside generative AI and the broader trend of AI-assisted coding, the term 'vibe coding' refers to creating software via natural language prompts ra...

#research #paper #software
1 month ago · software

[Paper] Integrative Analysis of Risk Management Methodologies in Data Science Projects

Data science initiatives frequently exhibit high failure rates, driven by technical constraints, organizational limitations and insufficient risk management pra...

#research #paper #software
1 month ago · ai

[Paper] Empirical Assessment of the Perception of Software Product Line Engineering by an SME before Migrating its Code Base

Migrating a set of software variants into a software product line (SPL) is an expensive and potentially challenging endeavor. Indeed, SPL engineering can signif...

#research #paper #ai #machine-learning

Newer posts

Older posts