research — Page 79

Sort:

2 months ago · ai · - · -

[Paper] ExposeAnyone: Personalized Audio-to-Expression Diffusion Models Are Robust Zero-Shot Face Forgery Detectors

Detecting unknown deepfake manipulations remains one of the most challenging problems in face forgery detection. Current state-of-the-art approaches fail to gen...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] VINO: A Unified Visual Generator with Interleaved OmniModal Context

We present VINO, a unified visual generator that performs image and video generation and editing within a single framework. Instead of relying on task-specific ...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] DARC: Drum accompaniment generation with fine-grained rhythm control

In music creation, rapid prototyping is essential for exploring and refining ideas, yet existing generative tools often fall short when users require both struc...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Talk2Move: Reinforcement Learning for Text-Instructed Object-Level Geometric Transformation in Scenes

We introduce Talk2Move, a reinforcement learning (RL) based diffusion framework for text-instructed spatial transformation of objects within scenes. Spatially m...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Meta-Learning Guided Pruning for Few-Shot Plant Pathology on Edge Devices

Farmers in remote areas need quick and reliable methods for identifying plant diseases, yet they often lack access to laboratories or high-performance computing...

#research #paper #ai #machine-learning #computer-vision
2 months ago · ai · - · -

[Paper] Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling

This work introduces Falcon-H1R, a 7B-parameter reasoning-optimized model that establishes the feasibility of achieving competitive reasoning performance with s...

#research #paper #ai #machine-learning
2 months ago · software · - · -

[Paper] Question Answering for Multi-Release Systems: A Case Study at Ciena

Companies regularly have to contend with multi-release systems, where several versions of the same software are in operation simultaneously. Question answering ...

#research #paper #software
2 months ago · ai · - · -

[Paper] Joint Semantic and Rendering Enhancements in 3D Gaussian Modeling with Anisotropic Local Encoding

Recent works propose extending 3DGS with semantic feature vectors for simultaneous semantic segmentation and image rendering. However, these methods often treat...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Robust Persona-Aware Toxicity Detection with Prompt Optimization and Learned Ensembling

Toxicity detection is inherently subjective, shaped by the diverse perspectives and social priors of different demographic groups. While ``pluralistic'' modelin...

#research #paper #ai #nlp
2 months ago · ai · - · -

[Paper] BEDS: Bayesian Emergent Dissipative Structures

We present BEDS (Bayesian Emergent Dissipative Structures), a theoretical framework that unifies concepts from non-equilibrium thermodynamics, Bayesian inferenc...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Hunting for 'Oddballs' with Machine Learning: Detecting Anomalous Exoplanets Using a Deep-Learned Low-Dimensional Representation of Transit Spectra with Autoencoders

This study explores the application of autoencoder-based machine learning techniques for anomaly detection to identify exoplanet atmospheres with unconventional...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Environment-Adaptive Covariate Selection: Learning When to Use Spurious Correlations for Out-of-Distribution Prediction

Out-of-distribution (OOD) prediction is often approached by restricting models to causal or invariant covariates, avoiding non-causal spurious associations that...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Estimating Text Temperature

Autoregressive language models typically use temperature parameter at inference to shape the probability distribution and control the randomness of the text gen...

#research #paper #ai #nlp
2 months ago · ai · - · -

[Paper] Fusion2Print: Deep Flash-Non-Flash Fusion for Contactless Fingerprint Matching

Contactless fingerprint recognition offers a hygienic and convenient alternative to contact-based systems, enabling rapid acquisition without latent prints, pre...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] DatBench: Discriminative, Faithful, and Efficient VLM Evaluations

Empirical evaluation serves as the primary compass guiding research progress in foundation models. Despite a large body of work focused on training frontier vis...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Prithvi-Complimentary Adaptive Fusion Encoder (CAFE): unlocking full-potential for flood inundation mapping

Geo-Foundation Models (GFMs), have proven effective in diverse downstream applications, including semantic segmentation, classification, and regression tasks. H...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Project Ariadne: A Structural Causal Framework for Auditing Faithfulness in LLM Agents

As Large Language Model (LLM) agents are increasingly tasked with high-stakes autonomous decision-making, the transparency of their reasoning processes has beco...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Game of Coding: Coding Theory in the Presence of Rational Adversaries, Motivated by Decentralized Machine Learning

Coding theory plays a crucial role in enabling reliable communication, storage, and computation. Classical approaches assume a worst-case adversarial model and ...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Placement Semantics for Distributed Deep Learning: A Systematic Framework for Analyzing Parallelism Strategies

Training large language models requires distributing computation across many accelerators, yet practitioners select parallelism strategies (data, tensor, pipeli...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Temporal Kolmogorov-Arnold Networks (T-KAN) for High-Frequency Limit Order Book Forecasting: Efficiency, Interpretability, and Alpha Decay

High-Frequency trading (HFT) environments are characterised by large volumes of limit order book (LOB) data, which is notoriously noisy and non-linear. Alpha de...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] 360DVO: Deep Visual Odometry for Monocular 360-Degree Camera

Monocular omnidirectional visual odometry (OVO) systems leverage 360-degree cameras to overcome field-of-view limitations of perspective VO systems. However, ex...

#research #paper #ai #computer-vision
2 months ago · ai · - · -

[Paper] Differential Privacy for Transformer Embeddings of Text with Nonparametric Variational Information Bottleneck

We propose a privacy-preserving method for sharing text data by sharing noisy versions of their transformer embeddings. It has been shown that hidden representa...

#research #paper #ai #machine-learning
2 months ago · ai · - · -

[Paper] Classifying several dialectal Nawatl varieties

Mexico is a country with a large number of indigenous languages, among which the most widely spoken is Nawatl, with more than two million people currently speak...

#research #paper #ai #nlp
2 months ago · ai · - · -

[Paper] SortWaste: A Densely Annotated Dataset for Object Detection in Industrial Waste Sorting

The increasing production of waste, driven by population growth, has created challenges in managing and recycling materials effectively. Manual waste sorting is...

#research #paper #ai #computer-vision

Newer posts

Older posts