arXiv Research Digest

May 18, 2026 • 125 papers across 5 interests

🔬

Efficient ML / Edge AI

🟢 Applied

Offline Semantic Guidance for Efficient Vision-Language-Action Policy Distillation

💡 This research faster predictions in language AI.

Billion-parameter Vision-Language-Action (VLA) policies have recently shown impressive performance in robotic manipulation . VLA-AD augments teacher-provided 7-DoF action targets with high-level semantic guidance . At test time, the student policy runs independently with neither the VLA teacher nor the VLM required .

Abstract ↗ PDF ↗

🟢 Applied

Surrogate Neural Architecture Codesign Package (SNAC-Pack)

💡 This research optimizes machine learning.

Surrogate Neural Architecture Codesign Package is an open-source AutoML framework for hardware-aware neural architecture codesign and end-to-end FPGA deployment . SNAC-Pack runs a multi-objective global search with Optuna and NSGA-II, loading trials to a shared SQLite store . A hardware surrogate model outputs per-trial resource and latency estimates, avoiding the synthesis cost that would otherwise dominate the search loop .

Abstract ↗ PDF ↗

🟢 Applied

AgriMind: An Ensemble Deep Learning Framework for Multi-Class Plant Disease Classification

💡 This research makes more efficient computer vision.

AgriMind is an ensemble of ResNet50, EfficientNet-B0, and DenseNet121 trained on 20,638 PlantVillage images across 15 pepper, potato, and tomato disease classes . Individual models hit 96--97% on the held-out test set, but averaging their softmax output pushes the ensemble to 99.23% .

Abstract ↗ PDF ↗

🟢 Applied

ITGPT: Generative Pretraining on Irregular Timeseries

💡 This research explores techniques in language AI.

Timeseries regression models often struggle to leverage large volumes of labeled multimodal data . This is common in domains like healthcare and predictive maintenance, where data are collected from unreliable sources . Transformer-based large language models have proven effective on structured data such as text .

Abstract ↗ PDF ↗

🟢 Applied

RecMem: Recurrence-based Memory Consolidation for Efficient and Effective Long-Running LLM Agents

💡 This research explores techniques in language AI.

Memory systems often organize user-agent interactions as retrievable external memory . RecMem stores incoming interactions in a subconscious memory layer and encode them using lightweight embedding models for retrieval . LLMs are only invoked to extract episodic and semantic memory when sustained recurrence are observed for semantically similar interactions .

Abstract ↗ PDF ↗

🟢 Applied

Learn Where Outcomes Diverge: Efficient VLA RL via Probabilistic Chunk Masking

💡 This research optimizes computer vision.

Reinforcement learning (RL) allows vision-language-action policies to generalize beyond their training distribution by optimizing directly for task success . Gradient cost dominates because much of this computation is spent on phases that contribute little to learning . Probabilistic Chunk Masking (PCM) is a drop-in modification to GRPO that allocates gradient computation to a small, probabilistically selected subset of chunks per trajectory .

Abstract ↗ PDF ↗

🟢 Applied

Accelerated Gradient Descent for Faster Convergence with Minimal Overhead

💡 This research optimizes machine learning.

CT-AGD (Curvature-Tuned Accelerated Gradient Descent) is a general boosting procedure that accelerates first-order methods by explicitly capturing the local curvature using finite-difference quotients . It has a comparable storage and computational overhead as adaptive gradient methods such as Adam .

Abstract ↗ PDF ↗

🟡 Advanced

Layer Equivalence Is Not a Property of Layers Alone: How You Test Redundancy Changes What You Find

💡 This research making models smaller for machine learning.

Replacement asks whether one layer's map can substitute for another's in place . Interchange asks whether two layers approximately commute when their positions are swapped . On a Pythia training trajectory (410M and 1.4B), replacement-interchange gap grows from initialization to convergence .

Abstract ↗ PDF ↗

🟢 Applied

FORGE: Self-Evolving Agent Memory With No Weight Updates via Population Broadcast

💡 This research improves language AI.

FORGE (Failure-Optimized Reflective Graduation and Evolution) is a staged, population-based protocol that evolves prompt-injected natural-language memory for hierarchical ReAct agents . FORGE improves average evaluation return by 1.7-7.7$\times$ over zero-shot and by 29-72% over Reflexion .

Abstract ↗ PDF ↗

🟢 Applied

Runtime-Orchestrated Second-Order Optimization for Scalable LLM Training

💡 This research optimizes language AI.

Asteria is a runtime system designed to remove this bottleneck by separating second-order optimization logic from the critical GPU training path . Asteria uses training hooks to prepare shadow states in advance, allowing expensive inverse-root computations to proceed asynchronously on the host while GPU computations continue . It uses a bounded-staleness protocol that limits synchronization frequency while preserving optimizer effectiveness through topology-aware coordination .

Abstract ↗ PDF ↗

🟢 Applied

SGR: A Stepwise Reasoning Framework for LLMs with External Subgraph Generation

💡 This research faster predictions in language AI.

Large Language Models (LLMs) have demonstrated strong capabilities across diverse NLP applications, such as translation, text generation, and question answering . SGR builds query-specific subgraphs from external knowledge bases and uses their semantic structure to support multi-step inference .

Abstract ↗ PDF ↗

🟢 Applied

DebiasRAG: A Tuning-Free Path to Fair Generation in Large Language Models through Retrieval-Augmented Generation

💡 This research achieves better language AI.

Large language models (LLMs) are prone to prejudiced responses involving race, gender, and age . DebiasRAG leverages self-diagnosed bias contexts relevant to the query through regular retrieval, where the bias contexts are prepared offline . Given the query-specific bias contexts, DebiasAG reversely produces debiasing contexts, which are provided as additional fairness constraints .

Abstract ↗ PDF ↗

🟢 Applied

Multi-Level Contextual Token Relation Modeling for Machine-Generated Text Detection

💡 This research automatically finding language AI.

Machine-generated texts (MGTs) pose risks such as disinformation and phishing, underscoring the need for reliable detection . Metric-based methods extract statistically distinguishable features of MGTs are often more practical than complex models that are prone to overfitting .

Abstract ↗ PDF ↗

🟢 Applied

ReAlign: Generalizable Image Forgery Detection via Reasoning-Aligned Representation

💡 This research makes more efficient language AI.

The rise of AI-generated images poses growing challenges for digital authenticity . ReAlign is a novel framework that distills reasoning texts generated by a GRPO-optimized LLM into a lightweight AIGI detector via contrastive learning .

Abstract ↗ PDF ↗

🟢 Applied

IVGT: Implicit Visual Geometry Transformer for Neural Scene Representation

💡 This research explores techniques in computer vision.

Reconstructing coherent 3D geometry and appearance from unposed multi-view images is a fundamental yet challenging problem in computer vision . We propose IVGT, an Implicit Visual Geometry Transformer that implicitly models continuous and coherent geometry from pose-free images . IVGT learns a continuous neural scene representation in a canonical coordinate system .

Abstract ↗ PDF ↗

🟡 Advanced

A Unified Generative-AI Framework for Smart Energy Infrastructure: Intelligent Gas Distribution, Utility Billing, Carbon Analytics, and Quantum-Inspired Optimisation

💡 This research optimizes machine learning.

The accelerating convergence of smart metering, artificial intelligence, and quantum-inspired combinatorial optimisation is reshaping how energy utilities manage physical infrastructure, customer engagement, and environmental accountability .

Abstract ↗ PDF ↗

🟢 Applied

Argus: Evidence Assembly for Scalable Deep Research Agents

💡 This research achieves better machine learning.

Argus is an agentic system in which a Searcher and a Navigator cooperate to treat deep research as assembling a jigsaw from complementary evidence pieces . The Searcher collects evidence traces for a given sub-query through ReAct-style interaction . The Navigator maintains a shared evidence graph, verifying which pieces are still missing, dispatching Searchers to gather them, and reasoning over the completed graph to produce a source-traced final answer . Argus gains 5.5 points

Abstract ↗ PDF ↗

🟢 Applied

Fully Open Meditron: An Auditable Pipeline for Clinical LLMs

💡 This research explores techniques in language AI.

Clinical decision support systems (CDSS) require scrutable, auditable pipelines that enable rigorous, reproducible validation . Most "open" models are open-weight only, withholding data provenance, curation procedures, and generation pipelines that determine model behavior . Fully Open (FO) models, which expose the complete training stack end-to-end, do not currently exist in medicine .

Abstract ↗ PDF ↗

🟢 Applied

A Scalable Nonparametric Continuous-Time Survival Model through Numerical Quadrature

💡 This research explores techniques in machine learning.

QSurv is a scalable deep learning framework that enables nonparametric continuous-time modeling without relying on time discretization or restrictive distributional assumptions . We propose a training objective based on Gauss-Legendre numerical quadrature, which approximates the cumulative hazard with high-order accuracy . QSurv achieves competitive predictive performance with advantages in instantaneous hazard function estimation .

Abstract ↗ PDF ↗

🟡 Advanced

Confirming Correct, Missing the Rest: LLM Tutoring Agents Struggle Where Feedback Matters Most

💡 This research explores techniques in language AI.

Effective tutoring requires distinguishing optimal, valid but suboptimal, and incorrect student solutions . We present benchmark of seven LLM feedback agents in propositional logic using knowledge-graph-derived ground truth across 10,836 solution--feedback pairs and three feedback conditions .

Abstract ↗ PDF ↗

🟢 Applied

Context, Reasoning, and Hierarchy: A Cost-Performance Study of Compound LLM Agent Design in an Adversarial POMDP

💡 This research improves language AI.

CybORG CAGE-2 is a cyber defense environment modeled as a Partially Observable Markov Decision Process . Programmatic state abstraction delivers the largest returns per token spent . Distributing deliberation tools across a hierarchy degrades performance relative to hierarchy alone for all five model families .

Abstract ↗ PDF ↗

🟢 Applied

Optimized Three-Dimensional Photovoltaic Structures with LLM guided Tree Search

💡 This research introduces a new approach to language AI.

A case study for how AI coding systems can be used to generate novel scientific hypotheses . We combine a generic coding agent (Google's AntiGravity) with an LLM-driven tree search algorithm to autonomously generate high-efficiency three-dimensional photovoltaic structures .

Abstract ↗ PDF ↗

🟢 Applied

MAgSeg: Segmentation of Agricultural Landscapes in High-Resolution Satellite Imagery using Multimodal Large Language Models

💡 This research explores techniques in language AI.

Agricultural landscape segmentation in the Global South is challenging as it is characterized by fragmented plots, high intra-class variance, and a scarcity of labeled training data . MAgSeg is an architecturally efficient approach that enables standard MLLMs to perform segmentation of complex smallholder agricultural landscapes from high-resolution satellite imagery, without requiring auxiliary vision decoders .

Abstract ↗ PDF ↗

🟢 Applied

Second-Order Multi-Level Variance Correction for Modality Competition in Multimodal Models

💡 This research optimizes computer vision.

Autoregressive next-token training offers a unified formulation for image generation and text understanding . But it also creates strong modality competition that destabilizes optimization and limits large-batch scaling . We propose a second-order optimization framework with Multi-Level Variance Correction .

Abstract ↗ PDF ↗

🟢 Applied

SwAIther-Precip: Lead-Time-Aware Bias Correction Enables Kilometer-Scale Downscaling of Global AI Precipitation Forecasts over Switzerland

💡 This research explores techniques in machine learning.

Global AI weather models can produce skillful medium-range forecasts, but their native 0.25 degrees resolution limits direct use for local hazard applications . We introduce SwAIther-Precip, a lead-time-aware downscaling framework that converts coarse-resolution AIFS forecasts into probabilistic km-scale precipitation fields over Switzerland .

Abstract ↗ PDF ↗

🔬

Privacy-Preserving ML

🟢 Applied

The Privacy Price of Tail-Risk Learning: Effective Tail Sample Size in Differentially Private CVaR Optimization

💡 This research protecting data privacy in privacy-preserving AI.

Differential privacy changes the effective sample size governing CVaR learning . For tail mass $τ$, the privacy-relevant sample size is not $n$, but $nτ$ For convex Lipschitz learning, modular upper and lower reductions show that CVAR-specific privacy term necessarily scales as $1/2 .

Abstract ↗ PDF ↗

🟢 Applied

FedEDAuth -- Federated Embedding Distribution Authentication for Counterfeit IC Detection

💡 This research protecting data privacy in privacy-preserving AI.

Federated Embedding Distribution Authentication (FedEDAuth) is a lightweight, embedding level client authentication framework that detects and filters malicious participants before model aggregation . FedEDAuth leverages reference embedding distributions derived from a golden dataset and evaluates clients using outlier analysis, mean shift measurements, and micro-cluster behavior .

Abstract ↗ PDF ↗

🟢 Applied

Centralized vs Decentralized Federated Learning: A trade-off performance analysis

💡 This research protecting data privacy in privacy-preserving AI.

Federated Learning (FL) has emerged as a promising paradigm for collaborative model training across distributed edge devices while preserving data privacy . Storing this amount of data centrally is challenging due to issues like limited communication, privacy, and regulations . Choosing the right FL architecture depends on the application's needs .

Abstract ↗ PDF ↗

🟢 Applied

Privacy is Fungibility: Why Endogenous Tokens Are Not Money

💡 This research proposes a method for privacy-preserving AI.

In this paper, we make a case that endogenous tokens such as cryptoassets are not money . We define and classify tokens found on public, permissionless ledgers . We then discuss the work of Kahn et al in Money is Privacy on cash versus simplified credit .

Abstract ↗ PDF ↗

🟢 Applied

Federated Imputation under Heterogeneous Feature Spaces

💡 This research distributed machine learning across privacy-preserving AI.

Federated Learning (FL) enables collaborative training across decentralized clients, but most methods assume aligned feature schemas . In these heterogeneous feature spaces, parameter-averaging methods transfer little information across weakly overlapping or disjoint feature groups . We propose a federated imputation framework that separates structural feature unavailability from conventional missingness . This enables indirect cross-client knowledge transfer, even when features are never jointly observed locally .

Abstract ↗ PDF ↗

🟢 Applied

PCDM: A Diffusion-Based Data Poisoning Attack Against Federated Learning Systems

💡 This research distributed machine learning across privacy-preserving AI.

Federated learning (FL) is vulnerable to data poisoning attacks due to its distributed nature . In this paper, we propose a diffusion-based data poisoning framework against FL systems . We conduct the most systematic and broad experimental evaluation for FL poisoning attacks against various defenses .

Abstract ↗ PDF ↗

🟢 Applied

Practical Validity Conditions for Byzantine-Tolerant Federated Learning

💡 This research distributed machine learning across privacy-preserving AI.

Robust aggregation is the core operation in Byzantine-tolerant federated learning . To ensure the quality of aggregation independently of data distribution or attacks, validity conditions are needed . The widespread convex validity requires the output to lie in the convex hull of the honest vectors .

Abstract ↗ PDF ↗

🟢 Applied

PersonaFingerprint: Measuring Persona Inference on Modern Websites with LLM-Driven Browsing

💡 This research protecting data privacy in language AI.

Website Fingerprinting has traditionally focused on inferring which website a user visits from encrypted traffic metadata such as packet sizes and timing . An adversary can infer a user's persona using only packet-length and inter-arrival-time sequences . We formalize persona fingerprinting under both closed-set and open-world settings .

Abstract ↗ PDF ↗

🟢 Applied

Attention Dispersion in Dynamic Graph Transformers: Diagnosis and a Transferable Fix

💡 This research explores techniques in machine learning.

Transformer-based architectures have become the dominant paradigm for Continuous-Time Dynamic Graph (CTDG) learning . However, existing Transformers fail to focus on critical nodes even when they are present in the input, as temporal shift weakens attention contrast and produces overly dispersed attention distributions . This diagnosis suggests a simple fix: replace standard attention with differential attention .

Abstract ↗ PDF ↗

🟢 Applied

When and Why Adversarial Training Improves PINNs: A Neural Tangent Kernel Perspective

💡 This research creating new content with machine learning.

Physics-informed neural networks (PINNs) are powerful surrogates for differential equations but are notoriously difficult to train due to spectral bias, stiffness, and poor accuracy . Adversarial training based on generative adversarial networks (GANs) has recently gained surprisingly strong empirical results in improving training .

Abstract ↗ PDF ↗

🟢 Applied

Context-aware Entity-Relation Extraction for Threat Intelligence Knowledge Graphs

💡 This research enhances language AI.

Cybersecurity Knowledge Graphs unify diverse Cyber Threat Intelligence sources into structured, queryable formats . CTiKG incorporates hybrid NLP models that leverage SecureBERT+ contextual embeddings and expert knowledge from a domain ontology . Experiments on the DNRTI-AUG-STIX2 dataset demonstrate significant improvements over state-of-the-art baselines .

Abstract ↗ PDF ↗

🟢 Applied

A Multi-Layer Cloud-IDS Pipeline with LLM and Adaptive Q-Learning Calibration

💡 This research automatically finding language AI.

The system secures three distinct layers: network, host, and hypervisor . Machine learning models at each layer detect known attack patterns, while prediction confidence distinguishes reliable decisions from uncertain outcomes . The proposed system learns adaptive thresholds and reduces LLM escalation by 58.78% .

Abstract ↗ PDF ↗

🟢 Applied

A Generative AI Framework for Intelligent Utility Billing CO 2 Analytics and Sustainable Resource Optimisation

💡 This research proposes a method for machine learning.

Distribution utilities now expected to deliver bills that customers can actually read attach a defensible carbon number to every kWh sold and schedule load against grid stress and emissions constraints . We propose an end-to-end framework that unifies four production-grade capabilities under one architectural roof .

Abstract ↗ PDF ↗

🟢 Applied

AI-Mediated Communication Can Steer Collective Opinion

💡 This research creating new content with language AI.

Generative artificial intelligence (AI) is increasingly integrated into online platforms where humans exchange opinions . We show biases introduced by AI in human-to-human communication can be amplified through the network and shift collective opinion in their direction . In light of these findings, we investigate whether such biases are controllable by online platforms .

Abstract ↗ PDF ↗

🟢 Applied

Dynamics-Level Watermarking of Flow Matching Models with Random Codes

💡 This research creating new content with machine learning.

We introduce a dynamics-level approach to watermarking generative models . We embed the watermark directly into the learned continuous dynamics -- the velocity field of a flow matching model . We formulate this as random coding over a continuous channel: a key-dependent perturbation is added during training .

Abstract ↗ PDF ↗

🟢 Applied

Universal Magnetic Structure Prediction from Atomic Coordinates with Near-Experimental Accuracy

💡 This research explores techniques in machine learning.

Magnetic order is a fundamental property of materials, governing collective behavior and enabling a broad range of functionalities . Yet magnetic structure remains difficult to determine: experiments are costly and specialized . First-principles methods often struggle with the noncollinear and incommensurate orders found in real materials .

Abstract ↗ PDF ↗

🟢 Applied

LymphNode: A Plug-and-Play Access Control Method for Deep Neural Networks

💡 This research tackles the problem of edge computing.

Deep Neural Networks (DNNs) are high-value intellectual property (IP) yet deploying them to edge environments exposes them to unrestricted oracle access . Existing defenses fail to address this practically: passive watermarking only offers post-hoc provenance, while active defenses impose prohibitive latency or require persistent access to sensitive training data .

Abstract ↗ PDF ↗

🟢 Applied

Artificial Aphasias in Lesioned Language Models

💡 This research explores techniques in language AI.

Aphasias reveal the functional organization of human language by providing causal links between affected brain regions and specific symptom profiles . We introduce an aphasia-inspired technique to characterize emergent functional organizations of language models (LMs) We measure the effects of this intervention against clinical aphasias symptoms as diagnosed by the Text Aphasia Battery (TAB)

Abstract ↗ PDF ↗

🟢 Applied

Hypothesis-driven construction of mesoscopic dynamics

💡 This research proposes a method for machine learning.

Traditional scientific modeling typically begins with fixed, instance-wise effective equations and then carries out equation-specific analysis and computation . We propose an alternative paradigm by learning mesoscopic dynamics within a mathematically constrained hypothesis class .

Abstract ↗ PDF ↗

🟢 Applied

Formal Methods Meet LLMs: Auditing, Monitoring, and Intervention for Compliance of Advanced AI Systems

💡 This research proposes a method for language AI.

We examine one dimension of AI governance: how to monitor and audit AI-enabled products and services throughout the AI development lifecycle . We propose techniques that enable developers to perform offline auditing and online (runtime) monitoring of product-specific (temporally extended) behavioral constraints .

Abstract ↗ PDF ↗

🟢 Applied

Imitation learning for clinical decision support in pediatric ECMO

💡 This research tackles the problem of machine learning.

Pediatric critical care is a dynamic, high-stakes process involving constant monitoring and adjustments in life-saving treatments . Modeling these interventions is crucial for effective decision support . We frame clinical decision-making as learning to act from trajectories, i.e., imitation learning .

Abstract ↗ PDF ↗

🟡 Advanced

BAPR: Bayesian amnesic piecewise-robust reinforcement learning for non-stationary continuous control

💡 This research explores techniques in language AI.

Bayesian Amnesic Piecewise-Robust SAC unifies Bayesian Online Change Detection (BOCD) with robust ensemble RL . The BAPR operator is a combination of mode-conditional Bellman operators weighted by a frozen belief distribution .

Abstract ↗ PDF ↗

🟢 Applied

From Backup Restoration to Minimum Viable Factory Recovery: A Systematization of Ransomware Recovery in Manufacturing Systems

💡 This research explores techniques in machine learning.

Ransomware recovery in critical manufacturing infrastructure is not only a backup-restoration problem, but a critical-infrastructure continuity and interdependency problem . After ransomware, a plant may remain unable to schedule work, authenticate operators or release product, reconnect OT assets, or coordinate suppliers .

Abstract ↗ PDF ↗

🟡 Advanced

Entropic Auto-Encoding via Implicit Free-Energy Minimization

💡 This research optimizes machine learning.

Entropic Autoencoders (EAEs) inherently suffer from posterior collapse, a failure mode in which latent variables are effectively ignored . EAEs mitigate posterior collapse by learning non-Gaussian, multimodal latent distributions that yield diverse, data-consistent generations .

Abstract ↗ PDF ↗

🟢 Applied

Skew-adaptive conformal prediction

💡 This research forecasting machine learning.

We develop a skew-adaptive extension of split conformal prediction for regression . The method starts from an asymmetric interval family centered at a point prediction and uses the gauge approach to deduce the conformity score induced by this family . We also develop a calibration-sample-based estimator for comparing the expected relative future width of skewness intervals .

Abstract ↗ PDF ↗

🔬

Creative AI / Emotion

🟢 Applied

Position: AI as Part of Self -- Extending the Mind Requires Cognitive Co-Regulation

💡 This research achieves better edge computing.

Contemporary AI increasingly participates in attention allocation, reasoning, synthesis, and decision-making, shaping the very cognitive processes through which humans form beliefs, make decisions, and constitute their sense of self . We identify the risks of unstructured delegation: deskilling, automation bias, transfer of epistemic authority, and oracle-style centralization of knowledge .

Abstract ↗ PDF ↗

🟢 Applied

SLIP & ETHICS: Graduated Intervention for AI Emotional Companions

💡 This research presents techniques for emotion AI.

AI emotional companions face a safety-rapport paradox: restrictive safeguards can damage supportive alliance, while permissive systems risk user harm . SLIP (Staged Layers of Intervention Protocol) is a four-stage graduated methodology deriving interventions from structured qualitative indicators .

Abstract ↗ PDF ↗

🟢 Applied

Designing for Robot Wranglers: A Synthesis of Literature and Practice

💡 This research presents techniques for machine learning.

Robots are increasingly present in human spaces, such as for conducting deliveries in hospitals, interacting with visitors at museums, and stocking items in warehouses . To ensure the seamless integration of robots into these spaces, a new role in human-robot interaction is emerging - the robot wrangler .

Abstract ↗ PDF ↗

🟢 Applied

Designing Datacenter Power Delivery Hierarchies for the AI Era

💡 This research speeds up computer vision.

Demand for AI accelerators is rapidly increasing rack power density, with projections approaching 1MW per deployment by 2027 . Power utilization is particularly important as grid power capacity is a scarce resource in the AI era . Designing an efficient power delivery hierarchy for the long run is difficult .

Abstract ↗ PDF ↗

🟢 Applied

Evaluating Design Video Generation: Metrics for Compositional Fidelity

💡 This research creating new content with machine learning.

Generative video models are increasingly used in design animation tasks . Unlike natural video generation, design animation imposes structured constraints . Specific components shall animate with prescribed motion types, directions, speed and timing . Non-animated regions must remain stable and layout structure must be preserved .

Abstract ↗ PDF ↗

🟢 Applied

ARIA: A Diagnostic Framework for Music Training Data Attribution

💡 This research reduces speech processing.

Training data attribution (TDA) for music generation must answer two questions that copyright analysis requires . Existing methods reduce influence to a single scalar, without revealing which musical aspects are dominant in that influence . We propose ARIA framework that decomposes attribution along musical aspects (five for symbolic music, three for audio)

Abstract ↗ PDF ↗

🟢 Applied

GenShield: Unified Detection and Artifact Correction for AI-Generated Images

💡 This research automatically finding computer vision.

Diffusion-based image synthesis has made AI-generated images (AIGI) increasingly photorealistic, raising concerns about authenticity in applications such as misinformation detection, digital forensics, and content moderation . We propose GenShield, a unified autoregressive framework that jointly performs explainable AIGI detection and controllable artifact correction .

Abstract ↗ PDF ↗

🟢 Applied

GEMS -- Guided Evolutionary Molecule Design for Sustainable Chemicals

💡 This research explores techniques in computer vision.

Machine learning (ML) methods have been developed to aid with de novo molecule design . Data on the environmental impacts of chemical compounds are sparse, resulting in low-fidelity ML oracles and unreliable candidate proposals . We present GEMS-an interactive visual analytics tool that enables domain experts to directly collaborate with a genetic algorithm for molecule design. Users can integrate their expert knowledge to guide the evolutionary process by modifying the scoring function and molecule population without programming knowledge .

Abstract ↗ PDF ↗

🟢 Applied

Synchronized Realities: Towards Magic Mobile Experiences through Aligned AR

💡 This research creating new content with machine learning.

In virtual reality environments, the alignment of perceptual modalities is crucial for immersion and presence . In the AR domain, it is difficult to create such alignments because elements in the physical world are often beyond the user's control . Recent advances in generative AI enable on-demand content creation, enabling highly reactive AR experiences .

Abstract ↗ PDF ↗

🟢 Applied

Property-Guided LLM Program Synthesis for Planning

💡 This research explores techniques in language AI.

LLMs have shown impressive success in program synthesis, discovering programs that surpass prior solutions . Instead of scoring programs after evaluation, we check whether a candidate satisfies a formally defined property . When the property is violated, we stop evaluation early and provide the LLM with a concrete counterexample showing how the program failed . This feedback drastically reduces both the number of program generations and the evaluation cost .

Abstract ↗ PDF ↗

🟢 Applied

Generative Long-term User Interest Modeling for Click-Through Rate Prediction

💡 This research enhances machine learning.

Modeling long-term user interests with massive historical user behaviors enhances click-through rate (CTR) prediction performance in advertising and recommendation systems . GenLI consists of an interest generation module (IGM), a behavior retrieval module (BRM), and an interest fusion module (IFM)

Abstract ↗ PDF ↗

🟢 Applied

VideoSeeker: Incentivizing Instance-level Video Understanding via Native Agentic Tool Invocation

💡 This research explores techniques in language AI.

Large Vision-Language Models (LVLMs) have shown significant progress in video understanding, but they face substantial challenges in tasks requiring precise spatiotemporal localization at the instance level . Existing methods primarily rely on text prompts for human-model interaction, but these prompts struggle to provide precise spatial and temporal references . VideoSeeker seamlessly integrates agentic reasoning with instance-level video understanding tasks .

Abstract ↗ PDF ↗

🟢 Applied

Ada-Diffuser: Latent-Aware Adaptive Diffusion for Decision-Making

💡 This research creating new content with machine learning.

Ada-Diffuser is a causal diffusion model that learns the temporal structure of observed interactions and the underlying latent dynamics simultaneously . The model leverages these dynamics for planning and control tasks . It has a modular design that supports both planning and policy learning tasks .

Abstract ↗ PDF ↗

🟢 Applied

Reasoners or Translators? Contamination-aware Evaluation and Neuro-Symbolic Robustness in Tax Law

💡 This research enhances language AI.

Recent advances in large language models (LLMs) have significantly enhanced automated legal reasoning . Yet, it remains unclear whether their performance reflects genuine legal reasoning ability or artifacts of data contamination . We show that performance can be inflated by contamination .

Abstract ↗ PDF ↗

🟢 Applied

XSearch: Explainable Code Search via Concept-to-Code Alignment

💡 This research explores techniques in machine learning.

Semantic code search has been widely adopted in both academia and industry . These approaches embeds natural-language queries and code snippets into a shared embedding space and retrieve results based on vector similarity . But these approaches often suffer from poor explainability and generalization . We propose XSearch, an intrinsically explainable code search framework .

Abstract ↗ PDF ↗

🟢 Applied

Constrained latent state modeling: A unifying perspective on representation learning under competing constraints

💡 This research presents techniques for machine learning.

Learning latent representations from complex data is central to modern machine learning . In such settings, representations are better understood as latent states capturing underlying system dynamics . Yet current approaches remain fragmented, relying on distinct assumptions about what these states should represent . We propose constrained latent state modeling (CLSM) as a unifying perspective .

Abstract ↗ PDF ↗

🟢 Applied

Beyond Content: A Comprehensive Speech Toxicity Dataset and Detection Framework Incorporating Paralinguistic Cues

💡 This research understanding emotions in speech processing.

Current toxic speech datasets are predominantly text-based, limiting the development of models that can capture paralinguistic cues . We present ToxiAlert-Bench, a large-scale audio dataset with over 30,000 audio clips annotated with seven major toxic categories and twenty fine-grained toxic labels .

Abstract ↗ PDF ↗

🟢 Applied

Driving Through the Network: Performance and Workload Under Latency and Video Impairment

💡 This research explores techniques in machine learning.

We report a fixed-base driving-simulator study with a 2x2 manipulation of added latency (100/300 ms) and bitrate (500/2000 kbit/s) We measured effective glass-to-glass (G2G) latency per condition . Physiological measures (heart rate, RR interval, heart rate, skin conductance) exhibited sub-additive interactions, whereas performance and oculomotor interactions were small or non-significant .

Abstract ↗ PDF ↗

🟢 Applied

Prospective multi-pathogen disease forecasting using autonomous LLM-guided tree search

💡 This research presents techniques for language AI.

Probabilistic forecasting of infectious diseases is crucial for public health but relies on labor-intensive manual model curation by expert modeling teams . Here, we present an autonomous system using Large Language Model (LLM)-guided tree search to generate, evaluate, and optimize executable forecasting software .

Abstract ↗ PDF ↗

🟢 Applied

Inside Baseball: The Automated Ball-Strike System as an Object Lesson in Technological Rule Enforcement

💡 This research explores techniques in computer vision.

Major League Baseball's seven-year experimentation with the Automated Ball-Strike System (ABS) shows how even seemingly straightforward rules require a complex translation process to operationalize via technological systems . ABS is envisioned to call balls and strikes accurately: a seemingly straightforward use of technology to objectively determine the distance between a pitch and the strike zone .

Abstract ↗ PDF ↗

🟢 Applied

An Algebraic Exposition of the Theory of Dyadic Morality

💡 This research explores techniques in machine learning.

This paper provides an algebraic exposition of the theory of dyadic morality (TDM) We formalize TDM using structural causal modeling (SCM) notation . This algebraic formalization enables neurosymbolic AI systems to compute morality in a way that is both mathematically rigorous and faithful to human moral cognition .

Abstract ↗ PDF ↗

🟢 Applied

Entropy Across the Bridge: Conditional-Marginal Discretization for Flow and Schrödinger Samplers

💡 This research faster predictions in machine learning.

Flow matching and Schrödinger bridges define probability paths, yet their inference grids are usually heuristic or inherited from one-endpoint diffusion . We derive a conditional-marginal entropy-rate objective for bridge-aware discretization . We use it to build a training-free entropic inference-time scheduler from first principles .

Abstract ↗ PDF ↗

🟢 Applied

ShopGym: An Integrated Framework for Realistic Simulation and Scalable Benchmarking of E-Commerce Web Agents

💡 This research explores techniques in machine learning.

The field lacks a scalable way to construct evaluation settings that are realistic, diverse, controllable, inspectable, and reproducible . We introduce ShopGym, an integrated framework for realistic simulation and scalable benchmarking of e-commerce web agents . We validate the framework through graph-based structural analysis and agent-based behavioral evaluation .

Abstract ↗ PDF ↗

🟢 Applied

Sign-Separated Finite-Time Error Analysis of Q-Learning

💡 This research presents techniques for language AI.

This paper develops a sign-separated finite-time error analysis for constant step-size Q-learning . The analysis identifies a max-induced asymmetry in error dynamics . Negative errors admit an optimal-policy lower comparison, the authors say .

Abstract ↗ PDF ↗

🟢 Applied

Multi-level Self-supervised Pretraining on Compositional Hierarchical Graph for Molecular Property Prediction

💡 This research running AI locally on devices for computer vision.

Self-supervised pretraining on molecular graphs has emerged as a promising approach for molecular property prediction . Most existing methods operate at a single structural granularity and treat bond information as auxiliary edge attributes rather than as an independent semantic layer . We propose MolCHG, a multi-level self-supervisory pretraining framework built upon a novel Compositional Hierarchical Graph .

Abstract ↗ PDF ↗

🔬

Lightweight Systems

🟢 Applied

Heterogeneous SoC Integrating an Open-Source Recurrent SNN Accelerator for Neuromorphic Edge Computing on FPGA

💡 This research makes more efficient edge computing.

Spiking Neural Networks (SNNs) are capable of mimicking the spike-based data processing typical of biological neurons . The spread of digital neuromorphic hardware is slowed down by the prohibitive costs that the silicon tape out of circuits brings . FPGAs could represent a viable alternative, offering a flexible and cost-effective platform .

Abstract ↗ PDF ↗

🟢 Applied

Evaluating Container Orchestration for Neuromorphic Workloads in Virtual Edge Environments

💡 This research makes more efficient computer vision.

Neuromorphic computing offers an energy-efficient alternative to conventional machine learning through event-driven computation . The growing adoption of edge computing has created an increasing need for workloads capable of operating under strict resource and energy constraints .

Abstract ↗ PDF ↗

🟢 Applied

Embedded Made Easy -- Rethinking Embedded + Cloud Software Development (WIP)

💡 This research running AI on low-power devices for computer vision.

The process of engineering and deploying applications in the edge/embedded space is massively complicated by the non-homogeneous nature of the software stack and the complexity of diagnostics & debugging . This paper presents a work-in-progress vision for a unified language and runtime system that allows applications to scale seamlessly across the edge and cloud . Using a single language and runtimes, applications can be developed and tested in a single environment, and then deployed to any component of the system .

Abstract ↗ PDF ↗

🟢 Applied

Memristor Technologies for Dynamic Vision Sensors: A Critical Assessment and Research Roadmap

💡 This research running AI locally on devices for computer vision.

Edge-AI deployment is bottlenecked by data-movement energy . Pair event-driven vision sensors with in-memory analog compute could lift that ceiling by orders of magnitude .

Abstract ↗ PDF ↗

🟢 Applied

Sustainable Graph Analytics Workload Scheduling with Evolutionary Reinforcement Learning in Edge-Cloud Systems

💡 This research running AI locally on devices for edge computing.

Graph analytics powers smart cities, cyber-physical infrastructure, IoT security, and large-scale social networks . Execution in heterogeneous edge-cloud environments results in higher energy use and carbon emission footprint . MERSEM integrates evolutionary search with reinforcement learning to solve the problem of graph workload allocation and scheduling .

Abstract ↗ PDF ↗

🟢 Applied

PipeSD: An Efficient Cloud-Edge Collaborative Pipeline Inference Framework with Speculative Decoding

💡 This research enhances language AI.

Speculative decoding can significantly accelerate LLM inference, especially given that its cloud-edge collaborative deployment offers cloud workload offloading, offline robustness, and privacy enhancement . PipeSD outperforms state-of-the-art baselines, achieving 1.16x speedup and reducing energy consumption .

Abstract ↗ PDF ↗

🟢 Applied

Scale: Deep Reinforcement Learning for Container Scheduling in Serverless Edge Computing

💡 This research makes more efficient computer vision.

Serverless computing has emerged as a promising computing paradigm for edge computing . Efficiently allocating requests to containers is therefore critical to reduce resource over provisioning and unnecessary data movement . This paper proposes Scale, a Service Level Objective aware container scheduling and resource allocation framework .

Abstract ↗ PDF ↗

🟢 Applied

Distributed Statistical Zero-Knowledge Proofs via Sumcheck

💡 This research tackles the problem of edge computing.

We study distributed zero-knowledge proofs, introduced by Bick, Kol, and Oshman (SODA 2022) While distributed interactive proofs have advanced rapidly, distributed interactive proof techniques remain limited and mostly problem-specific . Our main contribution is a distributed zero knowledge implementation of Sumcheck . For non-k-colorability, we obtain an $O(n)$-round distributed statistical zero knowledge proof deciding whether a graph is not k-colorable, for any constant k,

Abstract ↗ PDF ↗

🟢 Applied

The Distributed Complexity Landscape on Trees Depends on the Knowledge About the Network Size

💡 This research explores techniques in machine learning.

One of the central models in distributed computing is Linial's LOCAL model [SIAM J. Comp. 1992]. LCLs are graph problems whose valid solutions can be characterized by a finite set of allowed constant-radius neighborhoods . The last decade has seen major progress in understanding their complexity .

Abstract ↗ PDF ↗

🟢 Applied

Evolving Layer-Specific Scalar Functions for Hardware-Aware Transformer Adaptation

💡 This research reduces computer vision.

Vision Transformers (ViTs) achieve state-of-the-art performance on challenging vision tasks, but deployment on edge devices is severely hindered by the computational complexity and global reduction bottleneck imposed by layer normalization . Recent methods attempt to bypass this by replacing normalization layers with hardware-friendly scalar approximations .

Abstract ↗ PDF ↗

🟢 Applied

Constitutional Governance in Metric Spaces

💡 This research explores techniques in edge computing.

The constitution assigns, per amendable component including itself, a metric space, aggregation rule, and supermajority threshold . Amendments proceed by members voting with their ideal elements, followed by members submitting public proposals carrying supermajority public support under the revealed votes . Public proposals can be sourced from deliberation among members, vote aggregation, or AI mediation .

Abstract ↗ PDF ↗

🟢 Applied

ADS-IMC: Accelerating Data Sorting with In-Memory Computation

💡 This research explores techniques in edge computing.

Sorting is a fundamental operation across numerous computational domains . The proposed architecture achieves a significant 3.4x reduction in latency compared to memristor-based IMC sorting . To our knowledge, this work represents the first exploration of in-memory sorting using 6T SRAM .

Abstract ↗ PDF ↗

🟢 Applied

A GPU Accelerated Temporal Window-Based Random Walk Sampler

💡 This research explores techniques in edge computing.

Temporal random walks are widely used to analyze time-stamped interactions in domains such as microservices, finance, and online platforms . Tempest combines a GPU-native dual-index organization over a shared edge store with a hierarchical cooperative scheduler that dispatches walks at thread, warp, or block granularity based on per-step node convergence .

Abstract ↗ PDF ↗

🟢 Applied

APWA: A Distributed Architecture for Parallelizable Agentic Workflows

💡 This research explores techniques in language AI.

Autonomous multi-agent systems hit critical reasoning, coordination and scaling bottlenecks as the size and complexity of their tasks grow . APWA facilitates parallel execution by decomposing workflows into non-interfering subproblems that can be processed using independent resources without cross-communication .

Abstract ↗ PDF ↗

🟢 Applied

Mat2Boundary: Treating User-Defined Boundary Condition as SpMV for Distributed PDE Solvers on Block-Structured Grids

💡 This research presents techniques for edge computing.

Mat2Boundary is a DSL and compiler for boundary computations that models a broad class of boundary-conditions as affine sparse linear operators . This abstraction unifies halo copying, circular and symmetric mappings, zero padding, block-edge synchronization, and user-defined interpolation .

Abstract ↗ PDF ↗

🟢 Applied

Malleable Molecular Dynamics Simulations with GROMACS and DMR

💡 This research explores techniques in machine learning.

Dynamic Management of Resources middleware enables MPI process malleability in Slurm via a simple API decoupled from scheduler internals . We evaluate this design on the MareNostrum~5 supercomputer .

Abstract ↗ PDF ↗

🟢 Applied

Semi-Synchronous Exploration in Dynamic Graphs

💡 This research running AI locally on devices for edge computing.

We study the fundamental problem of graph exploration in dynamic graphs using mobile agents . For a graph with $n$ nodes and $k$ agents, we show that exploration is impossible if the adversary can deactivate at least $ \left\lceil \frac{k}{n-2} \right\rceil - 1$ agents per round . We further establish that achieving exploration at this threshold requires agents to have both $1$-hop visibility and global communication .

Abstract ↗ PDF ↗

🟢 Applied

EMA: Efficient Model Adaptation for Learning-based Systems

💡 This research optimizes computer vision.

Machine learning (ML) is increasingly applied to optimize system performance in tasks such as resource management and network simulation . Unlike traditional ML tasks, networked systems often operate in heterogeneous, long-running, and dynamic environment states . EMA takes a system-driven, data-centric approach that accommodates diverse system and model designs .

Abstract ↗ PDF ↗

🟢 Applied

Towards the Next Frontier of LLMs, Training on Private Data: A Cross-Domain Benchmark for Federated Fine-Tuning

💡 This research explores techniques in language AI.

Much of the world's most valuable information is private, especially in highly regulated sectors such as healthcare and finance, where data include patient histories or customer communications . Unlocking this data could represent a major leap forward, enabling LLMs with deeper domain expertise and stronger real-world utility .

Abstract ↗ PDF ↗

🟢 Applied

Rescaled Asynchronous SGD: Optimal Distributed Optimization under Data and System Heterogeneity

💡 This research explores techniques in machine learning.

Asynchronous stochastic gradient descent (ASGD) is a standard way to exploit heterogeneous compute resources in distributed learning . We show theoretically that the method is biased toward a frequency-weighted average of the local objectives rather than the desired global objective . We recover the correct objective by rescaling worker-specific stepsizes in proportion to their computation times, so that each worker contributes the same aggregate learning rate over a cycle .

Abstract ↗ PDF ↗

🟢 Applied

TurboGR: An Accelerated Training System for Large-Scale Generative Recommendation

💡 This research improves edge computing.

Generative recommendation (GR) has emerged as a promising paradigm that replaces fragmented, scenario-specific architectures with unified Transformer-based models . But deploying GR at scale on Ascend NPUs faces fundamental system-level challenges .

Abstract ↗ PDF ↗

🟢 Applied

Swarm Network-as-a-Service (SNaaS)

💡 This research proposes a method for edge computing.

Swarm Network-as-a-Service (SNaaS) leverages fleets of drones to provide on-demand connectivity at scale . SNaaS explicitly models drone-to-device and drone interactions as composable services . A dedicated enforcement module monitors queue stability and SLA latency, adaptively reconfiguring the swarm when violations occur .

Abstract ↗ PDF ↗

🟢 Applied

GenAI-Driven Approach to RISC-V Supply Chain Exploration

💡 This research proposes a method for language AI.

This paper presents an LLM-empowered workflow for RISC-V supply chain analysis . The proposed approach addresses the challenges of heterogeneous and unstructured supply chain data by leveraging LLMs for textual understanding and VLMs for extracting information from visual artifacts such as diagrams and tables .

Abstract ↗ PDF ↗

🟢 Applied

A detailed algorithmic study on a reuse-aware, near memory, all-digital Ising machine

💡 This research optimizes computer vision.

Nature-inspired computing approaches have gained significant attention for solving difficult optimization problems . SACHI is an all-digital Ising architecture implemented by repurposing the L1 cache of a CPU using SRAM-based processing-in-memory techniques .

Abstract ↗ PDF ↗

🟢 Applied

Efficient and Portable Support for Overdecomposition on Distributed Memory GPGPU Platforms

💡 This research explores techniques in edge computing.

Charm++ is a parallel programming system which has demonstrated the utility of overdecomposition for many applications and in multiple contexts . The emergence of GPGPUs as a dominant compute component has created some real and perceived challenges for this paradigm .

Abstract ↗ PDF ↗

🔬

Offline-First / Local AI

🟢 Applied

FocalPolicy: Frequency-Optimized Chunking and Locally Anchored Flow Matching for Coherent Visuomotor Policy

💡 This research optimizes machine learning.

FocalPolicy is a foresight-aware visuomotor policy that combines Frequency-Optimized Chunking with Locally Anchored flow matching . It combines time-domain alignment within the proximal actions while regularizing frequency-domain structure over multiple future action chunks to improve cross-chunk coherence .

Abstract ↗ PDF ↗

🟢 Applied

Unsupervised Domain Shift Detection with Interpretable Subspace Attribution

💡 This research presents techniques for edge computing.

We developed a tool for detecting subtle differences in the probability distributions of datasets . We identify these shifts using an algorithm designed to detect localised density anomalies in high-dimensional feature spaces . If an anomaly is present, we then identify the feature subspace in which the anomaly is most pronounced . This allows us to trace the domain shift to a small set of features, making the shift interpretable .

Abstract ↗ PDF ↗

🟢 Applied

Ti-iLSTM: A TinyDL Approach for Logic-Level Anomaly Detection in Industrial Water Treatment Systems

💡 This research explores techniques in computer vision.

Industrial Water Treatment Systems exposed to cyber threats that can manipulate process behaviour without creating obvious devices outliers . Logic-layer deception anomalies can preserve numerically plausible measurements while breaking expected cause-and-effect relationships .

Abstract ↗ PDF ↗

🟢 Applied

Navigating Potholes with Geometry-Aware Sharpness Minimization

💡 This research explores techniques in computer vision.

Sharpness-aware minimization (SAM) encourages flat minima by perturbing parameters along directions of high loss curvature . But SAM treats all directions uniformly, ignoring the underlying loss geometry . LLQR+SAM combines SAM with a learned preconditioner obtained from the recently proposed LLQr framework .

Abstract ↗ PDF ↗

🟢 Applied

Multi-Fidelity Flow Matching: Cascaded Refinement of PDE Solutions

💡 This research explores techniques in machine learning.

Multi-Fidelity Flow Matching (MFFM) is a cascade refinement framework for parametric PDE solutions . Source distribution in conditional flow matching is a design parameter that can be calibrated to data, not a default isotropic prior . Conditioning makes residual refinement problem substantially easier than unconditional field generation .

Abstract ↗ PDF ↗

🟢 Applied

MIND: Decoupling Model-Induced Label Noise via Latent Manifold Disentanglement

💡 This research explores techniques in language AI.

Model-Induced Noise Decoupling (MIND) is a theoretically grounded framework addressing model-induced label noise . We demonstrate that the high-dimensional noise manifold can be decoupled into tractable, subspace-dependent components via Latent Manifold Disentanglement . MIND significantly outperforms state-of-the-art methods on complex benchmarks .

Abstract ↗ PDF ↗

🟢 Applied

Looped SSMs: Depth-Recurrence and Input Reshaping for Time Series Classification

💡 This research explores techniques in machine learning.

State Space Models (SSMs) are inherently recurrent along the sequence dimension . Yet depth-recurrence - reusing the same block repeatedly across layers - has not been explored in this model family . We show that a looped SSM with $k$ parameters iterated $L$ times consistently closely matches or outperforms a standard SSM . We further show that input reshaping is an equally neglected design axis .

Abstract ↗ PDF ↗

🟢 Applied

Judge Circuits

💡 This research explores techniques in language AI.

LLM-as-a-judge has become the dominant paradigm for grading model outputs at scale . Yet the same model assigns systematically different scores when its output format changes . Using Position-aware Edge attribution Patching, we causally investigate the internal mechanism in Gemma-3, Qwen2.5, and Llama-3.5 .

Abstract ↗ PDF ↗

🟢 Applied

Clock-state olfactory search in turbulent flows using Q-learning: The geometry of plume recovery

💡 This research explores techniques in machine learning.

Using tabular Q-learning to train an olfactory search agent with a minimal memory of past observations . This agent learns an interpretable strategy to recover the plume which combines well-known behaviors observed in insects: surging, casting, and a return downwind .

Abstract ↗ PDF ↗

🟢 Applied

Shapley Neuron Values for Continual Learning: Which Neurons Matter Most?

💡 This research tackles the problem of computer vision.

Shapley Neuron Valuation (SNV) quantifies Neuron importance in continual learning . SNV selectively freezes important Neurons while keeping others plastic, enabling buffer-free continual learning without expanding architecture. SNV improves accuracy by +2.88% in the class incremental learning and +6.46% in task incremental learning scenarios .

Abstract ↗ PDF ↗

🟢 Applied

StippleDiffusion: Capacity-Constrained Stippling using Controlled Diffusion

💡 This research optimizes computer vision.

Stipple patterns, point sets whose local density tracks a target image, are traditionally produced by per-density iterative optimizers, which are slow, non-differentiable . We present the first diffusion-based sampler that simultaneously satisfies a learned local point-distribution prior and a continuous, image-defined capacity constraint at inference .

Abstract ↗ PDF ↗

🟢 Applied

Continual Learning of Domain-Invariant Representations

💡 This research optimizes computer vision.

Continual learning (CL) aims to train models sequentially over multiple domains without forgetting previously learned knowledge . Existing CL methods optimize for in-domain performance and are prone to learning spurious, domain-specific cues (``shortcut learning'')

Abstract ↗ PDF ↗

🟢 Applied

GOMA: Toward Structure-Driven Multimodal Alignment from a Graph Signal Smoothing Perspective

💡 This research running AI locally on devices for computer vision.

Graph-Optimized Multimodal Alignment (GOMA) is a structure-driven post-alignment framework . GOMA decouples three key design choices: where messages should flow, how multimodal evidence should propagate, and which smoothing depth should be retained . It learns modality-aware propagation operators, performs finite-step coupled smoothing without diagonal cross-modal shortcuts .

Abstract ↗ PDF ↗

🟢 Applied

Scalable neuromorphic computing from autonomous spiking dynamics in a clockless reconfigurable chip

💡 This research proposes a method for speech processing.

We propose a scalable neuromorphic architecture based on spiking dynamics emerging from the autonomous time-continuous evolution of clockless (asynchronous) digital circuits . Implemented on commercially available field-programmable gate arrays, our system implements networks of interacting Boolean spiking neurons with configurable excitatory and inhibitory synaptic weights .

Abstract ↗ PDF ↗

🟢 Applied

A numerical study into neural network surrogate model performance for uncertainty propagation

💡 This research reduces machine learning.

Neural network surrogate models have emerged as a promising approach to model solution fields for boundary value problems encountered in physical modeling . Stochastic problems represent an area of interest because of the potential to significantly reduce the repeated evaluation of expensive forward models via traditional numerical solvers when conducting parametric analysis .

Abstract ↗ PDF ↗

🟢 Applied

SAFE Quantum Machine Learning with Variational Quantum Classifiers

💡 This research proposes a method for machine learning.

We propose a variational quantum classifier operating on high dimensional deep representations via amplitude encoding, stabilized by a learnable classical pre encoding layer . Model reliability is assessed using SAFE-AI metrics derived from the Cramer von Mises divergence .

Abstract ↗ PDF ↗

🟢 Applied

Explainable AI Isn't Enough! Rethinking Algorithmic Contestability

💡 This research automatically finding machine learning.

Machine learning systems increasingly make life-changing decisions about individuals . How can individuals respond to negative decisions made by these opaque systems? We propose an operational definition of contestability as a complement to recourse .

Abstract ↗ PDF ↗

🟢 Applied

Mind Dreamer: Untethering Imagination via Active Latent Intervention on Latent Manifolds

💡 This research explores techniques in language AI.

Model-Based Reinforcement Learning (MBRL) leverages latent imagination for sample efficiency, yet remains constrained by Historical Tethering . We propose Mind Dreamer (MD), a framework that operationalizes Active Latent Intervention (ALI) to transcend Markovian continuity . MD reformulates discovery as the minimization of a global Relay Manifold Expected Free Energy (R-EFE)

Abstract ↗ PDF ↗

🟢 Applied

Variational Autoregressive Networks with probability priors

💡 This research proposes a method for machine learning.

Monte Carlo methods are essential across diverse scientific fields, yet their efficiency is frequently hampered by slowing down . In this paper, we demonstrate that incorporating physical priors into the model significantly enhances performance . Building upon existing strategies that integrate spin-spin interactions, we propose a framework that utilizes a prior probability distribution as a starting point for training .

Abstract ↗ PDF ↗

🟡 Advanced

Adaptive Outer-Loop Control of Quadrotors via Reinforcement Learning

💡 This research introduces a new approach to machine learning.

Deep Reinforcement Learning (DRL) for quadrotor flight control typically relies on Domain Randomization for sim-to-real transfer . We propose a novel adaptive control architecture that actively perceives and reacts to instantaneous perturbations . Real-world validations on a Crazyflie micro-quadrotor demonstrate that our adaptive controller significantly outperforms baselines .

Abstract ↗ PDF ↗

🟡 Advanced

Testing properties of trees in graphical models with covariance queries

💡 This research explores techniques in machine learning.

We consider the problem of testing properties of graphs underlying high-dimensional graphical models . We adopt the model of covariance queries introduced by Lugosi, Truszkowski, Velona, and Zwiernik (2021) We study the case when the underlying graph is a tree .

Abstract ↗ PDF ↗

🟢 Applied

Entropy-Based Characterisation of the Polarised Regime in Latent Variable Models

💡 This research explores techniques in machine learning.

Variational Autoencoders (VAEs) often exhibit a polarised regime in which latent variables separate into active, passive, and mixed subsets . Existing criteria for identifying active dimensions depend on a Gaussian prior, limiting their applicability to variational models and specific priors . We propose a simple information-theoretic classification based on the entropy of the mean representation . We show theoretically how this entropy couples to KL minimisation through entropy--variance bounds .

Abstract ↗ PDF ↗

🟡 Advanced

Imperfect World Models are Exploitable

💡 This research introduces a new approach to machine learning.

We propose a novel definition of model exploitation in reinforcement learning . Informally, a world model is exploitable if it implies that one policy should be strictly preferred over another while the environment's true transition model implies the reverse . We analogize our definition with a prior characterization of reward hacking .

Abstract ↗ PDF ↗

🟢 Applied

A Retrieval-Enhanced Transformer for Multi-Step Port-of-Call Sequence Prediction in Global Liner Shipping

💡 This research tackles the problem of machine learning.

Connectivity-Constrained and Retrieval-Enhanced (CCRE) deep learning framework proposes a deep-learning framework . CCRE achieves a 72.3% first-destination accuracy and a 61.4% average three-step accuracy, outperforming baselines like CatBoost and LSTM .

Abstract ↗ PDF ↗

🟢 Applied

LoCO: Low-rank Compositional Rotation Fine-tuning

💡 This research makes more efficient language AI.

Parameter-efficient fine-tuning (PEFT) has emerged as a critical technique for adapting large-scale foundation models across natural language processing and computer vision . We introduce a novel PEFT method that constructs orthogonal transformations through low-rank skew-symmetric matrices and compositional rotation chains . We propose an approximation scheme that enables fully parallel computation of compositional rotations .

Abstract ↗ PDF ↗