arXiv Research Digest

April 28, 2026 • 125 papers across 5 interests

🔬

Efficient ML / Edge AI

🟢 Applied

Structural Pruning of Large Vision Language Models: A Comprehensive Study on Pruning Dynamics, Recovery, and Data Efficiency

💡 This research reduces language AI.

Large Vision Language Models (LVLMs) demonstrate impressive capabilities, but their substantial computational and memory requirements pose deployment challenges on resource-constrained edge devices . We study a complementary route: compressing existing LVLMs by applying structured pruning to the language model backbone, followed by lightweight recovery training . We assess the feasibility of conducting recovery training with only a small fraction of the available data .

Abstract ↗ PDF ↗

🟢 Applied

DepthKV: Layer-Dependent KV Cache Pruning for Long-Context LLM Inference

💡 This research makes more efficient language AI.

Long-context reasoning is a critical capability of large language models, enabling applications such as long-document understanding, summarization, and code generation . However, efficient autoregressive inference relies on the key-value (KV) cache, whose memory footprint grows linearly with sequence length . To mitigate this overhead, KV cache pruning methods discard cached tokens with low attention scores during inference .

Abstract ↗ PDF ↗

🟢 Applied

CF-VLA: Efficient Coarse-to-Fine Action Generation for Vision-Language-Action Policies

💡 This research faster predictions in computer vision.

Flow-based vision-language-action (VLA) policies offer strong expressivity for action generation, but suffer from a fundamental inefficiency . Multi-step inference is required to recover action structure from uninformative Gaussian noise . We propose CF-VLA, a coarse-to-fine two-stage formulation that restructures action generation into a coarse initialization step that constructs an action-aware starting point .

Abstract ↗ PDF ↗

🟢 Applied

Deployment-Aligned Low-Precision Neural Architecture Search for Spaceborne Edge AI

💡 This research optimizes computer vision.

Designing deep networks that meet latency and accuracy constraints on edge accelerators increasingly relies on hardware-aware optimization . We evaluate the proposed framework on vessel segmentation for spaceborne maritime monitoring, targeting the Intel Movidius Myriad X Visual Processing Unit .

Abstract ↗ PDF ↗

🟢 Applied

Less Is More: Engineering Challenges of On-Device Small Language Model Integration in a Mobile Application

💡 This research achieves better language AI.

On-device Small Language Models promise fully offline, private AI experiences for mobile users . But is this promise achievable in practice? This paper presents a case study documenting the engineering challenges of integrating SLMs into a production Android word-guessing game .

Abstract ↗ PDF ↗

🟡 Advanced

Efficient learning by implicit exploration in bandit problems with side observations

💡 This research explores techniques in machine learning.

We consider online learning problems under a partial observability model capturing situations where the information conveyed to the learner is between full information and bandit feedback . For this setting, we propose the first algorithm that enjoys near-optimal regret guarantees without having to know the observation system before selecting its actions . We also define a new partial information setting that models online combinatorial optimization problems .

Abstract ↗ PDF ↗

🟢 Applied

GradMAP: Gradient-Based Multi-Agent Proximal Learning for Grid-Edge Flexibility

💡 This research proposes a method for edge computing.

GradMAP trains independent neural-network policies for each agent without any parameter sharing . Each agent uses only its own local observation for online decision-making without communication . GradMAP embeds a differentiable three-phase AC power-flow model in a primal-dual learning loop .

Abstract ↗ PDF ↗

🟢 Applied

Cortex-Inspired Continual Learning: Unsupervised Instantiation and Recovery of Functional Task Networks

💡 This research makes more efficient machine learning.

Functional Task Networks (FTN) uses a high dimensional, self-organizing binary mask over a large population of small but deep networks . The mask is produced by a three-stage procedure: (1) gradient descent on a continuous mask identifies task-relevant neurons, (2) a smoothing kernel biases the result toward spatial contiguity, and k-winner-take-all binarizes the resulting group at a fixed capacity budget .

Abstract ↗ PDF ↗

🟢 Applied

Point-MF: One-step Point Cloud Generation from a Single Image via Mean Flows

💡 This research achieves better computer vision.

Single-image point cloud reconstruction must infer complete 3D geometry, including occluded parts, from a single RGB image . While diffusion-based reconstructors achieve high accuracy, they typically require many denoising iterations, resulting in slow and expensive inference . We propose Point-MF, a Mean-Flow-based framework for low-NFE single image reconstruction . It operates directly in point-cloud space to learn the mean velocity field and enables one-step reconstruction .

Abstract ↗ PDF ↗

🟢 Applied

Enhancing molecular dynamics with equivariant machine-learned densities

💡 This research explores techniques in machine learning.

DenSNet is a density-first approach to machine-learned electronic structure . It learns the Hohenberg--Kohn map from nuclear configurations to the ground-state electron density . The approach uses an SE(3)-equivariant neural network to predict density coefficients of a flexible atom-centered Gaussian basis .

Abstract ↗ PDF ↗

🟢 Applied

Aligned Multi-View Scripts for Universal Chart-to-Code Generation

💡 This research explores techniques in computer vision.

Chart-to-code generation converts a chart image into an executable plotting script . The same chart can be expressed by semantically equivalent scripts in different plotting languages . CharLuMA augments the multimodal projector with a language-conditioned mixture of low-rank subspaces .

Abstract ↗ PDF ↗

🟡 Advanced

Layerwise Convergence Fingerprints for Runtime Misbehavior Detection in Large Language Models

💡 This research tackles the problem of language AI.

Layerwise Convergence Fingerprinting (LCF) is a tuning-free runtime monitor that treats the inter-layer hidden-state trajectory as a health signal . LCF aggregates via Ledoit-Wolf shrinkage and thresholds via leave-one-out calibration on 200 clean examples, with no reference model, trigger knowledge, or retraining. LCF reduces mean backdoor attack success rate (ASR) below 1% on Qwen2.5-7B and Gemma

Abstract ↗ PDF ↗

🟢 Applied

TextGround4M: A Prompt-Aligned Dataset for Layout-Aware Text Rendering

💡 This research explores techniques in computer vision.

Despite advances in text-to-image generation, models still struggle to accurately render prompt-specified text with correct spatial layout . This challenge is driven not only by the lack of datasets that align prompts with the exact text and layout expected in the image . To address these issues, we introduce TextGround4M, a large-scale dataset of over 4 million prompt-image pairs .

Abstract ↗ PDF ↗

🟢 Applied

SPLIT: Separating Physical-Contact via Latent Arithmetic in Image-Based Tactile Sensors

💡 This research speeds up computer vision.

SPLIT is a novel method for simulating image-based tactile sensors . Central to our approach is a latent space arithmetic strategy that explicitly disentangles contact geometry from sensor-specific optical properties . Unlike methods that require recalibration for every new unit, this disentanglement allows SPLIT to adapt to diverse DIGIT backgrounds .

Abstract ↗ PDF ↗

🟢 Applied

Kwai Summary Attention Technical Report

💡 This research explores techniques in language AI.

Long-context ability has become one of the most important iteration direction of next-generation Large Language Models . The standard softmax attention exhibits quadratic time complexity with respect to sequence length . As the sequence length increases, this incurs substantial overhead in long-context settings, leading the training and inference costs of extremely long sequences deteriorate rapidly .

Abstract ↗ PDF ↗

🟢 Applied

DYMAPIA: A Multi-Domain Framework for Detecting AI-based Video Manipulation

💡 This research automatically finding computer vision.

DYMAPIA is a multi-domain Deepfake detection framework that fuses spatial, spectral, and temporal cues to capture subtle traces of manipulation in visual data . System builds dynamic anomaly masks by combining evidence from Fourier spectra, local texture descriptors, edge irregularities, and optical flow consistency .

Abstract ↗ PDF ↗

🟢 Applied

MIPIC: Matryoshka Representation Learning via Self-Distilled Intra-Relational and Progressive Information Chaining

💡 This research presents techniques for language AI.

Representation learning is fundamental to NLP, but building embeddings that work well at different computational budgets is challenging . We propose MIPIC (Matryoshka Representation Learning via Self-Distilled Intra-Relational Alignment and Progressive Information Chaining)

Abstract ↗ PDF ↗

🟢 Applied

The Last Human-Written Paper: Agent-Native Research Artifacts

💡 This research explores techniques in machine learning.

Scientific publication compresses a branching, iterative research process into a linear narrative, discarding the majority of what was discovered along the way . We introduce the Agent-Native Research Artifact (Ara) protocol that replaces the narrative paper with machine-executable research package . Ara raises question-answering accuracy from 72.4% to 93.7% on PaperBench and RE-bench .

Abstract ↗ PDF ↗

🟢 Applied

XGRAG: A Graph-Native Framework for Explaining KG-based Retrieval-Augmented Generation

💡 This research running AI locally on devices for language AI.

Graph-based Retrieval-Augmented Generation (GraphRAG) extends traditional RAG by using knowledge graphs to give large language models a structured, semantically coherent context . GraphRAG reasoning process remains a black-box, limiting our ability to understand how specific pieces of structured knowledge influence the final output .

Abstract ↗ PDF ↗

🟢 Applied

Learning to Route Queries to Heads for Attention-based Re-ranking with Large Language Models

💡 This research explores techniques in language AI.

Large Language Models (LLMs) have recently been explored as fine-grained zero-shot re-rankers by leveraging attention signals to estimate document relevance . RouteHead is a query-dependent head selection method for attention-based re-ranking with LLMs .

Abstract ↗ PDF ↗

🟢 Applied

Majorization-Guided Test-Time Adaptation for Vision-Language Models under Modality-Specific Shift

💡 This research explores techniques in language AI.

Vision-language models transfer well in zero-shot settings but at deployment the visual and textual branches often shift asymmetrically . Under this condition, entropy-based test-time adaptation can sharpen the fused posterior while increasing error . We propose MG-MTTA, which keeps the backbone frozen and updates only a lightweight gate or adapter .

Abstract ↗ PDF ↗

🟢 Applied

Improving Vision-language Models with Perception-centric Process Reward Models

💡 This research improves language AI.

Recent advancements in reinforcement learning with verifiable rewards have significantly improved the complex reasoning ability of vision-language models . However, outcome-level supervision is too coarse to diagnose and correct errors within the reasoning chain . To this end, we propose Perceval, a process reward model (PRM) that enables token-level error grounding .

Abstract ↗ PDF ↗

🟢 Applied

MEG-RAG: Quantifying Multi-modal Evidence Grounding for Evidence Selection in RAG

💡 This research tackles the problem of language AI.

Multi-modal Evidence Grounding (MEG) is a metric that quantifies the contribution of retrieved evidence . Unlike standard confidence measures, MEG utilizes Semantic Certainty Anchoring, focusing on high-IDF information-bearing tokens that better capture the semantic core of the answer . MEG-RAG consistently outperforms strong baselines and demonstrates robust generalization across different teacher models .

Abstract ↗ PDF ↗

🟢 Applied

GSC-QEMit: A Telemetry-Driven Hierarchical Forecast-and-Bandit Framework for Adaptive Quantum Error Mitigation

💡 This research explores techniques in edge computing.

Quantum error mitigation (QEM) is essential for extracting reliable results from near-term quantum devices . Practical deployments must balance mitigation strength against runtime overhead under time-varying noise . GSC-QEMit is a telemetry-driven framework for mitigation that switches between lightweight suppression and heavier intervention as drift evolves .

Abstract ↗ PDF ↗

🟢 Applied

Extreme bandits

💡 This research makes more efficient machine learning.

In many areas of medicine, security, and life sciences, we want to allocate limited resources to different sources in order to detect extreme values . In this paper, we study an efficient way to allocate resources sequentially under limited feedback . The most commonly optimized property is the regret with respect to the maximum mean reward .

Abstract ↗ PDF ↗

🔬

Privacy-Preserving ML

🟢 Applied

X-NegoBox: An Explainable Privacy-Budget Negotiation Framework for Secure Peer-to-Peer Energy Data Exchange

💡 This research explores techniques in privacy-preserving AI.

The decentralization of modern energy systems is transforming consumers into prosumers who continuously exchange data with aggregators, peers, and market operators . While such data is essential for peer-to-peer trading, demand response, and distributed forecasting, it can reveal sensitive household patterns and introduce privacy risks .

Abstract ↗ PDF ↗

🟢 Applied

Agentic Witnessing: Pragmatic and Scalable TEE-Enabled Privacy-Preserving Auditing

💡 This research protecting data privacy in language AI.

The system is composed of three agents: a Verifier (who wants to check properties of a dataset), a Prover (who owns the dataset) and an Auditor . The Verifier is allowed to ask a limited number of simple binary true/false questions to the auditor . The Auditor uses the Model Context Protocol (MCP) to dynamically inspect the target dataset .

Abstract ↗ PDF ↗

🟢 Applied

A Survey on Split Learning for LLM Fine-Tuning: Models, Systems, and Privacy Optimizations

💡 This research protecting data privacy in language AI.

Fine-tuning unlocks large language models for specialized applications, but its high computational cost often puts it out of reach for resource-constrained organizations . Data privacy concerns make sharing sensitive information with third parties risky . A promising solution to this problem is split learning, which divides the model between clients and a server .

Abstract ↗ PDF ↗

🟡 Advanced

Information-Theoretic Distributed Point Functions with Shorter Keys

💡 This research explores techniques in machine learning.

A t-private n-server Information-Theoretic Distributed Point Function allows one to convert any point function f_{alpha,beta}(x): [N] -> G into n shares (secret keys) This paper constructs a novel share conversion based on the private information retrieval (PIR) of Ghasemi, Kopparty and Sudan .

Abstract ↗ PDF ↗

🟢 Applied

Computational Design and Experimental Validation of Photoactive PARP1 Inhibitors

💡 This research optimizes machine learning.

Light-activated drugs are a promising way to treat localized diseases for which existing treatments have severe side effects . Here we used computational techniques to find a set of promising candidates for the photoactive inhibition of the poly(ADP-ribose) polymerase 1 (PARP1) cancer target .

Abstract ↗ PDF ↗

🟢 Applied

SAGE: Sparse Adaptive Guidance for Dependency-Aware Tabular Data Generation

💡 This research enhances language AI.

SAGE (Sparse Adaptive Guidance) is a novel LLM-based generation framework that enforces sparse and dynamic dependency guidance . SAGE discretizes features into value-aware pseudo-features and constructs a mutual information-based sparse dependency graph . This graph adaptively guides generation through explicit context selection or implicit logit correction .

Abstract ↗ PDF ↗

🟢 Applied

Latent-Hysteresis Graph ODEs: Modeling Coupled Topology-Feature Evolution via Continuous Phase Transitions

💡 This research reduces edge computing.

Graph neural ordinary differential equations extend graph learning from discrete message-passing layers to continuous-time representation flows . While it supports adaptive long-range propagation, we show that Graph ODEs with strictly positive irreducible mixing operators face an inherent monostability trap . We propose HGODE (HGODE) which couples feature evolution with a latent topological potential driven by a learned pairwise force .

Abstract ↗ PDF ↗

🟢 Applied

Resolving Conflicts Between RTOS Timekeeping and Uninterruptable Trusted Computing

💡 This research explores techniques in machine learning.

Trusted Execution Environments (TEEs) on low-power microcontrollers (e.g., ARM TrustZone-M) enable isolation of Secure and Non-Secure software but still require both worlds to share resources, including interrupt controllers . Many RTOS-s rely on periodic interrupts (SysTicks) to advance their own notion of time (time-keeping), but the delivery of this interrupt is essential for preserving real-time behavior . On the other hand, the

Abstract ↗ PDF ↗

🟢 Applied

BitRL: Reinforcement Learning with 1-bit Quantized Language Models for Resource-Constrained Edge Deployment

💡 This research running AI locally on devices for language AI.

The deployment of intelligent reinforcement learning (RL) agents on resource-constrained edge devices remains a challenge . We introduce BitRL, a framework for building RL agents using 1-bit quantized language models . BitRL achieves 10-16x memory reduction and 3-5x energy efficiency improvements over full-precision baselines .

Abstract ↗ PDF ↗

🟢 Applied

Dynamic Cyber Ranges

💡 This research explores techniques in language AI.

As LLM-driven agents advance in cybersecurity, Jeopardy CTF benchmarks are approaching saturation . Cyber ranges, the natural next evaluation frontier, offer diminishing resistance under current design . To counteract this trend, we propose Dynamic Cyber Ranges .

Abstract ↗ PDF ↗

🟢 Applied

Detecting Avalanche Effect in Adversarial Settings: Spotting the Encryption Loops in Ransomware

💡 This research explores techniques in machine learning.

CipherXRay is inspired by avalanche effect, but it only checks whether a "ripple effect" of avalanche effect exists, allowing a straightforward counterattack to succeed . In this work, we present a new approach that checks the avalanche effect itself .

Abstract ↗ PDF ↗

🟢 Applied

AgentWard: A Lifecycle Security Architecture for Autonomous AI Agents

💡 This research explores techniques in language AI.

Autonomous AI agents extend large language models into full runtime systems that load skills, ingest external content, maintain memory, plan multi-step actions, and invoke privileged tools . This paper presents AgentWard, a lifecycle-oriented, defense-in-depth architecture that organizes protection across these five stages .

Abstract ↗ PDF ↗

🟢 Applied

ARCANE: Cross-Campaign Attacker Re-identification via Passive Beacon Telemetry -- A Bayesian Network Framework for Longitudinal Cyber Attribution

💡 This research improves machine learning.

Current cyber attribution approaches typically operate on a per-incident basis . We investigate whether cross-campaign attribution reduces ambiguity or whether structural limits persist under longitudinal data . We model adversary fingerprints as multi-dimensional feature vectors encoding behavioral, infrastructural, and temporal characteristics . We introduce ARCANE (Attacker Re-identification via Cross-campaign Attribution Network) framework .

Abstract ↗ PDF ↗

🟢 Applied

Meta-CoT: Enhancing Granularity and Generalization in Image Editing

💡 This research improves computer vision.

Meta-CoT is a paradigm that performs a two-level decomposition of any single-image editing operation with two key properties: (1) Decomposability. (2) Generalizability. We observe that any editing intention can be represented as a triplet - (task, target, required understanding ability). We find that training on these five meta-tasks, together with the other two elements of the triplet, is sufficient to achieve strong generalization across unseen

Abstract ↗ PDF ↗

🟢 Applied

Uncovering Latent Patterns in Social Media Usage and Mental Health: A Clustering-Based Approach Using Unsupervised Machine Learning

💡 This research explores techniques in computer vision.

The widespread adoption of social media has heightened interest in its psychological effects . This study seeks to address this by segmenting individuals according to their social media usage and psychological well-being . Data from 551 participants, collected via an online survey, were preprocessed using KNN imputation for missing values .

Abstract ↗ PDF ↗

🟢 Applied

DETOUR: A Practical Backdoor Attack against Object Detection

💡 This research optimizes computer vision.

Backdoor attacks on detection transformers for OD tasks rely on patch-wise triggers optimized at fixed locations with minimal perturbations . We propose DETOUR, a practical backdoor attack by using semantic triggers that are effective in real-world object detection systems .

Abstract ↗ PDF ↗

🟢 Applied

Fraud Detection in Cryptocurrency Markets with Spatio-Temporal Graph Neural Networks

💡 This research automatically finding machine learning.

Existing fraud detection mechanisms typically rely on machine learning methods that treat each financial asset (i.e., token) and its related transactions independently . However, market manipulation strategies are rarely isolated events, but are characterized by coordination, repetition, and frequent transfers among related assets . This suggests that relational structure constitutes an integral component of the signal .

Abstract ↗ PDF ↗

🟢 Applied

Hierarchical Behaviour Spaces

💡 This research explores techniques in machine learning.

Recent work in hierarchical reinforcement learning has shown success in scaling to billions of timesteps when learning over a set of predefined reward functions . We show that, instead of using a single reward function per option, the reward functions can be effectively used to induce a space of behaviours . We call this method Hierarchical Behaviour Spaces (HBS)

Abstract ↗ PDF ↗

🟢 Applied

Dialysis Risk Prediction and Treatment Effect Estimation for AKI patients using Longitudinal Electronic Health Records

💡 This research forecasting machine learning.

Progression to dialysis or end-stage renal disease is a rare but clinically important outcome . Clinicians need evidence on how medication exposures influence downstream risk . We constructed a fixed-window EHR cohort (90-day observation, 730-day prediction; N=81401) and modeled sequences of diagnoses, procedures, and medications with kidney laboratory trends .

Abstract ↗ PDF ↗

🟢 Applied

Stochastic simultaneous optimistic optimization

💡 This research explores techniques in edge computing.

We study the problem of global maximization of a function f given a finite number of evaluations perturbed by noise . We consider a very weak assumption on the function, namely that it is locally smooth (in some precise sense) with respect to some semi-metric, around one of its global maxima . StoSOO follows an optimistic strategy to iteratively construct upper confidence bounds over the hierarchical partitions of the function domain to decide which point to sample next .

Abstract ↗ PDF ↗

🟢 Applied

A Reward-Free Viewpoint on Multi-Objective Reinforcement Learning

💡 This research optimizes edge computing.

In multi-objective reinforcement learning (MORL) one widely studied approach addresses this by training a single policy network conditioned on preference-weighted rewards . We propose using the RFRL's training objective as an auxiliary task to enhance MORL .

Abstract ↗ PDF ↗

🟢 Applied

Prior-Agnostic Robust Forecast Aggregation

💡 This research forecasting edge computing.

Previous work largely focuses on settings with a known binary state space, where the state is either 0 or 1 . We study prior-agnostic robust forecast aggregation in which the aggregator observes only experts' reports, yet is ignorant of the underlying joint information structure and the full prior, including the underlying state space .

Abstract ↗ PDF ↗

🟢 Applied

SceneSelect: Selective Learning for Trajectory Scene Classification and Expert Scheduling

💡 This research forecasting machine learning.

SceneSelect uses unsupervised clustering on interpretable geometric and kinematic features to discover a latent scene taxonomy . A highly decoupled classification module is trained to assign real-time inputs to these taxonomy categories . A plug-and-play scheduling policy automatically dispatches the trajectory sequence to the optimal expert predictor .

Abstract ↗ PDF ↗

🟢 Applied

MIMIC: A Generative Multimodal Foundation Model for Biomolecules

💡 This research presents techniques for machine learning.

Most foundation models in biology are trained within one modality or for a fixed forward task . We present MIMIC, a generative multimodal foundation model trained on our newly curated and aligned dataset, LORE . We link nucleic acid, protein, evolutionary, structural, regulatory, and semantic/contextual modalities within partially observed biomolecular states .

Abstract ↗ PDF ↗

🟢 Applied

GAMMAF: A Common Framework for Graph-Based Anomaly Monitoring Benchmarking in LLM Multi-Agent Systems

💡 This research enhances language AI.

Graph-based anomaly detection methods show promise in protecting networks, but field lacks a standardized, reproducible environment to train these models and evaluate their efficacy . Gammaf is not a novel defense mechanism itself, but rather a comprehensive evaluation architecture designed to generate synthetic multi-agent interaction datasets and benchmark the performance of existing and future defense models .

Abstract ↗ PDF ↗

🔬

Creative AI / Emotion

🟢 Applied

Generative Design of a Gas Turbine Combustor Using Invertible Neural Networks

💡 This research makes more efficient edge computing.

The need to burn 100% H2 in high efficient gas turbines featuring low NOx combustion in premix mode require the complete redesign of the combustion system to ensure stable operation without any flashback . Generative design methods using latest AI technology will provide promising potential .

Abstract ↗ PDF ↗

🟢 Applied

From Players to Participants: Citizen Science and Video Games to Understand Cognition

💡 This research explores techniques in machine learning.

Citizen science is transforming how cognitive scientists study the human mind . By embedding experimental tasks into engaging, game-like experiences, researchers can reach large, diverse populations while collecting rich behavioral data outside the lab .

Abstract ↗ PDF ↗

🟢 Applied

Interoceptive machine framework: Toward interoception-inspired regulatory architectures in artificial intelligence

💡 This research proposes a method for machine learning.

This review proposes an integrative framework grounded on interoception and embodied AI-termed the interoceptive machine framework- that translates biologically inspired principles of internal-state regulation into computational architectures for adaptive autonomy . Interoception is conceived as the monitoring, integration, and regulation of internal signals .

Abstract ↗ PDF ↗

🟢 Applied

Putting a Face to the Issue: Fostering User Empathy of Open Source Software Developers With PersonaFlow

💡 This research explores techniques in emotion AI.

PersonaFlow generates editable user personas from OSS repository artifacts and integrates them alongside issue reports . In a user study with 13 OSS developers, most reported shifts in how they understood users . More than half modified their responses by adding empathetic language .

Abstract ↗ PDF ↗

🟢 Applied

All That Glitters Is Not Audio: Rethinking Text Priors and Audio Reliance in Audio-Language Evaluation

💡 This research presents techniques for language AI.

Large Audio-Language Models show consistent performance gains across speech and audio benchmarks . High scores may not reflect true auditory perception . If a model can answer questions without processing the acoustic signal, the benchmark fails as a measure of auditory understanding .

Abstract ↗ PDF ↗

🟢 Applied

An event-based sequence modeling approach to recognizing non-triad chords with oversegmentation minimization

💡 This research explores techniques in speech processing.

Automatic chord recognition (ACR) extracts time-aligned chord labels from music audio recordings . ACR still struggles with oversegmentation, data scarcity, and imbalance, especially in recognizing complex chords such as non-triads . We reformulate ACR as a segment-level sequence-to-sequence prediction task .

Abstract ↗ PDF ↗

🟢 Applied

SolarTformer: A Transformer Based Deep Learning Approach for Short Term Solar Power Forecasting

💡 This research makes more efficient machine learning.

An attention-based deep learning model, inspired by transformer architecture, is used for short-term solar power forecasting . Unlike traditional models, SolarTformer leverages self-attention mechanisms to capture temporal and spatial variability in solar irradiance .

Abstract ↗ PDF ↗

🟢 Applied

Seeing Is No Longer Believing: Frontier Image Generation Models, Synthetic Visual Evidence, and Real-World Risk

💡 This research explores techniques in computer vision.

Frontier image generation has moved from artistic synthesis to synthetic visual evidence . This paper provides a source-grounded technical and policy analysis of synthetic visual risk . We argue for layered control: model-side restrictions, cryptographic provenance, visible labeling, platform friction, sector-grade verification, and incident response .

Abstract ↗ PDF ↗

🟢 Applied

Modeling Behavioral Intensity and Transitions for Generative Recommendation

💡 This research achieves better machine learning.

BITRec is a novel generative multi-behavior recommendation framework that introduces structured behavioral modeling through selective dependency activation . BITRec incorporates Hierarchical Behavior Aggregation (HBA), which explicitly models behavioral intensity differences through separated exploration and commitment pathways . Experiments on four large-scale datasets (RetailRocket, Taobao, Tmall, Insurance Dataset) achieve consistent improvements of 15-23% across multiple metrics .

Abstract ↗ PDF ↗

🟢 Applied

BandRouteNet: An Adaptive Band Routing Neural Network for EEG Artifact Removal

💡 This research explores techniques in machine learning.

Electroencephalography (EEG) is highly susceptible to artifact contamination, such as electrooculographic (EOG) interference, which severely degrades signal quality and hinders reliable interpretation in applications including neurological diagnosis, brain-computer interfaces (BCIs) and neurological diagnosis . We propose BandRouteNet, an adaptive frequency-aware neural network for EEG denoising .

Abstract ↗ PDF ↗

🟢 Applied

RefEvo: Agentic Design with Co-Evolutionary Verification for Agile Reference Model Generation

💡 This research explores techniques in language AI.

RefEvo is a dynamic multi-agent framework designed for agile and reliable reference modeling . It has three key innovations: A Dynamic Design Planner that autonomously decomposes design specifications and constructs tailored execution workflows based on semantic complexity . A Co-Evolutionary Verification Mechanism employs a Dialectical Arbiter to simultaneously rectify the model and verification logic against the specification .

Abstract ↗ PDF ↗

🟢 Applied

FastOMOP: A Foundational Architecture for Reliable Agentic Real-World Evidence Generation on OMOP CDM data

💡 This research explores techniques in language AI.

The Observational Medical Outcomes Partnership Common Data Model (OMOP CDM) enabled harmonisation of electronic health records data of nearly one billion patients in 83 countries . But generating real-world evidence (RWE) from these repositories remains a manual process requiring clinical, epidemiological and technical expertise .

Abstract ↗ PDF ↗

🟢 Applied

Understanding the Limits of Automated Evaluation for Code Review Bots in Practice

💡 This research explores techniques in language AI.

Automated code review (ACR) bots are increasingly used in industrial software development . As adoption grows, a key challenge is how to evaluate the usefulness of bot-generated comments reliably and at scale . We examine the feasibility and limitations of automating the evaluation of LLM-powered ACR bots in an industrial setting .

Abstract ↗ PDF ↗

🟢 Applied

Envisioning Mobile Data Visualization Libraries for Digital Health

💡 This research explores techniques in computer vision.

A key limitation is suboptimal design of visualizations for small-screen devices . We argue that this gap is partly driven by a lack of specialized developer tools . We advocate for dedicated mobile visualization libraries tailored to personal health data and mobile contexts .

Abstract ↗ PDF ↗

🟢 Applied

DPRM: A Plug-in Doob h transform-induced Token-Ordering Module for Diffusion Language Models

💡 This research explores techniques in language AI.

Diffusion language models generate without a fixed left-to-right order, making token ordering a central algorithmic choice . Existing systems mainly use random masking or confidence-driven ordering . We introduce DPRM (Doob h-transform Process Reward Model), a plug-in token-ordering module for diffusion language models .

Abstract ↗ PDF ↗

🟢 Applied

Rewarding the Scientific Process: Process-Level Reward Modeling for Agentic Data Analysis

💡 This research achieves better language AI.

Process Reward Models (PRMs) have achieved remarkable success in augmenting the reasoning capabilities of Large Language Models (LLMs) within static domains such as mathematics . However, their potential in dynamic data analysis tasks remains underexplored . We introduce DataPRM, a novel environment-aware generative process reward model that can serve as an active verifier .

Abstract ↗ PDF ↗

🟢 Applied

K-MetBench: A Multi-Dimensional Benchmark for Fine-Grained Evaluation of Expert Reasoning, Locality, and Multimodality in Meteorology

💡 This research tackles the problem of language AI.

The development of practical (multimodal) large language model assistants for Korean weather forecasters is hindered by the absence of a multidimensional, expert-level evaluation framework grounded in authoritative sources . To address this, we introduce K-MetBench, a diagnostic benchmark grounded in national qualification exams .

Abstract ↗ PDF ↗

🟢 Applied

Evaluating whether AI models would sabotage AI safety research

💡 This research explores techniques in language AI.

We evaluate the propensity of frontier models to sabotage or refuse to assist with safety research when deployed as AI research agents within a frontier AI company . We find no instances of unprompted sabotage across any model, with refusal rates close to zero for Mythos Preview and Opus 4.7 Preview, though all models sometimes only partially completed tasks .

Abstract ↗ PDF ↗

🟢 Applied

NeSyCat: A Monad-Based Categorical Semantics of the Neurosymbolic ULLER Framework

💡 This research running AI locally on devices for edge computing.

ULLER (Unified Language for LEarning and Reasoning) offers a unified first-order logic (FOL) syntax . We show that these seemingly disparate semantics are all instances of one categorical framework based on monads . This enables the modular addition of new semantics and systematic translations .

Abstract ↗ PDF ↗

🟢 Applied

Children's Online Safety Risks and Ethical Considerations in XR Games

💡 This research presents techniques for machine learning.

Emerging extended reality technologies are reshaping how children play, learn, and socialize . Recent news has highlighted safety concerns such as car accidents, lower judgment for real-world situations, and exposure to disturbing content like virtual rape . This research examines how XR game design may lead to online safety risks for children .

Abstract ↗ PDF ↗

🟢 Applied

Towards Lawful Autonomous Driving: Deriving Scenario-Aware Driving Requirements from Traffic Laws and Regulations

💡 This research explores techniques in language AI.

Driving in compliance with traffic laws and regulations is a basic requirement for human drivers . But autonomous vehicles can violate these requirements in diverse real-world scenarios . Conventional approaches use formal logic languages to specify behavioral constraints . We propose a novel pipeline that grounds LLM reasoning in a traffic scenario taxonomy through node-wise anchors .

Abstract ↗ PDF ↗

🟢 Applied

Beyond the Attention Stability Boundary: Agentic Self-Synthesizing Reasoning Protocols

💡 This research explores techniques in language AI.

As LLM agents transition to autonomous digital coworkers, maintaining deterministic goal-directedness in non-linear multi-turn conversations emerged as an architectural bottleneck . We propose Self-Synthesizing Reasoning Protocols (SSRP), a metacognitive framework that implements a discrete separation between high-level architectural planning (Architect) and turn-by-turn procedural execution .

Abstract ↗ PDF ↗

🟢 Applied

Blur Effects on User Performance in Target-Pointing Tasks

💡 This research explores techniques in computer vision.

In projectors and head-mounted displays, an out-of-focus image appears blurred . Even when a display itself is in focus, computer operation may be hindered if the display is far from the user or if a user has poor visual acuity, because the user cannot see the screen clearly .

Abstract ↗ PDF ↗

🟢 Applied

Agentic clinical reasoning over longitudinal myeloma records: a retrospective evaluation against expert consensus

💡 This research explores techniques in language AI.

Multiple myeloma is managed through sequential lines of therapy over years to decades, with each decision depending on cumulative disease history distributed across dozens to hundreds of heterogeneous clinical documents . Whether LLM-based systems can synthesise this evidence at a level approaching expert agreement has not been established .

Abstract ↗ PDF ↗

🟢 Applied

Measuring Successful Cooperation in Human-AI Teamwork: Development and Validation of the Perceived Cooperativity and Teaming Perception Scales

💡 This research explores techniques in language AI.

The Perceived Cooperativity Scale (PCS) and Teaming Perception Scale (TPS) are theoretically grounded scales . The PCS captures an agent's perceived cooperative capability and practice within a single interaction sequence . The TPS captures the emergent sense of teaming arising from mutual contribution and support . Both scales were adapted for human-human cooperation to enable cross-agent comparisons .

Abstract ↗ PDF ↗

🔬

Lightweight Systems

🟢 Applied

Exact, Efficient, and Reliable Multi-Objective and Multi-Constrained IoT Workflow Scheduling in Edge-Hub-Cloud Cyber-Physical Systems

💡 This research makes more efficient edge computing.

Emerging IoT-enabled cyber-physical applications demand low-latency, energy-efficient, and reliable execution across resource-constrained edge devices . Existing scheduling approaches fail to jointly address all these objectives and constraints . The proposed formulation jointly optimizes latency, energy, and reliability, while holistically addressing timing and resource constraints .

Abstract ↗ PDF ↗

🟢 Applied

Architectural Isolation as a Timing Safety Primitive for Edge AI Medical Devices: Controlled Experimental Evidence on a Shared-Silicon Platform

💡 This research explores techniques in machine learning.

A system can satisfy accuracy-based validation, maintain output stability (Safety-Threshold Exceedance Rate, STER, equal to zero) and still violate timing constraints under deployment load . These are structurally independent properties that current pre-market validation protocols often do not operationalize at the inference layer . Joint STER and latency verification is proposed as a candidate method for operationalizing U.S. FDA Draft Guidance FDA-2024-D-4488 robustness requirements at

Abstract ↗ PDF ↗

🟢 Applied

Hardware-Efficient Softmax and Layer Normalization with Guaranteed Normalization for Edge Devices

💡 This research tackles the problem of language AI.

In Transformer models, non-GEMM operations often dominate hardware cost due to their nonlinear nature . We propose a hardware-efficient Softmax and LayerNorm with Guaranteed Normalization for Edge devices . Compared to the state of the art, we achieve 11x and 14x reduction in area .

Abstract ↗ PDF ↗

🟢 Applied

Hardware-Efficient FPGA Implementation of Sigmoid Function Using Mixed-Radix Hyperbolic Rotation CORDIC

💡 This research makes more efficient edge computing.

Efficient hardware implementation of nonlinear activation functions is a crucial task in deploying artificial neural networks on resource-constrained and edge devices such as Field-Programmable Gate Arrays . The proposed approach leverages the mathematical relationship between the sigmoid and hyperbolic tangent functions .

Abstract ↗ PDF ↗

🟡 Advanced

Tessera: Secure, Near-Line-Rate Weight Streaming for UMA Edge Accelerators

💡 This research running AI locally on devices for language AI.

Tessera is a reference architecture for inline, cache-line granularity weight decryption on UMA edge accelerators . The design intercepts 64-byte AXI bursts, computing AES-256-CTR keystreams in parallel with DRAM fetches . This streams plaintext directly into isolated NPU SRAM, creating a transient memory footprint confined to the active tile .

Abstract ↗ PDF ↗

🟢 Applied

Network Edge Inference for Large Language Models: Principles, Techniques, and Opportunities

💡 This research running AI locally on devices for language AI.

Large language models (LLMs) have advanced rapidly, emerging as versatile tools across fields . However, performing LLM inference at the network edge remains challenging due to their large memory and compute demands . This survey outlines the challenges specific to LLM edge inference and provides a comprehensive overview of recent progress .

Abstract ↗ PDF ↗

🟢 Applied

A Task Decomposition and Planning Framework for Efficient LLM Inference in AI-Enabled WiFi-Offload Networks

💡 This research tackles the problem of language AI.

AI WiFi offload is emerging as a promising approach for providing large language model services to resource-constrained wireless devices . Unlike conventional edge computing, LLM inference over WiFi must jointly address heterogeneous model capabilities, wireless contention, uncertain task complexity .

Abstract ↗ PDF ↗

🟢 Applied

LaissezCloud: Continuous Resource Renegotiation for the Public Cloud

💡 This research explores techniques in machine learning.

Public clouds increasingly expose heterogeneous hardware, but allocation interface remains built around rigid on-demand and spot service classes . LaissezCloud keeps allocations contestable during execution: tenants and operators update bids online .

Abstract ↗ PDF ↗

🟢 Applied

Risk-Aware and Stable Edge Server Selection Under Network Latency SLOs

💡 This research presents techniques for edge computing.

Each candidate server is characterised by predictive mean and uncertainty summaries of network latency . Risk is evaluated using a tight Normal approximation complemented by a conservative Cantelli bound . percentile-based scoring coupled with hysteresis stabilizes decisions and suppresses oscillatory switching under short-lived network fluctuations .

Abstract ↗ PDF ↗

🟢 Applied

Maximizing Memory-Level Parallelism via Integrated Stochastic Logic-in-Memory Architectures

💡 This research explores techniques in edge computing.

In-memory architectures have emerged as a complementary solution to conventional von Neumann systems . This study proposes a parallel in-memory stochastic computing (SC) architecture that implements an end-to-end computation pipeline .

Abstract ↗ PDF ↗

🟢 Applied

SPAC: Automating FPGA-based Network Switches with Protocol Adaptive Customization

💡 This research explores techniques in machine learning.

SPAC (Switch and Protocol Adaptive Customization) automates generation of FPGA-based network switches co-optimized for custom protocols and application-specific traffic patterns . SPAC delivers latency reductions ranging from 7.8% to 38.4% across various tasks .

Abstract ↗ PDF ↗

🟢 Applied

AGNT2: Autonomous Agent Economies on Interaction-Optimized Layer 2 Infrastructure

💡 This research optimizes machine learning.

AGNT2 is a three-tier stack purpose-built for agent and microservice coordination on-chain . Autonomous AI agents generate high-frequency, semantically rich service invocations among mutually untrusting principals . Existing chains treat those interactions as generic calldata forcing identity, escrow, dependency ordering, and session state to be encoded above execution layer at the wrong cost point .

Abstract ↗ PDF ↗

🟢 Applied

Déjà Vu Packing: Optimizing FPGA Logic Clustering Runtime via Pattern Memoization

💡 This research explores techniques in machine learning.

The packing stage constitutes 58% and 94% of the entire Versatile Place and Route (VPR) flow runtime on average when mapping a wide variety of benchmarks to the AMD 7-series-like and Altera Stratix-10-like VTR architecture captures .

Abstract ↗ PDF ↗

🟢 Applied

SpotVista: Availability-Aware Recommendation System for Reliable and Cost-Efficient Multi-Node Spot Instances

💡 This research explores techniques in machine learning.

Cloud vendors offer discounted spot instances to maximize surplus resource utilization, but these instances are subject to the risk of sudden interruption . Traditional pricing datasets have been employed to predict this risk, yet recent policy changes by cloud vendors have diminished their effectiveness . Paper proposes SpotVista, a system that recommends a resource pool of reliable and cost-efficient multi-node spot instances .

Abstract ↗ PDF ↗

🟢 Applied

Incisor: Ex Ante Cloud Instance Selection for HPC Jobs

💡 This research presents techniques for language AI.

Incisor is a cloud HPC job submission system for the ex ante instance selection problem: choosing suitable hardware in the challenging but common setting where only the executable, inputs, and invocation commands are available at submission time . In practice, this task is manual and expertise-intensive, requiring users to combine incomplete knowledge of rapidly evolving cloud offerings with workload-specific intuition, static analysis, and systems reasoning to infer hardware constraints .

Abstract ↗ PDF ↗

🟢 Applied

FreeScale: Distributed Training for Sequence Recommendation Models with Minimal Scaling Cost

💡 This research forecasting machine learning.

FreeScale reduces computational bubbles caused by severe stragglers and slow blocking communications . FreeScale achieves up to 90.3% reduction in computational bubbles when applied to real-world workloads running on 256 H100 GPUs .

Abstract ↗ PDF ↗

🟢 Applied

KubePACS: Kubernetes Cluster Using Performant, Highly Available, and Cost Efficient Spot Instances

💡 This research reduces computer vision.

KubePACS is a Kubernetes-native spot instance provisioning system that constructs node pools optimized for both cost and performance while guaranteeing high availability . Spot instances are widely adopted due to steep discounts compared to on-demand pricing .

Abstract ↗ PDF ↗

🟢 Applied

Hybrid JIT-CUDA Graph Optimization for Low-Latency Large Language Model Inference

💡 This research achieves better language AI.

Large Language Models (LLMs) have achieved strong performance across natural language and multimodal tasks, yet their practical deployment remains constrained by inference latency and kernel launch overhead . This paper presents a hybrid runtime framework that combines Just-In-Time compilation with CUDA Graph execution to reduce launch overhead while preserving runtime flexibility during decoding .

Abstract ↗ PDF ↗

🟢 Applied

A comprehensive evaluation of spatial co-execution on GPUs using MPS and MIG technologies

💡 This research achieves better computer vision.

Multi-Process Service (MPS) improves performance by up to 30% and reduces energy by about 20% . However, under memory contention, MPS suffers severe degradation, worsening performance by around 30% . MIG's full hardware isolation resolves memory contention leading to more consistent improvements .

Abstract ↗ PDF ↗

🟢 Applied

Microarchitectural Co-Optimization for Sustained Throughput of RISC-V Multi-Lane Chaining Vector Processors

💡 This research achieves better machine learning.

Modern RISC vector processors rely on the synergy of multi-lane parallelism and chaining to achieve high sustained throughput . But achieved performance often falls substantially short of the theoretical performance bound due to microarchitectural inefficiencies . In this work, we take the open-source RVV processor Ara as the target platform and analyze the sources of its sustained-throughput loss and optimize the design accordingly .

Abstract ↗ PDF ↗

🟢 Applied

Exploiting pre-optimized kernels with polyhedral transformations for CGRA compilation

💡 This research makes more efficient machine learning.

Modern computing workloads commonly involve matrix-matrix multiplication (mmul) as a core computing pattern . Coarse-Grained Reconfigurable Arrays (CGRAs) can flexibly and efficiently support it . Here, we introduce a specialized mmul CGRA kernel schedule, parametrizable across different CGRA sizes .

Abstract ↗ PDF ↗

🟢 Applied

HGQ-LUT: Fast LUT-Aware Training and Efficient Architectures for DNN Inference

💡 This research explores techniques in machine learning.

Lookup-table (LUT) based neural networks can deliver ultra-low latency and excellent hardware efficiency on FPGAs by mapping arithmetic operations directly onto the logic primitives . However, state-of-the-art LUT-aware training (LAT) approaches remain difficult to use in practice . They are often orders of magnitude slower to train than conventional networks and require non-trivial manual tuning .

Abstract ↗ PDF ↗

🟢 Applied

AutoINV: Automated Invariant Generation Framework for Formal Verification on High-Level Synthesis Designs

💡 This research reduces machine learning.

High-level synthesis (HLS) transforms an algorithmic description of hardware from a higher abstraction (e.g., C/C) into a register-transfer level (RTL) design . Large size of the generated RTL often causes model checking to struggle to conclude within reasonable time or resource limits . We propose utilizing the high-level design features from the HLS flow to construct a set of helper assertions aimed at guiding the model checker and accelerating the verification process .

Abstract ↗ PDF ↗

🟢 Applied

GR-Evolve: Design-Adaptive Global Routing via LLM-Driven Algorithm Evolution

💡 This research achieves better language AI.

GR-Evolve is a code evolution framework that leverages an agentic large language model to modify global routing source code using QoR-driven feedback . The framework equips the LLM with persistent contextual knowledge of open-source global routers along with an integrated toolchain for evaluation within the OpenROAD infrastructure . We demonstrate up to 8.72% reduction in post-detailed-routing wirelength over existing baseline routers .

Abstract ↗ PDF ↗

🟢 Applied

Accelerating Intra-Node GPU-to-GPU Communication Through Multi-Path Transfers with CUDA Graphs

💡 This research enhances edge computing.

CUDA Graph-based multi-path communication approach achieves up to a 2.95x bandwidth improvement, compared to the single-path UCX (UCT::CUDA-IPC) approach . To the best of our knowledge, our proposed approach is the first to seamlessly integrate CUDA graphs into UCX .

Abstract ↗ PDF ↗

🔬

Offline-First / Local AI

🟢 Applied

Few-Shot Cross-Device Transfer for Quantum Noise Modeling on Real Hardware

💡 This research explores techniques in edge computing.

In the noisy intermediate-scale quantum (NISQ) regime, quantum devices contain hardware-specific noise sources which restrict device-invariant error mitigation strategies . We explore transfer learning approaches to apply noise models learned on one quantum device to a different device with the help of a small amount of data .

Abstract ↗ PDF ↗

🟢 Applied

GeoEdit: Local Frames for Fast, Training-Free On-Manifold Editing in Diffusion Models

💡 This research tackles the problem of machine learning.

Diffusion models are a leading paradigm for data generation, but training-free editing typically re-runs the full denoising trajectory for every edit strength . To address this issue, we instead edit near the data manifold, where small local updates can replace repeated re-synthesis .

Abstract ↗ PDF ↗

🟢 Applied

Geometry-Aware Offline-to-Online Learning in Linear Contextual Bandits

💡 This research explores techniques in machine learning.

We study offline-to-online learning in linear contextual bandits with biased offline regression data . We propose Ellipsoidal-MINUCB, which combines a standard online branch with an offline-informed pooled branch and uses offline information only when it tightens uncertainty .

Abstract ↗ PDF ↗

🟢 Applied

Explaining Temporal Graph Predictions With Shapley Values

💡 This research forecasting edge computing.

Temporal Graph Neural Networks (TGNNs) have become increasingly popular in recent years due to their superior predictive performance . However, how these models utilize the information to make predictions is rather unexplored, leading to potentially faulty or biased models . This work introduces two novel model-agnostic explainers for local explanations of TGNNs based on Shapley and Owen values .

Abstract ↗ PDF ↗

🟢 Applied

Complexity of Linear Regions in Self-supervised Deep ReLU Networks

💡 This research explores techniques in machine learning.

Self-Supervised Learning (SSL) differs in that it directly optimises the representation space using a loss function to enhance the model's performance across multiple downstream tasks . We demonstrate that the evolution of linear regions correlates with the representation quality by using SplineCam to extract two-dimensional polytopes near the data distribution . Contrastive methods rapidly expand regions over time, whereas self-distillation methods tend to consolidate by merging neighbouring regions .

Abstract ↗ PDF ↗

🟢 Applied

An Aircraft Upset Recovery System with Reinforcement Learning

💡 This research enhances edge computing.

The PARS model employs an advanced reinforcement learning (RL) architecture, incorporating a cutting-edge soft-actor critic (SAC) model and hyper-parameter optimization methods . Negative-g punishments and other handcrafted features are also taken into account by the system .

Abstract ↗ PDF ↗

🟢 Applied

ARETE: Attention-based Rasterized Encoding for Topology Estimation using HSV-transformed Crowdsourced Vehicle Fleet Data

💡 This research makes more efficient machine learning.

The continuous advancement of autonomous driving introduces challenges across multiple disciplines to ensure safe and efficient driving . One promising approach is the use of crowdsourced data from a vehicle fleet, representing road topology and lane-level features . We adopt a Detection Transformer (DETR)-based approach .

Abstract ↗ PDF ↗

🟢 Applied

Diffusion Templates: A Unified Plugin Framework for Controllable Diffusion

💡 This research explores techniques in computer vision.

Diffusion Templates is a unified and open plugin framework that decouples base-model inference from controllable capability injection . The framework is organized around three components: Template models that map arbitrary task-specific inputs to an intermediate capability representation . All resources will be open sourced, including code, models and datasets .

Abstract ↗ PDF ↗

🟢 Applied

AgenticCache: Cache-Driven Asynchronous Planning for Embodied AI Agents

💡 This research explores techniques in language AI.

Embodied AI agents increasingly rely on large language models (LLMs) for planning, yet per-step LLM calls impose severe latency and cost . In AgenticCache, each agent queries a runtime cache of frequent plan transitions . Cache-based plan reuse offers a practical path to low-latency, low-cost embodied agents .

Abstract ↗ PDF ↗

🟢 Applied

A Limit Theory of Foundation Models: A Mathematical Approach to Understanding Emergent Intelligence and Scaling Laws

💡 This research explores techniques in edge computing.

Study attempts to formalize emergent intelligence from the perspective of limit theory . Intelligence emerges as transition from finite to effectively infinite knowledge, authors say . They say intelligence originates from the existence of a parameter-limit architecture .

Abstract ↗ PDF ↗

🟡 Advanced

FedSLoP: Memory-Efficient Federated Learning with Low-Rank Gradient Projection

💡 This research distributed machine learning across privacy-preserving AI.

Federated learning enables a population of clients to collaboratively train machine learning models without exchanging raw data . Standard algorithms such as FedAvg suffer from slow convergence and high communication and memory costs . FedSLoP is guaranteed to converge to a first-order stationary point at a rate of $O(1/\sqrt{NT)$.

Abstract ↗ PDF ↗

🟢 Applied

Adaptive-Distribution Randomized Neural Networks for PDEs: A Low-Dimensional Distribution-Learning Framework

💡 This research explores techniques in machine learning.

Randomized neural networks (RaNNs) are attractive for partial differential equations (PDEs) because they replace expensive end-to-end training with a linear least-squares solve over randomized hidden features . AD-RaNN parameterizes the hidden-feature sampling distribution by a low-dimensional vector p and optimizes only p . The method uses a two-stage strategy: ridge-regularized reduced training for stable distribution-parameter optimization .

Abstract ↗ PDF ↗

🟢 Applied

Advancing Ligand-based Virtual Screening and Molecular Generation with Pretrained Molecular Embedding Distance

💡 This research makes more efficient computer vision.

Molecular similarity plays a central role in ligand-based drug discovery, such as virtual screening, analog searching and goal-directed molecular generation . Traditional similarity measures, ranging from fingerprint-based Tanimoto coefficients to 3D shape overlays, are often computationally expensive at scale or rely on hand-crafted molecular descriptors .

Abstract ↗ PDF ↗

🟡 Advanced

Scaling Properties of Continuous Diffusion Spoken Language Models

💡 This research explores techniques in language AI.

Speech-only spoken language models (SLMs) lag behind text and text-speech models in performance . Recent discrete autoregressive (AR) SLMs indicate significant computational and data demands to match text models . Since discretizing continuous speech for AR creates bottlenecks, we explore whether continuous diffusion (CD) SLM is more viable .

Abstract ↗ PDF ↗

🟢 Applied

An Automatic Ground Collision Avoidance System with Reinforcement Learning

💡 This research enhances machine learning.

This article evaluates an artificial intelligence (AI)-based Automatic Ground Collision Avoidance System (AGCAS) designed for advanced jet trainers to enhance operational effectiveness . The system utilizes line-of-sight queries on a terrain server to ensure precise and efficient collision avoidance .

Abstract ↗ PDF ↗

🟢 Applied

Certified geometric robustness -- Super-DeepG

💡 This research tackles the problem of computer vision.

Image processing functions are often required to be insensitive to small geometric perturbations such as rotation, scaling, shearing or translation . Safety-critical applications are required to perform as expected in normal operations . By doing so, Super-DeepG achieves both precision and computational efficiency .

Abstract ↗ PDF ↗

🟢 Applied

PathMoG: A Pathway-Centric Modular Graph Neural Network for Multi-Omics Survival Prediction

💡 This research proposes a method for machine learning.

Cancer survival prediction from multi-omics data remains challenging because prognostic signals are high-dimensional, heterogeneous, and distributed across interacting genes and pathways . PathMoG reorganizes genome-scale inputs into 354 KEGG-informed pathway modules, introduces a Hierarchical Omics Modulation module to condition gene-expression representations on mutation, copy number variation, pathway, and clinical context .

Abstract ↗ PDF ↗

🟡 Advanced

Primitive Recursion without Composition: Dynamical Characterizations, from Neural Networks to Polynomial ODEs

💡 This research explores techniques in machine learning.

All three operate over the continuum--real-valued states evolved by real-valued dynamics--even when the target functions are discrete . We prove primitive recursion admits equivalent characterizations in all three frameworks . No fixed polynomial map can round uniformly to the nearest integer or realize exact phase selection .

Abstract ↗ PDF ↗

🟢 Applied

Unveiling the Backdoor Mechanism Hidden Behind Catastrophic Overfitting in Fast Adversarial Training

💡 This research enhances computer vision.

Fast Adversarial Training (FAT) has attracted significant attention due to its efficiency in enhancing neural network robustness against adversarial attacks . However, FAT is prone to catastrophic overfitting (CO) wherein models overfit to the specific attack used during training and fail to generalize to others . In this work, we innovatively interpret CO through the lens of backdoor . We conceptualize CO as a weak trigger variant of unlearnable tasks .

Abstract ↗ PDF ↗

🟢 Applied

Semantic Segmentation for Histopathology using Learned Regularization based on Global Proportions

💡 This research explores techniques in computer vision.

Variational Segmentation from Label Proportions (VSLP) is a two-stage framework that infers dense segmentations from global label proportions without any pixel-level annotations . We validate our approach on two public datasets, achieving superior performance over existing weakly supervised and unsupervised methods .

Abstract ↗ PDF ↗

🟢 Applied

Perfecting Aircraft Maneuvers with Reinforcement Learning

💡 This research explores techniques in machine learning.

This paper evaluates an advanced jet trainer's utilization of artificial intelligence (AI)-based aircraft aerobatic maneuvers . A multitude of aircraft maneuvers have been simulated using reinforcement learning (RL) agents, which will serve as a training tool for future pilots .

Abstract ↗ PDF ↗

🟢 Applied

New non-Euclidean neural quantum states from additional types of hyperbolic recurrent neural networks

💡 This research explores techniques in machine learning.

In this work, we extend the class of non-Euclidean neural quantum states (NQS) to new variants including Poincaré RNN and Lorentz GRU . In particular, using larger systems consisting of 100 spins, we found that all four hyperbolic RNN/GRU NQS variants always outperformed their respective Euclidean counterparts .

Abstract ↗ PDF ↗

🟢 Applied

Mitigating Error Amplification in Fast Adversarial Training

💡 This research enhances computer vision.

Fast Adversarial Training (FAT) has proven effective in enhancing model robustness by encouraging networks to learn perturbation-invariant representations . But FAT often suffers from catastrophic overfitting (CO) where the model overfits to the training attack and fails to generalize to unseen ones .

Abstract ↗ PDF ↗

🟢 Applied

Self-Abstraction Learning for Effective and Stable Training of Deep Neural Networks

💡 This research explores techniques in machine learning.

Self-Abstraction Learning (SAL) is a hierarchical framework for deep learning . In SAL, networks are arranged by structural complexity, where the simplest topmost network is trained first and its hidden and output layers serve as guidance for the successively more complex networks below . This top-down sequential guidance effectively mitigates optimization issues, enabling stable training of deep architectures .

Abstract ↗ PDF ↗

🟢 Applied

Model-Free Inference of Investor Preferences: A Relative Entropy IRL Approach

💡 This research presents techniques for machine learning.

Relative Entropy Inverse Reinforcement Learning (RE-IRL) is employed to account for environments where transition probabilities are unknown or inaccessible . To address the challenge of data sparsity, we utilize a $K$-nearest neighbor approach to estimate the observed behavior policy . We propose a statistical testing framework to evaluate the validity of the estimated results .

Abstract ↗ PDF ↗