arXiv Research Digest

May 30, 2026 • 125 papers across 5 interests

🔬

Efficient ML / Edge AI

🟢 Applied

Offline Semantic Guidance for Efficient Vision-Language-Action Policy Distillation

💡 VLA-AD uses high-level semantic guidance to enhance teacher-provided action targets, enabling the student policy to operate without external assistance.

VLA-AD augments teacher-provided 7-DoF action targets with high-level semantic guidance for robot manipulation, allowing the student policy to run independently without requiring the VLA teacher or VLM.

languageroboticsvisionactionpolicy

Abstract ↗ PDF ↗

Surrogate Neural Architecture Codesign Package (SNAC-Pack)

Surrogate Neural Architecture Codesign Package is an open-source AutoML framework for hardware-aware neural architecture codesign and end-to-end FPGA deployment . SNAC-Pack runs a multi-objective global search with Optuna and NSGA-II, loading trials to a shared SQLite store . A hardware surrogate model outputs per-trial resource and latency estimates, avoiding the synthesis cost that would otherwise dominate the search loop .

Abstract ↗ PDF ↗

🟢 Applied

AgriMind: An Ensemble Deep Learning Framework for Multi-Class Plant Disease Classification

💡 AgriMind combines three deep learning models to improve plant disease detection accuracy, showing that averaging their outputs yields 99.23% on the test set.

AgriMind is an ensemble deep learning framework that combines three models to achieve high accuracy in plant disease classification.

plantmachinedeep-learningclassification

Abstract ↗ PDF ↗

🟢 Applied

ITGPT: Generative Pretraining on Irregular Timeseries

💡 ITGPT is a specialized machine learning model designed for timeseries regression that can leverage large volumes of labeled multimodal data for structured data like text.

ITGPT is a Transformer-based large language model designed for timeseries regression that can leverage large volumes of labeled multimodal data for structured data like text.

timeserieslarge language modelmultimodal datastructured data

Abstract ↗ PDF ↗

🟢 Applied

RecMem: Recurrence-based Memory Consolidation for Efficient and Effective Long-Running LLM Agents

💡 RecMem is a memory system for LLM agents that stores interactions in a subconscious layer and uses embedding models for retrieval.

RecMem is a memory system for LLM agents that stores interactions in a subconscious layer and uses embedding models for retrieval.

memoryllmagent

Abstract ↗ PDF ↗

🟢 Applied

Learn Where Outcomes Diverge: Efficient VLA RL via Probabilistic Chunk Masking

💡 This technique helps reduce the computational cost of training reinforcement learning policies by focusing gradient computation on a smaller, probabilistically selected subset of policy phases per trajectory.

This method uses probabilistic chunk masking to reduce gradient computation in reinforcement learning by selecting a small subset of policy phases per trajectory.

rlpolicy optimizationgradient reductionvla

Abstract ↗ PDF ↗

Accelerated Gradient Descent for Faster Convergence with Minimal Overhead

CT-AGD (Curvature-Tuned Accelerated Gradient Descent) is a general boosting procedure that accelerates first-order methods by explicitly capturing the local curvature using finite-difference quotients . It has a comparable storage and computational overhead as adaptive gradient methods such as Adam .

Abstract ↗ PDF ↗

🟢 Applied

Layer Equivalence Is Not a Property of Layers Alone: How You Test Redundancy Changes What You Find

💡 This study shows that replacing one layer's map with another's can cause the system to fail to learn, even though both layers are equivalent.

This study shows that replacing one layer's map with another's can cause the system to fail to learn, even though both layers are equivalent.

layerequivalencereplacingmachine-learningpythiatraining

Abstract ↗ PDF ↗

🟢 Applied

FORGE: Self-Evolving Agent Memory With No Weight Updates via Population Broadcast

💡 FORGE is a protocol for evolving agent memory in a staged, population-based way that improves evaluation return.

FORGE is a protocol for evolving agent memory in a staged, population-based way that improves evaluation return.

memoryagentevolution

Abstract ↗ PDF ↗

🟢 Applied

Runtime-Orchestrated Second-Order Optimization for Scalable LLM Training

💡 Asteria is a system that optimizes machine learning models by separating second-order optimization from the GPU training path, allowing for asynchronous processing of expensive computations.

Asteria separates second-order optimization from GPU training by using training hooks to prepare shadow states in advance, allowing expensive inverse-root computations to proceed asynchronously on the host while GPU computations continue.

machine-learningtrainingoptimization

Abstract ↗ PDF ↗

🟢 Applied

SGR: A Stepwise Reasoning Framework for LLMs with External Subgraph Generation

💡 SGR helps LLMs understand and reason about complex queries by extracting semantic information from external knowledge sources and using that structure to guide multi-step reasoning.

SGR is a stepwise reasoning framework that uses external knowledge bases to generate subgraphs and supports multi-step inference for LLMs.

llmreasoningknowledge basessubgraph generation

Abstract ↗ PDF ↗

🟢 Applied

DebiasRAG: A Tuning-Free Path to Fair Generation in Large Language Models through Retrieval-Augmented Generation

💡 DebiasRAG uses self-diagnosed bias contexts from the query through retrieval, then reversely produces debiasing contexts to address potential bias in LLMs.

DebiasRAG addresses LLM bias through self-diagnosed bias contexts from the query through retrieval, then reversely produces debiasing contexts to address potential bias.

llmbiasfairness

Abstract ↗ PDF ↗

🟢 Applied

Multi-Level Contextual Token Relation Modeling for Machine-Generated Text Detection

💡 This approach uses statistical features to identify MGTs, avoiding the dangers of overfitting that complex models can face.

This method detects machine-generated texts by using metric-based features rather than complex models, addressing the risk of overfitting.

text_detectionmachine_learningmetrics

Abstract ↗ PDF ↗

ReAlign: Generalizable Image Forgery Detection via Reasoning-Aligned Representation

The rise of AI-generated images poses growing challenges for digital authenticity . ReAlign is a novel framework that distills reasoning texts generated by a GRPO-optimized LLM into a lightweight AIGI detector via contrastive learning .

Abstract ↗ PDF ↗

🟢 Applied

IVGT: Implicit Visual Geometry Transformer for Neural Scene Representation

💡 IVGT is a computer vision technique that uses a neural network to learn continuous and coherent 3D geometry from unposed multi-view images.

IVGT is a computer vision technique that uses a neural network to learn continuous and coherent 3D geometry from unposed multi-view images.

computer visionneural network3d reconstructionmulti-view

Abstract ↗ PDF ↗

A Unified Generative-AI Framework for Smart Energy Infrastructure: Intelligent Gas Distribution, Utility Billing, Carbon Analytics, and Quantum-Inspired Optimisation

The accelerating convergence of smart metering, artificial intelligence, and quantum-inspired combinatorial optimisation is reshaping how energy utilities manage physical infrastructure, customer engagement, and environmental accountability .

Abstract ↗ PDF ↗

🟢 Applied

Argus: Evidence Assembly for Scalable Deep Research Agents

💡 Argus is a research system that uses a Searcher and Navigator to collect evidence and produce a verified final answer for deep research.

Argus is an agentic system that uses a Searcher and Navigator to cooperate and assemble evidence for deep research by verifying missing pieces and producing a source-traced final answer.

researchagentsevidence

Abstract ↗ PDF ↗

🟢 Applied

Fully Open Meditron: An Auditable Pipeline for Clinical LLMs

💡 FO models are medical AI systems that are fully open and auditable, enabling reproducible validation for clinical decision support.

Fully Open (FO) models are auditable medical AI systems that expose complete training stacks for rigorous validation.

llmmedicalaihealthcareclinicalvalidation

Abstract ↗ PDF ↗

A Scalable Nonparametric Continuous-Time Survival Model through Numerical Quadrature

QSurv is a scalable deep learning framework that enables nonparametric continuous-time modeling without relying on time discretization or restrictive distributional assumptions . We propose a training objective based on Gauss-Legendre numerical quadrature, which approximates the cumulative hazard with high-order accuracy . QSurv achieves competitive predictive performance with advantages in instantaneous hazard function estimation .

Abstract ↗ PDF ↗

🟢 Applied

Confirming Correct, Missing the Rest: LLM Tutoring Agents Struggle Where Feedback Matters Most

💡 This study evaluates how different LLM agents handle various types of solutions and feedback conditions in a logic problem setting.

A benchmark of seven LLM feedback agents for solving propositional logic problems across 10,836 solution--feedback pairs.

llmeducationlogicfeedbackground truthsolutions

Abstract ↗ PDF ↗

🟢 Applied

Context, Reasoning, and Hierarchy: A Cost-Performance Study of Compound LLM Agent Design in an Adversarial POMDP

💡 The research shows that adding a hierarchy to a programmatic state abstraction degrades performance compared to using only the hierarchy alone.

This study examines how distributing deliberation tools across a hierarchy affects performance in cyber defense environments.

policy optimizationaicybersecuritymachine learningprogrammatic state abstraction

Abstract ↗ PDF ↗

🟢 Applied

Optimized Three-Dimensional Photovoltaic Structures with LLM guided Tree Search

💡 This case study demonstrates how AI coding systems can be used to autonomously generate high-efficiency three-dimensional photovoltaic structures.

This case study uses AI coding systems to generate novel scientific hypotheses about three-dimensional photovoltaic structures.

aiphotovoltaicscodingscientific researchthree-dimensional structures

Abstract ↗ PDF ↗

🟢 Applied

MAgSeg: Segmentation of Agricultural Landscapes in High-Resolution Satellite Imagery using Multimodal Large Language Models

💡 MAgSeg is a method for agricultural landscape segmentation that uses MLLMs and doesn't require auxiliary vision decoders.

MAgSeg uses MLLMs to segment agricultural landscapes from satellite imagery without auxiliary vision decoders.

deep-learningllmmachine-learningagriculturesatellitesegmentation

Abstract ↗ PDF ↗

🟢 Applied

Second-Order Multi-Level Variance Correction for Modality Competition in Multimodal Models

💡 This paper addresses a problem in multimodal models where different modalities (images and text) compete for attention, which can destabilize optimization and limit large-batch scaling.

The paper proposes a second-order optimization framework with Multi-Level Variance Correction for multimodal models, addressing modality competition that can destabilize optimization and limit large-batch scaling.

optimizationautoregressivenext-tokenmultimodalvariance-corrected

Abstract ↗ PDF ↗

SwAIther-Precip: Lead-Time-Aware Bias Correction Enables Kilometer-Scale Downscaling of Global AI Precipitation Forecasts over Switzerland

Global AI weather models can produce skillful medium-range forecasts, but their native 0.25 degrees resolution limits direct use for local hazard applications . We introduce SwAIther-Precip, a lead-time-aware downscaling framework that converts coarse-resolution AIFS forecasts into probabilistic km-scale precipitation fields over Switzerland .

Abstract ↗ PDF ↗

🔬

Privacy-Preserving ML

The Privacy Price of Tail-Risk Learning: Effective Tail Sample Size in Differentially Private CVaR Optimization

Differential privacy changes the effective sample size governing CVaR learning . For tail mass $τ$, the privacy-relevant sample size is not $n$, but $nτ$ For convex Lipschitz learning, modular upper and lower reductions show that CVAR-specific privacy term necessarily scales as $1/2 .

Abstract ↗ PDF ↗

🟢 Applied

FedEDAuth -- Federated Embedding Distribution Authentication for Counterfeit IC Detection

💡 FedEDAuth is a client authentication framework that detects and filters malicious participants before model aggregation using outlier analysis and micro-cluster behavior.

FedEDAuth is a lightweight client authentication framework that detects and filters malicious participants before model aggregation using outlier analysis and micro-cluster behavior.

machine learningauthenticationcounterfeit detection

Abstract ↗ PDF ↗

Centralized vs Decentralized Federated Learning: A trade-off performance analysis

Federated Learning (FL) has emerged as a promising paradigm for collaborative model training across distributed edge devices while preserving data privacy . Storing this amount of data centrally is challenging due to issues like limited communication, privacy, and regulations . Choosing the right FL architecture depends on the application's needs .

Abstract ↗ PDF ↗

🟢 Applied

Privacy is Fungibility: Why Endogenous Tokens Are Not Money

💡 Endogenous tokens are not money - simplified credit

Endogenous tokens are not money - simplified credit

cryptotokenmoney

Abstract ↗ PDF ↗

🟢 Applied

Federated Imputation under Heterogeneous Feature Spaces

💡 A machine learning framework that allows collaborative training across decentralized clients, even when features are not jointly observed locally.

A federated learning framework that separates structural feature unavailability from conventional missingness to enable indirect cross-client knowledge transfer in heterogeneous feature spaces.

federated learningimputationheterogeneous feature spaces

Abstract ↗ PDF ↗

🟢 Applied

PCDM: A Diffusion-Based Data Poisoning Attack Against Federated Learning Systems

💡 This paper proposes a new defense against data poisoning attacks in federated learning systems.

This paper proposes a diffusion-based data poisoning framework for federated learning systems.

federated learningdata poisoningdiffusiondefense

Abstract ↗ PDF ↗

🟢 Applied

Practical Validity Conditions for Byzantine-Tolerant Federated Learning

💡 This research provides practical conditions for making federated learning more robust to malicious data, ensuring the output stays within the convex hull of honest data.

This paper discusses practical validity conditions for Byzantine-tolerant federated learning, ensuring robust aggregation by bounding the output within the convex hull of honest data.

byzantinefederatedconvexityrobustnesslearning

Abstract ↗ PDF ↗

PersonaFingerprint: Measuring Persona Inference on Modern Websites with LLM-Driven Browsing

Website Fingerprinting has traditionally focused on inferring which website a user visits from encrypted traffic metadata such as packet sizes and timing . An adversary can infer a user's persona using only packet-length and inter-arrival-time sequences . We formalize persona fingerprinting under both closed-set and open-world settings .

Abstract ↗ PDF ↗

🟢 Applied

Attention Dispersion in Dynamic Graph Transformers: Diagnosis and a Transferable Fix

💡 Transformer-based systems struggle to focus on important nodes in dynamic graphs because time shifts weaken the contrast between attention, making it too spread out.

CTDG transformers fail to focus on critical nodes due to temporal shift weakening attention contrast, suggesting a simple fix of replacing standard attention with differential attention.

graphtransformerattentiondifferential

Abstract ↗ PDF ↗

🟢 Applied

When and Why Adversarial Training Improves PINNs: A Neural Tangent Kernel Perspective

💡 PINNs are like mathematical models that approximate real-world differential equations, but they are hard to train because they can be unstable and inaccurate. GAN-based adversarial training helps improve these models by using generative adversarial networks to create more stable and accurate surrogates.

PINNs are powerful surrogates for differential equations but are difficult to train due to spectral bias, stiffness, and poor accuracy. GAN-based adversarial training has shown strong results in improving training.

differential equationsneural networksadversarial traininggenerative adversarial networks

Abstract ↗ PDF ↗

🟢 Applied

Context-aware Entity-Relation Extraction for Threat Intelligence Knowledge Graphs

💡 This system helps organize and query diverse cyber threat intelligence sources into structured formats, which is essential for cybersecurity research and security operations.

A context-aware entity-relation extraction system for threat intelligence knowledge graphs that leverages hybrid NLP models and domain ontology to organize diverse cybersecurity sources.

machine-learningcybersecurityknowledge-graphsthreat-intelligence

Abstract ↗ PDF ↗

🟢 Applied

A Multi-Layer Cloud-IDS Pipeline with LLM and Adaptive Q-Learning Calibration

💡 This system secures three distinct layers of cloud infrastructure using machine learning models at each layer to detect known attack patterns and distinguish reliable decisions from uncertain outcomes.

The system secures three distinct layers of cloud infrastructure using machine learning models at each layer to detect known attack patterns and distinguish reliable decisions from uncertain outcomes.

machine-learningcloudsecurity

Abstract ↗ PDF ↗

🟢 Applied

A Generative AI Framework for Intelligent Utility Billing CO 2 Analytics and Sustainable Resource Optimisation

💡 This system helps utilities track carbon emissions and optimize resource use in a unified way.

A new framework integrates four production-grade capabilities into one system for distribution utilities.

carbonenergyutilitiesgrid

Abstract ↗ PDF ↗

🟢 Applied

AI-Mediated Communication Can Steer Collective Opinion

💡 AI can create biases in human communication, which can be amplified through the network and shift collective opinion.

AI can create biases in human communication, which can be amplified through the network and shift collective opinion.

aicommunicationopinion

Abstract ↗ PDF ↗

🟢 Applied

Dynamics-Level Watermarking of Flow Matching Models with Random Codes

💡 This method embeds the key-dependent perturbation directly into the learned continuous dynamics of a flow matching model, allowing for dynamic-level watermarking.

This method embeds the key-dependent perturbation directly into the learned continuous dynamics of a flow matching model, allowing for dynamic-level watermarking.

generative modelsflow matchingwatermarkingrandom codesdynamicscontinuous

Abstract ↗ PDF ↗

🟢 Applied

Universal Magnetic Structure Prediction from Atomic Coordinates with Near-Experimental Accuracy

💡 Understanding magnetic order in materials is difficult because real materials often have complex magnetic structures that are hard to determine through experiments or specialized first-principles calculations.

Predicting magnetic order in materials remains challenging due to noncollinear and incommensurate magnetic structures, which are difficult to determine through experiments or specialized first-principles methods.

magneticmaterialsphysics

Abstract ↗ PDF ↗

🟢 Applied

LymphNode: A Plug-and-Play Access Control Method for Deep Neural Networks

💡 LymphNode is a practical defense against unrestricted oracle access in deep neural networks by using plug-and-play watermarking techniques.

LymphNode is a practical defense against unrestricted oracle access in deep neural networks by using plug-and-play watermarking techniques.

securitywatermarkingdeep neural networksaccess controledge computingprivacy

Abstract ↗ PDF ↗

🟢 Applied

Artificial Aphasias in Lesioned Language Models

💡 This research provides a method to characterize how language models function, using a technique inspired by aphasias to measure their organizational patterns.

This paper introduces a method to analyze language model organization using a technique inspired by aphasias, measuring how the model's functional patterns correlate with symptoms.

machine learninglanguagelinguisticsneurologymodeling

Abstract ↗ PDF ↗

🟢 Applied

Hypothesis-driven construction of mesoscopic dynamics

💡 This method uses mathematical constraints and assumptions to construct the model, making it more principled and less dependent on empirical data.

We propose a hypothesis-driven approach to modeling mesoscopic dynamics by learning from a mathematically constrained hypothesis class rather than using fixed equations.

dynamicsphysicsmodeling

Abstract ↗ PDF ↗

🟢 Applied

Formal Methods Meet LLMs: Auditing, Monitoring, and Intervention for Compliance of Advanced AI Systems

💡 The paper looks at how to monitor and audit AI systems throughout their development lifecycle.

This paper examines AI governance techniques for monitoring and auditing AI-enabled products.

aiformal methodsmonitoringaudit

Abstract ↗ PDF ↗

🟢 Applied

Imitation learning for clinical decision support in pediatric ECMO

💡 Learning to act from the trajectories of interventions to support pediatric critical care decisions

Imitating clinical interventions to support pediatric critical care decisions

ecmopediatriccritical care

Abstract ↗ PDF ↗

BAPR: Bayesian amnesic piecewise-robust reinforcement learning for non-stationary continuous control

Bayesian Amnesic Piecewise-Robust SAC unifies Bayesian Online Change Detection (BOCD) with robust ensemble RL . The BAPR operator is a combination of mode-conditional Bellman operators weighted by a frozen belief distribution .

Abstract ↗ PDF ↗

🟢 Applied

From Backup Restoration to Minimum Viable Factory Recovery: A Systematization of Ransomware Recovery in Manufacturing Systems

💡 Manufacturing systems are interconnected and depend on each other; a ransomware attack can disrupt operations, making recovery essential for maintaining production continuity.

Ransomware recovery in manufacturing systems is about restoring critical operations after a cyberattack, maintaining interdependencies between components and ensuring the plant can resume work, authenticate operators, and release products.

cybersecuritymanufacturinginfrastructurerecovery

Abstract ↗ PDF ↗

🟢 Applied

Entropic Auto-Encoding via Implicit Free-Energy Minimization

💡 Entropic Autoencoders are a solution to posterior collapse in generative models by learning non-Gaussian, multimodal latent distributions that produce diverse, data-consistent outputs.

Entropic Autoencoders are a solution to posterior collapse in generative models by learning non-Gaussian, multimodal latent distributions that produce diverse, data-consistent outputs.

generativeautoencoderposteriorcollapselatentdistribution

Abstract ↗ PDF ↗

🟢 Applied

Skew-adaptive conformal prediction

💡 This method extends split conformal prediction to include skew-adaptive features, allowing for more accurate regression prediction.

This method extends split conformal prediction to include skew-adaptive features, allowing for more accurate regression prediction.

classificationconformal predictionskewregression

Abstract ↗ PDF ↗

🔬

Creative AI / Emotion

🔴 Theory-Heavy

Position: AI as Part of Self -- Extending the Mind Requires Cognitive Co-Regulation

💡 AI is increasingly participating in human cognitive processes, shaping how we form beliefs and make decisions.

AI increasingly participates in human cognitive processes, shaping how we form beliefs and make decisions.

aicognitiondecision-making

Abstract ↗ PDF ↗

🟢 Applied

SLIP & ETHICS: Graduated Intervention for AI Emotional Companions

💡 AI companions face a safety paradox where restrictive measures can hurt the supportive relationship, while permissive ones can cause harm. SLIP addresses this by gradually increasing protection levels.

SLIP is a four-stage graduated intervention method for AI companions starting with restrictive safeguards and moving to permissive systems.

aiinterventioncompanion

Abstract ↗ PDF ↗

🟢 Applied

Designing for Robot Wranglers: A Synthesis of Literature and Practice

💡 Robot wranglers are experts who create the perfect environment for robots to work in human spaces, ensuring safe and effective interactions.

Robot wranglers are professionals who design and integrate robots into human spaces to ensure safe and effective interactions.

roboticshuman-robot interactionrobot designrobot integration

Abstract ↗ PDF ↗

🟢 Applied

Designing Datacenter Power Delivery Hierarchies for the AI Era

💡 Designing power delivery hierarchies for AI accelerators is difficult because the rapidly increasing demand for power density requires careful planning to maximize grid capacity.

Designing efficient power delivery hierarchies for AI accelerators is challenging because the rapidly increasing demand for power density requires careful planning to maximize grid capacity.

aigridpower

Abstract ↗ PDF ↗

🟢 Applied

Evaluating Design Video Generation: Metrics for Compositional Fidelity

💡 Design animation uses generative video to create content with prescribed motion, directions, and layout constraints.

Generative video models are used in design animation to create structured content with specific motion and layout requirements.

designanimationvideo generation

Abstract ↗ PDF ↗

🟢 Applied

ARIA: A Diagnostic Framework for Music Training Data Attribution

💡 ARIA helps music creators understand which musical elements are contributing to their generated music, improving training data quality.

ARIA is a framework for music training data attribution that helps creators understand which musical elements are contributing to their generated music, improving training data quality.

musiccopyrightdata

Abstract ↗ PDF ↗

🟢 Applied

GenShield: Unified Detection and Artifact Correction for AI-Generated Images

💡 GenShield helps users identify AI-generated images and fix them to prevent misinformation and other issues.

GenShield is a unified framework that detects and corrects AI-generated images by jointly performing explainable detection and artifact correction.

aigenerative aicontent moderationverification

Abstract ↗ PDF ↗

🟢 Applied

GEMS -- Guided Evolutionary Molecule Design for Sustainable Chemicals

💡 GEMS is a tool for designing sustainable chemicals using machine learning and genetic algorithms.

GEMS is a tool for designing sustainable chemicals using machine learning and genetic algorithms.

machine-learningdesignchemistrygenetics

Abstract ↗ PDF ↗

🟢 Applied

Synchronized Realities: Towards Magic Mobile Experiences through Aligned AR

💡 AR experiences can be made highly reactive through AI-generated content, which requires careful alignment of perceptual modalities for maximum immersion.

AR experiences can be made highly reactive through AI-generated content, requiring careful alignment of perceptual modalities for maximum immersion.

aiarvralignment

Abstract ↗ PDF ↗

🟢 Applied

Property-Guided LLM Program Synthesis for Planning

💡 LLMs can now find programs that satisfy complex requirements, and this method helps reduce evaluation time and cost.

This method uses LLMs to check if programs satisfy formal properties and provide counterexamples when violated, reducing evaluation time and cost.

llmverificationprogram synthesiscounterexamples

Abstract ↗ PDF ↗

🟡 Advanced

Generative Long-term User Interest Modeling for Click-Through Rate Prediction

💡 GenLI is a system that combines interest generation, behavior retrieval, and interest fusion to enhance click-through rate prediction in advertising and recommendation systems.

GenLI is a system that combines interest generation, behavior retrieval, and interest fusion to enhance click-through rate prediction in advertising and recommendation systems.

advertisingrecommendation systemsuser interest modelingclick-through rate

Abstract ↗ PDF ↗

🟢 Applied

VideoSeeker: Incentivizing Instance-level Video Understanding via Native Agentic Tool Invocation

💡 VideoSeeker uses agentic reasoning to solve instance-level video understanding problems, providing precise spatial and temporal references to assist human models.

VideoSeeker integrates agentic reasoning with instance-level video understanding to solve spatiotemporal localization problems.

agentreasoningvideolarge_language_modelspatialtemporal

Abstract ↗ PDF ↗

🟢 Applied

Ada-Diffuser: Latent-Aware Adaptive Diffusion for Decision-Making

💡 Ada-Diffuser lets machines learn how to plan and control by simultaneously modeling the underlying dynamics and observed interactions.

Ada-Diffuser is a causal diffusion model that learns temporal dynamics and latent interactions simultaneously for planning and control.

diffusioncausalplanningcontroladaptive

Abstract ↗ PDF ↗

🟡 Advanced

Reasoners or Translators? Contamination-aware Evaluation and Neuro-Symbolic Robustness in Tax Law

💡 Large language models can be used for legal reasoning, but their performance can be inflated by contamination.

This study examines whether large language models for legal reasoning are truly reasoning or artifacts of data contamination.

llmlegalreasoning

Abstract ↗ PDF ↗

🟢 Applied

XSearch: Explainable Code Search via Concept-to-Code Alignment

💡 XSearch explains why certain code snippets were returned by aligning concepts with code, making the search results more understandable and trustworthy.

XSearch is a code search framework that uses concept-to-code alignment to explain search results, making the search more transparent and trustworthy.

codesearchexplainabilitysemantic

Abstract ↗ PDF ↗

🟢 Applied

Constrained latent state modeling: A unifying perspective on representation learning under competing constraints

💡 This paper proposes a unified perspective on learning latent representations, offering a more comprehensive approach to addressing the fragmentation in current machine learning methods.

Constrained latent state modeling provides a unified framework for learning latent representations under competing constraints, addressing the fragmentation in current machine learning approaches.

learningrepresentationconstraints

Abstract ↗ PDF ↗

🟢 Applied

Beyond Content: A Comprehensive Speech Toxicity Dataset and Detection Framework Incorporating Paralinguistic Cues

💡 This dataset helps researchers understand how certain words or phrases can harm speech, which is important for creating better safety systems.

ToxiAlert-Bench is a large-scale audio dataset containing 30,000+ clips with 20 fine-grained labels for speech toxicity.

machine_learningtoxicityaudiospeech

Abstract ↗ PDF ↗

🟢 Applied

Driving Through the Network: Performance and Workload Under Latency and Video Impairment

💡 Researchers tested how different changes in speed and video quality affect driving performance and body signals.

This study measured G2G latency and physiological responses under two latency and bitrate manipulations in a fixed-base driving simulator.

drivinglatencyvideophysiologysimulator

Abstract ↗ PDF ↗

🟢 Applied

Prospective multi-pathogen disease forecasting using autonomous LLM-guided tree search

💡 This system uses machine learning to create software that helps forecast infectious diseases, which is crucial for public health decisions.

An autonomous system using LLM-guided tree search to generate, evaluate, and optimize executable forecasting software for infectious diseases.

machine learningdisease forecastingpublic health

Abstract ↗ PDF ↗

🟢 Applied

Inside Baseball: The Automated Ball-Strike System as an Object Lesson in Technological Rule Enforcement

💡 ABS uses technology to accurately determine the distance between pitches and strike zones, ensuring fair play in baseball.

ABS is a technological rule enforcement mechanism that translates the rule into an objective determination of distance between pitches and strike zones.

baseballtechnologyrules

Abstract ↗ PDF ↗

🟢 Applied

An Algebraic Exposition of the Theory of Dyadic Morality

💡 This paper explains how to formally represent and compute dyadic morality using structural causal modeling, making the theory accessible to neurosymbolic AI systems.

This paper provides an algebraic exposition of the theory of dyadic morality (TDM) using structural causal modeling (SCM) notation to enable neurosymbolic AI systems to compute morality mathematically.

moralityneurosymbolic aistructural causal modeling

Abstract ↗ PDF ↗

🟢 Applied

Entropy Across the Bridge: Conditional-Marginal Discretization for Flow and Schrödinger Samplers

💡 We use this new objective to create a training-free entropic inference-time scheduler for probability paths.

We derive a new objective for discretizing probability paths and use it to create a training-free entropic inference-time scheduler.

flowschrödinger-bridgesdiscretizationentropyinference-time

Abstract ↗ PDF ↗

🟢 Applied

ShopGym: An Integrated Framework for Realistic Simulation and Scalable Benchmarking of E-Commerce Web Agents

💡 ShopGym helps researchers create realistic and diverse evaluation settings for e-commerce web agents.

ShopGym is an integrated framework for realistic simulation and scalable benchmarking of e-commerce web agents.

e-commerceweb agentsbenchmarkingsimulation

Abstract ↗ PDF ↗

🟢 Applied

Sign-Separated Finite-Time Error Analysis of Q-Learning

💡 This research provides a method to analyze Q-learning error dynamics by separating signs, which helps identify potential control limits and optimal policy lower bounds for negative errors.

This paper analyzes Q-learning error dynamics with sign separation to identify max-induced asymmetry and optimal policy lower bounds for negative errors.

controlq-learningerror analysis

Abstract ↗ PDF ↗

Multi-level Self-supervised Pretraining on Compositional Hierarchical Graph for Molecular Property Prediction

Self-supervised pretraining on molecular graphs has emerged as a promising approach for molecular property prediction . Most existing methods operate at a single structural granularity and treat bond information as auxiliary edge attributes rather than as an independent semantic layer . We propose MolCHG, a multi-level self-supervisory pretraining framework built upon a novel Compositional Hierarchical Graph .

Abstract ↗ PDF ↗

🔬

Lightweight Systems

🟢 Applied

Evaluating Container Orchestration for Neuromorphic Workloads in Virtual Edge Environments

💡 This study examines how neuromorphic computing can be applied to edge environments to achieve energy efficiency.

Research on using neuromorphic computing for energy-efficient edge environments.

edge computingneuromorphic computingenergy efficiencyworkload optimization

Abstract ↗ PDF ↗

🟢 Applied

Scale: Deep Reinforcement Learning for Container Scheduling in Serverless Edge Computing

💡 It optimizes requests to containers to reduce resource over provisioning and unnecessary data movement.

Scale is a container scheduling and resource allocation framework for serverless edge computing.

cloudcontainerserverlessedgeschedulingresource

Abstract ↗ PDF ↗

🟢 Applied

ADS-IMC: Accelerating Data Sorting with In-Memory Computation

💡 This research demonstrates a significant 3.4x reduction in latency for in-memory sorting compared to memristor-based approaches, utilizing 6T SRAM for in-memory sorting.

This work presents a 3.4x latency reduction in IMC sorting compared to memristor-based approaches, utilizing 6T SRAM for in-memory sorting.

sram

Abstract ↗ PDF ↗

A GPU Accelerated Temporal Window-Based Random Walk Sampler

Temporal random walks are widely used to analyze time-stamped interactions in domains such as microservices, finance, and online platforms . Tempest combines a GPU-native dual-index organization over a shared edge store with a hierarchical cooperative scheduler that dispatches walks at thread, warp, or block granularity based on per-step node convergence .

Abstract ↗ PDF ↗

🟢 Applied

Embedded Made Easy -- Rethinking Embedded + Cloud Software Development (WIP)

💡 Applications can be developed and tested in a single environment, and then deployed to any component of the system.

A work-in-progress vision for a unified language and runtime system that allows applications to scale across the edge and cloud.

cloudembeddedsoftware

Abstract ↗ PDF ↗

APWA: A Distributed Architecture for Parallelizable Agentic Workflows

Autonomous multi-agent systems hit critical reasoning, coordination and scaling bottlenecks as the size and complexity of their tasks grow . APWA facilitates parallel execution by decomposing workflows into non-interfering subproblems that can be processed using independent resources without cross-communication .

Abstract ↗ PDF ↗

🟢 Applied

Mat2Boundary: Treating User-Defined Boundary Condition as SpMV for Distributed PDE Solvers on Block-Structured Grids

💡 Mat2Boundary is a powerful programming language for handling boundary conditions in computational physics, enabling efficient and flexible modeling of complex boundary behaviors.

Mat2Boundary is a DSL and compiler for boundary computations that models a broad class of boundary-conditions as affine sparse linear operators.

boundary conditionscomputationprogramming language

Abstract ↗ PDF ↗

🟢 Applied

Malleable Molecular Dynamics Simulations with GROMACS and DMR

💡 This new system allows multiple processes to share resources on a supercomputer without the overhead of traditional scheduler internals.

This paper discusses a new architecture for managing MPI resources on a supercomputer using a simple API decoupled from scheduler internals.

mpisupercomputerresource management

Abstract ↗ PDF ↗

🟢 Applied

Semi-Synchronous Exploration in Dynamic Graphs

💡 This research addresses a fundamental problem in graph exploration using mobile agents, showing that exploration is impossible if the number of agents exceeds a specific threshold.

We study graph exploration in dynamic graphs using mobile agents and show that exploration is impossible if the number of agents exceeds a specific threshold.

graphexplorationmobile agentsthreshold

Abstract ↗ PDF ↗

🟢 Applied

Memristor Technologies for Dynamic Vision Sensors: A Critical Assessment and Research Roadmap

💡 This research suggests a new approach to reducing energy consumption in edge AI systems by using in-memory analog compute paired with event-driven vision sensors.

This paper discusses a potential solution to energy bottlenecks in edge AI systems by using in-memory analog compute paired with event-driven vision sensors.

aienergyvisioncomputing

Abstract ↗ PDF ↗

🟡 Advanced

Sustainable Graph Analytics Workload Scheduling with Evolutionary Reinforcement Learning in Edge-Cloud Systems

💡 MERSEM optimizes how graph data is distributed across cloud resources to improve efficiency and reduce energy consumption.

MERSEM uses evolutionary search and reinforcement learning to optimize graph workload allocation and scheduling in edge-cloud systems.

graph analyticscloud systemsreinforcement learningenergy efficiency

Abstract ↗ PDF ↗

🟢 Applied

PipeSD: An Efficient Cloud-Edge Collaborative Pipeline Inference Framework with Speculative Decoding

💡 PipeSD is a practical framework for cloud-edge collaborative LLM inference that offloads cloud workloads and enables offline robustness and privacy enhancement through speculative decoding.

PipeSD accelerates LLM inference by offloading cloud workloads and enabling offline robustness and privacy enhancement through cloud-edge collaborative deployment with speculative decoding.

llmenergyprivacyinferencecloud-edgespeculative decoding

Abstract ↗ PDF ↗

Distributed Statistical Zero-Knowledge Proofs via Sumcheck

We study distributed zero-knowledge proofs, introduced by Bick, Kol, and Oshman (SODA 2022) While distributed interactive proofs have advanced rapidly, distributed interactive proof techniques remain limited and mostly problem-specific . Our main contribution is a distributed zero knowledge implementation of Sumcheck . For non-k-colorability, we obtain an $O(n)$-round distributed statistical zero knowledge proof deciding whether a graph is not k-colorable, for any constant k,

Abstract ↗ PDF ↗

🟡 Advanced

Evolving Layer-Specific Scalar Functions for Hardware-Aware Transformer Adaptation

💡 This method helps deploy Vision Transformers on edge devices by replacing normalization layers with scalar approximations that bypass the global reduction bottleneck.

This method replaces normalization layers with scalar approximations to bypass global reduction bottlenecks in Vision Transformers for edge deployment.

visionedge devicescomputational efficiency

Abstract ↗ PDF ↗

🟢 Applied

Constitutional Governance in Metric Spaces

💡 This system allows members to vote on proposals with public support under a supermajority rule, using a metric space for governance.

A constitutional system with a metric space, aggregation rule, and supermajority threshold for amendments.

constitutionmetric spaceamendmentssupermajority

Abstract ↗ PDF ↗

🟢 Applied

EMA: Efficient Model Adaptation for Learning-based Systems

💡 EMA is a machine learning method that adapts system models to handle diverse, long-running, and dynamic network environments, making it suitable for resource management and simulation tasks.

EMA adapts system models to handle diverse, long-running, and dynamic network environments, making it suitable for resource management and simulation tasks.

machinelearningnetwork

Abstract ↗ PDF ↗

🟡 Advanced

Towards the Next Frontier of LLMs, Training on Private Data: A Cross-Domain Benchmark for Federated Fine-Tuning

💡 LLMs can understand and generate human-like text, but training on private data enables them to learn from sensitive information, which is crucial for secure and useful applications in regulated industries.

Training LLMs on private data allows for better domain-specific knowledge and real-world utility, addressing a major challenge in secure AI systems.

llmprivate datafederated fine-tuningcross-domain benchmark

Abstract ↗ PDF ↗

🟢 Applied

Rescaled Asynchronous SGD: Optimal Distributed Optimization under Data and System Heterogeneity

💡 Rescaled Asynchronous SGD is a distributed optimization method that corrects bias toward a frequency-weighted average by rescaling worker-specific stepsizes, ensuring each worker contributes the same aggregate learning rate over a cycle.

Rescaled Asynchronous SGD is a distributed optimization method that corrects bias toward a frequency-weighted average by rescaling worker-specific stepsizes, ensuring each worker contributes the same aggregate learning rate over a cycle.

distributed optimizationstochastic gradient descentasynchronous sgd

Abstract ↗ PDF ↗

🟡 Advanced

TurboGR: An Accelerated Training System for Large-Scale Generative Recommendation

💡 TurboGR is a specialized system that accelerates training for large-scale Generative Recommendation using Transformer-based models, solving the fundamental system-level challenges of deploying GR at scale on Ascend NPUs.

TurboGR is an accelerated training system for large-scale Generative Recommendation using Transformer-based models that addresses fundamental system-level challenges of deploying GR at scale on Ascend NPUs.

transformergenerative recommendationnpu

Abstract ↗ PDF ↗

🟢 Applied

Swarm Network-as-a-Service (SNaaS)

💡 Swarm Network-as-a-Service uses drones to provide on-demand connectivity at scale, treating drone interactions as composable services.

Swarm Network-as-a-Service uses drones to provide on-demand connectivity at scale, treating drone interactions as composable services.

networkdronesservices

Abstract ↗ PDF ↗

GenAI-Driven Approach to RISC-V Supply Chain Exploration

This paper presents an LLM-empowered workflow for RISC-V supply chain analysis . The proposed approach addresses the challenges of heterogeneous and unstructured supply chain data by leveraging LLMs for textual understanding and VLMs for extracting information from visual artifacts such as diagrams and tables .

Abstract ↗ PDF ↗

🟢 Applied

A detailed algorithmic study on a reuse-aware, near memory, all-digital Ising machine

💡 SACHI is a machine learning architecture that uses L1 cache repurposing to create a near-memory, all-digital Ising machine for solving optimization problems.

SACHI is a nature-inspired computing architecture that uses L1 cache repurposing to create a near-memory, all-digital Ising machine for optimization problems.

machine learningisingall-digitalnature-inspired computing

Abstract ↗ PDF ↗

🟢 Applied

Heterogeneous SoC Integrating an Open-Source Recurrent SNN Accelerator for Neuromorphic Edge Computing on FPGA

💡 This paper discusses using heterogeneous SoC to combine open-source recurrent neural network accelerators with FPGA for cost-effective neuromorphic edge computing.

Research project integrating open-source recurrent SNN accelerator for neuromorphic edge computing on FPGA.

computingneurofpgahardwareneurocomputing

Abstract ↗ PDF ↗

🟢 Applied

The Distributed Complexity Landscape on Trees Depends on the Knowledge About the Network Size

💡 LCLs are a fundamental model in distributed systems that characterize solutions to graph problems using constant-radius neighborhoods.

LCLs are a key model in distributed computing that describe valid solutions to graph problems using constant-radius neighborhoods.

graphdistributedcomputing

Abstract ↗ PDF ↗

🟢 Applied

Efficient and Portable Support for Overdecomposition on Distributed Memory GPGPU Platforms

💡 Charm++ is a parallel programming system that demonstrates the utility of overdecomposition for various applications and contexts, while GPGPUs have emerged as a dominant compute component presenting challenges for this paradigm.

Charm++ is a parallel programming system that demonstrates the utility of overdecomposition for various applications and contexts, while GPGPUs have emerged as a dominant compute component presenting challenges for this paradigm.

parallel programmingoverdecompositiongpgpusdistributed memorygpu computing

Abstract ↗ PDF ↗

🔬

Offline-First / Local AI

FocalPolicy: Frequency-Optimized Chunking and Locally Anchored Flow Matching for Coherent Visuomotor Policy

FocalPolicy is a foresight-aware visuomotor policy that combines Frequency-Optimized Chunking with Locally Anchored flow matching . It combines time-domain alignment within the proximal actions while regularizing frequency-domain structure over multiple future action chunks to improve cross-chunk coherence .

Abstract ↗ PDF ↗

🟢 Applied

Unsupervised Domain Shift Detection with Interpretable Subspace Attribution

💡 This tool helps researchers detect subtle differences in dataset probability distributions by identifying localised density anomalies in high-dimensional feature spaces.

This tool detects subtle differences in dataset probability distributions by identifying localised density anomalies in high-dimensional feature spaces.

domain shiftfeature spaceanomaly detection

Abstract ↗ PDF ↗

🟢 Applied

Ti-iLSTM: A TinyDL Approach for Logic-Level Anomaly Detection in Industrial Water Treatment Systems

💡 This system can manipulate process behaviour without creating obvious devices outliers, which is a form of logic-layer deception that can preserve numerically plausible measurements while breaking expected cause-and-effect relationships.

This system can manipulate process behaviour without creating obvious devices outliers, which is a form of logic-layer deception that can preserve numerically plausible measurements while breaking expected cause-and-effect relationships.

cybersecurityanomaly detectionwater treatment

Abstract ↗ PDF ↗

🟢 Applied

Navigating Potholes with Geometry-Aware Sharpness Minimization

💡 This method uses a learned preconditioner to address the issue of ignoring loss geometry, making it applicable to practical applications.

LLQR+SAM combines sharpness-aware minimization with a learned preconditioner to address the issue of ignoring loss geometry.

optimizationmachine learninggeometryminimization

Abstract ↗ PDF ↗

🟢 Applied

Multi-Fidelity Flow Matching: Cascaded Refinement of PDE Solutions

💡 MFFM is a refinement technique that uses conditional flow matching to improve parametric solutions by conditioning on data-driven source distributions.

Multi-Fidelity Flow Matching is a refinement framework that uses cascading refinement to improve parametric PDE solutions by conditioning on data-driven source distributions.

flow_matchingpdeconditional_flow_matchingresidual_refinementparametric_solutions

Abstract ↗ PDF ↗

🟢 Applied

MIND: Decoupling Model-Induced Label Noise via Latent Manifold Disentanglement

💡 MIND helps prevent model-induced label noise by decoupling the noise manifold into subspace-dependent components.

MIND decouples model-induced label noise using Latent Manifold Disentanglement to separate noise components.

neural networksdeep learningmodel-induced noisedisentanglementlatent manifold

Abstract ↗ PDF ↗

🟢 Applied

Looped SSMs: Depth-Recurrence and Input Reshaping for Time Series Classification

💡 Looped SSMs are a recurrent model family that can be improved by repeating blocks across layers and reshaping input data.

Looped SSMs are a recurrent model family that can be improved by repeating blocks across layers and reshaping input data.

recurrentssmmodelrepetitioninputreshaping

Abstract ↗ PDF ↗

🟢 Applied

Judge Circuits

💡 This study investigates how different models are graded by analyzing internal mechanisms using position-aware edge attribution patching.

This research explains how different model outputs are graded by examining internal mechanisms using position-aware edge attribution patching.

llmattentionmodeljudginggradingpatching

Abstract ↗ PDF ↗

🟢 Applied

Clock-state olfactory search in turbulent flows using Q-learning: The geometry of plume recovery

💡 This study uses Q-learning to train an agent to recover plumes in turbulent flows, combining insect behaviors to improve recovery.

This research uses Q-learning to train an agent to recover plumes in turbulent flows, combining insect behaviors to improve recovery.

learningcontrolbiologyplumeturbulence

Abstract ↗ PDF ↗

🟢 Applied

Shapley Neuron Values for Continual Learning: Which Neurons Matter Most?

💡 Shapley Neuron Valuation is a method for continual learning that quantifies Neuron importance by freezing important neurons while keeping others plastic, enabling buffer-free continual learning without expanding architecture.

Shapley Neuron Valuation quantifies Neuron importance in continual learning by freezing important neurons while keeping others plastic.

continual learningneuron importancelearning theory

Abstract ↗ PDF ↗

🟢 Applied

StippleDiffusion: Capacity-Constrained Stippling using Controlled Diffusion

💡 This method uses diffusion to create stipple patterns while ensuring the local density matches a learned prior and respects a continuous image-defined capacity constraint.

A diffusion-based sampler for stipple patterns that satisfies a learned local point-distribution prior and a continuous image-defined capacity constraint.

diffusionimage processingstipplepattern generation

Abstract ↗ PDF ↗

🟢 Applied

Continual Learning of Domain-Invariant Representations

💡 This paper describes a method that trains models to learn domain-invariant representations, preventing them from learning spurious domain-specific cues

Continual learning of domain-invariant representations addresses the problem of shortcut learning by training models to learn domain-invariant representations

continual learningdomain invariant representationsshortcut learning

Abstract ↗ PDF ↗

🟢 Applied

GOMA: Toward Structure-Driven Multimodal Alignment from a Graph Signal Smoothing Perspective

💡 GOMA is a structure-driven post-alignment framework that decouples three key design choices to improve multimodal alignment.

GOMA is a structure-driven multimodal alignment framework that decouples three key design choices: where messages flow, how multimodal evidence propagates, and smoothing depth.

multimodalgraphcommunicationalignment

Abstract ↗ PDF ↗

🟢 Applied

Scalable neuromorphic computing from autonomous spiking dynamics in a clockless reconfigurable chip

💡 This system uses autonomous spiking dynamics in clockless digital circuits to create networks of interacting neurons with configurable weights, enabling scalable neuromorphic computing.

This system uses autonomous spiking dynamics in clockless digital circuits to create networks of interacting neurons with configurable weights, enabling scalable neuromorphic computing.

neurosciencecomputer architecturedistributed systemsspiking neural networksfield-programmable gate arraysneuromorphic computing

Abstract ↗ PDF ↗

🟢 Applied

A numerical study into neural network surrogate model performance for uncertainty propagation

💡 This research explores using neural networks to approximate complex physical solutions for stochastic problems, potentially lowering the computational cost of repeated simulations.

This study examines how neural network surrogate models can reduce repeated expensive forward model evaluations for solving boundary value problems in stochastic scenarios.

stochastic problemsboundary value problemssurrogate modelingnumerical analysis

Abstract ↗ PDF ↗

🟢 Applied

SAFE Quantum Machine Learning with Variational Quantum Classifiers

💡 This research explores using quantum machine learning to solve optimization problems, specifically a variational quantum classifier that operates on high-dimensional deep representations.

This paper proposes a variational quantum classifier for optimization problems using amplitude encoding and a learnable classical pre-encoding layer, with model reliability assessed via SAFE-AI metrics.

optimizationmachine learningquantum machine learningvariational quantum classifiers

Abstract ↗ PDF ↗

🟢 Applied

Explainable AI Isn't Enough! Rethinking Algorithmic Contestability

💡 Individuals can respond to negative decisions made by opaque machine learning systems, and we propose an operational definition of contestability as a complement to recourse.

Machine learning systems increasingly make life-changing decisions about individuals, and we propose an operational definition of contestability as a complement to recourse.

aimachine learningalgorithmiccontestability

Abstract ↗ PDF ↗

Mind Dreamer: Untethering Imagination via Active Latent Intervention on Latent Manifolds

Model-Based Reinforcement Learning (MBRL) leverages latent imagination for sample efficiency, yet remains constrained by Historical Tethering . We propose Mind Dreamer (MD), a framework that operationalizes Active Latent Intervention (ALI) to transcend Markovian continuity . MD reformulates discovery as the minimization of a global Relay Manifold Expected Free Energy (R-EFE)

Abstract ↗ PDF ↗

🟢 Applied

Variational Autoregressive Networks with probability priors

💡 This paper introduces a method for training neural networks that can incorporate physical priors to improve performance, using a prior probability distribution as a starting point.

This paper proposes a framework for training Variational Autoregressive Networks with probability priors, enhancing performance by incorporating physical priors.

machine learningneural networksphysics

Abstract ↗ PDF ↗

🟢 Applied

Adaptive Outer-Loop Control of Quadrotors via Reinforcement Learning

💡 This research introduces a new control method for quadrotors that uses Reinforcement Learning to adapt to real-world disturbances.

This paper proposes an adaptive control architecture for quadrotors using Reinforcement Learning to handle real-world perturbations.

airoboticscontrolflight control

Abstract ↗ PDF ↗

🟢 Applied

Testing properties of trees in graphical models with covariance queries

💡 Testing properties of graphs in graphical models using covariance queries is a specialized statistical problem that helps researchers analyze complex systems by examining relationships between variables.

Testing properties of graphs in graphical models using covariance queries is a specialized statistical problem that helps researchers analyze complex systems by examining relationships between variables.

machine learningstatisticsgraph theorydata science

Abstract ↗ PDF ↗

🟢 Applied

Entropy-Based Characterisation of the Polarised Regime in Latent Variable Models

💡 This paper proposes a simple method to identify active dimensions in variational models using entropy-based criteria.

This paper proposes a simple information-theoretic classification method based on entropy to identify active dimensions in variational models.

entropyvariancelatent variables

Abstract ↗ PDF ↗

🟢 Applied

Imperfect World Models are Exploitable

💡 We define exploitable models as those where one policy should be preferred over another while the environment's true model suggests the reverse, similar to how reward hacking is characterized.

We propose a novel definition of model exploitation in reinforcement learning, comparing it to reward hacking to characterize exploitable models.

reinforcement_learningmodel_exploitationexploitation

Abstract ↗ PDF ↗

A Retrieval-Enhanced Transformer for Multi-Step Port-of-Call Sequence Prediction in Global Liner Shipping

Connectivity-Constrained and Retrieval-Enhanced (CCRE) deep learning framework proposes a deep-learning framework . CCRE achieves a 72.3% first-destination accuracy and a 61.4% average three-step accuracy, outperforming baselines like CatBoost and LSTM .

Abstract ↗ PDF ↗

🟢 Applied

LoCO: Low-rank Compositional Rotation Fine-tuning

💡 LoCO lets researchers fine-tune large models with low-rank rotations, making it practical for tasks like NLP and CV.

LoCO enables efficient fine-tuning of large models using low-rank rotations.

fine-tuningrotationslarge_models

Abstract ↗ PDF ↗