Drift-Aware Online Dynamic Learning for Nonstationary Multivariate Time Series: Application to Sintering Quality Prediction
đĄ This research forecasting computer vision.
Drift-Aware Multi-Scale Dynamic Learning (DA-MSDL) framework is proposed to maintain robust multi-output predictive performance via online adaptive mechanisms on nonstationary data streams . The framework employs a multi-scale bi-branch convolutional network as its backbone to disentangle local fluctuations from long-term trends .
Distributed Online Convex Optimization with Compressed Communication: Optimal Regret and Applications
đĄ This research optimizes machine learning.
Distributed online convex optimization (D-OCO) is a powerful paradigm for modeling distributed scenarios with streaming data . However, the communication cost between local learners and the central server is substantial in large-scale applications .
Towards Lifelong Aerial Autonomy: Geometric Memory Management for Continual Visual Place Recognition in Dynamic Environments
đĄ This research explores techniques in computer vision.
Robust geo-localization in changing environmental conditions is critical for long-term aerial autonomy . Existing continual learning methods often fail here because geographic features exhibit severe intra-class variations . To respect strict onboard storage constraints, our pipeline decouples geographic knowledge into static satellite anchors and a dynamic experience replay buffer .
Nexus: Same Pretraining Loss, Better Downstream Generalization via Common Minima
đĄ This research running AI locally on devices for language AI.
Pretraining is the cornerstone of Large Language Models, dominating the vast majority of computational budget and data to serve as the primary engine for their capabilities . We hypothesize that the geometric "closeness" of task-specific minima is intrinsically linked to downstream generalization . We propose the Nexus optimizer, which encourages the closeness of these minima by maximizing gradient similarity during optimization . Nexus reduces the out-of-distribution loss by 0.012 and yields up to a 15
Temporal Patch Shuffle (TPS): Leveraging Patch-Level Shuffling to Boost Generalization and Robustness in Time Series Forecasting
đĄ This research improves machine learning.
Temporal Patch Shuffle (TPS) is a simple and model-agnostic data augmentation method for forecasting . It extracts overlapping temporal patches, selectively shuffles a subset of patches using variance-based ordering as a conservative heuristic . This design increases sample diversity while preserving forecast-consistent local temporal structure .
Feature-Label Modal Alignment for Robust Partial Multi-Label Learning
đĄ This research tackles the problem of machine learning.
In partial multi-label learning, each instance is associated with a set of candidate labels containing both ground-truth and noisy labels . The presence of noisy labels disrupts the correspondence between features and labels, degrading classification performance . We propose a novel PML method based on feature-label modal alignment (PML-MA)
Plasticity-Enhanced Multi-Agent Mixture of Experts for Dynamic Objective Adaptation in UAVs-Assisted Emergency Communication Networks
đĄ This research explores techniques in machine learning.
PE-MAMoE equips each UAV with a sparsely gated mixture of experts actor whose router selects a single specialist per step . Phase Controller injects brief, expert-only stochastic perturbations after phase switches, resets the action log-standard-deviation, entropy and learning rate, and schedules the router temperature, all to re-plasticize the policy without destabilizing safe behaviors .
Biologically-Grounded Multi-Encoder Architectures as Developability Oracles for Antibody Design
đĄ This research proposes a method for language AI.
CrossAbSense is a framework of property-specific neural oracles that combine frozen protein language model encoders with configurable attention decoders . On the GDPa1 benchmark of 242 therapeutic IgGs, our oracles achieve notable improvements of 12--20\% over established baselines on three of five developability assays .
Bringing Clustering to MLL: Weakly-Supervised Clustering for Partial Multi-Label Learning
đĄ This research explores techniques in computer vision.
Label noise in multi-label learning (MLL) poses significant challenges for model training . We propose a novel weakly-supervised clustering approach for PML . WSC-PML employs a three-stage process: initial prototype learning from noisy labels, adaptive confidence-based weak supervision construction and joint optimization via iterative clustering refinement .
Hierarchical Flow Decomposition for Turning Movement Prediction at Signalized Intersections
đĄ This research proposes a method for machine learning.
HFD-TM (Hierarchical Flow-Decomposition for Turning Movement Prediction) is a deep learning framework that predicts turning movements by first forecasting corridor through-movements and then expanding these predictions to individual turning streams . The design is motivated by empirical traffic structure, where corridor flows account for 65.1% of total volume, exhibit lower volatility than turning movements .
Stability Enhanced Gaussian Process Variational Autoencoders
đĄ This research enhances machine learning.
A novel stability-enhanced Gaussian process variational autoencoder (SEGP-VAE) is proposed for indirectly training a low-dimensional linear time invariant (LTI) system, using high-dimensional video data . The mean and covariance function of the novel SEGP prior are derived from the definition of an LTI system .
Online Intention Prediction via Control-Informed Learning
đĄ This research presents techniques for machine learning.
This paper presents an online intention prediction framework for estimating the goal state of autonomous systems in real time . The problem is formulated as an inverse optimal control / inverse reinforcement learning task, with intention treated as a parameter in the objective .
Natural Riemannian gradient for learning functional tensor networks
đĄ This research optimizes machine learning.
We consider machine learning tasks with low-rank functional tree tensor networks (TTN) as the learning model . We propose a natural Riemannian gradient descent type approach applicable to arbitrary losses which is based on the natural gradient by Amari .
Beyond Segmentation: Structurally Informed Facade Parsing from Imperfect Images
đĄ This research tackles the problem of machine learning.
Standard object detectors typically treat architectural elements independently . We address this limitation by augmenting the YOLOv8 training objective with a custom lightweight alignment loss . This regularization encourages grid-consistent arrangements of bounding boxes during training .
Statistical Properties of the King Wen Sequence: An Anti-Habituation Structure That Does Not Improve Neural Network Training
đĄ This research presents techniques for machine learning.
The King Wen sequence of the I-Ching (c. 1000 BC) orders 64 hexagrams in a pattern that has puzzled scholars for three millennia . We present a rigorous statistical characterization of this ordering using Monte Carlo permutation analysis against 100,000 random baselines . We find that the sequence has four statistically significant properties: higher-than-random transition distance, negative lag-1 autocorrelation, yang-balanced groups of four, and asymmetric within-pair
A Predictive View on Streaming Hidden Markov Models
đĄ This research optimizes machine learning.
We develop a predictive-first optimisation framework for streaming hidden Markov models . We assume access to regime-specific predictive models whose parameters are learned online while maintaining a fixed transition prior over regimes . Our objective is to sequentially identify latent regimes while maintaining accurate step-ahead predictive distributions .
On the Role of DAG topology in Energy-Aware Cloud Scheduling : A GNN-Based Deep Reinforcement Learning Approach
đĄ This research explores techniques in machine learning.
Cloud providers must assign heterogeneous compute resources to workflow DAGs while balancing competing objectives such as completion time, cost, and energy consumption . We identify specific out-of-distribution conditions under which GNN-based deep reinforcement learning schedulers fail . We demonstrate that performance degradation stems from structural mismatches between training and deployment environments .
Do LLMs Follow Their Own Rules? A Reflexive Audit of Self-Stated Safety Policies
đĄ This research explores techniques in language AI.
Existing benchmarks evaluate models against external standards but do not measure whether models understand and enforce their own stated boundaries . We introduce the Symbolic-Neural Consistency Audit (SNCA), a framework that extracts a model's self-stated safety rules via structured prompts, formalizes them as typed predicates .
MixFlow: Mixed Source Distributions Improve Rectified Flows
đĄ This research creating new content with computer vision.
MixFlow trains a flow model on linear mixtures of a fixed unconditional distribution and a $Îș\texttt{-FC$-based distribution . This simple mixture improves the alignment between the source and data, provides better generation quality with less required sampling steps, and accelerates training convergence considerably .
Generalization and Scaling Laws for Mixture-of-Experts Transformers
đĄ This research explores techniques in machine learning.
We develop a theory of generalization and scaling for Mixture-of-Experts (MoE) Transformers . By conditioning on fixed routing patterns and union-bounding across them, we derive a sup-norm covering-number bound whose metric entropy scales with the active parameter budget and incurs a MoE-specific routing overhead .
Truncated Rectified Flow Policy for Reinforcement Learning with One-Step Sampling
đĄ This research explores techniques in machine learning.
Maximum entropy reinforcement learning (MaxEnt RL) has become a standard framework for sequential decision making . MaxEnt RL's standard Gaussian policy parameterization is inherently unimodal, limiting its ability to model complex multimodal action distributions . We propose Truncated Rectified Rectified Flow Policy (TRFP), a framework built on a hybrid deterministic-stochastic architecture .
A fast and Generic Energy-Shifting Transformer for Hybrid Monte Carlo Radiotherapy Calculation
đĄ This research speeds up machine learning.
We introduce a novel learning framework for accelerated Monte Carlo (MC) dose calculation termed Energy-Shifting . This approach leverages deep learning to synthesize 6 MV TrueBeam Linear Accelerator (LINAC) dose distributions directly from monoenergetic inputs under identical beam configurations . We propose a novel 3D architecture termed TransUNetSE3D, featuring Transformer blocks for global context and Residual Squeeze-and-Excitation (SE) modules for adaptive channel-
Score-Driven Rating System for Sports
đĄ This research proposes a method for machine learning.
This paper introduces a score-driven rating system that employs the score, i.e. the gradient of the log-likelihood, as the updating mechanism for player and team ratings . The proposed framework extends beyond simple win/loss game outcomes and accommodates a wide range of game results, such as point differences, win/draw/loss outcomes .
Identifying Causal Effects Using a Single Proxy Variable
đĄ This research explores techniques in machine learning.
Unobserved confounding is a key challenge when estimating causal effects from a treatment on an outcome in scientific applications . We assume that we observe a single, potentially multi-dimensional proxy variable of the unobserved confounder . We develop a neural network based estimation framework, SPICE-Net, to estimate causal effects .
FIRE-CIR: Fine-grained Reasoning for Composed Fashion Image Retrieval
đĄ This research achieves better language AI.
Composed image retrieval (CIR) aims to retrieve a target image that depicts a reference image modified by a textual description . Recent vision-language models (VLMs) achieve promising CIR performance by embedding images and text into a shared space . Instead of relying solely on embedding similarity, FIRE-CIR performs question-driven visual reasoning .