Revealing graph bandits for maximizing local influence
π‘ This research explores techniques in edge computing.
We study a graph bandit setting where the objective of the learner is to detect the most influential node of a graph by requesting as little information from the graph as possible . We propose BARE, a bandit strategy for which we prove a regret guarantee that scales with the detectable dimension .
Scalable Context-Aware Graph Attention for Unsupervised Anomaly Detection in Large-Scale Mobile Networks
π‘ This research explores techniques in machine learning.
Mobile network operators must monitor thousands of heterogeneous network elements across the radio access network and the packet core . Scale and cost of incident labelling make supervised approaches impractical, motivating unsupervised anomaly detection robust to context shifts and nonstationarity . C-MTAD-GAT is an anomaly detection framework designed to operate as a single shared model across large populations of network elements .
CleanBase: Detecting Malicious Documents in RAG Knowledge Databases
π‘ This research running AI locally on devices for edge computing.
Retrieval-augmented generation (RAG) is vulnerable to prompt injection attacks, in which an adversary inserts malicious documents containing carefully crafted injected prompts into the knowledge database . When a user issues a question targeted by the attack, the RAG system may retrieve these malicious documents, whose injected prompts mislead it into generating attacker-specified answers . CleanBase constructs a similarity graph over the database, where each node represents a document and an edge connects two nodes if their semantic similarity exceeds a statistically
SAGA: Workflow-Atomic Scheduling for AI Agent Inference on GPU Clusters
π‘ This research explores techniques in language AI.
AI agents execute tens to hundreds of chained LLM calls per task . GPU schedulers treat each call as independent, discarding gigabytes of intermediate state between steps and inflating end-to-end latency by 3-8x . We present SAGA, a distributed scheduler that implements this abstraction through three mechanisms .
ControBench: An Interaction-Aware Benchmark for Controversial Discourse Analysis on Social Networks
π‘ This research explores techniques in language AI.
ControBench is a benchmark for controversial discourse analysis that combines heterogeneous social interaction graphs with rich textual semantics . Built from Reddit discussions on three topics, Trump, abortion, and religion, Controbench contains 7,370 users, 1,783 posts, and 26,525 interactions .
Bridging Graph Drawing and Dimensionality Reduction with Stochastic Stress Optimization
π‘ This research reduces computer vision.
We present a scikit-learn compatible estimator that minimizes global stress through local pairwise updates, improving upon the existing implementation . Experiments on standard high-dimensional benchmarks show that our stochastic solver converges substantially faster than SMACOF .
Class Angular Distortion Index for Dimensionality Reduction
π‘ This research reduces computer vision.
Dimensionality reduction (DR) techniques are often characterized by whether they preserve global, high-level structures in data or local, neighborhood structures . Existing cluster quality metrics either only measure cluster separability or assume spherical, globular clusters in the original space . We introduce the Class Angular Distortion Index (CADI), a metric that uses internal angles among point triples to determine the faithfulness of cluster organization .
Gradient Regularized Newton Boosting Trees with Global Convergence
π‘ This research explores techniques in edge computing.
Restricted Newton Descent studies convex optimization with Newton's method on Hilbert spaces with inexact iterates . Modern implementations like XGBoost, LightGBM, and CatBoost are based on Newton boosting: a second-order descent step in the space of decision trees .
Stable-GFlowNet: Toward Diverse and Robust LLM Red-Teaming via Contrastive Trajectory Balance
π‘ This research achieves better language AI.
Large Language Model (LLM) Red-Teaming, which proactively identifies vulnerabilities of LLMs, is an essential process for ensuring safety . Generative Flow Networks (GFNs) that perform distribution matching are notorious for training instability and mode collapse . We propose Stable-GFN, which eliminates partition function $Z$ estimation in GFN and reduces training instability .
Scale-Aware Adversarial Analysis: A Diagnostic for Generative AI in Multiscale Complex Systems
π‘ This research explores techniques in machine learning.
Complex physical systems, from supersonic turbulence to the macroscopic structure of the universe, are governed by continuous multiscale dynamics . Modern machine learning architectures excel at mapping the high-dimensional observables of these systems, it remains unclear whether they internalize the governing physical laws or merely interpolate discrete statistical correlations .
A Comparative Study of QSPR Methods on a Unique Multitask PAMPA dataset
π‘ This research presents techniques for edge computing.
We present a unique, multitask dataset comprising 143 drug and drug candidate molecules . Each evaluated on in vitro, parallel artificial-membrane permeability assays (PAMPA) using six different model membranes . This is the most comprehensive study on simultaneous modeling of multiple organ-specific PAMPA membranes to date .
Foresight Arena: An On-Chain Benchmark for Evaluating AI Forecasting Agents
π‘ This research explores techniques in language AI.
Foresight Arena is the first permissionless, on-chain benchmark for evaluating AI forecasting agents on real-world prediction markets . Agents submit probabilistic forecasts on binary Polymarket markets via a commit-reveal protocol enforced by Solidity smart contracts . Performance is measured by the Brier Score and a novel Alpha Score .
AdaMeZO: Adam-style Zeroth-Order Optimizer for LLM Fine-tuning Without Maintaining the Moments
π‘ This research reduces language AI.
AdaMeZO is a zeroth-order optimizer that leverages Adam-style first- and second-moment estimates without maintaining them in memory . It can outperform MeZO while requiring up to $70\%$ fewer forward passes .
From Prediction to Practice: A Task-Aware Evaluation Framework for Blood Glucose Forecasting
π‘ This research explores techniques in machine learning.
We present a task-aware evaluation framework for blood glucose forecasting built around two downstream uses: hypoglycemia early warning and insulin dosing decision support . We evaluate on real data from three clinical cohorts using event-level recall and false alarms per patient-day . We show that models appearing acceptable overall, with recall above 0.9 on the full test set, can fail badly in the post-bolus slice .
Knowing when to trust machine-learned interatomic potentials
π‘ This research forecasting machine learning.
Machine-learned uncertainty-quantification methods rely on ensembles of independently trained backbones . These methods scale unfavorably with foundation-scale MLIPs, and their member-disagreement signals correlate weakly with per-molecule prediction error . The resulting method, PROBE (Post-hoc Reliability frOm Backbone Embeddings), produces a per-prediction reliability probability that monotonically tracks actual error without modification to the underlying
Fairness of Classifiers in the Presence of Constraints between Features
π‘ This research explores techniques in machine learning.
In Machine Learning, an accepted definition of fairness of a decision taken by a classifier is that it should not depend on protected features, such as gender . We propose that a decision be considered fair if it has a fair explanation . We identify relationships between different definitions of fairness and study the computational complexity of testing fairness of classifiers .
Jailbreaking Vision-Language Models Through the Visual Modality
π‘ This research explores techniques in language AI.
The visual modality of vision-language models (VLMs) is an underexplored attack surface for bypassing safety alignment . We introduce four jailbreak attacks exploiting the vision component . They include encoding harmful instructions as visual symbol sequences with a decoding legend .
Beyond Continuity: Simulation-free Reconstruction of Discrete Branching Dynamics from Single-cell Snapshots
π‘ This research faster predictions in machine learning.
Inferring cellular trajectories from destructive snapshots is complicated by the challenges of stochasticity and non-conservative mass dynamics such as cell proliferation and apoptosis . Existing unbalanced Optimal Transport (OT) methods treat mass as a continuous fluid, performing inference at the population level . But this macroscopic view often fails to capture the discrete, jump-like nature of birth-death events at single-cell resolution . We present Unbalanced SchrΓΆdinger Bridge (USB
Vesselpose: Vessel Graph Reconstruction from Learned Voxel-wise Direction Vectors in 3D Vascular Images
π‘ This research explores techniques in computer vision.
The prevailing segment-then-fix paradigm is fundamentally limited regarding its suitability for modeling the task of complete and topologically accurate vascular network reconstruction . We propose an approach to extract topologically more accurate vascular graphs from 3D image data . Our approach achieves state-of-the-art performance on three benchmark datasets, spanning both synthetic and real imagery .
Multi-frame Restoration for High-rate Lissajous Confocal Laser Endomicroscopy
π‘ This research explores techniques in computer vision.
Lissajous confocal laser endomicroscopy (CLE) is a promising solution for high speed in vivo optical biopsy for handheld scenarios . However, at high frame rates, many pixels remain unvisited, creating structured holes . We propose MIRA, a lightweight recurrent framework for CLE restoration . MIRA outperforms both lightweight and high-complexity baselines in restoration quality .
End-to-End Autoregressive Image Generation with 1D Semantic Tokenizer
π‘ This research optimizes computer vision.
Autoregressive image modeling relies on visual tokenizers to compress images into compact latent representations . We design an end-to-end training pipeline that jointly optimizes reconstruction and generation . This contrasts with prior two-stage approaches that train tokenizers and generative models separately .
LambdaRankIC: Directly Optimizing Rank IC for Financial Prediction
π‘ This research forecasting machine learning.
In financial predictions, the performance of machine learning models is often assessed by Rank IC . Rank IC is Spearman rank correlation between the model predictions and the realized asset returns . We propose LambdaRankIC, a novel learning-to-rank approach that directly optimizes Rank IC.
Distance metric learning for conditional anomaly detection
π‘ This research proposes a method for machine learning.
A recently proposed conditional anomaly detection framework extends anomaly detection to the problem of identifying anomalous patterns on a subset of attributes in the data . The anomaly always depends (is conditioned) on the value of remaining attributes . The work presented in this paper focuses on instance-based methods for detecting conditional anomalies .
Trading off rewards and errors in multi-armed bandits
π‘ This research presents techniques for machine learning.
In multi-armed bandits, the most-explored arms are the most informative, while reward maximization typically pulls only the best arm . We present an algorithm with regret guarantees that interpolates between the two objectives .
Near-optimal and Efficient First-Order Algorithm for Multi-Task Learning with Shared Linear Representation
π‘ This research makes more efficient machine learning.
Multi-task learning (MTL) has emerged as a pivotal paradigm in machine learning by leveraging shared structures across multiple related tasks . Despite its empirical success, the development of likelihood-based efficiently solvable algorithms remains largely underdeveloped . This paper introduces a first-order algorithm that jointly learns a shared representation and task-specific parameters .