How Do People Accept Robot in Public Space? A Cross-Cultural Study in Germany and Japan
💡 This research presents techniques for emotion AI.
Social Norms and Trust were the strongest positive EA predictors across cultures . For Germans, EA was directly influenced by Usefulness, Interest and Anger, showing a functional-affective pattern . For Japanese participants, Trust, Surprise and Fear were the direct associational factors, forming a trust-emotion pattern .
Fast and Forgettable: A Controlled Study of Novices' Performance, Learning, Workload, and Emotion in AI-Assisted and Human Pair Programming Paradigms
💡 This research explores techniques in emotion AI.
Code-generating Artificial Intelligence has gained popularity within both professional and educational programming settings over the past several years . While research and pedagogy are beginning to cope with this change, computing students are left to bear the unforeseen consequences of AI amidst a dearth of empirical evidence .
Symbolic Synthesis for LTLf+ Obligations
💡 This research explores techniques in machine learning.
We study synthesis for obligation properties expressed in LTLFP, the extension of LTLf to infinite traces . Obligation properties are positive Boolean combinations of safety and guarantee properties . They form the second level of the temporal hierarchy of Manna and Pnueli .
AlphaContext: An Evolutionary Tree-based Psychometric Context Generator for Creativity Assessment
💡 This research improves language AI.
AlphaContext is an evolutionary tree-based psychometric context generator for creativity assessment . Creativity has become a core competence in the era of LLMs and human-AI collaboration, underpinning innovation in real-world problem solving . AlphaContext yields an average improvement of 8% over competitive methods across 6 quality metrics .
Omni-Embed-Audio: Leveraging Multimodal LLMs for Robust Audio-Text Retrieval
💡 This research achieves better language AI.
Omni-Embed-Audio (OEA) is a retrieval-oriented encoder leveraging multimodal LLMs with native audio understanding . OEA achieves comparable text-to-text retrieval performance to state-of-the-art M2D-CLAP, while demonstrating clear advantages in two critical areas .
One Pass for All: A Discrete Diffusion Model for Knowledge Graph Triple Set Prediction
💡 This research running AI locally on devices for edge computing.
DiffTSP is a novel discrete diffusion model that treats TSP as a generative task . It adds noise to the KG through a discrete diffusion process . The reverse process gradually recovers the complete KG conditioned on the incomplete graph . Our approach achieves state-of-the-art performance on three public datasets .
TacticGen: Grounding Adaptable and Scalable Generation of Football Tactics
💡 This research explores techniques in machine learning.
TacticGen formulates tactics as sequences of multi-agent movements and interactions conditioned on the game context . Trained with over 3.3 million events and 100 million tracking frames from top-tier leagues .
Audio-DeepThinker: Progressive Reasoning-Aware Reinforcement Learning for High-Quality Chain-of-Thought Emergence in Audio Language Models
💡 This research enhances language AI.
Large Audio-Language Models (LALMs) have made significant progress in audio understanding, yet they operate as perception-and-answer systems without explicit reasoning processes . Existing methods for enhancing audio reasoning rely on supervised chain-of-thought (CoT) fine-tuning, which is limited by training data quality . We propose Audio-DeepThinker, a framework built on two core ideas .
IceBreaker for Conversational Agents: Breaking the First-Message Barrier with Personalized Starters
💡 This research enhances machine learning.
Conversational agents, such as ChatGPT and Doubao, have become essential daily assistants for billions of users . To further enhance engagement, these systems are evolving from passive responders to proactive companions .
Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence
💡 This research explores techniques in language AI.
Large language models are increasingly expected to serve as general-purpose agents that interact with external, stateful tool environments . Training robust agents remains limited by the lack of realistic environments and principled mechanisms for life-long learning . We present a self-evolving training arena for advancing general agent intelligence through scalable environments .
STaD: Scaffolded Task Design for Identifying Compositional Skill Gaps in LLMs
💡 This research improves language AI.
Scaffolded Task Design (STaD) framework generates controlled variations of benchmark tasks based on the concept of scaffolding . The approach enables systematic and scalable probing of model behavior by identifying the specific reasoning skill compositions they lack .
LLM Safety From Within: Detecting Harmful Content with Internal Representations
💡 This research presents techniques for language AI.
Guard models are widely used to detect harmful content in user prompts and LLM responses . However, state-of-the-art guard models rely on terminal-layer representations and overlook the rich safety-relevant features distributed across internal layers . We present SIREN, a lightweight guard model that harnesses these internal features .
WorldDB: A Vector Graph-of-Worlds Memory Engine with Ontology-Aware Write-Time Reconciliation
💡 This research explores techniques in edge computing.
Persistent memory is the bottleneck separating stateless chatbots from long-running agentic systems . We present WorldDB, a memory engine built on three commitments: (i) every node is a world -- a container with its own interior subgraph, ontology scope, composed embedding . (ii) nodes are content-addressed and immutable, so any edit produces a new hash at the node and every ancestor, giving a Merkle-style audit trail for free . (
Asset Harvester: Extracting 3D Assets from Autonomous Driving Logs for Simulation
💡 This research explores techniques in computer vision.
Closed-loop simulation is a core component of autonomous vehicle development, enabling scalable testing, training, and safety validation before real-world deployment . We present Asset Harvester, an image-to-3D model that converts sparse, in-the-wild object observations from real driving logs into simulation-ready assets .
An Integrated Deep-Learning Framework for Peptide-Protein Interaction Prediction and Target-Conditioned Peptide Generation with ConGA-PePPI and TC-PepGen
💡 This research forecasting machine learning.
Peptide-protein interactions (PepPIs) are central to cellular regulation and peptide therapeutics, but characterization remains too slow for large-scale screening . ConGA-PepPI uses asymmetric encoding, bidirectional cross-attention, and progressive transfer from pair prediction to binding-site localization .
From Awareness to Intent: Mitigating Silent Driving System Failures through Prospective Situation Awareness Enhancing Interfaces
💡 This research explores techniques in machine learning.
Silent automation failures pose a critical safety challenge for partially automated vehicles . How to support a driver in silent failure remains underexplored . We found that providing perceptual cues was most effective in enhancing SA, while communicating system intent was superior for building trust .
Randomly Initialized Networks Can Learn from Peer-to-Peer Consensus
💡 This research presents techniques for machine learning.
In self-supervised learning, self-distilled methods have shown impressive performance . However, state-of-the-art methods usually rely on ensembles of complex mechanisms, with many design choices that are not well understood . In this work, we explore the role of self-diffusion within learning dynamics . We show that even this minimal setup can lead to learned representations .
Toward Zero-Egress Psychiatric AI: On-Device LLM Deployment for Privacy-Preserving Mental Health Decision Support
💡 This research presents techniques for language AI.
Privacy represents one of the most critical yet underaddressed barriers to AI adoption in mental healthcare . The proposed system extends our prior work on fine-tuned LLM consortiums for psychiatric diagnosis standardization by fundamentally re-architecting the inference pipeline for fully local execution . The platform is designed to assist clinicians with differential diagnosis and evidence-linked symptom mapping .
Enhancing Tabular Anomaly Detection via Pseudo-Label-Guided Generation
💡 This research improves machine learning.
PLAG is a pseudo-label-guided anomaly generation method designed to enhance tabular anomaly detection . PLAG uses pseudo-anomalies as guidance signals and decoupling the overall anomaly quantification of a sample into an accumulation of feature-level abnormalities. PLAG achieves state-of-the-art performance against eight representative baselines .
Style-Based Neural Architectures for Real-Time Weather Classification
💡 This research presents techniques for computer vision.
In this paper, we present three neural network architectures designed for real-time classification of weather conditions (sunny, rain, snow, fog) from images . These models aim to capture the stylistic elements present in images . Although developed for weather detection, these architectures are also suitable for other appearance-based classification tasks .
Aether: Network Validation Using Agentic AI and Digital Twin
💡 This research explores techniques in machine learning.
Network change validation remains a critical yet predominantly manual, time-consuming, and error-prone process in modern network operations . We present a novel approach that integrates Generative Agentic AI with a multi-functional Network Digital Twin to automate and streamline network change validation workflows .
Continuous Focus Groups: A Longitudinal Method for Clinical HRI in Autism Care
💡 This research improves machine learning.
Qualitative methods are important to use alongside quantitative methods to improve Human-Robot Interaction (HRI) We introduce continuous focus groups, a longitudinal and co-agential method designed to sustain dialogue with assistive care professionals working with children with autism spectrum disorder .
Alleviating Linguistic and Interactional Anxiety of Non-Native Speakers in Multilingual Communication
💡 This research explores techniques in machine learning.
Non-native speakers (NNSs) often encounter speaking difficulties in multilingual communication . We introduce an AI tool with translation for real-time speaking support . It builds a channel for mutual understanding with native speakers (NSs) to mitigate interactional anxiety .
Agentic Forecasting using Sequential Bayesian Updating of Linguistic Beliefs
💡 This research achieves better language AI.
We present BLF (Bayesian Linguistic Forecaster), an agentic system for binary forecasting that achieves state-of-the-art performance on the ForecastBench benchmark . BLF outperforms all the top public methods, including Cassi, GPT-5, Grok~4.20, and Foresight-32B .
LQM: Linguistically Motivated Multidimensional Quality Metrics for Machine Translation
💡 This research explores techniques in language AI.
LQM is a hierarchical error taxonomy for diagnosing MT errors through six linguistically grounded levels: sociolinguistics, pragmatics, semantics, morphosyntax, orthography, and graphetics . We construct a bidirectional parallel corpus of 3,850 sentences (550 per variety) spanning seven Arabic dialects (Egyptian, Emirati, Jordanian, Mauritanian, Moroccan, Palestinian, and Yemeni) We evaluate six LLMs in a zero-shot