Latest AI News

arXiv cs.LG•about 5 hours ago·5 min read

Neural Bayesian Sequential Routing

arXiv:2605.26147v1 Announce Type: new Abstract: Human decision-making is sequential and uncertainty-aware, yet standard neural networks often rely on static, dense forward computation with limited visibility into evidence acquisition, uncertainty evolution, or when computation should stop. We introduce \textbf{Neural Bayesian Sequential Routing (NBSR)}, a framework that models neural inference as active evidence accumulation over a hierarchical Directed Acyclic Graph (DAG). Within a Dirichlet--Categorical conjugate framework, neural experts query a persistent global knowledge oracle to extract positive evidence vectors, which act as pseudo-counts and update a Dirichlet belief state by exact conjugate addition. Coupled with a Gumbel-Softmax Straight-Through estimator, this update enables hard, path-dependent routing while preserving surrogate gradients for end-to-end training. The resulting Dirichlet precision and entropy provide mechanisms for uncertainty quantification, entropy-based early exiting, OOD abstention, and cost-aware evidence acquisition. We prove that, under strictly positive evidence extraction, total Dirichlet precision increases monotonically along any valid trajectory and marginal predictive variance is bounded, formalizing sequential ``hypothesis sharpening''; under idealized capacity and optimization assumptions, the terminal Dirichlet expectation recovers the Bayes-optimal conditional distribution. Empirical evaluations across visual categorization, structured medical diagnosis, language modeling, partially observable control, and cost-aware Bayesian experimental design show that NBSR achieves competitive predictive performance while providing transparent routing traces, path-dependent evidence attribution, uncertainty-aware decision control, and resource-rational inference. Overall, NBSR offers a mathematically grounded framework for interpretable, modular, and resource-rational agentic AI.

Latest News

AdaSD: Adaptive Speculative Decoding for Efficient Language Model Inference

SilIF: Silhouette-Augmented Isolation Forest for Unsupervised Transaction Fraud Detection

Latest News

AdaSD: Adaptive Speculative Decoding for Efficient Language Model Inference

SilIF: Silhouette-Augmented Isolation Forest for Unsupervised Transaction Fraud Detection

Which Changes Matter? Towards Trustworthy Legal AI via Relevance-Sensitive Evaluation and Solver-Grounded Reasoning

Neural Bayesian Sequential Routing

On the Role of Inductive Bias in Time-Series Pretraining: A Case Study in Learning Generalizable Representations for Clinical Time Series

The Bridge-Garden Dilemma in LLM Distillation: Why Mixing Hard and Soft Labels Works

Dynamic Link Prediction with Temporally Enhanced Signed Graph Neural Networks

Classification and detection of multiple UAVs using rational Gaussian wavelet neural networks

When Rule Violations Are Rare: Chimera Training for Logical Anomaly Detection

Stateful Inference for Low-Latency Multi-Agent Tool Calling

AirCast-SR: A Foundation Model for Kilometer-Scale Atmospheric Super-Resolution via Latent Consistency Diffusion

Provably Communication-Efficient and Privacy-Preserving Federated Graph Neural Networks

Belief-Sim: Towards Belief-Driven Simulation of Demographic Misinformation Susceptibility

EpiQAL: Benchmarking Large Language Models in Epidemiological Question Answering and Reasoning

Dissecting Multimodal In-Context Learning: Modality Asymmetries and Circuit Dynamics in modern Transformers

Omanic: Towards Step-wise Evaluation of Multi-hop Reasoning in Large Language Models

AIDG: A Formal Decomposition of Information Extraction and Containment Asymmetries in Multi-Turn LLM Dialogue

ADRD-Bench: A Preliminary LLM Benchmark for Alzheimer's Disease and Related Dementias

How Human-Like Are Large Language Models? A Register-Aware Linguistic Evaluation Framework

Probing the Knowledge Boundary: An Interactive Agentic Framework for Deep Knowledge Extraction