We gratefully acknowledge support from
the Simons Foundation and member institutions.

Machine Learning

Authors and titles for recent submissions, skipping first 106

[ total of 803 entries: 1-25 | ... | 32-56 | 57-81 | 82-106 | 107-131 | 132-156 | 157-181 | 182-206 | ... | 782-803 ]
[ showing 25 entries per page: fewer | more | all ]

Tue, 9 Dec 2025 (continued, showing 25 of 264 entries)

[107]  arXiv:2512.06471 [pdf, ps, other]
Title: Why Goal-Conditioned Reinforcement Learning Works: Relation to Dual Control
Comments: IFAC preprint
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[108]  arXiv:2512.06457 [pdf, ps, other]
Title: BitStopper: An Efficient Transformer Attention Accelerator via Stage-fusion and Early Termination
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[109]  arXiv:2512.06440 [pdf, ps, other]
Title: Neural expressiveness for beyond importance model compression
Subjects: Machine Learning (cs.LG)
[110]  arXiv:2512.06427 [pdf, ps, other]
Title: A new initialisation to Control Gradients in Sinusoidal Neural network
Subjects: Machine Learning (cs.LG)
[111]  arXiv:2512.06417 [pdf, ps, other]
Title: Hankel-FNO: Fast Underwater Acoustic Charting Via Physics-Encoded Fourier Neural Operator
Authors: Yifan Sun (1), Lei Cheng (1), Jianlong Li (1), Peter Gerstoft (2) ((1) College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou, China, (2) Scripps Institution of Oceanography, University of California San Diego, La Jolla, USA)
Subjects: Machine Learning (cs.LG); Sound (cs.SD)
[112]  arXiv:2512.06392 [pdf, ps, other]
Title: RLAX: Large-Scale, Distributed Reinforcement Learning for Large Language Models on TPUs
Comments: 14 pages, 6 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[113]  arXiv:2512.06370 [pdf, ps, other]
Title: Optimizing Optimizers for Fast Gradient-Based Learning
Comments: 49 pages, 5 figures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[114]  arXiv:2512.06357 [pdf, ps, other]
Title: Proportional integral derivative booster for neural networks-based time-series prediction: Case of water demand prediction
Comments: Engineering Applications of Artificial Intelligence 2022
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[115]  arXiv:2512.06356 [pdf, ps, other]
Title: DDFI: Diverse and Distribution-aware Missing Feature Imputation via Two-step Reconstruction
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[116]  arXiv:2512.06351 [pdf, ps, other]
Title: LLM-Upgraded Graph Reinforcement Learning for Carbon-Aware Job Scheduling in Smart Manufacturing
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[117]  arXiv:2512.06347 [pdf, ps, other]
Title: Zero Generalization Error Theorem for Random Interpolators via Algebraic Geometry
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[118]  arXiv:2512.06343 [pdf, ps, other]
Title: When Distance Distracts: Representation Distance Bias in BT-Loss for Reward Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[119]  arXiv:2512.06341 [pdf, ps, other]
Title: Interpretive Efficiency: Information-Geometric Foundations of Data Usefulness
Authors: Ronald Katende
Subjects: Machine Learning (cs.LG); Information Retrieval (cs.IR); Information Theory (cs.IT)
[120]  arXiv:2512.06303 [pdf, ps, other]
Title: Multimodal Graph Neural Networks for Prognostic Modeling of Brain Network Reorganization
Comments: 5 pages, 2 figures. IEEE conference-style format
Subjects: Machine Learning (cs.LG)
[121]  arXiv:2512.06301 [pdf, ps, other]
Title: Chemistry Integrated Language Model using Hierarchical Molecular Representation for Polymer Informatics
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[122]  arXiv:2512.06297 [pdf, ps, other]
Title: Entropic Confinement and Mode Connectivity in Overparameterized Neural Networks
Comments: Under Review
Subjects: Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn); Statistical Mechanics (cond-mat.stat-mech); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[123]  arXiv:2512.06293 [pdf, ps, other]
Title: Importance-aware Topic Modeling for Discovering Public Transit Risk from Noisy Social Media
Subjects: Machine Learning (cs.LG)
[124]  arXiv:2512.06288 [pdf, ps, other]
Title: Theoretical Compression Bounds for Wide Multilayer Perceptrons
Subjects: Machine Learning (cs.LG); Statistics Theory (math.ST)
[125]  arXiv:2512.06274 [pdf, ps, other]
Title: Networked Restless Multi-Arm Bandits with Reinforcement Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[126]  arXiv:2512.06252 [pdf, ps, other]
Title: Learning Without Time-Based Embodiment Resets in Soft-Actor Critic
Comments: In Proceedings of the 4th Conference on Lifelong Learning Agents (CoLLAs)
Subjects: Machine Learning (cs.LG)
[127]  arXiv:2512.06250 [pdf, ps, other]
Title: Learning When to Switch: Adaptive Policy Selection via Reinforcement Learning
Authors: Chris Tava
Comments: 7 pages
Subjects: Machine Learning (cs.LG)
[128]  arXiv:2512.06244 [pdf, ps, other]
Title: Auto-exploration for online reinforcement learning
Comments: 35 pages (9 appendix), 1 figure. Comments are welcome
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC)
[129]  arXiv:2512.06243 [pdf, ps, other]
Title: Quantization Blindspots: How Model Compression Breaks Backdoor Defenses
Authors: Rohan Pandey, Eric Ye
Comments: 10 pages
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[130]  arXiv:2512.06236 [pdf, ps, other]
Title: Back to Author Console Empowering GNNs for Domain Adaptation via Denoising Target Graph
Subjects: Machine Learning (cs.LG)
[131]  arXiv:2512.06218 [pdf, ps, other]
Title: Average-reward reinforcement learning in semi-Markov decision processes via relative value iteration
Comments: 24 pages. This paper presents the reinforcement-learning material previously contained in version 2 of arXiv:2409.03915, which is now being split into two stand-alone papers. Minor corrections and improvements to the main results have also been made in the course of this reformatting
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[ total of 803 entries: 1-25 | ... | 32-56 | 57-81 | 82-106 | 107-131 | 132-156 | 157-181 | 182-206 | ... | 782-803 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)