Machine Learning

Authors and titles for recent submissions, skipping first 106

[ total of 803 entries: 1-25 | ... | 32-56 | 57-81 | 82-106 | 107-131 | 132-156 | 157-181 | 182-206 | ... | 782-803 ]
[ showing 25 entries per page: fewer | more | all ]

Tue, 9 Dec 2025 (continued, showing 25 of 264 entries)

[107] arXiv:2512.06471 [pdf, ps, other]: Title: Why Goal-Conditioned Reinforcement Learning Works: Relation to Dual Control

Authors: Nathan P. Lawrence, Ali Mesbah

Comments: IFAC preprint

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[108] arXiv:2512.06457 [pdf, ps, other]: Title: BitStopper: An Efficient Transformer Attention Accelerator via Stage-fusion and Early Termination

Authors: Huizheng Wang, Hongbin Wang, Shaojun Wei, Yang Hu, Shouyi Yin

Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[109] arXiv:2512.06440 [pdf, ps, other]: Title: Neural expressiveness for beyond importance model compression

Authors: Angelos-Christos Maroudis, Sotirios Xydis

Subjects: Machine Learning (cs.LG)
[110] arXiv:2512.06427 [pdf, ps, other]: Title: A new initialisation to Control Gradients in Sinusoidal Neural network

Authors: Andrea Combette, Antoine Venaille, Nelly Pustelnik

Subjects: Machine Learning (cs.LG)
[111] arXiv:2512.06417 [pdf, ps, other]: Title: Hankel-FNO: Fast Underwater Acoustic Charting Via Physics-Encoded Fourier Neural Operator

Authors: Yifan Sun (1), Lei Cheng (1), Jianlong Li (1), Peter Gerstoft (2) ((1) College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou, China, (2) Scripps Institution of Oceanography, University of California San Diego, La Jolla, USA)

Subjects: Machine Learning (cs.LG); Sound (cs.SD)
[112] arXiv:2512.06392 [pdf, ps, other]: Title: RLAX: Large-Scale, Distributed Reinforcement Learning for Large Language Models on TPUs

Authors: Runlong Zhou, Lefan Zhang, Shang-Chen Wu, Kelvin Zou, Hanzhi Zhou, Ke Ye, Yihao Feng, Dong Yin, Alex Guillen Garcia, Dmytro Babych, Rohit Chatterjee, Matthew Hopkins, Xiang Kong, Chang Lan, Lezhi Li, Yiping Ma, Daniele Molinari, Senyu Tong, Yanchao Sun, Thomas Voice, Jianyu Wang, Chong Wang, Simon Wang, Floris Weers, Yechen Xu, Guolin Yin, Muyang Yu, Yi Zhang, Zheng Zhou, Danyang Zhuo, Ruoming Pang, Cheng Leong

Comments: 14 pages, 6 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[113] arXiv:2512.06370 [pdf, ps, other]: Title: Optimizing Optimizers for Fast Gradient-Based Learning

Authors: Jaerin Lee, Kyoung Mu Lee

Comments: 49 pages, 5 figures

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[114] arXiv:2512.06357 [pdf, ps, other]: Title: Proportional integral derivative booster for neural networks-based time-series prediction: Case of water demand prediction

Authors: Tony Sallooma, Okyay Kaynak, Xinbo Yub, Wei He

Comments: Engineering Applications of Artificial Intelligence 2022

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[115] arXiv:2512.06356 [pdf, ps, other]: Title: DDFI: Diverse and Distribution-aware Missing Feature Imputation via Two-step Reconstruction

Authors: Yifan Song, Fenglin Yu, Yihong Luo, Xingjian Tao, Siya Qiu, Kai Han, Jing Tang

Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[116] arXiv:2512.06351 [pdf, ps, other]: Title: LLM-Upgraded Graph Reinforcement Learning for Carbon-Aware Job Scheduling in Smart Manufacturing

Authors: Zhiying Yang, Fang Liu, Wei Zhang, Xin Lou, Malcolm Yoke Hean Low, Boon Ping Gan

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[117] arXiv:2512.06347 [pdf, ps, other]: Title: Zero Generalization Error Theorem for Random Interpolators via Algebraic Geometry

Authors: Naoki Yoshida, Isao Ishikawa, Masaaki Imaizumi

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[118] arXiv:2512.06343 [pdf, ps, other]: Title: When Distance Distracts: Representation Distance Bias in BT-Loss for Reward Models

Authors: Tong Xie, Andrew Bai, Yuanhao Ban, Yunqi Hong, Haoyu Li, Cho-jui Hsieh

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[119] arXiv:2512.06341 [pdf, ps, other]: Title: Interpretive Efficiency: Information-Geometric Foundations of Data Usefulness

Authors: Ronald Katende

Subjects: Machine Learning (cs.LG); Information Retrieval (cs.IR); Information Theory (cs.IT)
[120] arXiv:2512.06303 [pdf, ps, other]: Title: Multimodal Graph Neural Networks for Prognostic Modeling of Brain Network Reorganization

Authors: Preksha Girish, Rachana Mysore, Kiran K. N., Hiranmayee R., Shipra Prashanth, Shrey Kumar

Comments: 5 pages, 2 figures. IEEE conference-style format

Subjects: Machine Learning (cs.LG)
[121] arXiv:2512.06301 [pdf, ps, other]: Title: Chemistry Integrated Language Model using Hierarchical Molecular Representation for Polymer Informatics

Authors: Jihun Ahn, Gabriella Pasya Irianti, Vikram Thapar, Su-Mi Hur

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[122] arXiv:2512.06297 [pdf, ps, other]: Title: Entropic Confinement and Mode Connectivity in Overparameterized Neural Networks

Authors: Luca Di Carlo, Chase Goddard, David J. Schwab

Comments: Under Review

Subjects: Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn); Statistical Mechanics (cond-mat.stat-mech); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[123] arXiv:2512.06293 [pdf, ps, other]: Title: Importance-aware Topic Modeling for Discovering Public Transit Risk from Noisy Social Media

Authors: Fatima Ashraf, Muhammad Ayub Sabir, Jiaxin Deng, Junbiao Pang, Haitao Yu

Subjects: Machine Learning (cs.LG)
[124] arXiv:2512.06288 [pdf, ps, other]: Title: Theoretical Compression Bounds for Wide Multilayer Perceptrons

Authors: Houssam El Cheairi, David Gamarnik, Rahul Mazumder

Subjects: Machine Learning (cs.LG); Statistics Theory (math.ST)
[125] arXiv:2512.06274 [pdf, ps, other]: Title: Networked Restless Multi-Arm Bandits with Reinforcement Learning

Authors: Hanmo Zhang, Zenghui Sun, Kai Wang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[126] arXiv:2512.06252 [pdf, ps, other]: Title: Learning Without Time-Based Embodiment Resets in Soft-Actor Critic

Authors: Homayoon Farrahi, A. Rupam Mahmood

Comments: In Proceedings of the 4th Conference on Lifelong Learning Agents (CoLLAs)

Subjects: Machine Learning (cs.LG)
[127] arXiv:2512.06250 [pdf, ps, other]: Title: Learning When to Switch: Adaptive Policy Selection via Reinforcement Learning

Authors: Chris Tava

Comments: 7 pages

Subjects: Machine Learning (cs.LG)
[128] arXiv:2512.06244 [pdf, ps, other]: Title: Auto-exploration for online reinforcement learning

Authors: Caleb Ju, Guanghui Lan

Comments: 35 pages (9 appendix), 1 figure. Comments are welcome

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC)
[129] arXiv:2512.06243 [pdf, ps, other]: Title: Quantization Blindspots: How Model Compression Breaks Backdoor Defenses

Authors: Rohan Pandey, Eric Ye

Comments: 10 pages

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[130] arXiv:2512.06236 [pdf, ps, other]: Title: Back to Author Console Empowering GNNs for Domain Adaptation via Denoising Target Graph

Authors: Haiyang Yu, Meng-Chieh Lee, Xiang song, Qi Zhu, Christos Faloutsos

Subjects: Machine Learning (cs.LG)
[131] arXiv:2512.06218 [pdf, ps, other]: Title: Average-reward reinforcement learning in semi-Markov decision processes via relative value iteration

Authors: Huizhen Yu, Yi Wan, Richard S. Sutton

Comments: 24 pages. This paper presents the reinforcement-learning material previously contained in version 2 of arXiv:2409.03915, which is now being split into two stand-alone papers. Minor corrections and improvements to the main results have also been made in the course of this reformatting

Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)

[ total of 803 entries: 1-25 | ... | 32-56 | 57-81 | 82-106 | 107-131 | 132-156 | 157-181 | 182-206 | ... | 782-803 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help (Access key information)

> cs > cs.LG

Machine Learning

Authors and titles for recent submissions, skipping first 106

Tue, 9 Dec 2025 (continued, showing 25 of 264 entries)