We gratefully acknowledge support from
the Simons Foundation and member institutions.

Machine Learning

Authors and titles for recent submissions, skipping first 101

[ total of 803 entries: 1-25 | ... | 27-51 | 52-76 | 77-101 | 102-126 | 127-151 | 152-176 | 177-201 | ... | 802-803 ]
[ showing 25 entries per page: fewer | more | all ]

Tue, 9 Dec 2025 (continued, showing 25 of 264 entries)

[102]  arXiv:2512.06547 [pdf, ps, other]
Title: A-3PO: Accelerating Asynchronous LLM Training with Staleness-aware Proximal Policy Approximation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[103]  arXiv:2512.06533 [pdf, ps, other]
Title: Beyond Token-level Supervision: Unlocking the Potential of Decoding-based Regression via Reinforcement Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[104]  arXiv:2512.06520 [pdf, ps, other]
Title: Hierarchical geometric deep learning enables scalable analysis of molecular dynamics
Comments: 17 pages, 12 figures
Subjects: Machine Learning (cs.LG); Statistical Mechanics (cond-mat.stat-mech); Computational Physics (physics.comp-ph); Data Analysis, Statistics and Probability (physics.data-an)
[105]  arXiv:2512.06511 [pdf, ps, other]
Title: Diagnosis-based mortality prediction for intensive care unit patients via transfer learning
Subjects: Machine Learning (cs.LG); Applications (stat.AP)
[106]  arXiv:2512.06490 [pdf, ps, other]
Title: Optimizing LLMs Using Quantization for Mobile Execution
Comments: 11 pages, 1 equation, 2 tables. Author Accepted Manuscript (AAM) of a paper published in Springer LNNS, ICT4SD 2025. DOI: 10.1007/978-3-032-06697-8_33
Journal-ref: Fong, S., Dey, N., Joshi, A. (eds) ICT Analysis and Applications. ICT4SD 2025. Lecture Notes in Networks and Systems, vol 1654. Springer, Cham
Subjects: Machine Learning (cs.LG)
[107]  arXiv:2512.06471 [pdf, ps, other]
Title: Why Goal-Conditioned Reinforcement Learning Works: Relation to Dual Control
Comments: IFAC preprint
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[108]  arXiv:2512.06457 [pdf, ps, other]
Title: BitStopper: An Efficient Transformer Attention Accelerator via Stage-fusion and Early Termination
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[109]  arXiv:2512.06440 [pdf, ps, other]
Title: Neural expressiveness for beyond importance model compression
Subjects: Machine Learning (cs.LG)
[110]  arXiv:2512.06427 [pdf, ps, other]
Title: A new initialisation to Control Gradients in Sinusoidal Neural network
Subjects: Machine Learning (cs.LG)
[111]  arXiv:2512.06417 [pdf, ps, other]
Title: Hankel-FNO: Fast Underwater Acoustic Charting Via Physics-Encoded Fourier Neural Operator
Authors: Yifan Sun (1), Lei Cheng (1), Jianlong Li (1), Peter Gerstoft (2) ((1) College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou, China, (2) Scripps Institution of Oceanography, University of California San Diego, La Jolla, USA)
Subjects: Machine Learning (cs.LG); Sound (cs.SD)
[112]  arXiv:2512.06392 [pdf, ps, other]
Title: RLAX: Large-Scale, Distributed Reinforcement Learning for Large Language Models on TPUs
Comments: 14 pages, 6 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[113]  arXiv:2512.06370 [pdf, ps, other]
Title: Optimizing Optimizers for Fast Gradient-Based Learning
Comments: 49 pages, 5 figures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[114]  arXiv:2512.06357 [pdf, ps, other]
Title: Proportional integral derivative booster for neural networks-based time-series prediction: Case of water demand prediction
Comments: Engineering Applications of Artificial Intelligence 2022
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[115]  arXiv:2512.06356 [pdf, ps, other]
Title: DDFI: Diverse and Distribution-aware Missing Feature Imputation via Two-step Reconstruction
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[116]  arXiv:2512.06351 [pdf, ps, other]
Title: LLM-Upgraded Graph Reinforcement Learning for Carbon-Aware Job Scheduling in Smart Manufacturing
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[117]  arXiv:2512.06347 [pdf, ps, other]
Title: Zero Generalization Error Theorem for Random Interpolators via Algebraic Geometry
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[118]  arXiv:2512.06343 [pdf, ps, other]
Title: When Distance Distracts: Representation Distance Bias in BT-Loss for Reward Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[119]  arXiv:2512.06341 [pdf, ps, other]
Title: Interpretive Efficiency: Information-Geometric Foundations of Data Usefulness
Authors: Ronald Katende
Subjects: Machine Learning (cs.LG); Information Retrieval (cs.IR); Information Theory (cs.IT)
[120]  arXiv:2512.06303 [pdf, ps, other]
Title: Multimodal Graph Neural Networks for Prognostic Modeling of Brain Network Reorganization
Comments: 5 pages, 2 figures. IEEE conference-style format
Subjects: Machine Learning (cs.LG)
[121]  arXiv:2512.06301 [pdf, ps, other]
Title: Chemistry Integrated Language Model using Hierarchical Molecular Representation for Polymer Informatics
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[122]  arXiv:2512.06297 [pdf, ps, other]
Title: Entropic Confinement and Mode Connectivity in Overparameterized Neural Networks
Comments: Under Review
Subjects: Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn); Statistical Mechanics (cond-mat.stat-mech); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[123]  arXiv:2512.06293 [pdf, ps, other]
Title: Importance-aware Topic Modeling for Discovering Public Transit Risk from Noisy Social Media
Subjects: Machine Learning (cs.LG)
[124]  arXiv:2512.06288 [pdf, ps, other]
Title: Theoretical Compression Bounds for Wide Multilayer Perceptrons
Subjects: Machine Learning (cs.LG); Statistics Theory (math.ST)
[125]  arXiv:2512.06274 [pdf, ps, other]
Title: Networked Restless Multi-Arm Bandits with Reinforcement Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[126]  arXiv:2512.06252 [pdf, ps, other]
Title: Learning Without Time-Based Embodiment Resets in Soft-Actor Critic
Comments: In Proceedings of the 4th Conference on Lifelong Learning Agents (CoLLAs)
Subjects: Machine Learning (cs.LG)
[ total of 803 entries: 1-25 | ... | 27-51 | 52-76 | 77-101 | 102-126 | 127-151 | 152-176 | 177-201 | ... | 802-803 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)