We gratefully acknowledge support from
the Simons Foundation and member institutions.

Machine Learning

Authors and titles for recent submissions, skipping first 96

[ total of 803 entries: 1-25 | 22-46 | 47-71 | 72-96 | 97-121 | 122-146 | 147-171 | 172-196 | ... | 797-803 ]
[ showing 25 entries per page: fewer | more | all ]

Tue, 9 Dec 2025 (continued, showing 25 of 264 entries)

[97]  arXiv:2512.06609 [pdf, ps, other]
Title: Vector Quantization using Gaussian Variational Autoencoder
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[98]  arXiv:2512.06607 [pdf, ps, other]
Title: A Fast and Effective Solution to the Problem of Look-ahead Bias in LLMs
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[99]  arXiv:2512.06592 [pdf, ps, other]
Title: On fine-tuning Boltz-2 for protein-protein affinity prediction
Comments: MLSB 2025
Subjects: Machine Learning (cs.LG); Biomolecules (q-bio.BM)
[100]  arXiv:2512.06582 [pdf, ps, other]
Title: QL-LSTM: A Parameter-Efficient LSTM for Stable Long-Sequence Modeling
Authors: Isaac Kofi Nti
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
[101]  arXiv:2512.06563 [pdf, ps, other]
Title: Deep Manifold Part 2: Neural Network Mathematics
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[102]  arXiv:2512.06547 [pdf, ps, other]
Title: A-3PO: Accelerating Asynchronous LLM Training with Staleness-aware Proximal Policy Approximation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[103]  arXiv:2512.06533 [pdf, ps, other]
Title: Beyond Token-level Supervision: Unlocking the Potential of Decoding-based Regression via Reinforcement Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[104]  arXiv:2512.06520 [pdf, ps, other]
Title: Hierarchical geometric deep learning enables scalable analysis of molecular dynamics
Comments: 17 pages, 12 figures
Subjects: Machine Learning (cs.LG); Statistical Mechanics (cond-mat.stat-mech); Computational Physics (physics.comp-ph); Data Analysis, Statistics and Probability (physics.data-an)
[105]  arXiv:2512.06511 [pdf, ps, other]
Title: Diagnosis-based mortality prediction for intensive care unit patients via transfer learning
Subjects: Machine Learning (cs.LG); Applications (stat.AP)
[106]  arXiv:2512.06490 [pdf, ps, other]
Title: Optimizing LLMs Using Quantization for Mobile Execution
Comments: 11 pages, 1 equation, 2 tables. Author Accepted Manuscript (AAM) of a paper published in Springer LNNS, ICT4SD 2025. DOI: 10.1007/978-3-032-06697-8_33
Journal-ref: Fong, S., Dey, N., Joshi, A. (eds) ICT Analysis and Applications. ICT4SD 2025. Lecture Notes in Networks and Systems, vol 1654. Springer, Cham
Subjects: Machine Learning (cs.LG)
[107]  arXiv:2512.06471 [pdf, ps, other]
Title: Why Goal-Conditioned Reinforcement Learning Works: Relation to Dual Control
Comments: IFAC preprint
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[108]  arXiv:2512.06457 [pdf, ps, other]
Title: BitStopper: An Efficient Transformer Attention Accelerator via Stage-fusion and Early Termination
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[109]  arXiv:2512.06440 [pdf, ps, other]
Title: Neural expressiveness for beyond importance model compression
Subjects: Machine Learning (cs.LG)
[110]  arXiv:2512.06427 [pdf, ps, other]
Title: A new initialisation to Control Gradients in Sinusoidal Neural network
Subjects: Machine Learning (cs.LG)
[111]  arXiv:2512.06417 [pdf, ps, other]
Title: Hankel-FNO: Fast Underwater Acoustic Charting Via Physics-Encoded Fourier Neural Operator
Authors: Yifan Sun (1), Lei Cheng (1), Jianlong Li (1), Peter Gerstoft (2) ((1) College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou, China, (2) Scripps Institution of Oceanography, University of California San Diego, La Jolla, USA)
Subjects: Machine Learning (cs.LG); Sound (cs.SD)
[112]  arXiv:2512.06392 [pdf, ps, other]
Title: RLAX: Large-Scale, Distributed Reinforcement Learning for Large Language Models on TPUs
Comments: 14 pages, 6 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[113]  arXiv:2512.06370 [pdf, ps, other]
Title: Optimizing Optimizers for Fast Gradient-Based Learning
Comments: 49 pages, 5 figures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[114]  arXiv:2512.06357 [pdf, ps, other]
Title: Proportional integral derivative booster for neural networks-based time-series prediction: Case of water demand prediction
Comments: Engineering Applications of Artificial Intelligence 2022
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[115]  arXiv:2512.06356 [pdf, ps, other]
Title: DDFI: Diverse and Distribution-aware Missing Feature Imputation via Two-step Reconstruction
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[116]  arXiv:2512.06351 [pdf, ps, other]
Title: LLM-Upgraded Graph Reinforcement Learning for Carbon-Aware Job Scheduling in Smart Manufacturing
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[117]  arXiv:2512.06347 [pdf, ps, other]
Title: Zero Generalization Error Theorem for Random Interpolators via Algebraic Geometry
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[118]  arXiv:2512.06343 [pdf, ps, other]
Title: When Distance Distracts: Representation Distance Bias in BT-Loss for Reward Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[119]  arXiv:2512.06341 [pdf, ps, other]
Title: Interpretive Efficiency: Information-Geometric Foundations of Data Usefulness
Authors: Ronald Katende
Subjects: Machine Learning (cs.LG); Information Retrieval (cs.IR); Information Theory (cs.IT)
[120]  arXiv:2512.06303 [pdf, ps, other]
Title: Multimodal Graph Neural Networks for Prognostic Modeling of Brain Network Reorganization
Comments: 5 pages, 2 figures. IEEE conference-style format
Subjects: Machine Learning (cs.LG)
[121]  arXiv:2512.06301 [pdf, ps, other]
Title: Chemistry Integrated Language Model using Hierarchical Molecular Representation for Polymer Informatics
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[ total of 803 entries: 1-25 | 22-46 | 47-71 | 72-96 | 97-121 | 122-146 | 147-171 | 172-196 | ... | 797-803 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)