We gratefully acknowledge support from
the Simons Foundation and member institutions.

Machine Learning

Authors and titles for recent submissions, skipping first 50

[ total of 979 entries: 1-25 | 26-50 | 51-75 | 76-100 | 101-125 | 126-150 | ... | 976-979 ]
[ showing 25 entries per page: fewer | more | all ]

Fri, 5 Dec 2025 (continued, showing 25 of 116 entries)

[51]  arXiv:2512.04299 [pdf, ps, other]
Title: When do spectral gradient updates help in deep learning?
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[52]  arXiv:2512.04296 [pdf, ps, other]
Title: GRASP: GRouped Activation Shared Parameterization for Parameter-Efficient Fine-Tuning and Robust Inference of Transformers
Comments: Under Review
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[53]  arXiv:2512.04277 [pdf, ps, other]
Title: Bootstrapped Mixed Rewards for RL Post-Training: Injecting Canonical Action Order
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[54]  arXiv:2512.04268 [pdf, ps, other]
Title: The Initialization Determines Whether In-Context Learning Is Gradient Descent
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[55]  arXiv:2512.04264 [pdf, ps, other]
Title: Studying Various Activation Functions and Non-IID Data for Machine Learning Model Robustness
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[56]  arXiv:2512.04252 [pdf, ps, other]
Title: Fine-Tuning ChemBERTa for Predicting Inhibitory Activity Against TDP1 Using Deep Learning
Authors: Baichuan Zeng
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[57]  arXiv:2512.04223 [pdf, ps, other]
Title: ActVAE: Modelling human activity schedules with a deep conditional generative approach
Subjects: Machine Learning (cs.LG)
[58]  arXiv:2512.04198 [pdf, ps, other]
Title: Network of Theseus (like the ship)
Comments: Preprint. 24 pages, 9 figures, 8 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[59]  arXiv:2512.04189 [pdf, ps, other]
Title: BEP: A Binary Error Propagation Algorithm for Binary Neural Networks Training
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[60]  arXiv:2512.04165 [pdf, ps, other]
Title: Mitigating the Curse of Detail: Scaling Arguments for Feature Learning and Sample Complexity
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[61]  arXiv:2512.04138 [pdf, ps, other]
Title: MechDetect: Detecting Data-Dependent Errors
Comments: International Conference on Data Science and Intelligent Systems (DSIS 2025)
Subjects: Machine Learning (cs.LG); Databases (cs.DB); Information Retrieval (cs.IR)
[62]  arXiv:2512.04135 [pdf, ps, other]
Title: Decoding Large Language Diffusion Models with Foreseeing Movement
Subjects: Machine Learning (cs.LG)
[63]  arXiv:2512.04125 [pdf, ps, other]
Title: ASCIIBench: Evaluating Language-Model-Based Understanding of Visually-Oriented Text
Comments: Accepted to The Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025): LLM Evaluation Workshop & Multimodal Algorithmic Reasoning Workshop
Subjects: Machine Learning (cs.LG)
[64]  arXiv:2512.05112 (cross-list from cs.CV) [pdf, ps, other]
Title: DraCo: Draft as CoT for Text-to-Image Preview and Rare Concept Generation
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[65]  arXiv:2512.05106 (cross-list from cs.CV) [pdf, ps, other]
Title: NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Robotics (cs.RO)
[66]  arXiv:2512.05105 (cross-list from cs.CL) [pdf, ps, other]
Title: Semantic Soft Bootstrapping: Long Context Reasoning in LLMs without Reinforcement Learning
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Theory (cs.IT); Machine Learning (cs.LG); Signal Processing (eess.SP)
[67]  arXiv:2512.05100 (cross-list from cs.CL) [pdf, ps, other]
Title: Structured Document Translation via Format Reinforcement Learning
Comments: IJCNLP-AACL 2025 Main (Oral)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[68]  arXiv:2512.05092 (cross-list from stat.ML) [pdf, ps, other]
Title: Foundations of Diffusion Models in General State Spaces: A Self-Contained Introduction
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[69]  arXiv:2512.05070 (cross-list from stat.ML) [pdf, ps, other]
Title: Control Consistency Losses for Diffusion Bridges
Comments: Frontiers in Probabilistic Inference: Sampling Meets Learning Workshop at NeurIPS 2025 (Oral)
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[70]  arXiv:2512.05058 (cross-list from quant-ph) [pdf, ps, other]
Title: Meta-Learning for Quantum Optimization via Quantum Sequence Model
Subjects: Quantum Physics (quant-ph); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[71]  arXiv:2512.05049 (cross-list from quant-ph) [pdf, ps, other]
Title: QKAN-LSTM: Quantum-inspired Kolmogorov-Arnold Long Short-term Memory
Subjects: Quantum Physics (quant-ph); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[72]  arXiv:2512.05033 (cross-list from cs.CL) [pdf, ps, other]
Title: Arbitrage: Efficient Reasoning via Advantage-Aware Speculation
Comments: 22 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[73]  arXiv:2512.05024 (cross-list from stat.ME) [pdf, ps, other]
Title: Model-Free Assessment of Simulator Fidelity via Quantile Curves
Comments: 33 pages, 11 figures
Subjects: Methodology (stat.ME); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[74]  arXiv:2512.05021 (cross-list from cs.CV) [pdf, ps, other]
Title: HTR-ConvText: Leveraging Convolution and Textual Information for Handwritten Text Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[75]  arXiv:2512.04992 (cross-list from cs.NE) [pdf, ps, other]
Title: Evolutionary Architecture Search through Grammar-Based Sequence Alignment
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[ total of 979 entries: 1-25 | 26-50 | 51-75 | 76-100 | 101-125 | 126-150 | ... | 976-979 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)