We gratefully acknowledge support from
the Simons Foundation and member institutions.

Machine Learning

Authors and titles for recent submissions, skipping first 50

[ total of 979 entries: 1-50 | 51-100 | 101-150 | 151-200 | 201-250 | ... | 951-979 ]
[ showing 50 entries per page: fewer | more | all ]

Fri, 5 Dec 2025 (continued, showing 50 of 116 entries)

[51]  arXiv:2512.04299 [pdf, ps, other]
Title: When do spectral gradient updates help in deep learning?
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[52]  arXiv:2512.04296 [pdf, ps, other]
Title: GRASP: GRouped Activation Shared Parameterization for Parameter-Efficient Fine-Tuning and Robust Inference of Transformers
Comments: Under Review
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[53]  arXiv:2512.04277 [pdf, ps, other]
Title: Bootstrapped Mixed Rewards for RL Post-Training: Injecting Canonical Action Order
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[54]  arXiv:2512.04268 [pdf, ps, other]
Title: The Initialization Determines Whether In-Context Learning Is Gradient Descent
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[55]  arXiv:2512.04264 [pdf, ps, other]
Title: Studying Various Activation Functions and Non-IID Data for Machine Learning Model Robustness
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[56]  arXiv:2512.04252 [pdf, ps, other]
Title: Fine-Tuning ChemBERTa for Predicting Inhibitory Activity Against TDP1 Using Deep Learning
Authors: Baichuan Zeng
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[57]  arXiv:2512.04223 [pdf, ps, other]
Title: ActVAE: Modelling human activity schedules with a deep conditional generative approach
Subjects: Machine Learning (cs.LG)
[58]  arXiv:2512.04198 [pdf, ps, other]
Title: Network of Theseus (like the ship)
Comments: Preprint. 24 pages, 9 figures, 8 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[59]  arXiv:2512.04189 [pdf, ps, other]
Title: BEP: A Binary Error Propagation Algorithm for Binary Neural Networks Training
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[60]  arXiv:2512.04165 [pdf, ps, other]
Title: Mitigating the Curse of Detail: Scaling Arguments for Feature Learning and Sample Complexity
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[61]  arXiv:2512.04138 [pdf, ps, other]
Title: MechDetect: Detecting Data-Dependent Errors
Comments: International Conference on Data Science and Intelligent Systems (DSIS 2025)
Subjects: Machine Learning (cs.LG); Databases (cs.DB); Information Retrieval (cs.IR)
[62]  arXiv:2512.04135 [pdf, ps, other]
Title: Decoding Large Language Diffusion Models with Foreseeing Movement
Subjects: Machine Learning (cs.LG)
[63]  arXiv:2512.04125 [pdf, ps, other]
Title: ASCIIBench: Evaluating Language-Model-Based Understanding of Visually-Oriented Text
Comments: Accepted to The Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025): LLM Evaluation Workshop & Multimodal Algorithmic Reasoning Workshop
Subjects: Machine Learning (cs.LG)
[64]  arXiv:2512.05112 (cross-list from cs.CV) [pdf, ps, other]
Title: DraCo: Draft as CoT for Text-to-Image Preview and Rare Concept Generation
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[65]  arXiv:2512.05106 (cross-list from cs.CV) [pdf, ps, other]
Title: NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Robotics (cs.RO)
[66]  arXiv:2512.05105 (cross-list from cs.CL) [pdf, ps, other]
Title: Semantic Soft Bootstrapping: Long Context Reasoning in LLMs without Reinforcement Learning
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Theory (cs.IT); Machine Learning (cs.LG); Signal Processing (eess.SP)
[67]  arXiv:2512.05100 (cross-list from cs.CL) [pdf, ps, other]
Title: Structured Document Translation via Format Reinforcement Learning
Comments: IJCNLP-AACL 2025 Main (Oral)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[68]  arXiv:2512.05092 (cross-list from stat.ML) [pdf, ps, other]
Title: Foundations of Diffusion Models in General State Spaces: A Self-Contained Introduction
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[69]  arXiv:2512.05070 (cross-list from stat.ML) [pdf, ps, other]
Title: Control Consistency Losses for Diffusion Bridges
Comments: Frontiers in Probabilistic Inference: Sampling Meets Learning Workshop at NeurIPS 2025 (Oral)
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[70]  arXiv:2512.05058 (cross-list from quant-ph) [pdf, ps, other]
Title: Meta-Learning for Quantum Optimization via Quantum Sequence Model
Subjects: Quantum Physics (quant-ph); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[71]  arXiv:2512.05049 (cross-list from quant-ph) [pdf, ps, other]
Title: QKAN-LSTM: Quantum-inspired Kolmogorov-Arnold Long Short-term Memory
Subjects: Quantum Physics (quant-ph); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[72]  arXiv:2512.05033 (cross-list from cs.CL) [pdf, ps, other]
Title: Arbitrage: Efficient Reasoning via Advantage-Aware Speculation
Comments: 22 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[73]  arXiv:2512.05024 (cross-list from stat.ME) [pdf, ps, other]
Title: Model-Free Assessment of Simulator Fidelity via Quantile Curves
Comments: 33 pages, 11 figures
Subjects: Methodology (stat.ME); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[74]  arXiv:2512.05021 (cross-list from cs.CV) [pdf, ps, other]
Title: HTR-ConvText: Leveraging Convolution and Textual Information for Handwritten Text Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[75]  arXiv:2512.04992 (cross-list from cs.NE) [pdf, ps, other]
Title: Evolutionary Architecture Search through Grammar-Based Sequence Alignment
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[76]  arXiv:2512.04985 (cross-list from stat.ML) [pdf, ps, other]
Title: Towards a unified framework for guided diffusion models
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
[77]  arXiv:2512.04981 (cross-list from cs.CV) [pdf, ps, other]
Title: Aligned but Stereotypical? The Hidden Influence of System Prompts on Social Bias in LVLM-Based Text-to-Image Models
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[78]  arXiv:2512.04980 (cross-list from stat.ML) [pdf, ps, other]
Title: Learning Causality for Longitudinal Data
Comments: PhD thesis manuscript
Subjects: Machine Learning (stat.ML); Information Theory (cs.IT); Machine Learning (cs.LG)
[79]  arXiv:2512.04969 (cross-list from cs.CV) [pdf, ps, other]
Title: Rethinking the Use of Vision Transformers for AI-Generated Image Detection
Comments: Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[80]  arXiv:2512.04966 (cross-list from cs.IT) [pdf, ps, other]
Title: Environment-Aware Channel Inference via Cross-Modal Flow: From Multimodal Sensing to Wireless Channels
Comments: 13 pages, 13 figures, 40 references, submitted to IEEE for possible publication
Subjects: Information Theory (cs.IT); Machine Learning (cs.LG); Signal Processing (eess.SP)
[81]  arXiv:2512.04874 (cross-list from math.FA) [pdf, ps, other]
Title: Shorting Dynamics and Structured Kernel Regularization
Authors: James Tian
Subjects: Functional Analysis (math.FA); Machine Learning (cs.LG)
[82]  arXiv:2512.04871 (cross-list from cs.AI) [pdf, ps, other]
Title: STELLA: Guiding Large Language Models for Time Series Forecasting with Semantic Abstractions
Comments: This work has been submitted to the IEEE for possible publication
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[83]  arXiv:2512.04865 (cross-list from math.AG) [pdf, ps, other]
Title: Series of quasi-uniform scatterings with fast search, root systems and neural network classifications
Authors: Igor V. Netay
Subjects: Algebraic Geometry (math.AG); Machine Learning (cs.LG); Representation Theory (math.RT)
[84]  arXiv:2512.04832 (cross-list from cs.CV) [pdf, ps, other]
Title: Tokenizing Buildings: A Transformer for Layout Synthesis
Comments: 8 pages, 1 page References, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[85]  arXiv:2512.04827 (cross-list from cs.SD) [pdf, ps, other]
Title: Contract-Driven QoE Auditing for Speech and Singing Services: From MOS Regression to Service Graphs
Authors: Wenzhang Du
Comments: 11 pages, 3 figures
Subjects: Sound (cs.SD); Machine Learning (cs.LG)
[86]  arXiv:2512.04781 (cross-list from eess.SY) [pdf, ps, other]
Title: Pick-to-Learn for Systems and Control: Data-driven Synthesis with State-of-the-art Safety Guarantees
Comments: 27 double-column pages, 18 figures
Subjects: Systems and Control (eess.SY); Machine Learning (cs.LG)
[87]  arXiv:2512.04771 (cross-list from cs.MA) [pdf, ps, other]
Title: Complementary Characterization of Agent-Based Models via Computational Mechanics and Diffusion Models
Authors: Roberto Garrone
Comments: 11 pages. Methods paper introducing a dual-domain framework for analyzing ABM dynamics. Companion temporal-analysis preprint: arXiv:2510.12729
Subjects: Multiagent Systems (cs.MA); Machine Learning (cs.LG)
[88]  arXiv:2512.04727 (cross-list from cs.AI) [pdf, ps, other]
Title: Sequential Enumeration in Large Language Models
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[89]  arXiv:2512.04697 (cross-list from math.OC) [pdf, ps, other]
Title: Continuous-time reinforcement learning for optimal switching over multiple regimes
Comments: Keywords: Optimal regime switching, multiple regimes, continuous-time reinforcement learning, system of HJB equations, policy improvement, policy iteration convergence
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Computational Finance (q-fin.CP)
[90]  arXiv:2512.04696 (cross-list from stat.ML) [pdf, ps, other]
Title: Provable FDR Control for Deep Feature Selection: Deep MLPs and Beyond
Authors: Kazuma Sawaya
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
[91]  arXiv:2512.04690 (cross-list from stat.ML) [pdf, ps, other]
Title: Recurrent Neural Networks with Linear Structures for Electricity Price Forecasting
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[92]  arXiv:2512.04663 (cross-list from quant-ph) [pdf, ps, other]
Title: Fermionic neural Gibbs states
Subjects: Quantum Physics (quant-ph); Strongly Correlated Electrons (cond-mat.str-el); Machine Learning (cs.LG); Computational Physics (physics.comp-ph)
[93]  arXiv:2512.04653 (cross-list from cs.MA) [pdf, ps, other]
Title: Semi Centralized Training Decentralized Execution Architecture for Multi Agent Deep Reinforcement Learning in Traffic Signal Control
Comments: Co-first authors: Pouria Yazdani and Arash Rezaali
Subjects: Multiagent Systems (cs.MA); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[94]  arXiv:2512.04643 (cross-list from cs.CV) [pdf, ps, other]
Title: SEASON: Mitigating Temporal Hallucination in Video Large Language Models via Self-Diagnostic Contrastive Decoding
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[95]  arXiv:2512.04597 (cross-list from cs.CV) [pdf, ps, other]
Title: When Robots Should Say "I Don't Know": Benchmarking Abstention in Embodied Question Answering
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[96]  arXiv:2512.04469 (cross-list from cs.AI) [pdf, ps, other]
Title: Mathematical Framing for Different Agent Strategies
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[97]  arXiv:2512.04452 (cross-list from physics.ao-ph) [pdf, ps, other]
Title: NORi: An ML-Augmented Ocean Boundary Layer Parameterization
Comments: 48 pages, 16 figures, submitted to Journal of Advances in Modeling Earth Systems (JAMES)
Subjects: Atmospheric and Oceanic Physics (physics.ao-ph); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Computational Physics (physics.comp-ph); Fluid Dynamics (physics.flu-dyn)
[98]  arXiv:2512.04434 (cross-list from physics.flu-dyn) [pdf, ps, other]
Title: Predicting Time-Dependent Flow Over Complex Geometries Using Operator Networks
Subjects: Fluid Dynamics (physics.flu-dyn); Machine Learning (cs.LG)
[99]  arXiv:2512.04396 (cross-list from cs.CL) [pdf, ps, other]
Title: Sarcasm Detection on Reddit Using Classical Machine Learning and Feature Engineering
Authors: Subrata Karmaker
Comments: 11 pages, 2 figures, includes full Python code. Classical machine learning baseline for sarcasm detection on the SARC 2.0 dataset
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[100]  arXiv:2512.04392 (cross-list from stat.ML) [pdf, ps, other]
Title: Informative missingness and its implications in semi-supervised learning
Comments: 1
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[ total of 979 entries: 1-50 | 51-100 | 101-150 | 151-200 | 201-250 | ... | 951-979 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)