We gratefully acknowledge support from
the Simons Foundation and member institutions.

Machine Learning

Authors and titles for recent submissions, skipping first 40

[ total of 804 entries: 1-50 | 41-90 | 91-140 | 141-190 | 191-240 | ... | 791-804 ]
[ showing 50 entries per page: fewer | more | all ]

Wed, 10 Dec 2025 (continued, showing 50 of 143 entries)

[41]  arXiv:2512.08211 [pdf, ps, other]
Title: MobileFineTuner: A Unified End-to-End Framework for Fine-Tuning LLMs on Mobile Phones
Comments: 15 pages, 9 figures, submitted to Mobisys 2026
Subjects: Machine Learning (cs.LG)
[42]  arXiv:2512.08160 [pdf, ps, other]
Title: LayerPipe2: Multistage Pipelining and Weight Recompute via Improved Exponential Moving Average for Training Neural Networks
Comments: Proc. of 2025 Asilomar Conference on Signals, Systems, and Computers, October 2025, Pacific Grove, CA
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR)
[43]  arXiv:2512.08153 [pdf, ps, other]
Title: TreeGRPO: Tree-Advantage GRPO for Online RL Post-Training of Diffusion Models
Authors: Zheng Ding, Weirui Ye
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[44]  arXiv:2512.08143 [pdf, ps, other]
Title: PolyLingua: Margin-based Inter-class Transformer for Robust Cross-domain Language Detection
Subjects: Machine Learning (cs.LG)
[45]  arXiv:2512.08139 [pdf, ps, other]
Title: Robust Agents in Open-Ended Worlds
Comments: PhD Thesis
Subjects: Machine Learning (cs.LG)
[46]  arXiv:2512.08130 [pdf, ps, other]
Title: Biothreat Benchmark Generation Framework for Evaluating Frontier AI Models I: The Task-Query Architecture
Comments: 18 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[47]  arXiv:2512.08129 [pdf, ps, other]
Title: Improving the Sensitivity of Backdoor Detectors via Class Subspace Orthogonalization
Subjects: Machine Learning (cs.LG)
[48]  arXiv:2512.08124 [pdf, ps, other]
Title: Long-only cryptocurrency portfolio management by ranking the assets: a neural network approach
Authors: Zijiang Yang
Journal-ref: 2025 International Joint Conference on Neural Networks (IJCNN), Rome, Italy, 2025, pp. 1-8
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
[49]  arXiv:2512.08121 [pdf, ps, other]
Title: Balanced Accuracy: The Right Metric for Evaluating LLM Judges -- Explained through Youden's J statistic
Comments: 9 pages, 5 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[50]  arXiv:2512.08108 [pdf, ps, other]
Title: Scalable Offline Model-Based RL with Action Chunks
Comments: 22 pages, 7 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[51]  arXiv:2512.08093 [pdf, ps, other]
Title: Training LLMs for Honesty via Confessions
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[52]  arXiv:2512.08091 [pdf, ps, other]
Title: Complexity of One-Dimensional ReLU DNNs
Comments: Presented at IEEE MIT URTC 2025
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[53]  arXiv:2512.08077 [pdf, ps, other]
Title: Unveiling Latent Knowledge in Chemistry Language Models through Sparse Autoencoders
Subjects: Machine Learning (cs.LG); Chemical Physics (physics.chem-ph)
[54]  arXiv:2512.08071 [pdf, ps, other]
Title: CAMO: Causality-Guided Adversarial Multimodal Domain Generalization for Crisis Classification
Subjects: Machine Learning (cs.LG)
[55]  arXiv:2512.08063 [pdf, ps, other]
Title: Deep Kernel Aalen-Johansen Estimator: An Interpretable and Flexible Neural Net Framework for Competing Risks
Comments: Machine Learning for Health (ML4H) 2025 Spotlight
Subjects: Machine Learning (cs.LG)
[56]  arXiv:2512.08061 [pdf, ps, other]
Title: LUNA: Linear Universal Neural Attention with Generalization Guarantees
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[57]  arXiv:2512.08029 [pdf, ps, other]
Title: CLARITY: Medical World Model for Guiding Treatment Decisions by Modeling Context-Aware Disease Trajectories in Latent Space
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[58]  arXiv:2512.08012 [pdf, ps, other]
Title: Benchmarking Offline Multi-Objective Reinforcement Learning in Critical Care
Subjects: Machine Learning (cs.LG)
[59]  arXiv:2512.07992 [pdf, ps, other]
Title: Bridging the Clinical Expertise Gap: Development of a Web-Based Platform for Accessible Time Series Forecasting and Analysis
Subjects: Machine Learning (cs.LG); Software Engineering (cs.SE)
[60]  arXiv:2512.07988 [pdf, ps, other]
Title: HOLE: Homological Observation of Latent Embeddings for Neural Network Interpretability
Subjects: Machine Learning (cs.LG); Graphics (cs.GR); Human-Computer Interaction (cs.HC)
[61]  arXiv:2512.07981 [pdf, ps, other]
Title: CIP-Net: Continual Interpretable Prototype-based Network
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[62]  arXiv:2512.07961 [pdf, ps, other]
Title: Towards symbolic regression for interpretable clinical decision scores
Comments: 15 pages, 5 figures. Accepted for publication in Philosophical Transactions A. Autor Accepted Manuscript version
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[63]  arXiv:2512.07885 [pdf, ps, other]
Title: ByteStorm: a multi-step data-driven approach for Tropical Cyclones detection and tracking
Comments: 21 pages, 10 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[64]  arXiv:2512.07884 [pdf, ps, other]
Title: GSPN-2: Efficient Parallel Sequence Modeling
Comments: NeurIPS 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[65]  arXiv:2512.07880 [pdf, ps, other]
Title: Semi-Supervised Contrastive Learning with Orthonormal Prototypes
Authors: Huanran Li (1), Manh Nguyen (2), Daniel Pimentel-Alarcón (3) ((1) Department of Electrical Engineering, (2) Statistics, (3) Biostatistics, Wisconsin Institute of Discovery, University of Wisconsin-Madison)
Subjects: Machine Learning (cs.LG)
[66]  arXiv:2512.07879 [pdf, ps, other]
Title: Nonnegative Matrix Factorization through Cone Collapse
Authors: Manh Nguyen (1), Daniel Pimentel-Alarcón (2) ((1) Department of Statistics, (2) Department of Biostatistics and Medical Informatics, Wisconsin Institute of Discovery, University of Wisconsin-Madison)
Subjects: Machine Learning (cs.LG)
[67]  arXiv:2512.07878 [pdf, ps, other]
Title: Graph Contrastive Learning via Spectral Graph Alignment
Authors: Manh Nguyen, Joshua Cape (Department of Statistics, University of Wisconsin-Madison)
Subjects: Machine Learning (cs.LG)
[68]  arXiv:2512.07877 [pdf, ps, other]
Title: Artificial Intelligence-Driven Network-on-Chip Design Space Exploration: Neural Network Architectures for Design
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[69]  arXiv:2512.07876 [pdf, ps, other]
Title: Fourier-Enhanced Recurrent Neural Networks for Electrical Load Time Series Downscaling
Comments: Submitted to IEEE PES General Meeting 2026
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[70]  arXiv:2512.07875 [pdf, ps, other]
Title: Softly Symbolifying Kolmogorov-Arnold Networks
Comments: 13 pages, 5 figures, 3 tables
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Data Analysis, Statistics and Probability (physics.data-an); Machine Learning (stat.ML)
[71]  arXiv:2512.07874 [pdf, ps, other]
Title: Controllable risk scenario generation from human crash data for autonomous vehicle testing
Subjects: Machine Learning (cs.LG)
[72]  arXiv:2512.07873 [pdf, ps, other]
Title: Advancing physiological time series reconstruction and imputation via mixture of receptive fields and experts fusion
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[73]  arXiv:2512.07868 [pdf, ps, other]
Title: Bayesian Optimization for Function-Valued Responses under Min-Max Criteria
Comments: 25 pages, 6 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[74]  arXiv:2512.07866 [pdf, ps, other]
Title: Command & Control (C2) Traffic Detection Via Algorithm Generated Domain (Dga) Classification Using Deep Learning And Natural Language Processing
Comments: Language: Portuguese
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[75]  arXiv:2512.07865 [pdf, ps, other]
Title: Using Text-Based Life Trajectories from Swedish Register Data to Predict Residential Mobility with Pretrained Transformers
Subjects: Machine Learning (cs.LG)
[76]  arXiv:2512.07864 [pdf, ps, other]
Title: Pattern Recognition of Ozone-Depleting Substance Exports in Global Trade Data
Subjects: Machine Learning (cs.LG); Econometrics (econ.EM); General Economics (econ.GN)
[77]  arXiv:2512.07863 [pdf, ps, other]
Title: SetAD: Semi-Supervised Anomaly Learning in Contextual Sets
Comments: 9 pages
Subjects: Machine Learning (cs.LG)
[78]  arXiv:2512.07858 [pdf, ps, other]
Title: FAIM: Frequency-Aware Interactive Mamba for Time Series Classification
Subjects: Machine Learning (cs.LG)
[79]  arXiv:2512.07857 [pdf, ps, other]
Title: SA^2GFM: Enhancing Robust Graph Foundation Models with Structure-Aware Semantic Augmentation
Subjects: Machine Learning (cs.LG)
[80]  arXiv:2512.07856 [pdf, ps, other]
Title: Medical Test-free Disease Detection Based on Big Data
Subjects: Machine Learning (cs.LG)
[81]  arXiv:2512.07855 [pdf, ps, other]
Title: LAPA: Log-Domain Prediction-Driven Dynamic Sparsity Accelerator for Transformer Model
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[82]  arXiv:2512.07854 [pdf, ps, other]
Title: HSTMixer: A Hierarchical MLP-Mixer for Large-Scale Traffic Forecasting
Comments: 10 pages, 9 figures
Subjects: Machine Learning (cs.LG)
[83]  arXiv:2512.07853 [pdf, ps, other]
Title: GPU Memory Prediction for Multimodal Model Training
Comments: 1st Workshop on Systems for Agentic AI (SAA '25), co-located with SOSP 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[84]  arXiv:2512.07850 [pdf, ps, other]
Title: SABER: Small Actions, Big Errors -- Safeguarding Mutating Steps in LLM Agents
Comments: submitted to ICLR2026
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[85]  arXiv:2512.07848 [pdf, ps, other]
Title: RaX-Crash: A Resource Efficient and Explainable Small Model Pipeline with an Application to City Scale Injury Severity Prediction
Subjects: Machine Learning (cs.LG)
[86]  arXiv:2512.07847 [pdf, ps, other]
Title: CarBench: A Comprehensive Benchmark for Neural Surrogates on High-Fidelity 3D Car Aerodynamics
Subjects: Machine Learning (cs.LG)
[87]  arXiv:2512.07844 [pdf, ps, other]
Title: Space Alignment Matters: The Missing Piece for Inducing Neural Collapse in Long-Tailed Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[88]  arXiv:2512.07843 [pdf, ps, other]
Title: ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[89]  arXiv:2512.08931 (cross-list from cs.CV) [pdf, ps, other]
Title: Astra: General Interactive World Model with Autoregressive Denoising
Comments: Code is available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[90]  arXiv:2512.08920 (cross-list from cs.RO) [pdf, ps, other]
Title: OSMO: Open-Source Tactile Glove for Human-to-Robot Skill Transfer
Comments: Project website: this https URL
Subjects: Robotics (cs.RO); Machine Learning (cs.LG)
[ total of 804 entries: 1-50 | 41-90 | 91-140 | 141-190 | 191-240 | ... | 791-804 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)