We gratefully acknowledge support from
the Simons Foundation and member institutions.

Machine Learning

Authors and titles for recent submissions, skipping first 21

[ total of 804 entries: 1-250 | 22-271 | 272-521 | 522-771 | 772-804 ]
[ showing 250 entries per page: fewer | more | all ]

Wed, 10 Dec 2025 (continued, showing last 122 of 143 entries)

[22]  arXiv:2512.08499 [pdf, ps, other]
Title: Developing Distance-Aware Uncertainty Quantification Methods in Physics-Guided Neural Networks for Reliable Bearing Health Prediction
Comments: Under review at Structural health Monitoring - SAGE
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[23]  arXiv:2512.08485 [pdf, ps, other]
Title: Optimal Perturbation Budget Allocation for Data Poisoning in Offline Reinforcement Learning
Authors: Junnan Qiu, Jie Li
Subjects: Machine Learning (cs.LG)
[24]  arXiv:2512.08475 [pdf, ps, other]
Title: Solving Over-Smoothing in GNNs via Nonlocal Message Passing: Algebraic Smoothing and Depth Scalability
Authors: Weiqi Guan, Junlin He
Comments: 18 pages, 4 figures
Subjects: Machine Learning (cs.LG)
[25]  arXiv:2512.08462 [pdf, ps, other]
Title: Transformers for Multimodal Brain State Decoding: Integrating Functional Magnetic Resonance Imaging Data and Medical Metadata
Subjects: Machine Learning (cs.LG)
[26]  arXiv:2512.08459 [pdf, ps, other]
Title: Biothreat Benchmark Generation Framework for Evaluating Frontier AI Models III: Implementing the Bacterial Biothreat Benchmark (B3) Dataset
Comments: 19 pages, 2 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Emerging Technologies (cs.ET)
[27]  arXiv:2512.08451 [pdf, ps, other]
Title: Biothreat Benchmark Generation Framework for Evaluating Frontier AI Models II: Benchmark Generation Process
Comments: 18 pages, 3 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Emerging Technologies (cs.ET)
[28]  arXiv:2512.08443 [pdf, ps, other]
Title: Fully Decentralized Certified Unlearning
Subjects: Machine Learning (cs.LG)
[29]  arXiv:2512.08371 [pdf, ps, other]
Title: A Multivariate Bernoulli-Based Sampling Method for Multi-Label Data with Application to Meta-Research
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[30]  arXiv:2512.08314 [pdf, ps, other]
Title: Minimizing Layerwise Activation Norm Improves Generalization in Federated Learning
Comments: Accepted to WACV 2024
Subjects: Machine Learning (cs.LG)
[31]  arXiv:2512.08306 [pdf, ps, other]
Title: Jacobian Aligned Random Forests
Authors: Sarwesh Rauniyar
Subjects: Machine Learning (cs.LG)
[32]  arXiv:2512.08274 [pdf, ps, other]
Title: gHAWK: Local and Global Structure Encoding for Scalable Training of Graph Neural Networks on Knowledge Graphs
Subjects: Machine Learning (cs.LG); Databases (cs.DB)
[33]  arXiv:2512.08267 [pdf, ps, other]
Title: SOFA-FL: Self-Organizing Hierarchical Federated Learning with Adaptive Clustered Data Sharing
Subjects: Machine Learning (cs.LG)
[34]  arXiv:2512.08264 [pdf, ps, other]
Title: Mathematical Foundations of Neural Tangents and Infinite-Width Networks
Comments: 7 pages, 2 figures
Subjects: Machine Learning (cs.LG)
[35]  arXiv:2512.08257 [pdf, ps, other]
Title: Geometric-Stochastic Multimodal Deep Learning for Predictive Modeling of SUDEP and Stroke Vulnerability
Comments: 7 pages, 3 figures
Subjects: Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[36]  arXiv:2512.08256 [pdf, ps, other]
Title: Wavelet-Accelerated Physics-Informed Quantum Neural Network for Multiscale Partial Differential Equations
Subjects: Machine Learning (cs.LG); Analysis of PDEs (math.AP); Quantum Algebra (math.QA)
[37]  arXiv:2512.08246 [pdf, ps, other]
Title: SPROCKET: Extending ROCKET to Distance-Based Time-Series Transformations With Prototypes
Authors: Nicholas Harner
Comments: 63 Pages, 28 in main body with 3 appendices for supplemental figures
Subjects: Machine Learning (cs.LG)
[38]  arXiv:2512.08241 [pdf, ps, other]
Title: Persistent Topological Structures and Cohomological Flows as a Mathematical Framework for Brain-Inspired Representation Learning
Comments: 6 pages, 2 figures
Subjects: Machine Learning (cs.LG)
[39]  arXiv:2512.08218 [pdf, ps, other]
Title: PR-CapsNet: Pseudo-Riemannian Capsule Network with Adaptive Curvature Routing for Graph Learning
Comments: To appear in WSDM 2026 (ACM International Conference on Web Search and Data Mining)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[40]  arXiv:2512.08217 [pdf, ps, other]
Title: Correction of Decoupled Weight Decay
Subjects: Machine Learning (cs.LG)
[41]  arXiv:2512.08211 [pdf, ps, other]
Title: MobileFineTuner: A Unified End-to-End Framework for Fine-Tuning LLMs on Mobile Phones
Comments: 15 pages, 9 figures, submitted to Mobisys 2026
Subjects: Machine Learning (cs.LG)
[42]  arXiv:2512.08160 [pdf, ps, other]
Title: LayerPipe2: Multistage Pipelining and Weight Recompute via Improved Exponential Moving Average for Training Neural Networks
Comments: Proc. of 2025 Asilomar Conference on Signals, Systems, and Computers, October 2025, Pacific Grove, CA
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR)
[43]  arXiv:2512.08153 [pdf, ps, other]
Title: TreeGRPO: Tree-Advantage GRPO for Online RL Post-Training of Diffusion Models
Authors: Zheng Ding, Weirui Ye
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[44]  arXiv:2512.08143 [pdf, ps, other]
Title: PolyLingua: Margin-based Inter-class Transformer for Robust Cross-domain Language Detection
Subjects: Machine Learning (cs.LG)
[45]  arXiv:2512.08139 [pdf, ps, other]
Title: Robust Agents in Open-Ended Worlds
Comments: PhD Thesis
Subjects: Machine Learning (cs.LG)
[46]  arXiv:2512.08130 [pdf, ps, other]
Title: Biothreat Benchmark Generation Framework for Evaluating Frontier AI Models I: The Task-Query Architecture
Comments: 18 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[47]  arXiv:2512.08129 [pdf, ps, other]
Title: Improving the Sensitivity of Backdoor Detectors via Class Subspace Orthogonalization
Subjects: Machine Learning (cs.LG)
[48]  arXiv:2512.08124 [pdf, ps, other]
Title: Long-only cryptocurrency portfolio management by ranking the assets: a neural network approach
Authors: Zijiang Yang
Journal-ref: 2025 International Joint Conference on Neural Networks (IJCNN), Rome, Italy, 2025, pp. 1-8
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
[49]  arXiv:2512.08121 [pdf, ps, other]
Title: Balanced Accuracy: The Right Metric for Evaluating LLM Judges -- Explained through Youden's J statistic
Comments: 9 pages, 5 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[50]  arXiv:2512.08108 [pdf, ps, other]
Title: Scalable Offline Model-Based RL with Action Chunks
Comments: 22 pages, 7 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[51]  arXiv:2512.08093 [pdf, ps, other]
Title: Training LLMs for Honesty via Confessions
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[52]  arXiv:2512.08091 [pdf, ps, other]
Title: Complexity of One-Dimensional ReLU DNNs
Comments: Presented at IEEE MIT URTC 2025
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[53]  arXiv:2512.08077 [pdf, ps, other]
Title: Unveiling Latent Knowledge in Chemistry Language Models through Sparse Autoencoders
Subjects: Machine Learning (cs.LG); Chemical Physics (physics.chem-ph)
[54]  arXiv:2512.08071 [pdf, ps, other]
Title: CAMO: Causality-Guided Adversarial Multimodal Domain Generalization for Crisis Classification
Subjects: Machine Learning (cs.LG)
[55]  arXiv:2512.08063 [pdf, ps, other]
Title: Deep Kernel Aalen-Johansen Estimator: An Interpretable and Flexible Neural Net Framework for Competing Risks
Comments: Machine Learning for Health (ML4H) 2025 Spotlight
Subjects: Machine Learning (cs.LG)
[56]  arXiv:2512.08061 [pdf, ps, other]
Title: LUNA: Linear Universal Neural Attention with Generalization Guarantees
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[57]  arXiv:2512.08029 [pdf, ps, other]
Title: CLARITY: Medical World Model for Guiding Treatment Decisions by Modeling Context-Aware Disease Trajectories in Latent Space
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[58]  arXiv:2512.08012 [pdf, ps, other]
Title: Benchmarking Offline Multi-Objective Reinforcement Learning in Critical Care
Subjects: Machine Learning (cs.LG)
[59]  arXiv:2512.07992 [pdf, ps, other]
Title: Bridging the Clinical Expertise Gap: Development of a Web-Based Platform for Accessible Time Series Forecasting and Analysis
Subjects: Machine Learning (cs.LG); Software Engineering (cs.SE)
[60]  arXiv:2512.07988 [pdf, ps, other]
Title: HOLE: Homological Observation of Latent Embeddings for Neural Network Interpretability
Subjects: Machine Learning (cs.LG); Graphics (cs.GR); Human-Computer Interaction (cs.HC)
[61]  arXiv:2512.07981 [pdf, ps, other]
Title: CIP-Net: Continual Interpretable Prototype-based Network
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[62]  arXiv:2512.07961 [pdf, ps, other]
Title: Towards symbolic regression for interpretable clinical decision scores
Comments: 15 pages, 5 figures. Accepted for publication in Philosophical Transactions A. Autor Accepted Manuscript version
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[63]  arXiv:2512.07885 [pdf, ps, other]
Title: ByteStorm: a multi-step data-driven approach for Tropical Cyclones detection and tracking
Comments: 21 pages, 10 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[64]  arXiv:2512.07884 [pdf, ps, other]
Title: GSPN-2: Efficient Parallel Sequence Modeling
Comments: NeurIPS 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[65]  arXiv:2512.07880 [pdf, ps, other]
Title: Semi-Supervised Contrastive Learning with Orthonormal Prototypes
Authors: Huanran Li (1), Manh Nguyen (2), Daniel Pimentel-Alarcón (3) ((1) Department of Electrical Engineering, (2) Statistics, (3) Biostatistics, Wisconsin Institute of Discovery, University of Wisconsin-Madison)
Subjects: Machine Learning (cs.LG)
[66]  arXiv:2512.07879 [pdf, ps, other]
Title: Nonnegative Matrix Factorization through Cone Collapse
Authors: Manh Nguyen (1), Daniel Pimentel-Alarcón (2) ((1) Department of Statistics, (2) Department of Biostatistics and Medical Informatics, Wisconsin Institute of Discovery, University of Wisconsin-Madison)
Subjects: Machine Learning (cs.LG)
[67]  arXiv:2512.07878 [pdf, ps, other]
Title: Graph Contrastive Learning via Spectral Graph Alignment
Authors: Manh Nguyen, Joshua Cape (Department of Statistics, University of Wisconsin-Madison)
Subjects: Machine Learning (cs.LG)
[68]  arXiv:2512.07877 [pdf, ps, other]
Title: Artificial Intelligence-Driven Network-on-Chip Design Space Exploration: Neural Network Architectures for Design
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[69]  arXiv:2512.07876 [pdf, ps, other]
Title: Fourier-Enhanced Recurrent Neural Networks for Electrical Load Time Series Downscaling
Comments: Submitted to IEEE PES General Meeting 2026
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[70]  arXiv:2512.07875 [pdf, ps, other]
Title: Softly Symbolifying Kolmogorov-Arnold Networks
Comments: 13 pages, 5 figures, 3 tables
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Data Analysis, Statistics and Probability (physics.data-an); Machine Learning (stat.ML)
[71]  arXiv:2512.07874 [pdf, ps, other]
Title: Controllable risk scenario generation from human crash data for autonomous vehicle testing
Subjects: Machine Learning (cs.LG)
[72]  arXiv:2512.07873 [pdf, ps, other]
Title: Advancing physiological time series reconstruction and imputation via mixture of receptive fields and experts fusion
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[73]  arXiv:2512.07868 [pdf, ps, other]
Title: Bayesian Optimization for Function-Valued Responses under Min-Max Criteria
Comments: 25 pages, 6 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[74]  arXiv:2512.07866 [pdf, ps, other]
Title: Command & Control (C2) Traffic Detection Via Algorithm Generated Domain (Dga) Classification Using Deep Learning And Natural Language Processing
Comments: Language: Portuguese
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[75]  arXiv:2512.07865 [pdf, ps, other]
Title: Using Text-Based Life Trajectories from Swedish Register Data to Predict Residential Mobility with Pretrained Transformers
Subjects: Machine Learning (cs.LG)
[76]  arXiv:2512.07864 [pdf, ps, other]
Title: Pattern Recognition of Ozone-Depleting Substance Exports in Global Trade Data
Subjects: Machine Learning (cs.LG); Econometrics (econ.EM); General Economics (econ.GN)
[77]  arXiv:2512.07863 [pdf, ps, other]
Title: SetAD: Semi-Supervised Anomaly Learning in Contextual Sets
Comments: 9 pages
Subjects: Machine Learning (cs.LG)
[78]  arXiv:2512.07858 [pdf, ps, other]
Title: FAIM: Frequency-Aware Interactive Mamba for Time Series Classification
Subjects: Machine Learning (cs.LG)
[79]  arXiv:2512.07857 [pdf, ps, other]
Title: SA^2GFM: Enhancing Robust Graph Foundation Models with Structure-Aware Semantic Augmentation
Subjects: Machine Learning (cs.LG)
[80]  arXiv:2512.07856 [pdf, ps, other]
Title: Medical Test-free Disease Detection Based on Big Data
Subjects: Machine Learning (cs.LG)
[81]  arXiv:2512.07855 [pdf, ps, other]
Title: LAPA: Log-Domain Prediction-Driven Dynamic Sparsity Accelerator for Transformer Model
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[82]  arXiv:2512.07854 [pdf, ps, other]
Title: HSTMixer: A Hierarchical MLP-Mixer for Large-Scale Traffic Forecasting
Comments: 10 pages, 9 figures
Subjects: Machine Learning (cs.LG)
[83]  arXiv:2512.07853 [pdf, ps, other]
Title: GPU Memory Prediction for Multimodal Model Training
Comments: 1st Workshop on Systems for Agentic AI (SAA '25), co-located with SOSP 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[84]  arXiv:2512.07850 [pdf, ps, other]
Title: SABER: Small Actions, Big Errors -- Safeguarding Mutating Steps in LLM Agents
Comments: submitted to ICLR2026
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[85]  arXiv:2512.07848 [pdf, ps, other]
Title: RaX-Crash: A Resource Efficient and Explainable Small Model Pipeline with an Application to City Scale Injury Severity Prediction
Subjects: Machine Learning (cs.LG)
[86]  arXiv:2512.07847 [pdf, ps, other]
Title: CarBench: A Comprehensive Benchmark for Neural Surrogates on High-Fidelity 3D Car Aerodynamics
Subjects: Machine Learning (cs.LG)
[87]  arXiv:2512.07844 [pdf, ps, other]
Title: Space Alignment Matters: The Missing Piece for Inducing Neural Collapse in Long-Tailed Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[88]  arXiv:2512.07843 [pdf, ps, other]
Title: ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[89]  arXiv:2512.08931 (cross-list from cs.CV) [pdf, ps, other]
Title: Astra: General Interactive World Model with Autoregressive Denoising
Comments: Code is available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[90]  arXiv:2512.08920 (cross-list from cs.RO) [pdf, ps, other]
Title: OSMO: Open-Source Tactile Glove for Human-to-Robot Skill Transfer
Comments: Project website: this https URL
Subjects: Robotics (cs.RO); Machine Learning (cs.LG)
[91]  arXiv:2512.08882 (cross-list from cs.CR) [pdf, ps, other]
Title: Decentralized Trust for Space AI: Blockchain-Based Federated Learning Across Multi-Vendor LEO Satellite Networks
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[92]  arXiv:2512.08862 (cross-list from cs.CR) [pdf, ps, other]
Title: Secure and Privacy-Preserving Federated Learning for Next-Generation Underground Mine Safety
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[93]  arXiv:2512.08854 (cross-list from cs.CV) [pdf, ps, other]
Title: Generation is Required for Data-Efficient Perception
Comments: Preprint
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[94]  arXiv:2512.08819 (cross-list from cs.CL) [pdf, ps, other]
Title: Do Depth-Grown Models Overcome the Curse of Depth? An In-Depth Analysis
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[95]  arXiv:2512.08810 (cross-list from cs.SE) [pdf, ps, other]
Title: Multicalibration for LLM-based Code Generation
Comments: Accepted at AI-SQE 2026 (The 1st International Workshop on AI for Software Quality Evaluation: Judgment, Metrics, Benchmarks, and Beyond)
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[96]  arXiv:2512.08809 (cross-list from cs.CR) [pdf, ps, other]
Title: PrivTune: Efficient and Privacy-Preserving Fine-Tuning of Large Language Models via Device-Cloud Collaboration
Comments: Accepted at IEEE INFOCOM 2026 (full version)
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[97]  arXiv:2512.08733 (cross-list from cs.CV) [pdf, ps, other]
Title: Mitigating Individual Skin Tone Bias in Skin Lesion Classification through Distribution-Aware Reweighting
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[98]  arXiv:2512.08715 (cross-list from cs.PF) [pdf, ps, other]
Title: Multi-domain performance analysis with scores tailored to user preferences
Subjects: Performance (cs.PF); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[99]  arXiv:2512.08705 (cross-list from eess.SY) [pdf, ps, other]
Title: Gradient-Informed Monte Carlo Fine-Tuning of Diffusion Models for Low-Thrust Trajectory Design
Subjects: Systems and Control (eess.SY); Machine Learning (cs.LG); Optimization and Control (math.OC)
[100]  arXiv:2512.08667 (cross-list from eess.SY) [pdf, ps, other]
Title: Direct transfer of optimized controllers to similar systems using dimensionless MPC
Comments: 7 pages, 4 figures
Subjects: Systems and Control (eess.SY); Machine Learning (cs.LG)
[101]  arXiv:2512.08659 (cross-list from cs.CL) [pdf, ps, other]
Title: An Agentic AI System for Multi-Framework Communication Coding
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[102]  arXiv:2512.08657 (cross-list from cs.SE) [pdf, ps, other]
Title: Reusability in MLOps: Leveraging Ports and Adapters to Build a Microservices Architecture for the Maritime Domain
Authors: Renato Cordeiro Ferreira (1,2,3,4), Aditya Dhinavahi (1,2), Rowanne Trapmann (1,3), Willem-Jan van den Heuvel (1,2,3) ((1) Jheronimus Academy of Data Science, (2) Technical University of Eindhoven, (3) Tilburg University, (4) University of São Paulo)
Comments: 7 pages, 3 figures (3 diagrams), submitted to ICSA 2026
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[103]  arXiv:2512.08601 (cross-list from stat.ML) [pdf, ps, other]
Title: Heuristics for Combinatorial Optimization via Value-based Reinforcement Learning: A Unified Framework and Analysis
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC)
[104]  arXiv:2512.08577 (cross-list from cs.CV) [pdf, ps, other]
Title: Disturbance-Free Surgical Video Generation from Multi-Camera Shadowless Lamps for Open Surgery
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[105]  arXiv:2512.08513 (cross-list from econ.EM) [pdf, ps, other]
Title: Minimax and Bayes Optimal Adaptive Experimental Design for Treatment Choice
Authors: Masahiro Kato
Subjects: Econometrics (econ.EM); Machine Learning (cs.LG); Statistics Theory (math.ST); Methodology (stat.ME); Machine Learning (stat.ML)
[106]  arXiv:2512.08510 (cross-list from physics.bio-ph) [pdf, ps, other]
Title: Data-Efficient Learning of Anomalous Diffusion with Wavelet Representations: Enabling Direct Learning from Experimental Trajectories
Comments: 23 pages, 16 figures
Subjects: Biological Physics (physics.bio-ph); Soft Condensed Matter (cond-mat.soft); Machine Learning (cs.LG); Data Analysis, Statistics and Probability (physics.data-an)
[107]  arXiv:2512.08508 (cross-list from q-bio.BM) [pdf, ps, other]
Title: Fused Gromov-Wasserstein Contrastive Learning for Effective Enzyme-Reaction Screening
Subjects: Biomolecules (q-bio.BM); Machine Learning (cs.LG)
[108]  arXiv:2512.08463 (cross-list from cs.AI) [pdf, ps, other]
Title: Using reinforcement learning to probe the role of feedback in skill acquisition
Comments: Website: this https URL
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO); Systems and Control (eess.SY)
[109]  arXiv:2512.08445 (cross-list from cs.CV) [pdf, ps, other]
Title: Uncertainty-Aware Subset Selection for Robust Visual Explainability under Distribution Shifts
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[110]  arXiv:2512.08444 (cross-list from eess.IV) [pdf, ps, other]
Title: Learned iterative networks: An operator learning perspective
Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG); Functional Analysis (math.FA); Numerical Analysis (math.NA); Optimization and Control (math.OC)
[111]  arXiv:2512.08436 (cross-list from eess.SY) [pdf, ps, other]
Title: Beyond Wave Variables: A Data-Driven Ensemble Approach for Enhanced Teleoperation Transparency and Stability
Comments: 14 pages, 8 figures, 5 tables
Subjects: Systems and Control (eess.SY); Machine Learning (cs.LG)
[112]  arXiv:2512.08365 (cross-list from cs.DC) [pdf, ps, other]
Title: Magneton: Optimizing Energy Efficiency of ML Systems via Differential Energy Debugging
Comments: 12 pages, 10 fi
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[113]  arXiv:2512.08360 (cross-list from cs.NE) [pdf, ps, other]
Title: Conditional Morphogenesis: Emergent Generation of Structural Digits via Neural Cellular Automata
Authors: Ali Sakour
Comments: 13 pages, 5 figures. Code available at: this https URL
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[114]  arXiv:2512.08344 (cross-list from cs.AI) [pdf, ps, other]
Title: Enhancing Explainability of Graph Neural Networks Through Conceptual and Structural Analyses and Their Extensions
Authors: Tien Cuong Bui
Comments: 157 pages, Doctoral dissertation at Seoul National University (submitted in 2024.08 to SNU library, slightly updated in 2025.11 for open digital version)
Subjects: Artificial Intelligence (cs.AI); Information Theory (cs.IT); Machine Learning (cs.LG)
[115]  arXiv:2512.08341 (cross-list from cs.NI) [pdf, ps, other]
Title: Multi-Agent Deep Reinforcement Learning for Collaborative UAV Relay Networks under Jamming Atatcks
Comments: IEEE ICC 2026
Subjects: Networking and Internet Architecture (cs.NI); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[116]  arXiv:2512.08340 (cross-list from cs.AI) [pdf, ps, other]
Title: Predicting California Bearing Ratio with Ensemble and Neural Network Models: A Case Study from Türkiye
Comments: Presented at the 13th International Symposium on Intelligent Manufacturing and Service Systems, Duzce, Turkey, Sep 25-27, 2025. Also available on Zenodo: DOI 10.5281/zenodo.17530868
Journal-ref: Proc. of the 13th Int. Symp. on Intelligent Manufacturing and Service Systems, pp. 563-570, 2025, ISBN 978-625-00-3472-9
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[117]  arXiv:2512.08329 (cross-list from cs.CV) [pdf, ps, other]
Title: Interpreting Structured Perturbations in Image Protection Methods for Diffusion Models
Comments: 32 pages, 17 figures, 1 table, 5 algorithms, preprint
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[118]  arXiv:2512.08327 (cross-list from cs.CV) [pdf, ps, other]
Title: Low Rank Support Quaternion Matrix Machine
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[119]  arXiv:2512.08309 (cross-list from cs.CV) [pdf, ps, other]
Title: Terrain Diffusion: A Diffusion-Based Successor to Perlin Noise in Infinite, Real-Time Terrain Generation
Authors: Alexander Goslin
Comments: Project website: this https URL Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
[120]  arXiv:2512.08305 (cross-list from astro-ph.SR) [pdf, ps, other]
Title: Magnetic activity of ultracool dwarfs in the LAMOST DR11
Comments: 13 pages, 10 figures, accepted for publication in ApJ
Subjects: Solar and Stellar Astrophysics (astro-ph.SR); Instrumentation and Methods for Astrophysics (astro-ph.IM); Machine Learning (cs.LG)
[121]  arXiv:2512.08286 (cross-list from cs.SE) [pdf, ps, other]
Title: Empowering smart app development with SolidGPT: an edge-cloud hybrid AI agent framework
Journal-ref: Advances in Engineering Innovation 16.7 (2025): 86-92
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[122]  arXiv:2512.08281 (cross-list from cs.MA) [pdf, ps, other]
Title: Probabilistic Multi-Agent Aircraft Landing Time Prediction
Comments: 13 pages, 8 figures, accepted at AIAA SciTech 2026
Subjects: Multiagent Systems (cs.MA); Machine Learning (cs.LG)
[123]  arXiv:2512.08277 (cross-list from cs.SE) [pdf, ps, other]
Title: FedLAD: A Modular and Adaptive Testbed for Federated Log Anomaly Detection
Comments: Accepted Artifact at ACSOS 2025
Subjects: Software Engineering (cs.SE); Machine Learning (cs.LG)
[124]  arXiv:2512.08271 (cross-list from cs.RO) [pdf, ps, other]
Title: Zero-Splat TeleAssist: A Zero-Shot Pose Estimation Framework for Semantic Teleoperation
Comments: Published and Presented at 3rd Workshop on Human-Centric Multilateral Teleoperation in ICRA 2025
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[125]  arXiv:2512.08243 (cross-list from cs.CV) [pdf, ps, other]
Title: Residual-SwinCA-Net: A Channel-Aware Integrated Residual CNN-Swin Transformer for Malignant Lesion Segmentation in BUSI
Authors: Saeeda Naz, Saddam Hussain Khan (Artificial Intelligence Lab, Department of Computer Systems Engineering, University of Engineering and Applied Sciences (UEAS), Swat, Pakistan)
Comments: 26 Pages, 10 Figures, 4 Tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[126]  arXiv:2512.08216 (cross-list from eess.IV) [pdf, ps, other]
Title: Tumor-anchored deep feature random forests for out-of-distribution detection in lung cancer segmentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[127]  arXiv:2512.08176 (cross-list from stat.ML) [pdf, ps, other]
Title: Worst-case generation via minimax optimization in Wasserstein space
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC)
[128]  arXiv:2512.08138 (cross-list from cs.GT) [pdf, ps, other]
Title: Robust equilibria in continuous games: From strategic to dynamic robustness
Comments: 33 pages, 5 figures
Subjects: Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG); Optimization and Control (math.OC)
[129]  arXiv:2512.08132 (cross-list from cs.GT) [pdf, ps, other]
Title: Multi-agent learning under uncertainty: Recurrence vs. concentration
Comments: 44 pages, 17 figures
Subjects: Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG); Optimization and Control (math.OC)
[130]  arXiv:2512.08055 (cross-list from cs.SI) [pdf, ps, other]
Title: Fairness-aware PageRank via Edge Reweighting
Journal-ref: WSDM 2026
Subjects: Social and Information Networks (cs.SI); Machine Learning (cs.LG)
[131]  arXiv:2512.08052 (cross-list from cs.RO) [pdf, ps, other]
Title: An Introduction to Deep Reinforcement and Imitation Learning
Authors: Pedro Santana
Subjects: Robotics (cs.RO); Machine Learning (cs.LG)
[132]  arXiv:2512.08022 (cross-list from stat.ML) [pdf, ps, other]
Title: Provable Diffusion Posterior Sampling for Bayesian Inversion
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Numerical Analysis (math.NA); Probability (math.PR); Statistics Theory (math.ST)
[133]  arXiv:2512.08013 (cross-list from eess.SY) [pdf, ps, other]
Title: Learning Dynamics from Infrequent Output Measurements for Uncertainty-Aware Optimal Control
Comments: Submitted to the 2026 IFAC World Congress
Subjects: Systems and Control (eess.SY); Machine Learning (cs.LG); Optimization and Control (math.OC)
[134]  arXiv:2512.07997 (cross-list from cs.HC) [pdf, ps, other]
Title: A Comparative Study of EMG- and IMU-based Gesture Recognition at the Wrist and Forearm
Subjects: Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[135]  arXiv:2512.07946 (cross-list from hep-th) [pdf, ps, other]
Title: Conformal Defects in Neural Network Field Theories
Comments: 23 pages, 1 figure
Subjects: High Energy Physics - Theory (hep-th); Machine Learning (cs.LG)
[136]  arXiv:2512.07926 (cross-list from cs.AI) [pdf, ps, other]
Title: Can AI autonomously build, operate, and use the entire data stack?
Subjects: Artificial Intelligence (cs.AI); Databases (cs.DB); Machine Learning (cs.LG)
[137]  arXiv:2512.07890 (cross-list from cs.MA) [pdf, ps, other]
Title: CrowdLLM: Building LLM-Based Digital Populations Augmented with Generative Models
Subjects: Multiagent Systems (cs.MA); Machine Learning (cs.LG); Methodology (stat.ME); Machine Learning (stat.ML)
[138]  arXiv:2512.07888 (cross-list from stat.ML) [pdf, ps, other]
Title: Functional Random Forest with Adaptive Cost-Sensitive Splitting for Imbalanced Functional Data Classification
Comments: 23 pages, 4 figures
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Applications (stat.AP); Computation (stat.CO)
[139]  arXiv:2512.07860 (cross-list from q-fin.ST) [pdf, ps, other]
Title: Integrating LSTM Networks with Neural Levy Processes for Financial Forecasting
Subjects: Statistical Finance (q-fin.ST); Machine Learning (cs.LG)
[140]  arXiv:2512.07846 (cross-list from cs.IR) [pdf, ps, other]
Title: MixLM: High-Throughput and Effective LLM Ranking via Text-Embedding Mix-Interaction
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[141]  arXiv:2512.07838 (cross-list from cs.CV) [pdf, ps, other]
Title: Detection of Cyberbullying in GIF using AI
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[142]  arXiv:2512.07785 (cross-list from physics.data-an) [pdf, ps, other]
Title: Automating High Energy Physics Data Analysis with LLM-Powered Agents
Comments: 16 pages, 6 figures, 2 tables, the 39th Conference on Neural Information Processing Systems (NeurIPS 2025) - Machine Learning and the Physical Sciences (ML4PS) workshop (poster)
Subjects: Data Analysis, Statistics and Probability (physics.data-an); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); High Energy Physics - Experiment (hep-ex)
[143]  arXiv:2512.05791 (cross-list from physics.med-ph) [pdf, ps, other]
Title: Fast and Robust Diffusion Posterior Sampling for MR Image Reconstruction Using the Preconditioned Unadjusted Langevin Algorithm
Comments: Submitted to Magnetic Resonance in Medicine
Subjects: Medical Physics (physics.med-ph); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Probability (math.PR)

Tue, 9 Dec 2025 (showing first 128 of 264 entries)

[144]  arXiv:2512.07828 [pdf, ps, other]
Title: The Adoption and Usage of AI Agents: Early Evidence from Perplexity
Subjects: Machine Learning (cs.LG); General Economics (econ.GN)
[145]  arXiv:2512.07818 [pdf, ps, other]
Title: Provable Long-Range Benefits of Next-Token Prediction
Comments: 66 pages, 5 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[146]  arXiv:2512.07805 [pdf, ps, other]
Title: Group Representational Position Encoding
Comments: Project Page: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[147]  arXiv:2512.07782 [pdf, ps, other]
Title: GatedFWA: Linear Flash Windowed Attention with Gated Associative Memory
Subjects: Machine Learning (cs.LG)
[148]  arXiv:2512.07766 [pdf, ps, other]
Title: Formalized Hopfield Networks and Boltzmann Machines
Subjects: Machine Learning (cs.LG); Logic in Computer Science (cs.LO)
[149]  arXiv:2512.07741 [pdf, ps, other]
Title: A multimodal Bayesian Network for symptom-level depression and anxiety prediction from voice and speech data
Subjects: Machine Learning (cs.LG); Sound (cs.SD)
[150]  arXiv:2512.07723 [pdf, ps, other]
Title: Enabling Delayed-Full Charging Through Transformer-Based Real-Time-to-Departure Modeling for EV Battery Longevity
Comments: 16 pages, 9 figures, AAAI'26 (accepted)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[151]  arXiv:2512.07705 [pdf, ps, other]
Title: In-Context and Few-Shots Learning for Forecasting Time Series Data based on Large Language Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[152]  arXiv:2512.07676 [pdf, ps, other]
Title: A Bootstrap Perspective on Stochastic Gradient Descent
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[153]  arXiv:2512.07667 [pdf, ps, other]
Title: Depth-Wise Activation Steering for Honest Language Models
Comments: See \url{this https URL}. for code and experiments
Subjects: Machine Learning (cs.LG)
[154]  arXiv:2512.07647 [pdf, ps, other]
Title: A Mathematical Theory of Top-$k$ Sparse Attention via Total Variation Distance
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[155]  arXiv:2512.07624 [pdf, ps, other]
Title: Time Series Foundation Models for Process Model Forecasting
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[156]  arXiv:2512.07569 [pdf, ps, other]
Title: Weighted Contrastive Learning for Anomaly-Aware Time-Series Forecasting
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[157]  arXiv:2512.07558 [pdf, ps, other]
Title: ReLaX: Reasoning with Latent Exploration for Large Reasoning Models
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[158]  arXiv:2512.07542 [pdf, ps, other]
Title: RRAEDy: Adaptive Latent Linearization of Nonlinear Dynamical Systems
Subjects: Machine Learning (cs.LG)
[159]  arXiv:2512.07539 [pdf, ps, other]
Title: FRWKV:Frequency-Domain Linear Attention for Long-Term Time Series Forecasting
Subjects: Machine Learning (cs.LG)
[160]  arXiv:2512.07528 [pdf, ps, other]
Title: Model-Based Reinforcement Learning Under Confounding
Comments: 9 pages, 2 figures - decompressed draft
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[161]  arXiv:2512.07519 [pdf, ps, other]
Title: Machine Learning: Progress and Prospects
Comments: Inaugural Lecture. 18 pages, 13 figures, Published in 1997 by Royal Holloway, University of London, ISBN 0 900145 93 5
Subjects: Machine Learning (cs.LG)
[162]  arXiv:2512.07509 [pdf, ps, other]
Title: Exploring possible vector systems for faster training of neural networks with preconfigured latent spaces
Authors: Nikita Gabdullin
Comments: 9 pages, 5 figures, 1 table, 4 equations
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[163]  arXiv:2512.07490 [pdf, ps, other]
Title: Efficient Low-Tubal-Rank Tensor Estimation via Alternating Preconditioned Gradient Descent
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[164]  arXiv:2512.07486 [pdf, ps, other]
Title: Materium: An Autoregressive Approach for Material Generation
Subjects: Machine Learning (cs.LG); Materials Science (cond-mat.mtrl-sci)
[165]  arXiv:2512.07463 [pdf, ps, other]
Title: Parallel Algorithms for Combined Regularized Support Vector Machines: Application in Music Genre Classification
Subjects: Machine Learning (cs.LG); Applications (stat.AP); Computation (stat.CO)
[166]  arXiv:2512.07450 [pdf, ps, other]
Title: Forget and Explain: Transparent Verification of GNN Unlearning
Authors: Imran Ahsan (1), Hyunwook Yu (2), Jinsung Kim (2), Mucheol Kim (2) ((1) Department of Smart Cities, Chung-Ang University, (2) Department of Computer Science and Engineering, Chung-Ang University)
Comments: To appear in WSDM 2026 (ACM International Conference on Web Search and Data Mining). Code is available at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[167]  arXiv:2512.07437 [pdf, ps, other]
Title: KAN-Dreamer: Benchmarking Kolmogorov-Arnold Networks as Function Approximators in World Models
Comments: 23 pages, 8 figures, 3 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Robotics (cs.RO)
[168]  arXiv:2512.07433 [pdf, ps, other]
Title: Mitigating Bias in Graph Hyperdimensional Computing
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[169]  arXiv:2512.07430 [pdf, ps, other]
Title: MIDG: Mixture of Invariant Experts with knowledge injection for Domain Generalization in Multimodal Sentiment Analysis
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[170]  arXiv:2512.07419 [pdf, ps, other]
Title: Revolutionizing Mixed Precision Quantization: Towards Training-free Automatic Proxy Discovery via Large Language Models
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[171]  arXiv:2512.07417 [pdf, ps, other]
Title: Adaptive Tuning of Parameterized Traffic Controllers via Multi-Agent Reinforcement Learning
Subjects: Machine Learning (cs.LG)
[172]  arXiv:2512.07400 [pdf, ps, other]
Title: Asymptotic analysis of shallow and deep forgetting in replay with Neural Collapse
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[173]  arXiv:2512.07393 [pdf, other]
Title: Empirical Results for Adjusting Truncated Backpropagation Through Time while Training Neural Audio Effects
Authors: Yann Bourdin (ASTRAL), Pierrick Legrand (ASTRAL, ENSC, IMS), Fanny Roche
Journal-ref: 28th International Conference on Digital Audio Effects (DAFx25), Sep 2025, Ancona, Italy
Subjects: Machine Learning (cs.LG)
[174]  arXiv:2512.07390 [pdf, ps, other]
Title: Towards Reliable Test-Time Adaptation: Style Invariance as a Correctness Likelihood
Comments: Accepted to WACV 2026
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[175]  arXiv:2512.07375 [pdf, ps, other]
Title: LUNE: Efficient LLM Unlearning via LoRA Fine-Tuning with Negative Examples
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[176]  arXiv:2512.07374 [pdf, ps, other]
Title: Recover-to-Forget: Gradient Reconstruction from LoRA for Efficient LLM Unlearning
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[177]  arXiv:2512.07332 [pdf, ps, other]
Title: Local-Curvature-Aware Knowledge Graph Embedding: An Extended Ricci Flow Approach
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[178]  arXiv:2512.07313 [pdf, ps, other]
Title: Learning-Augmented Ski Rental with Discrete Distributions: A Bayesian Approach
Comments: 7 pages
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS)
[179]  arXiv:2512.07310 [pdf, ps, other]
Title: Towards a Relationship-Aware Transformer for Tabular Data
Subjects: Machine Learning (cs.LG)
[180]  arXiv:2512.07287 [pdf, ps, other]
Title: SIT-Graph: State Integrated Tool Graph for Multi-Turn Agents
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[181]  arXiv:2512.07249 [pdf, ps, other]
Title: IFFair: Influence Function-driven Sample Reweighting for Fair Classification
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[182]  arXiv:2512.07244 [pdf, ps, other]
Title: PINE: Pipeline for Important Node Exploration in Attributed Networks
Subjects: Machine Learning (cs.LG)
[183]  arXiv:2512.07222 [pdf, ps, other]
Title: Pay Less Attention to Function Words for Free Robustness of Vision-Language Models
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[184]  arXiv:2512.07208 [pdf, ps, other]
Title: Geometric Prior-Guided Federated Prompt Calibration
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[185]  arXiv:2512.07200 [pdf, ps, other]
Title: Less is More: Non-uniform Road Segments are Efficient for Bus Arrival Prediction
Subjects: Machine Learning (cs.LG)
[186]  arXiv:2512.07184 [pdf, ps, other]
Title: UniDiff: A Unified Diffusion Framework for Multimodal Time Series Forecasting
Subjects: Machine Learning (cs.LG)
[187]  arXiv:2512.07175 [pdf, ps, other]
Title: SPACE: Noise Contrastive Estimation Stabilizes Self-Play Fine-Tuning for Large Language Models
Comments: NeurIPS 2025
Subjects: Machine Learning (cs.LG)
[188]  arXiv:2512.07173 [pdf, ps, other]
Title: Improving the Throughput of Diffusion-based Large Language Models via a Training-Free Confidence-Aware Calibration
Comments: 8 pages, 3 figures. Preprint under review
Subjects: Machine Learning (cs.LG)
[189]  arXiv:2512.07150 [pdf, ps, other]
Title: FlowLPS: Langevin-Proximal Sampling for Flow-based Inverse Problem Solvers
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[190]  arXiv:2512.07142 [pdf, ps, other]
Title: Winning the Lottery by Preserving Network Training Dynamics with Concrete Ticket Search
Comments: This work plans to be submitted to the IEEE for possible publication
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[191]  arXiv:2512.07113 [pdf, ps, other]
Title: PlantBiMoE: A Bidirectional Foundation Model with SparseMoE for Plant Genomes
Comments: 6 pages, 5 figures, accept to BIBM
Subjects: Machine Learning (cs.LG); Genomics (q-bio.GN)
[192]  arXiv:2512.07112 [pdf, ps, other]
Title: FOAM: Blocked State Folding for Memory-Efficient LLM Training
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[193]  arXiv:2512.07100 [pdf, ps, other]
Title: Dual Refinement Cycle Learning: Unsupervised Text Classification of Mamba and Community Detection on Text Attributed Graph
Subjects: Machine Learning (cs.LG)
[194]  arXiv:2512.07092 [pdf, ps, other]
Title: The Geometry of Persona: Disentangling Personality from Reasoning in Large Language Models
Authors: Zhixiang Wang
Comments: 10 pages, 3 figures, 1 table. Code and dataset available at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[195]  arXiv:2512.07082 [pdf, ps, other]
Title: TRACE: A Generalizable Drift Detector for Streaming Data-Driven Optimization
Comments: Accepted by AAAI 2026
Subjects: Machine Learning (cs.LG)
[196]  arXiv:2512.07079 [pdf, ps, other]
Title: Procrustean Bed for AI-Driven Retrosynthesis: A Unified Framework for Reproducible Evaluation
Comments: 11 pages + 7 pages of SI. RetroCast is available on GitHub, see this https URL SynthArena is publicly available, see this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[197]  arXiv:2512.07064 [pdf, ps, other]
Title: Self-Supervised Learning on Molecular Graphs: A Systematic Investigation of Masking Design
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Quantitative Methods (q-bio.QM)
[198]  arXiv:2512.07040 [pdf, ps, other]
Title: Transformation of Biological Networks into Images via Semantic Cartography for Visual Interpretation and Scalable Deep Analysis
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[199]  arXiv:2512.07021 [pdf, ps, other]
Title: Transferring Clinical Knowledge into ECGs Representation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[200]  arXiv:2512.07011 [pdf, ps, other]
Title: Block Sparse Flash Attention
Comments: 10 pages, 5 figures. Code: this https URL
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Performance (cs.PF)
[201]  arXiv:2512.07010 [pdf, ps, other]
Title: Always Keep Your Promises: DynamicLRP, A Model-Agnostic Solution To Layer-Wise Relevance Propagation
Comments: Work in progress, (12 pages manuscript, 6 figures, 6 tables, 3 pages references, 14 pages appendix)
Subjects: Machine Learning (cs.LG)
[202]  arXiv:2512.06993 [pdf, ps, other]
Title: Toward Reliable Machine Unlearning: Theory, Algorithms, and Evaluation
Subjects: Machine Learning (cs.LG)
[203]  arXiv:2512.06989 [pdf, ps, other]
Title: Flash Multi-Head Feed-Forward Network
Comments: 17 pages, 8 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[204]  arXiv:2512.06987 [pdf, ps, other]
Title: OXtal: An All-Atom Diffusion Model for Organic Crystal Structure Prediction
Subjects: Machine Learning (cs.LG); Materials Science (cond-mat.mtrl-sci)
[205]  arXiv:2512.06982 [pdf, ps, other]
Title: LLM-Driven Composite Neural Architecture Search for Multi-Source RL State Encoding
Comments: NeurIPS 2025 Workshop on Bridging Language, Agent, and World Models for Reasoning and Planning
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
[206]  arXiv:2512.06971 [pdf, ps, other]
Title: Prediction with Expert Advice under Local Differential Privacy
Comments: 19 pages, 3 figures
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Data Structures and Algorithms (cs.DS); Machine Learning (stat.ML)
[207]  arXiv:2512.06969 [pdf, ps, other]
Title: Comparing BFGS and OGR for Second-Order Optimization
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[208]  arXiv:2512.06944 [pdf, ps, other]
Title: A Unifying Human-Centered AI Fairness Framework
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[209]  arXiv:2512.06932 [pdf, ps, other]
Title: Hidden Leaks in Time Series Forecasting: How Data Leakage Affects LSTM Evaluation Across Configurations and Validation Strategies
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[210]  arXiv:2512.06929 [pdf, ps, other]
Title: Adaptive Normalization Mamba with Multi Scale Trend Decomposition and Patch MoE Encoding
Authors: MinCheol Jeon
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[211]  arXiv:2512.06926 [pdf, ps, other]
Title: Evaluating the Sensitivity of BiLSTM Forecasting Models to Sequence Length and Input Noise
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[212]  arXiv:2512.06925 [pdf, ps, other]
Title: Deep Reinforcement Learning for Phishing Detection with Transformer-Based Semantic Features
Authors: Aseer Al Faisal
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[213]  arXiv:2512.06920 [pdf, ps, other]
Title: Parent-Guided Semantic Reward Model (PGSRM): Embedding-Based Reward Functions for Reinforcement Learning of Transformer Language Models
Subjects: Machine Learning (cs.LG)
[214]  arXiv:2512.06917 [pdf, ps, other]
Title: Know your Trajectory -- Trustworthy Reinforcement Learning deployment through Importance-Based Trajectory Analysis
Comments: Accepted at 4th Deployable AI Workshop at AAAI 2026
Subjects: Machine Learning (cs.LG)
[215]  arXiv:2512.06837 [pdf, ps, other]
Title: Neural Factorization-based Bearing Fault Diagnosis
Subjects: Machine Learning (cs.LG)
[216]  arXiv:2512.06813 [pdf, ps, other]
Title: Partial Inverse Design of High-Performance Concrete Using Cooperative Neural Networks for Constraint-Aware Mix Generation
Comments: 19 pages, 12 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[217]  arXiv:2512.06791 [pdf, ps, other]
Title: Small-Gain Nash: Certified Contraction to Nash Equilibria in Differentiable Games
Authors: Vedansh Sharma
Subjects: Machine Learning (cs.LG); Computer Science and Game Theory (cs.GT); Optimization and Control (math.OC)
[218]  arXiv:2512.06785 [pdf, ps, other]
Title: Angular Regularization for Positive-Unlabeled Learning on the Hypersphere
Comments: Featured Certification, J2C Certification. Transactions on Machine Learning Research, 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[219]  arXiv:2512.06782 [pdf, ps, other]
Title: Measuring Over-smoothing beyond Dirichlet energy
Authors: Weiqi Guan, Zihao Shi
Comments: 17 pages, 1 figure
Subjects: Machine Learning (cs.LG)
[220]  arXiv:2512.06758 [pdf, ps, other]
Title: Optimal Analysis for Bandit Learning in Matching Markets with Serial Dictatorship
Authors: Zilong Wang, Shuai Li
Journal-ref: Theor. Comput. Sci. 1010, C (Sep 2024)
Subjects: Machine Learning (cs.LG)
[221]  arXiv:2512.06752 [pdf, ps, other]
Title: Multi-Scale Protein Structure Modelling with Geometric Graph U-Nets
Comments: Presented at Machine Learning in Structural Biology, 2025. Open-source code: this https URL
Subjects: Machine Learning (cs.LG)
[222]  arXiv:2512.06737 [pdf, ps, other]
Title: Arc Gradient Descent: A Mathematically Derived Reformulation of Gradient Descent with Phase-Aware, User-Controlled Step Dynamics
Comments: 80 pages, 6 tables, 2 figures, 5 appendices, proof-of-concept
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[223]  arXiv:2512.06730 [pdf, ps, other]
Title: Enhancing Interpretability of AR-SSVEP-Based Motor Intention Recognition via CNN-BiLSTM and SHAP Analysis on EEG Data
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[224]  arXiv:2512.06727 [pdf, ps, other]
Title: KV-CAR: KV Cache Compression using Autoencoders and KV Reuse in Large Language Models
Subjects: Machine Learning (cs.LG)
[225]  arXiv:2512.06725 [pdf, ps, other]
Title: Decoding Motor Behavior Using Deep Learning and Reservoir Computing
Authors: Tian Lan
Comments: 10 pages, 3 figures
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[226]  arXiv:2512.06714 [pdf, ps, other]
Title: A Novel Deep Neural Network Architecture for Real-Time Water Demand Forecasting
Journal-ref: Journal of Hydrology 2021
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[227]  arXiv:2512.06708 [pdf, ps, other]
Title: A Novel Multimodal RUL Framework for Remaining Useful Life Estimation with Layer-wise Explanations
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[228]  arXiv:2512.06702 [pdf, ps, other]
Title: Pathway to $O(\sqrt{d})$ Complexity bound under Wasserstein metric of flow-based models
Subjects: Machine Learning (cs.LG)
[229]  arXiv:2512.06695 [pdf, ps, other]
Title: Mitigating Barren plateaus in quantum denoising diffusion probabilistic models
Comments: 22 pages, 9 figures
Subjects: Machine Learning (cs.LG); Quantum Physics (quant-ph)
[230]  arXiv:2512.06692 [pdf, ps, other]
Title: State Diversity Matters in Offline Behavior Distillation
Comments: 12 pages, 5 figures, 5 tables
Subjects: Machine Learning (cs.LG)
[231]  arXiv:2512.06678 [pdf, ps, other]
Title: GradientSpace: Unsupervised Data Clustering for Improved Instruction Tuning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[232]  arXiv:2512.06666 [pdf, ps, other]
Title: The Meta-Learning Gap: Combining Hydra and Quant for Large-Scale Time Series Classification
Authors: Urav Maniar
Comments: Link to the repository: this https URL
Subjects: Machine Learning (cs.LG)
[233]  arXiv:2512.06665 [pdf, ps, other]
Title: Rethinking Robustness: A New Approach to Evaluating Feature Attribution Methods
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[234]  arXiv:2512.06655 [pdf, ps, other]
Title: GSAE: Graph-Regularized Sparse Autoencoders for Robust LLM Safety Steering
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[235]  arXiv:2512.06652 [pdf, ps, other]
Title: Adaptive Test-Time Training for Predicting Need for Invasive Mechanical Ventilation in Multi-Center Cohorts
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[236]  arXiv:2512.06649 [pdf, ps, other]
Title: Estimating Black Carbon Concentration from Urban Traffic Using Vision-Based Machine Learning
Comments: 12 pages, 16 figures, 4 tables, 4 pages Appendix, in submission and under review for ACM MobiSys 2026 as of December 6th, 2025
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Emerging Technologies (cs.ET)
[237]  arXiv:2512.06648 [pdf, ps, other]
Title: Financial Fraud Identification and Interpretability Study for Listed Companies Based on Convolutional Neural Network
Authors: Xiao Li
Comments: in Chinese language
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[238]  arXiv:2512.06638 [pdf, ps, other]
Title: The Impact of Data Characteristics on GNN Evaluation for Detecting Fake News
Comments: Preprint. Approximately 15 pages, 5 figures, 3 tables
Subjects: Machine Learning (cs.LG)
[239]  arXiv:2512.06630 [pdf, ps, other]
Title: Quantum Temporal Convolutional Neural Networks for Cross-Sectional Equity Return Prediction: A Comparative Benchmark Study
Subjects: Machine Learning (cs.LG); Quantum Physics (quant-ph)
[240]  arXiv:2512.06609 [pdf, ps, other]
Title: Vector Quantization using Gaussian Variational Autoencoder
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[241]  arXiv:2512.06607 [pdf, ps, other]
Title: A Fast and Effective Solution to the Problem of Look-ahead Bias in LLMs
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[242]  arXiv:2512.06592 [pdf, ps, other]
Title: On fine-tuning Boltz-2 for protein-protein affinity prediction
Comments: MLSB 2025
Subjects: Machine Learning (cs.LG); Biomolecules (q-bio.BM)
[243]  arXiv:2512.06582 [pdf, ps, other]
Title: QL-LSTM: A Parameter-Efficient LSTM for Stable Long-Sequence Modeling
Authors: Isaac Kofi Nti
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
[244]  arXiv:2512.06563 [pdf, ps, other]
Title: Deep Manifold Part 2: Neural Network Mathematics
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[245]  arXiv:2512.06547 [pdf, ps, other]
Title: A-3PO: Accelerating Asynchronous LLM Training with Staleness-aware Proximal Policy Approximation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[246]  arXiv:2512.06533 [pdf, ps, other]
Title: Beyond Token-level Supervision: Unlocking the Potential of Decoding-based Regression via Reinforcement Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[247]  arXiv:2512.06520 [pdf, ps, other]
Title: Hierarchical geometric deep learning enables scalable analysis of molecular dynamics
Comments: 17 pages, 12 figures
Subjects: Machine Learning (cs.LG); Statistical Mechanics (cond-mat.stat-mech); Computational Physics (physics.comp-ph); Data Analysis, Statistics and Probability (physics.data-an)
[248]  arXiv:2512.06511 [pdf, ps, other]
Title: Diagnosis-based mortality prediction for intensive care unit patients via transfer learning
Subjects: Machine Learning (cs.LG); Applications (stat.AP)
[249]  arXiv:2512.06490 [pdf, ps, other]
Title: Optimizing LLMs Using Quantization for Mobile Execution
Comments: 11 pages, 1 equation, 2 tables. Author Accepted Manuscript (AAM) of a paper published in Springer LNNS, ICT4SD 2025. DOI: 10.1007/978-3-032-06697-8_33
Journal-ref: Fong, S., Dey, N., Joshi, A. (eds) ICT Analysis and Applications. ICT4SD 2025. Lecture Notes in Networks and Systems, vol 1654. Springer, Cham
Subjects: Machine Learning (cs.LG)
[250]  arXiv:2512.06471 [pdf, ps, other]
Title: Why Goal-Conditioned Reinforcement Learning Works: Relation to Dual Control
Comments: IFAC preprint
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[251]  arXiv:2512.06457 [pdf, ps, other]
Title: BitStopper: An Efficient Transformer Attention Accelerator via Stage-fusion and Early Termination
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[252]  arXiv:2512.06440 [pdf, ps, other]
Title: Neural expressiveness for beyond importance model compression
Subjects: Machine Learning (cs.LG)
[253]  arXiv:2512.06427 [pdf, ps, other]
Title: A new initialisation to Control Gradients in Sinusoidal Neural network
Subjects: Machine Learning (cs.LG)
[254]  arXiv:2512.06417 [pdf, ps, other]
Title: Hankel-FNO: Fast Underwater Acoustic Charting Via Physics-Encoded Fourier Neural Operator
Authors: Yifan Sun (1), Lei Cheng (1), Jianlong Li (1), Peter Gerstoft (2) ((1) College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou, China, (2) Scripps Institution of Oceanography, University of California San Diego, La Jolla, USA)
Subjects: Machine Learning (cs.LG); Sound (cs.SD)
[255]  arXiv:2512.06392 [pdf, ps, other]
Title: RLAX: Large-Scale, Distributed Reinforcement Learning for Large Language Models on TPUs
Comments: 14 pages, 6 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[256]  arXiv:2512.06370 [pdf, ps, other]
Title: Optimizing Optimizers for Fast Gradient-Based Learning
Comments: 49 pages, 5 figures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[257]  arXiv:2512.06357 [pdf, ps, other]
Title: Proportional integral derivative booster for neural networks-based time-series prediction: Case of water demand prediction
Comments: Engineering Applications of Artificial Intelligence 2022
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[258]  arXiv:2512.06356 [pdf, ps, other]
Title: DDFI: Diverse and Distribution-aware Missing Feature Imputation via Two-step Reconstruction
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[259]  arXiv:2512.06351 [pdf, ps, other]
Title: LLM-Upgraded Graph Reinforcement Learning for Carbon-Aware Job Scheduling in Smart Manufacturing
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[260]  arXiv:2512.06347 [pdf, ps, other]
Title: Zero Generalization Error Theorem for Random Interpolators via Algebraic Geometry
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[261]  arXiv:2512.06343 [pdf, ps, other]
Title: When Distance Distracts: Representation Distance Bias in BT-Loss for Reward Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[262]  arXiv:2512.06341 [pdf, ps, other]
Title: Interpretive Efficiency: Information-Geometric Foundations of Data Usefulness
Authors: Ronald Katende
Subjects: Machine Learning (cs.LG); Information Retrieval (cs.IR); Information Theory (cs.IT)
[263]  arXiv:2512.06303 [pdf, ps, other]
Title: Multimodal Graph Neural Networks for Prognostic Modeling of Brain Network Reorganization
Comments: 5 pages, 2 figures. IEEE conference-style format
Subjects: Machine Learning (cs.LG)
[264]  arXiv:2512.06301 [pdf, ps, other]
Title: Chemistry Integrated Language Model using Hierarchical Molecular Representation for Polymer Informatics
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[265]  arXiv:2512.06297 [pdf, ps, other]
Title: Entropic Confinement and Mode Connectivity in Overparameterized Neural Networks
Comments: Under Review
Subjects: Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn); Statistical Mechanics (cond-mat.stat-mech); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[266]  arXiv:2512.06293 [pdf, ps, other]
Title: Importance-aware Topic Modeling for Discovering Public Transit Risk from Noisy Social Media
Subjects: Machine Learning (cs.LG)
[267]  arXiv:2512.06288 [pdf, ps, other]
Title: Theoretical Compression Bounds for Wide Multilayer Perceptrons
Subjects: Machine Learning (cs.LG); Statistics Theory (math.ST)
[268]  arXiv:2512.06274 [pdf, ps, other]
Title: Networked Restless Multi-Arm Bandits with Reinforcement Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[269]  arXiv:2512.06252 [pdf, ps, other]
Title: Learning Without Time-Based Embodiment Resets in Soft-Actor Critic
Comments: In Proceedings of the 4th Conference on Lifelong Learning Agents (CoLLAs)
Subjects: Machine Learning (cs.LG)
[270]  arXiv:2512.06250 [pdf, ps, other]
Title: Learning When to Switch: Adaptive Policy Selection via Reinforcement Learning
Authors: Chris Tava
Comments: 7 pages
Subjects: Machine Learning (cs.LG)
[271]  arXiv:2512.06244 [pdf, ps, other]
Title: Auto-exploration for online reinforcement learning
Comments: 35 pages (9 appendix), 1 figure. Comments are welcome
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC)
[ total of 804 entries: 1-250 | 22-271 | 272-521 | 522-771 | 772-804 ]
[ showing 250 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)