We gratefully acknowledge support from
the Simons Foundation and member institutions.

Machine Learning

Authors and titles for recent submissions, skipping first 160

[ total of 804 entries: 1-100 | 61-160 | 161-260 | 261-360 | 361-460 | 461-560 | ... | 761-804 ]
[ showing 100 entries per page: fewer | more | all ]

Tue, 9 Dec 2025 (continued, showing 100 of 264 entries)

[161]  arXiv:2512.07519 [pdf, ps, other]
Title: Machine Learning: Progress and Prospects
Comments: Inaugural Lecture. 18 pages, 13 figures, Published in 1997 by Royal Holloway, University of London, ISBN 0 900145 93 5
Subjects: Machine Learning (cs.LG)
[162]  arXiv:2512.07509 [pdf, ps, other]
Title: Exploring possible vector systems for faster training of neural networks with preconfigured latent spaces
Authors: Nikita Gabdullin
Comments: 9 pages, 5 figures, 1 table, 4 equations
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[163]  arXiv:2512.07490 [pdf, ps, other]
Title: Efficient Low-Tubal-Rank Tensor Estimation via Alternating Preconditioned Gradient Descent
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[164]  arXiv:2512.07486 [pdf, ps, other]
Title: Materium: An Autoregressive Approach for Material Generation
Subjects: Machine Learning (cs.LG); Materials Science (cond-mat.mtrl-sci)
[165]  arXiv:2512.07463 [pdf, ps, other]
Title: Parallel Algorithms for Combined Regularized Support Vector Machines: Application in Music Genre Classification
Subjects: Machine Learning (cs.LG); Applications (stat.AP); Computation (stat.CO)
[166]  arXiv:2512.07450 [pdf, ps, other]
Title: Forget and Explain: Transparent Verification of GNN Unlearning
Authors: Imran Ahsan (1), Hyunwook Yu (2), Jinsung Kim (2), Mucheol Kim (2) ((1) Department of Smart Cities, Chung-Ang University, (2) Department of Computer Science and Engineering, Chung-Ang University)
Comments: To appear in WSDM 2026 (ACM International Conference on Web Search and Data Mining). Code is available at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[167]  arXiv:2512.07437 [pdf, ps, other]
Title: KAN-Dreamer: Benchmarking Kolmogorov-Arnold Networks as Function Approximators in World Models
Comments: 23 pages, 8 figures, 3 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Robotics (cs.RO)
[168]  arXiv:2512.07433 [pdf, ps, other]
Title: Mitigating Bias in Graph Hyperdimensional Computing
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[169]  arXiv:2512.07430 [pdf, ps, other]
Title: MIDG: Mixture of Invariant Experts with knowledge injection for Domain Generalization in Multimodal Sentiment Analysis
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[170]  arXiv:2512.07419 [pdf, ps, other]
Title: Revolutionizing Mixed Precision Quantization: Towards Training-free Automatic Proxy Discovery via Large Language Models
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[171]  arXiv:2512.07417 [pdf, ps, other]
Title: Adaptive Tuning of Parameterized Traffic Controllers via Multi-Agent Reinforcement Learning
Subjects: Machine Learning (cs.LG)
[172]  arXiv:2512.07400 [pdf, ps, other]
Title: Asymptotic analysis of shallow and deep forgetting in replay with Neural Collapse
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[173]  arXiv:2512.07393 [pdf, other]
Title: Empirical Results for Adjusting Truncated Backpropagation Through Time while Training Neural Audio Effects
Authors: Yann Bourdin (ASTRAL), Pierrick Legrand (ASTRAL, ENSC, IMS), Fanny Roche
Journal-ref: 28th International Conference on Digital Audio Effects (DAFx25), Sep 2025, Ancona, Italy
Subjects: Machine Learning (cs.LG)
[174]  arXiv:2512.07390 [pdf, ps, other]
Title: Towards Reliable Test-Time Adaptation: Style Invariance as a Correctness Likelihood
Comments: Accepted to WACV 2026
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[175]  arXiv:2512.07375 [pdf, ps, other]
Title: LUNE: Efficient LLM Unlearning via LoRA Fine-Tuning with Negative Examples
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[176]  arXiv:2512.07374 [pdf, ps, other]
Title: Recover-to-Forget: Gradient Reconstruction from LoRA for Efficient LLM Unlearning
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[177]  arXiv:2512.07332 [pdf, ps, other]
Title: Local-Curvature-Aware Knowledge Graph Embedding: An Extended Ricci Flow Approach
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[178]  arXiv:2512.07313 [pdf, ps, other]
Title: Learning-Augmented Ski Rental with Discrete Distributions: A Bayesian Approach
Comments: 7 pages
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS)
[179]  arXiv:2512.07310 [pdf, ps, other]
Title: Towards a Relationship-Aware Transformer for Tabular Data
Subjects: Machine Learning (cs.LG)
[180]  arXiv:2512.07287 [pdf, ps, other]
Title: SIT-Graph: State Integrated Tool Graph for Multi-Turn Agents
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[181]  arXiv:2512.07249 [pdf, ps, other]
Title: IFFair: Influence Function-driven Sample Reweighting for Fair Classification
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[182]  arXiv:2512.07244 [pdf, ps, other]
Title: PINE: Pipeline for Important Node Exploration in Attributed Networks
Subjects: Machine Learning (cs.LG)
[183]  arXiv:2512.07222 [pdf, ps, other]
Title: Pay Less Attention to Function Words for Free Robustness of Vision-Language Models
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[184]  arXiv:2512.07208 [pdf, ps, other]
Title: Geometric Prior-Guided Federated Prompt Calibration
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[185]  arXiv:2512.07200 [pdf, ps, other]
Title: Less is More: Non-uniform Road Segments are Efficient for Bus Arrival Prediction
Subjects: Machine Learning (cs.LG)
[186]  arXiv:2512.07184 [pdf, ps, other]
Title: UniDiff: A Unified Diffusion Framework for Multimodal Time Series Forecasting
Subjects: Machine Learning (cs.LG)
[187]  arXiv:2512.07175 [pdf, ps, other]
Title: SPACE: Noise Contrastive Estimation Stabilizes Self-Play Fine-Tuning for Large Language Models
Comments: NeurIPS 2025
Subjects: Machine Learning (cs.LG)
[188]  arXiv:2512.07173 [pdf, ps, other]
Title: Improving the Throughput of Diffusion-based Large Language Models via a Training-Free Confidence-Aware Calibration
Comments: 8 pages, 3 figures. Preprint under review
Subjects: Machine Learning (cs.LG)
[189]  arXiv:2512.07150 [pdf, ps, other]
Title: FlowLPS: Langevin-Proximal Sampling for Flow-based Inverse Problem Solvers
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[190]  arXiv:2512.07142 [pdf, ps, other]
Title: Winning the Lottery by Preserving Network Training Dynamics with Concrete Ticket Search
Comments: This work plans to be submitted to the IEEE for possible publication
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[191]  arXiv:2512.07113 [pdf, ps, other]
Title: PlantBiMoE: A Bidirectional Foundation Model with SparseMoE for Plant Genomes
Comments: 6 pages, 5 figures, accept to BIBM
Subjects: Machine Learning (cs.LG); Genomics (q-bio.GN)
[192]  arXiv:2512.07112 [pdf, ps, other]
Title: FOAM: Blocked State Folding for Memory-Efficient LLM Training
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[193]  arXiv:2512.07100 [pdf, ps, other]
Title: Dual Refinement Cycle Learning: Unsupervised Text Classification of Mamba and Community Detection on Text Attributed Graph
Subjects: Machine Learning (cs.LG)
[194]  arXiv:2512.07092 [pdf, ps, other]
Title: The Geometry of Persona: Disentangling Personality from Reasoning in Large Language Models
Authors: Zhixiang Wang
Comments: 10 pages, 3 figures, 1 table. Code and dataset available at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[195]  arXiv:2512.07082 [pdf, ps, other]
Title: TRACE: A Generalizable Drift Detector for Streaming Data-Driven Optimization
Comments: Accepted by AAAI 2026
Subjects: Machine Learning (cs.LG)
[196]  arXiv:2512.07079 [pdf, ps, other]
Title: Procrustean Bed for AI-Driven Retrosynthesis: A Unified Framework for Reproducible Evaluation
Comments: 11 pages + 7 pages of SI. RetroCast is available on GitHub, see this https URL SynthArena is publicly available, see this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[197]  arXiv:2512.07064 [pdf, ps, other]
Title: Self-Supervised Learning on Molecular Graphs: A Systematic Investigation of Masking Design
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Quantitative Methods (q-bio.QM)
[198]  arXiv:2512.07040 [pdf, ps, other]
Title: Transformation of Biological Networks into Images via Semantic Cartography for Visual Interpretation and Scalable Deep Analysis
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[199]  arXiv:2512.07021 [pdf, ps, other]
Title: Transferring Clinical Knowledge into ECGs Representation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[200]  arXiv:2512.07011 [pdf, ps, other]
Title: Block Sparse Flash Attention
Comments: 10 pages, 5 figures. Code: this https URL
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Performance (cs.PF)
[201]  arXiv:2512.07010 [pdf, ps, other]
Title: Always Keep Your Promises: DynamicLRP, A Model-Agnostic Solution To Layer-Wise Relevance Propagation
Comments: Work in progress, (12 pages manuscript, 6 figures, 6 tables, 3 pages references, 14 pages appendix)
Subjects: Machine Learning (cs.LG)
[202]  arXiv:2512.06993 [pdf, ps, other]
Title: Toward Reliable Machine Unlearning: Theory, Algorithms, and Evaluation
Subjects: Machine Learning (cs.LG)
[203]  arXiv:2512.06989 [pdf, ps, other]
Title: Flash Multi-Head Feed-Forward Network
Comments: 17 pages, 8 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[204]  arXiv:2512.06987 [pdf, ps, other]
Title: OXtal: An All-Atom Diffusion Model for Organic Crystal Structure Prediction
Subjects: Machine Learning (cs.LG); Materials Science (cond-mat.mtrl-sci)
[205]  arXiv:2512.06982 [pdf, ps, other]
Title: LLM-Driven Composite Neural Architecture Search for Multi-Source RL State Encoding
Comments: NeurIPS 2025 Workshop on Bridging Language, Agent, and World Models for Reasoning and Planning
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
[206]  arXiv:2512.06971 [pdf, ps, other]
Title: Prediction with Expert Advice under Local Differential Privacy
Comments: 19 pages, 3 figures
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Data Structures and Algorithms (cs.DS); Machine Learning (stat.ML)
[207]  arXiv:2512.06969 [pdf, ps, other]
Title: Comparing BFGS and OGR for Second-Order Optimization
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[208]  arXiv:2512.06944 [pdf, ps, other]
Title: A Unifying Human-Centered AI Fairness Framework
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[209]  arXiv:2512.06932 [pdf, ps, other]
Title: Hidden Leaks in Time Series Forecasting: How Data Leakage Affects LSTM Evaluation Across Configurations and Validation Strategies
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[210]  arXiv:2512.06929 [pdf, ps, other]
Title: Adaptive Normalization Mamba with Multi Scale Trend Decomposition and Patch MoE Encoding
Authors: MinCheol Jeon
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[211]  arXiv:2512.06926 [pdf, ps, other]
Title: Evaluating the Sensitivity of BiLSTM Forecasting Models to Sequence Length and Input Noise
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[212]  arXiv:2512.06925 [pdf, ps, other]
Title: Deep Reinforcement Learning for Phishing Detection with Transformer-Based Semantic Features
Authors: Aseer Al Faisal
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[213]  arXiv:2512.06920 [pdf, ps, other]
Title: Parent-Guided Semantic Reward Model (PGSRM): Embedding-Based Reward Functions for Reinforcement Learning of Transformer Language Models
Subjects: Machine Learning (cs.LG)
[214]  arXiv:2512.06917 [pdf, ps, other]
Title: Know your Trajectory -- Trustworthy Reinforcement Learning deployment through Importance-Based Trajectory Analysis
Comments: Accepted at 4th Deployable AI Workshop at AAAI 2026
Subjects: Machine Learning (cs.LG)
[215]  arXiv:2512.06837 [pdf, ps, other]
Title: Neural Factorization-based Bearing Fault Diagnosis
Subjects: Machine Learning (cs.LG)
[216]  arXiv:2512.06813 [pdf, ps, other]
Title: Partial Inverse Design of High-Performance Concrete Using Cooperative Neural Networks for Constraint-Aware Mix Generation
Comments: 19 pages, 12 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[217]  arXiv:2512.06791 [pdf, ps, other]
Title: Small-Gain Nash: Certified Contraction to Nash Equilibria in Differentiable Games
Authors: Vedansh Sharma
Subjects: Machine Learning (cs.LG); Computer Science and Game Theory (cs.GT); Optimization and Control (math.OC)
[218]  arXiv:2512.06785 [pdf, ps, other]
Title: Angular Regularization for Positive-Unlabeled Learning on the Hypersphere
Comments: Featured Certification, J2C Certification. Transactions on Machine Learning Research, 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[219]  arXiv:2512.06782 [pdf, ps, other]
Title: Measuring Over-smoothing beyond Dirichlet energy
Authors: Weiqi Guan, Zihao Shi
Comments: 17 pages, 1 figure
Subjects: Machine Learning (cs.LG)
[220]  arXiv:2512.06758 [pdf, ps, other]
Title: Optimal Analysis for Bandit Learning in Matching Markets with Serial Dictatorship
Authors: Zilong Wang, Shuai Li
Journal-ref: Theor. Comput. Sci. 1010, C (Sep 2024)
Subjects: Machine Learning (cs.LG)
[221]  arXiv:2512.06752 [pdf, ps, other]
Title: Multi-Scale Protein Structure Modelling with Geometric Graph U-Nets
Comments: Presented at Machine Learning in Structural Biology, 2025. Open-source code: this https URL
Subjects: Machine Learning (cs.LG)
[222]  arXiv:2512.06737 [pdf, ps, other]
Title: Arc Gradient Descent: A Mathematically Derived Reformulation of Gradient Descent with Phase-Aware, User-Controlled Step Dynamics
Comments: 80 pages, 6 tables, 2 figures, 5 appendices, proof-of-concept
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[223]  arXiv:2512.06730 [pdf, ps, other]
Title: Enhancing Interpretability of AR-SSVEP-Based Motor Intention Recognition via CNN-BiLSTM and SHAP Analysis on EEG Data
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[224]  arXiv:2512.06727 [pdf, ps, other]
Title: KV-CAR: KV Cache Compression using Autoencoders and KV Reuse in Large Language Models
Subjects: Machine Learning (cs.LG)
[225]  arXiv:2512.06725 [pdf, ps, other]
Title: Decoding Motor Behavior Using Deep Learning and Reservoir Computing
Authors: Tian Lan
Comments: 10 pages, 3 figures
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[226]  arXiv:2512.06714 [pdf, ps, other]
Title: A Novel Deep Neural Network Architecture for Real-Time Water Demand Forecasting
Journal-ref: Journal of Hydrology 2021
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[227]  arXiv:2512.06708 [pdf, ps, other]
Title: A Novel Multimodal RUL Framework for Remaining Useful Life Estimation with Layer-wise Explanations
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[228]  arXiv:2512.06702 [pdf, ps, other]
Title: Pathway to $O(\sqrt{d})$ Complexity bound under Wasserstein metric of flow-based models
Subjects: Machine Learning (cs.LG)
[229]  arXiv:2512.06695 [pdf, ps, other]
Title: Mitigating Barren plateaus in quantum denoising diffusion probabilistic models
Comments: 22 pages, 9 figures
Subjects: Machine Learning (cs.LG); Quantum Physics (quant-ph)
[230]  arXiv:2512.06692 [pdf, ps, other]
Title: State Diversity Matters in Offline Behavior Distillation
Comments: 12 pages, 5 figures, 5 tables
Subjects: Machine Learning (cs.LG)
[231]  arXiv:2512.06678 [pdf, ps, other]
Title: GradientSpace: Unsupervised Data Clustering for Improved Instruction Tuning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[232]  arXiv:2512.06666 [pdf, ps, other]
Title: The Meta-Learning Gap: Combining Hydra and Quant for Large-Scale Time Series Classification
Authors: Urav Maniar
Comments: Link to the repository: this https URL
Subjects: Machine Learning (cs.LG)
[233]  arXiv:2512.06665 [pdf, ps, other]
Title: Rethinking Robustness: A New Approach to Evaluating Feature Attribution Methods
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[234]  arXiv:2512.06655 [pdf, ps, other]
Title: GSAE: Graph-Regularized Sparse Autoencoders for Robust LLM Safety Steering
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[235]  arXiv:2512.06652 [pdf, ps, other]
Title: Adaptive Test-Time Training for Predicting Need for Invasive Mechanical Ventilation in Multi-Center Cohorts
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[236]  arXiv:2512.06649 [pdf, ps, other]
Title: Estimating Black Carbon Concentration from Urban Traffic Using Vision-Based Machine Learning
Comments: 12 pages, 16 figures, 4 tables, 4 pages Appendix, in submission and under review for ACM MobiSys 2026 as of December 6th, 2025
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Emerging Technologies (cs.ET)
[237]  arXiv:2512.06648 [pdf, ps, other]
Title: Financial Fraud Identification and Interpretability Study for Listed Companies Based on Convolutional Neural Network
Authors: Xiao Li
Comments: in Chinese language
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[238]  arXiv:2512.06638 [pdf, ps, other]
Title: The Impact of Data Characteristics on GNN Evaluation for Detecting Fake News
Comments: Preprint. Approximately 15 pages, 5 figures, 3 tables
Subjects: Machine Learning (cs.LG)
[239]  arXiv:2512.06630 [pdf, ps, other]
Title: Quantum Temporal Convolutional Neural Networks for Cross-Sectional Equity Return Prediction: A Comparative Benchmark Study
Subjects: Machine Learning (cs.LG); Quantum Physics (quant-ph)
[240]  arXiv:2512.06609 [pdf, ps, other]
Title: Vector Quantization using Gaussian Variational Autoencoder
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[241]  arXiv:2512.06607 [pdf, ps, other]
Title: A Fast and Effective Solution to the Problem of Look-ahead Bias in LLMs
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[242]  arXiv:2512.06592 [pdf, ps, other]
Title: On fine-tuning Boltz-2 for protein-protein affinity prediction
Comments: MLSB 2025
Subjects: Machine Learning (cs.LG); Biomolecules (q-bio.BM)
[243]  arXiv:2512.06582 [pdf, ps, other]
Title: QL-LSTM: A Parameter-Efficient LSTM for Stable Long-Sequence Modeling
Authors: Isaac Kofi Nti
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
[244]  arXiv:2512.06563 [pdf, ps, other]
Title: Deep Manifold Part 2: Neural Network Mathematics
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[245]  arXiv:2512.06547 [pdf, ps, other]
Title: A-3PO: Accelerating Asynchronous LLM Training with Staleness-aware Proximal Policy Approximation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[246]  arXiv:2512.06533 [pdf, ps, other]
Title: Beyond Token-level Supervision: Unlocking the Potential of Decoding-based Regression via Reinforcement Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[247]  arXiv:2512.06520 [pdf, ps, other]
Title: Hierarchical geometric deep learning enables scalable analysis of molecular dynamics
Comments: 17 pages, 12 figures
Subjects: Machine Learning (cs.LG); Statistical Mechanics (cond-mat.stat-mech); Computational Physics (physics.comp-ph); Data Analysis, Statistics and Probability (physics.data-an)
[248]  arXiv:2512.06511 [pdf, ps, other]
Title: Diagnosis-based mortality prediction for intensive care unit patients via transfer learning
Subjects: Machine Learning (cs.LG); Applications (stat.AP)
[249]  arXiv:2512.06490 [pdf, ps, other]
Title: Optimizing LLMs Using Quantization for Mobile Execution
Comments: 11 pages, 1 equation, 2 tables. Author Accepted Manuscript (AAM) of a paper published in Springer LNNS, ICT4SD 2025. DOI: 10.1007/978-3-032-06697-8_33
Journal-ref: Fong, S., Dey, N., Joshi, A. (eds) ICT Analysis and Applications. ICT4SD 2025. Lecture Notes in Networks and Systems, vol 1654. Springer, Cham
Subjects: Machine Learning (cs.LG)
[250]  arXiv:2512.06471 [pdf, ps, other]
Title: Why Goal-Conditioned Reinforcement Learning Works: Relation to Dual Control
Comments: IFAC preprint
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[251]  arXiv:2512.06457 [pdf, ps, other]
Title: BitStopper: An Efficient Transformer Attention Accelerator via Stage-fusion and Early Termination
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[252]  arXiv:2512.06440 [pdf, ps, other]
Title: Neural expressiveness for beyond importance model compression
Subjects: Machine Learning (cs.LG)
[253]  arXiv:2512.06427 [pdf, ps, other]
Title: A new initialisation to Control Gradients in Sinusoidal Neural network
Subjects: Machine Learning (cs.LG)
[254]  arXiv:2512.06417 [pdf, ps, other]
Title: Hankel-FNO: Fast Underwater Acoustic Charting Via Physics-Encoded Fourier Neural Operator
Authors: Yifan Sun (1), Lei Cheng (1), Jianlong Li (1), Peter Gerstoft (2) ((1) College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou, China, (2) Scripps Institution of Oceanography, University of California San Diego, La Jolla, USA)
Subjects: Machine Learning (cs.LG); Sound (cs.SD)
[255]  arXiv:2512.06392 [pdf, ps, other]
Title: RLAX: Large-Scale, Distributed Reinforcement Learning for Large Language Models on TPUs
Comments: 14 pages, 6 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[256]  arXiv:2512.06370 [pdf, ps, other]
Title: Optimizing Optimizers for Fast Gradient-Based Learning
Comments: 49 pages, 5 figures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[257]  arXiv:2512.06357 [pdf, ps, other]
Title: Proportional integral derivative booster for neural networks-based time-series prediction: Case of water demand prediction
Comments: Engineering Applications of Artificial Intelligence 2022
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[258]  arXiv:2512.06356 [pdf, ps, other]
Title: DDFI: Diverse and Distribution-aware Missing Feature Imputation via Two-step Reconstruction
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[259]  arXiv:2512.06351 [pdf, ps, other]
Title: LLM-Upgraded Graph Reinforcement Learning for Carbon-Aware Job Scheduling in Smart Manufacturing
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[260]  arXiv:2512.06347 [pdf, ps, other]
Title: Zero Generalization Error Theorem for Random Interpolators via Algebraic Geometry
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[ total of 804 entries: 1-100 | 61-160 | 161-260 | 261-360 | 361-460 | 461-560 | ... | 761-804 ]
[ showing 100 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)