We gratefully acknowledge support from
the Simons Foundation and member institutions.

Machine Learning

Authors and titles for recent submissions, skipping first 91

[ total of 866 entries: 1-100 | 92-191 | 192-291 | 292-391 | 392-491 | ... | 792-866 ]
[ showing 100 entries per page: fewer | more | all ]

Mon, 8 Dec 2025 (continued, showing last 22 of 113 entries)

[92]  arXiv:2512.05438 (cross-list from cs.HC) [pdf, ps, other]
Title: EXR: An Interactive Immersive EHR Visualization in Extended Reality
Comments: 11 pages, 6 figures. Preprint version. This paper has been accepted to IEEE ICIR 2025. This is the author-prepared version and not the final published version. The final version will appear in IEEE Xplo
Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[93]  arXiv:2512.05430 (cross-list from cs.CL) [pdf, ps, other]
Title: ArtistMus: A Globally Diverse, Artist-Centric Benchmark for Retrieval-Augmented Music Question Answering
Comments: Submitted to LREC 2026. This work is an evolution of our earlier preprint arXiv:2507.23334
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[94]  arXiv:2512.05415 (cross-list from cs.CV) [pdf, ps, other]
Title: Moving object detection from multi-depth images with an attention-enhanced CNN
Comments: 14 pages, 22 figures, submitted to PASJ
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[95]  arXiv:2512.05362 (cross-list from cs.CV) [pdf, ps, other]
Title: PoolNet: Deep Learning for 2D to 3D Video Process Validation
Comments: All code related to this paper can be found at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[96]  arXiv:2512.05361 (cross-list from physics.optics) [pdf, ps, other]
Title: FieldSeer I: Physics-Guided World Models for Long-Horizon Electromagnetic Dynamics under Partial Observability
Subjects: Optics (physics.optics); Machine Learning (cs.LG); Computational Physics (physics.comp-ph)
[97]  arXiv:2512.05337 (cross-list from stat.ML) [pdf, ps, other]
Title: Symmetric Linear Dynamical Systems are Learnable from Few Observations
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Systems and Control (eess.SY); Optimization and Control (math.OC)
[98]  arXiv:2512.05331 (cross-list from cs.CL) [pdf, ps, other]
Title: Exposing Pink Slime Journalism: Linguistic Signatures and Robust Detection Against LLM-Generated Threats
Comments: Published in RANLP 2025
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[99]  arXiv:2512.05325 (cross-list from cs.CL) [pdf, ps, other]
Title: LYNX: Learning Dynamic Exits for Confidence-Controlled Reasoning
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[100]  arXiv:2512.05318 (cross-list from cs.CL) [pdf, ps, other]
Title: To Think or Not to Think: The Hidden Cost of Meta-Training with Excessive CoT Examples
Comments: 26 pages, 45 figures, 3 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[101]  arXiv:2512.05288 (cross-list from cs.CR) [pdf, ps, other]
Title: Beyond Detection: A Comprehensive Benchmark and Study on Representation Learning for Fine-Grained Webshell Family Classification
Authors: Feijiang Han
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[102]  arXiv:2512.05271 (cross-list from cs.GT) [pdf, ps, other]
Title: Robust forecast aggregation via additional queries
Subjects: Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG)
[103]  arXiv:2512.05267 (cross-list from cs.IT) [pdf, ps, other]
Title: Uncertainty-Aware Data-Efficient AI: An Information-Theoretic Perspective
Subjects: Information Theory (cs.IT); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[104]  arXiv:2512.05251 (cross-list from stat.ML) [pdf, ps, other]
Title: One-Step Diffusion Samplers via Self-Distillation and Deterministic Flow
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[105]  arXiv:2512.05245 (cross-list from q-bio.BM) [pdf, ps, other]
Title: STAR-GO: Improving Protein Function Prediction by Learning to Hierarchically Integrate Ontology-Informed Semantic Embeddings
Comments: 14 pages, 2 figures, 6 tables
Subjects: Biomolecules (q-bio.BM); Machine Learning (cs.LG)
[106]  arXiv:2512.05207 (cross-list from cs.NI) [pdf, ps, other]
Title: Hierarchical Reinforcement Learning for the Dynamic VNE with Alternatives Problem
Comments: Submitted to IEEE International Conference on Machine Learning for Communication and Networking (ICMLCN) 2026
Subjects: Networking and Internet Architecture (cs.NI); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[107]  arXiv:2512.05198 (cross-list from cs.CV) [pdf, ps, other]
Title: Your Latent Mask is Wrong: Pixel-Equivalent Latent Compositing for Diffusion Models
Comments: 16 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[108]  arXiv:2512.05162 (cross-list from stat.ML) [pdf, ps, other]
Title: How to Tame Your LLM: Semantic Collapse in Continuous Systems
Authors: C. M. Wyss
Comments: 35 pages, 1 figure. Exolytica AI Technical Report XTR-2025-01
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Dynamical Systems (math.DS); Probability (math.PR)
[109]  arXiv:2512.05158 (cross-list from math.DS) [pdf, ps, other]
Title: Continuous-Time Homeostatic Dynamics for Reentrant Inference Models
Authors: Byung Gyu Chae
Comments: 13 pages, 4 figures
Subjects: Dynamical Systems (math.DS); Machine Learning (cs.LG); Machine Learning (stat.ML)
[110]  arXiv:2512.05156 (cross-list from cs.AI) [pdf, ps, other]
Title: Semantic Faithfulness and Entropy Production Measures to Tame Your LLM Demons and Manage Hallucinations
Authors: Igor Halperin
Comments: 23 pages, 6 figures
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Theory (cs.IT); Machine Learning (cs.LG); Computational Finance (q-fin.CP)
[111]  arXiv:2512.05139 (cross-list from cs.CV) [pdf, ps, other]
Title: Spatiotemporal Satellite Image Downscaling with Transfer Encoders and Autoregressive Generative Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[112]  arXiv:2512.05134 (cross-list from cs.CV) [pdf, ps, other]
Title: InvarDiff: Cross-Scale Invariance Caching for Accelerated Diffusion Models
Authors: Zihao Wu
Comments: 8 pages main, 8 pages appendix, 16 figures, 5 tables. Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[113]  arXiv:2512.05127 (cross-list from physics.optics) [pdf, ps, other]
Title: Bayesian Optimization and Convolutional Neural Networks for Zernike-Based Wavefront Correction in High Harmonic Generation
Subjects: Optics (physics.optics); Machine Learning (cs.LG)

Fri, 5 Dec 2025 (showing first 78 of 116 entries)

[114]  arXiv:2512.05117 [pdf, ps, other]
Title: The Universal Weight Subspace Hypothesis
Comments: 37 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[115]  arXiv:2512.05116 [pdf, ps, other]
Title: Value Gradient Guidance for Flow Matching Alignment
Comments: Accepted at NeurIPS 2025; 26 pages, 20 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[116]  arXiv:2512.05114 [pdf, ps, other]
Title: Deep infant brain segmentation from multi-contrast MRI
Comments: 8 pages, 8 figures, 1 table, website at this https URL, presented at the 2025 IEEE Asilomar Conference on Signals, Systems, and Computers
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[117]  arXiv:2512.05103 [pdf, ps, other]
Title: TV2TV: A Unified Framework for Interleaved Language and Video Generation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[118]  arXiv:2512.05089 [pdf, ps, other]
Title: The Geometry of Intelligence: Deterministic Functional Topology as a Foundation for Real-World Perception
Authors: Eduardo Di Santi
Comments: 35 pages, 6 figures. This preprint develops a deterministic functional-topological framework showing that physical systems generate compact perceptual manifolds with finite radius. We provide theory, Monte-Carlo estimators, and validation across PM, battery, and ECG domains, unifying biological perception and self-supervised AI
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[119]  arXiv:2512.05084 [pdf, ps, other]
Title: Gradient Descent with Provably Tuned Learning-rate Schedules
Authors: Dravyansh Sharma
Subjects: Machine Learning (cs.LG)
[120]  arXiv:2512.05080 [pdf, ps, other]
Title: OMTRA: A Multi-Task Generative Model for Structure-Based Drug Design
Comments: Presented at the Machine Learning for Structural Biology Workshop, 2025
Subjects: Machine Learning (cs.LG)
[121]  arXiv:2512.05073 [pdf, ps, other]
Title: David vs. Goliath: Can Small Models Win Big with Agentic AI in Hardware Design?
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Software Engineering (cs.SE)
[122]  arXiv:2512.05069 [pdf, ps, other]
Title: Hybrid Quantum-Classical Autoencoders for Unsupervised Network Intrusion Detection
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Quantum Physics (quant-ph)
[123]  arXiv:2512.05066 [pdf, ps, other]
Title: Multi-LLM Collaboration for Medication Recommendation
Comments: 8 pages, 5 figures, 1 table
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[124]  arXiv:2512.05038 [pdf, ps, other]
Title: SuperActivators: Only the Tail of the Distribution Contains Reliable Concept Signals
Subjects: Machine Learning (cs.LG)
[125]  arXiv:2512.05030 [pdf, ps, other]
Title: Dual-Path Region-Guided Attention Network for Ground Reaction Force and Moment Regression
Authors: Xuan Li, Samuel Bello
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[126]  arXiv:2512.04974 [pdf, ps, other]
Title: Efficient Generative Transformer Operators For Million-Point PDEs
Subjects: Machine Learning (cs.LG)
[127]  arXiv:2512.04958 [pdf, ps, other]
Title: Realizable Abstractions: Near-Optimal Hierarchical Reinforcement Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[128]  arXiv:2512.04954 [pdf, ps, other]
Title: Amortized Inference of Multi-Modal Posteriors using Likelihood-Weighted Normalizing Flows
Authors: Rajneil Baruah
Comments: 14 pages, 8 figures
Subjects: Machine Learning (cs.LG); High Energy Physics - Experiment (hep-ex); High Energy Physics - Phenomenology (hep-ph); Computational Physics (physics.comp-ph); Data Analysis, Statistics and Probability (physics.data-an)
[129]  arXiv:2512.04949 [pdf, ps, other]
Title: CARL: Critical Action Focused Reinforcement Learning for Multi-Step Agent
Comments: 10 pages, 4 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[130]  arXiv:2512.04918 [pdf, ps, other]
Title: Multi-Agent Reinforcement Learning for Intraday Operating Rooms Scheduling under Uncertainty
Subjects: Machine Learning (cs.LG)
[131]  arXiv:2512.04912 [pdf, ps, other]
Title: A result relating convex n-widths to covering numbers with some applications to neural networks
Subjects: Machine Learning (cs.LG)
[132]  arXiv:2512.04763 [pdf, ps, other]
Title: MemLoRA: Distilling Expert Adapters for On-Device Memory Systems
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[133]  arXiv:2512.04752 [pdf, ps, other]
Title: RLHFSpec: Breaking the Efficiency Bottleneck in RLHF Training via Adaptive Drafting
Subjects: Machine Learning (cs.LG)
[134]  arXiv:2512.04747 [pdf, ps, other]
Title: A Tutorial on Regression Analysis: From Linear Models to Deep Learning -- Lecture Notes on Artificial Intelligence
Subjects: Machine Learning (cs.LG)
[135]  arXiv:2512.04703 [pdf, ps, other]
Title: Towards Continuous-Time Approximations for Stochastic Gradient Descent without Replacement
Authors: Stefan Perko
Subjects: Machine Learning (cs.LG); Probability (math.PR)
[136]  arXiv:2512.04695 [pdf, ps, other]
Title: TRINITY: An Evolved LLM Coordinator
Subjects: Machine Learning (cs.LG)
[137]  arXiv:2512.04694 [pdf, ps, other]
Title: TimesNet-Gen: Deep Learning-based Site Specific Strong Motion Generation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[138]  arXiv:2512.04644 [pdf, ps, other]
Title: Contract-Governed Training for Earth Observation: Observed Service Agreement Graphs and Coverage-Accuracy Trade-offs
Authors: Wenzhang Du
Comments: 9 pages, 2 figures
Subjects: Machine Learning (cs.LG)
[139]  arXiv:2512.04635 [pdf, ps, other]
Title: Federated Learning for Anomaly Detection in Maritime Movement Data
Comments: Accepted at MDM2024
Subjects: Machine Learning (cs.LG)
[140]  arXiv:2512.04625 [pdf, ps, other]
Title: Rethinking Decoupled Knowledge Distillation: A Predictive Distribution Perspective
Comments: Accepted to IEEE TNNLS
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[141]  arXiv:2512.04617 [pdf, ps, other]
Title: Score Matching for Estimating Finite Point Processes
Subjects: Machine Learning (cs.LG)
[142]  arXiv:2512.04601 [pdf, ps, other]
Title: Natural Language Actor-Critic: Scalable Off-Policy Learning in Language Space
Comments: 22 pages, 4 figures
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[143]  arXiv:2512.04596 [pdf, ps, other]
Title: QoSDiff: An Implicit Topological Embedding Learning Framework Leveraging Denoising Diffusion and Adversarial Attention for Robust QoS Prediction
Comments: Preprint submitted to IEEE Transactions on Services Computing
Subjects: Machine Learning (cs.LG)
[144]  arXiv:2512.04590 [pdf, ps, other]
Title: Exploiting \texttt{ftrace}'s \texttt{function\_graph} Tracer Features for Machine Learning: A Case Study on Encryption Detection
Comments: Conference paper presented at AICCSA 2025
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[145]  arXiv:2512.04571 [pdf, ps, other]
Title: Temp-SCONE: A Novel Out-of-Distribution Detection and Domain Generalization Framework for Wild Data with Temporal Shift
Comments: 22 pages, 12 figures, 72 subfigures, 6 tables
Subjects: Machine Learning (cs.LG)
[146]  arXiv:2512.04566 [pdf, ps, other]
Title: Reliable Statistical Guarantees for Conformal Predictors with Small Datasets
Subjects: Machine Learning (cs.LG); Data Analysis, Statistics and Probability (physics.data-an); Machine Learning (stat.ML)
[147]  arXiv:2512.04562 [pdf, ps, other]
Title: LeMat-GenBench: A Unified Evaluation Framework for Crystal Generative Models
Comments: 46 pages, 17 figures, 16 tables
Subjects: Machine Learning (cs.LG)
[148]  arXiv:2512.04559 [pdf, ps, other]
Title: Diffusion Fine-Tuning via Reparameterized Policy Gradient of the Soft Q-Function
Comments: 36 pages, 21 figures, 4 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[149]  arXiv:2512.04558 [pdf, ps, other]
Title: On the Limits of Test-Time Compute: Sequential Reward Filtering for Better Inference
Comments: 45 pages, 6 figures, 3 tables
Subjects: Machine Learning (cs.LG)
[150]  arXiv:2512.04530 [pdf, ps, other]
Title: Explainable Graph Representation Learning via Graph Pattern Analysis
Comments: Full version with appendix of the paper published in the Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence (IJCAI-25), Main Track
Journal-ref: Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence (IJCAI-25), Main Track, pages 3426-3434, 2025
Subjects: Machine Learning (cs.LG)
[151]  arXiv:2512.04524 [pdf, ps, other]
Title: Prototype-Based Semantic Consistency Alignment for Domain Adaptive Retrieval
Comments: This paper was accepted by AAAI2026 main tech track not long ago. This is an expanded version with an appendix
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[152]  arXiv:2512.04476 [pdf, ps, other]
Title: Context-Aware Mixture-of-Experts Inference on CXL-Enabled GPU-NDP Systems
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR)
[153]  arXiv:2512.04475 [pdf, ps, other]
Title: GraphBench: Next-generation graph learning benchmarking
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
[154]  arXiv:2512.04464 [pdf, ps, other]
Title: Feature Engineering vs. Deep Learning for Automated Coin Grading: A Comparative Study on Saint-Gaudens Double Eagles
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[155]  arXiv:2512.04388 [pdf, ps, other]
Title: Learning to Orchestrate Agents in Natural Language with the Conductor
Subjects: Machine Learning (cs.LG)
[156]  arXiv:2512.04385 [pdf, ps, other]
Title: STeP-Diff: Spatio-Temporal Physics-Informed Diffusion Models for Mobile Fine-Grained Pollution Forecasting
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[157]  arXiv:2512.04354 [pdf, ps, other]
Title: SmartAlert: Implementing Machine Learning-Driven Clinical Decision Support for Inpatient Lab Utilization Reduction
Authors: April S. Liang (1), Fatemeh Amrollahi (2), Yixing Jiang (2), Conor K. Corbin (3), Grace Y.E. Kim (4), David Mui (5), Trevor Crowell (6), Aakash Acharya (7), Sreedevi Mony (7), Soumya Punnathanam (7), Jack McKeown (7), Margaret Smith (6 and 8), Steven Lin (6 and 8), Arnold Milstein (8 and 9), Kevin Schulman (1 and 9), Jason Hom (1), Michael A. Pfeffer (1 and 7), Tho D. Pham (10), David Svec (1 and 11), Weihan Chu (1 and 11), Lisa Shieh (1), Christopher Sharp (8), Stephen P. Ma (1), Jonathan H. Chen (1, 2, 9, and 12) ((1) Division of Hospital Medicine, Department of Medicine, Stanford University School of Medicine, Palo Alto, CA, (2) Department of Biomedical Data Science, Stanford University, Palo Alto, CA, (3) SmarterDx, New York, NY, (4) The Johns Hopkins University School of Medicine, Baltimore, MD, (5) Department of Medicine, Stanford University School of Medicine, Palo Alto, CA, (6) Stanford Healthcare AI Applied Research Team, Stanford University School of Medicine, Palo Alto, CA, (7) Technology and Digital Solutions, Stanford Health Care, Palo Alto, CA, (8) Division of Primary Care and Population Health, Department of Medicine, Stanford University School of Medicine, Palo Alto, CA, (9) Clinical Excellence Research Center, Stanford University, Palo Alto, CA, (10) Department of Pathology, Stanford University School of Medicine, Palo Alto, CA, (11) Stanford Health Care Tri-Valley, Pleasanton, CA, (12) Stanford Center for Biomedical Informatics Research, Stanford University, Palo Alto, CA)
Comments: 22 pages, 5 figures
Subjects: Machine Learning (cs.LG); Human-Computer Interaction (cs.HC)
[158]  arXiv:2512.04351 [pdf, ps, other]
Title: Distance Is All You Need: Radial Dispersion for Uncertainty Estimation in Large Language Models
Subjects: Machine Learning (cs.LG)
[159]  arXiv:2512.04341 [pdf, ps, other]
Title: Long-Horizon Model-Based Offline Reinforcement Learning Without Conservatism
Comments: Preprint (52 pages, 15 figures)
Subjects: Machine Learning (cs.LG)
[160]  arXiv:2512.04333 [pdf, ps, other]
Title: RGE-GCN: Recursive Gene Elimination with Graph Convolutional Networks for RNA-seq based Early Cancer Detection
Comments: 12 pages, 2 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[161]  arXiv:2512.04332 [pdf, ps, other]
Title: Data-regularized Reinforcement Learning for Diffusion Models at Scale
Subjects: Machine Learning (cs.LG)
[162]  arXiv:2512.04310 [pdf, ps, other]
Title: RNNs perform task computations by dynamically warping neural representations
Comments: NeurIPS 2025
Subjects: Machine Learning (cs.LG); Differential Geometry (math.DG); Dynamical Systems (math.DS); Neurons and Cognition (q-bio.NC)
[163]  arXiv:2512.04307 [pdf, ps, other]
Title: Evaluating Long-Context Reasoning in LLM-Based WebAgents
Comments: Accepted NeurIPS 25 LAW Workshop
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[164]  arXiv:2512.04299 [pdf, ps, other]
Title: When do spectral gradient updates help in deep learning?
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[165]  arXiv:2512.04296 [pdf, ps, other]
Title: GRASP: GRouped Activation Shared Parameterization for Parameter-Efficient Fine-Tuning and Robust Inference of Transformers
Comments: Under Review
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[166]  arXiv:2512.04277 [pdf, ps, other]
Title: Bootstrapped Mixed Rewards for RL Post-Training: Injecting Canonical Action Order
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[167]  arXiv:2512.04268 [pdf, ps, other]
Title: The Initialization Determines Whether In-Context Learning Is Gradient Descent
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[168]  arXiv:2512.04264 [pdf, ps, other]
Title: Studying Various Activation Functions and Non-IID Data for Machine Learning Model Robustness
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[169]  arXiv:2512.04252 [pdf, ps, other]
Title: Fine-Tuning ChemBERTa for Predicting Inhibitory Activity Against TDP1 Using Deep Learning
Authors: Baichuan Zeng
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[170]  arXiv:2512.04223 [pdf, ps, other]
Title: ActVAE: Modelling human activity schedules with a deep conditional generative approach
Subjects: Machine Learning (cs.LG)
[171]  arXiv:2512.04198 [pdf, ps, other]
Title: Network of Theseus (like the ship)
Comments: Preprint. 24 pages, 9 figures, 8 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[172]  arXiv:2512.04189 [pdf, ps, other]
Title: BEP: A Binary Error Propagation Algorithm for Binary Neural Networks Training
Subjects: Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn); Artificial Intelligence (cs.AI)
[173]  arXiv:2512.04165 [pdf, ps, other]
Title: Mitigating the Curse of Detail: Scaling Arguments for Feature Learning and Sample Complexity
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[174]  arXiv:2512.04138 [pdf, ps, other]
Title: MechDetect: Detecting Data-Dependent Errors
Comments: International Conference on Data Science and Intelligent Systems (DSIS 2025)
Subjects: Machine Learning (cs.LG); Databases (cs.DB); Information Retrieval (cs.IR)
[175]  arXiv:2512.04135 [pdf, ps, other]
Title: Decoding Large Language Diffusion Models with Foreseeing Movement
Subjects: Machine Learning (cs.LG)
[176]  arXiv:2512.04125 [pdf, ps, other]
Title: ASCIIBench: Evaluating Language-Model-Based Understanding of Visually-Oriented Text
Comments: Accepted to The Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025): LLM Evaluation Workshop & Multimodal Algorithmic Reasoning Workshop
Subjects: Machine Learning (cs.LG)
[177]  arXiv:2512.05112 (cross-list from cs.CV) [pdf, ps, other]
Title: DraCo: Draft as CoT for Text-to-Image Preview and Rare Concept Generation
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[178]  arXiv:2512.05106 (cross-list from cs.CV) [pdf, ps, other]
Title: NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Robotics (cs.RO)
[179]  arXiv:2512.05105 (cross-list from cs.CL) [pdf, ps, other]
Title: Semantic Soft Bootstrapping: Long Context Reasoning in LLMs without Reinforcement Learning
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Theory (cs.IT); Machine Learning (cs.LG); Signal Processing (eess.SP)
[180]  arXiv:2512.05100 (cross-list from cs.CL) [pdf, ps, other]
Title: Structured Document Translation via Format Reinforcement Learning
Comments: IJCNLP-AACL 2025 Main (Oral)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[181]  arXiv:2512.05092 (cross-list from stat.ML) [pdf, ps, other]
Title: Foundations of Diffusion Models in General State Spaces: A Self-Contained Introduction
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[182]  arXiv:2512.05070 (cross-list from stat.ML) [pdf, ps, other]
Title: Control Consistency Losses for Diffusion Bridges
Comments: Frontiers in Probabilistic Inference: Sampling Meets Learning Workshop at NeurIPS 2025 (Oral)
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[183]  arXiv:2512.05058 (cross-list from quant-ph) [pdf, ps, other]
Title: Meta-Learning for Quantum Optimization via Quantum Sequence Model
Subjects: Quantum Physics (quant-ph); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[184]  arXiv:2512.05049 (cross-list from quant-ph) [pdf, ps, other]
Title: QKAN-LSTM: Quantum-inspired Kolmogorov-Arnold Long Short-term Memory
Subjects: Quantum Physics (quant-ph); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[185]  arXiv:2512.05033 (cross-list from cs.CL) [pdf, ps, other]
Title: Arbitrage: Efficient Reasoning via Advantage-Aware Speculation
Comments: 22 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[186]  arXiv:2512.05024 (cross-list from stat.ME) [pdf, ps, other]
Title: Model-Free Assessment of Simulator Fidelity via Quantile Curves
Comments: 33 pages, 11 figures
Subjects: Methodology (stat.ME); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[187]  arXiv:2512.05021 (cross-list from cs.CV) [pdf, ps, other]
Title: HTR-ConvText: Leveraging Convolution and Textual Information for Handwritten Text Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[188]  arXiv:2512.04992 (cross-list from cs.NE) [pdf, ps, other]
Title: Evolutionary Architecture Search through Grammar-Based Sequence Alignment
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[189]  arXiv:2512.04985 (cross-list from stat.ML) [pdf, ps, other]
Title: Towards a unified framework for guided diffusion models
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
[190]  arXiv:2512.04981 (cross-list from cs.CV) [pdf, ps, other]
Title: Aligned but Stereotypical? The Hidden Influence of System Prompts on Social Bias in LVLM-Based Text-to-Image Models
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[191]  arXiv:2512.04980 (cross-list from stat.ML) [pdf, ps, other]
Title: Learning Causality for Longitudinal Data
Comments: PhD thesis manuscript
Subjects: Machine Learning (stat.ML); Information Theory (cs.IT); Machine Learning (cs.LG)
[ total of 866 entries: 1-100 | 92-191 | 192-291 | 292-391 | 392-491 | ... | 792-866 ]
[ showing 100 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)