We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 748

[ total of 778 entries: 1-25 | ... | 674-698 | 699-723 | 724-748 | 749-773 | 774-778 ]
[ showing 25 entries per page: fewer | more | all ]

Tue, 2 Dec 2025 (continued, showing 25 of 278 entries)

[749]  arXiv:2512.01252 (cross-list from cs.LG) [pdf, ps, other]
Title: Efficient Training of Diffusion Mixture-of-Experts Models: A Practical Recipe
Comments: 9 pages, 7 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[750]  arXiv:2512.01181 (cross-list from cs.LG) [pdf, ps, other]
Title: First On-Orbit Demonstration of a Geospatial Foundation Model
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[751]  arXiv:2512.01152 (cross-list from cs.LG) [pdf, ps, other]
Title: Open-Set Domain Adaptation Under Background Distribution Shift: Challenges and A Provably Efficient Solution
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[752]  arXiv:2512.01104 (cross-list from cs.RO) [pdf, ps, other]
Title: Estimation of Kinematic Motion from Dashcam Footage
Comments: 8 pages, 10 figures
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[753]  arXiv:2512.01061 (cross-list from cs.RO) [pdf, ps, other]
Title: Opening the Sim-to-Real Door for Humanoid Pixel-to-Action Policy Transfer
Comments: this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[754]  arXiv:2512.01009 (cross-list from cs.RO) [pdf, ps, other]
Title: FOM-Nav: Frontier-Object Maps for Object Goal Navigation
Comments: Project page: this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[755]  arXiv:2512.00883 (cross-list from cs.MM) [pdf, ps, other]
Title: Audio-Visual World Models: Towards Multisensory Imagination in Sight and Sound
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD)
[756]  arXiv:2512.00818 (cross-list from cs.AI) [pdf, ps, other]
Title: Med-CMR: A Fine-Grained Benchmark Integrating Visual Evidence and Clinical Logic for Medical Complex Multimodal Reasoning
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[757]  arXiv:2512.00777 (cross-list from cs.RO) [pdf, ps, other]
Title: Sign Language Recognition using Bidirectional Reservoir Computing
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[758]  arXiv:2512.00736 (cross-list from cs.LG) [pdf, ps, other]
Title: REM: Evaluating LLM Embodied Spatial Reasoning through Multi-Frame Trajectories
Journal-ref: Proceedings of the Conference on Language Modeling (COLM 2025)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[759]  arXiv:2512.00659 (cross-list from cs.RO) [pdf, ps, other]
Title: Fast, Robust, Permutation-and-Sign Invariant SO(3) Pattern Alignment
Subjects: Robotics (cs.RO); Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV)
[760]  arXiv:2512.00403 (cross-list from cs.LG) [pdf, ps, other]
Title: SelfAI: Building a Self-Training AI System with LLM Agents
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[761]  arXiv:2512.00396 (cross-list from cs.LG) [pdf, ps, other]
Title: Time-Series at the Edge: Tiny Separable CNNs for Wearable Gait Detection and Optimal Sensor Placement
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[762]  arXiv:2512.00350 (cross-list from eess.IV) [pdf, ps, other]
Title: MedCondDiff: Lightweight, Robust, Semantically Guided Diffusion for Medical Image Segmentation
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[763]  arXiv:2512.00324 (cross-list from cs.RO) [pdf, ps, other]
Title: MILE: A Mechanically Isomorphic Exoskeleton Data Collection System with Fingertip Visuotactile Sensing for Dexterous Manipulation
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[764]  arXiv:2512.00287 (cross-list from cs.RO) [pdf, ps, other]
Title: RealAppliance: Let High-fidelity Appliance Assets Controllable and Workable as Aligned Real Manuals
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[765]  arXiv:2512.00229 (cross-list from cs.LG) [pdf, ps, other]
Title: TIE: A Training-Inversion-Exclusion Framework for Visually Interpretable and Uncertainty-Guided Out-of-Distribution Detection
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Machine Learning (stat.ML)
[766]  arXiv:2512.00138 (cross-list from cs.AR) [pdf, ps, other]
Title: Ternary-Input Binary-Weight CNN Accelerator Design for Miniature Object Classification System with Query-Driven Spatial DVS
Comments: 6 pages.12 figures & 2 table
Subjects: Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[767]  arXiv:2512.00120 (cross-list from cs.SD) [pdf, ps, other]
Title: Art2Music: Generating Music for Art Images with Multi-modal Feeling Alignment
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[768]  arXiv:2512.00115 (cross-list from cs.SD) [pdf, ps, other]
Title: MoLT: Mixture of Layer-Wise Tokens for Efficient Audio-Visual Learning
Comments: 10 pages, 5 figures
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[769]  arXiv:2512.00094 (cross-list from cs.CR) [pdf, ps, other]
Title: HMARK: Radioactive Multi-Bit Semantic-Latent Watermarking for Diffusion Models
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[770]  arXiv:2512.00076 (cross-list from cs.RO) [pdf, ps, other]
Title: Arcadia: Toward a Full-Lifecycle Framework for Embodied Lifelong Learning
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[771]  arXiv:2512.00074 (cross-list from cs.RO) [pdf, ps, other]
Title: Bootstrap Dynamic-Aware 3D Visual Representation for Scalable Robot Learning
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[772]  arXiv:2512.00052 (cross-list from physics.geo-ph) [pdf, ps, other]
Title: Coarse-to-Fine Non-Rigid Registration for Side-Scan Sonar Mosaicking
Subjects: Geophysics (physics.geo-ph); Computer Vision and Pattern Recognition (cs.CV)
[773]  arXiv:2512.00041 (cross-list from cs.RO) [pdf, ps, other]
Title: VISTAv2: World Imagination for Indoor Vision-and-Language Navigation
Comments: 11 pages, 5 figures
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[ total of 778 entries: 1-25 | ... | 674-698 | 699-723 | 724-748 | 749-773 | 774-778 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)