We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 905

[ total of 778 entries: 1-25 | ... | 679-703 | 704-728 | 729-753 | 754-778 ]
[ showing 25 entries per page: fewer | more | all ]

Tue, 2 Dec 2025 (continued, showing last 25 of 278 entries)

[754]  arXiv:2512.01009 (cross-list from cs.RO) [pdf, ps, other]
Title: FOM-Nav: Frontier-Object Maps for Object Goal Navigation
Comments: Project page: this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[755]  arXiv:2512.00883 (cross-list from cs.MM) [pdf, ps, other]
Title: Audio-Visual World Models: Towards Multisensory Imagination in Sight and Sound
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD)
[756]  arXiv:2512.00818 (cross-list from cs.AI) [pdf, ps, other]
Title: Med-CMR: A Fine-Grained Benchmark Integrating Visual Evidence and Clinical Logic for Medical Complex Multimodal Reasoning
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[757]  arXiv:2512.00777 (cross-list from cs.RO) [pdf, ps, other]
Title: Sign Language Recognition using Bidirectional Reservoir Computing
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[758]  arXiv:2512.00736 (cross-list from cs.LG) [pdf, ps, other]
Title: REM: Evaluating LLM Embodied Spatial Reasoning through Multi-Frame Trajectories
Journal-ref: Proceedings of the Conference on Language Modeling (COLM 2025)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[759]  arXiv:2512.00659 (cross-list from cs.RO) [pdf, ps, other]
Title: Fast, Robust, Permutation-and-Sign Invariant SO(3) Pattern Alignment
Subjects: Robotics (cs.RO); Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV)
[760]  arXiv:2512.00403 (cross-list from cs.LG) [pdf, ps, other]
Title: SelfAI: Building a Self-Training AI System with LLM Agents
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[761]  arXiv:2512.00396 (cross-list from cs.LG) [pdf, ps, other]
Title: Time-Series at the Edge: Tiny Separable CNNs for Wearable Gait Detection and Optimal Sensor Placement
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[762]  arXiv:2512.00350 (cross-list from eess.IV) [pdf, ps, other]
Title: MedCondDiff: Lightweight, Robust, Semantically Guided Diffusion for Medical Image Segmentation
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[763]  arXiv:2512.00324 (cross-list from cs.RO) [pdf, ps, other]
Title: MILE: A Mechanically Isomorphic Exoskeleton Data Collection System with Fingertip Visuotactile Sensing for Dexterous Manipulation
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[764]  arXiv:2512.00287 (cross-list from cs.RO) [pdf, ps, other]
Title: RealAppliance: Let High-fidelity Appliance Assets Controllable and Workable as Aligned Real Manuals
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[765]  arXiv:2512.00229 (cross-list from cs.LG) [pdf, ps, other]
Title: TIE: A Training-Inversion-Exclusion Framework for Visually Interpretable and Uncertainty-Guided Out-of-Distribution Detection
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Machine Learning (stat.ML)
[766]  arXiv:2512.00138 (cross-list from cs.AR) [pdf, ps, other]
Title: Ternary-Input Binary-Weight CNN Accelerator Design for Miniature Object Classification System with Query-Driven Spatial DVS
Comments: 6 pages.12 figures & 2 table
Subjects: Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[767]  arXiv:2512.00120 (cross-list from cs.SD) [pdf, ps, other]
Title: Art2Music: Generating Music for Art Images with Multi-modal Feeling Alignment
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[768]  arXiv:2512.00115 (cross-list from cs.SD) [pdf, ps, other]
Title: MoLT: Mixture of Layer-Wise Tokens for Efficient Audio-Visual Learning
Comments: 10 pages, 5 figures
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[769]  arXiv:2512.00094 (cross-list from cs.CR) [pdf, ps, other]
Title: HMARK: Radioactive Multi-Bit Semantic-Latent Watermarking for Diffusion Models
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[770]  arXiv:2512.00076 (cross-list from cs.RO) [pdf, ps, other]
Title: Arcadia: Toward a Full-Lifecycle Framework for Embodied Lifelong Learning
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[771]  arXiv:2512.00074 (cross-list from cs.RO) [pdf, ps, other]
Title: Bootstrap Dynamic-Aware 3D Visual Representation for Scalable Robot Learning
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[772]  arXiv:2512.00052 (cross-list from physics.geo-ph) [pdf, ps, other]
Title: Coarse-to-Fine Non-Rigid Registration for Side-Scan Sonar Mosaicking
Subjects: Geophysics (physics.geo-ph); Computer Vision and Pattern Recognition (cs.CV)
[773]  arXiv:2512.00041 (cross-list from cs.RO) [pdf, ps, other]
Title: VISTAv2: World Imagination for Indoor Vision-and-Language Navigation
Comments: 11 pages, 5 figures
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[774]  arXiv:2512.00037 (cross-list from cs.RO) [pdf, ps, other]
Title: ICD-Net: Inertial Covariance Displacement Network for Drone Visual-Inertial SLAM
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[775]  arXiv:2512.00027 (cross-list from cs.RO) [pdf, ps, other]
Title: A Survey on Improving Human Robot Collaboration through Vision-and-Language Navigation
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[776]  arXiv:2512.00024 (cross-list from cs.RO) [pdf, ps, other]
Title: Learning from Watching: Scalable Extraction of Manipulation Trajectories from Human Videos
Authors: X. Hu, G. Ye
Comments: Accepted to RSS 2025 Workshop
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[777]  arXiv:2512.00021 (cross-list from cs.RO) [pdf, ps, other]
Title: Foundation Models for Trajectory Planning in Autonomous Driving: A Review of Progress and Open Challenges
Comments: Under review
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[778]  arXiv:2512.00019 (cross-list from cs.RO) [pdf, ps, other]
Title: A Comprehensive Survey on Surgical Digital Twin
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[ total of 778 entries: 1-25 | ... | 679-703 | 704-728 | 729-753 | 754-778 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)