We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 854

[ total of 778 entries: 1-50 | ... | 579-628 | 629-678 | 679-728 | 729-778 ]
[ showing 50 entries per page: fewer | more | all ]

Tue, 2 Dec 2025 (continued, showing last 50 of 278 entries)

[729]  arXiv:2512.00078 [pdf, ps, other]
Title: Diffusion-Based Synthetic Brightfield Microscopy Images for Enhanced Single Cell Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[730]  arXiv:2512.00075 [pdf, ps, other]
Title: Adapter Shield: A Unified Framework with Built-in Authentication for Preventing Unauthorized Zero-Shot Image-to-Image Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[731]  arXiv:2512.00073 [pdf, ps, other]
Title: ProvRain: Rain-Adaptive Denoising and Vehicle Detection via MobileNet-UNet and Faster R-CNN
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[732]  arXiv:2512.00065 [pdf, ps, other]
Title: Satellite to Street : Disaster Impact Estimator
Comments: 11 pages,9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[733]  arXiv:2512.00061 [pdf, ps, other]
Title: DL-CapsNet: A Deep and Light Capsule Network
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[734]  arXiv:2512.00060 [pdf, ps, other]
Title: PEFT-DML: Parameter-Efficient Fine-Tuning Deep Metric Learning for Robust Multi-Modal 3D Object Detection in Autonomous Driving
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[735]  arXiv:2512.00042 [pdf, ps, other]
Title: Closing the Gap: Data-Centric Fine-Tuning of Vision Language Models for the Standardized Exam Questions
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computers and Society (cs.CY)
[736]  arXiv:2512.00008 [pdf, ps, other]
Title: MOTION: ML-Assisted On-Device Low-Latency Motion Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[737]  arXiv:2512.02020 (cross-list from cs.RO) [pdf, ps, other]
Title: EfficientFlow: Efficient Equivariant Flow Policy Learning for Embodied AI
Comments: Accepted by AAAI 2026. Project Page: this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[738]  arXiv:2512.01993 (cross-list from cs.RO) [pdf, ps, other]
Title: RoaD: Rollouts as Demonstrations for Closed-Loop Supervised Fine-Tuning of Autonomous Driving Policies
Comments: Preprint
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[739]  arXiv:2512.01979 (cross-list from cs.AI) [pdf, ps, other]
Title: Chain-of-Ground: Improving GUI Grounding via Iterative Reasoning and Reference Feedback
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[740]  arXiv:2512.01946 (cross-list from cs.RO) [pdf, ps, other]
Title: Guardian: Detecting Robotic Planning and Execution Errors with Vision-Language Models
Comments: Code, Data, and Models available at this https URL The paper contains 8 pages, 9 figures, 6 tables
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[741]  arXiv:2512.01913 (cross-list from eess.IV) [pdf, ps, other]
Title: Disentangling Progress in Medical Image Registration: Beyond Trend-Driven Architectures towards Domain-Specific Strategies
Comments: Submitted to Medical Image Analysis. Journal Extension of arXiv:2407.19274
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[742]  arXiv:2512.01822 (cross-list from cs.CL) [pdf, ps, other]
Title: InnoGym: Benchmarking the Innovation Potential of AI Agents
Comments: Work in progress
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[743]  arXiv:2512.01818 (cross-list from cs.LG) [pdf, ps, other]
Title: Forget Less, Retain More: A Lightweight Regularizer for Rehearsal-Based Continual Learning
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[744]  arXiv:2512.01687 (cross-list from cs.NE) [pdf, ps, other]
Title: Revisiting Direct Encoding: Learnable Temporal Dynamics for Static Image Spiking Neural Networks
Authors: Huaxu He
Subjects: Neural and Evolutionary Computing (cs.NE); Computer Vision and Pattern Recognition (cs.CV)
[745]  arXiv:2512.01550 (cross-list from cs.RO) [pdf, ps, other]
Title: NavForesee: A Unified Vision-Language World Model for Hierarchical Planning and Dual-Horizon Navigation Prediction
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[746]  arXiv:2512.01461 (cross-list from cs.LG) [pdf, ps, other]
Title: Stay Unique, Stay Efficient: Preserving Model Personality in Multi-Task Merging
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[747]  arXiv:2512.01329 (cross-list from cs.GR) [pdf, ps, other]
Title: TagSplat: Topology-Aware Gaussian Splatting for Dynamic Mesh Modeling and Tracking
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[748]  arXiv:2512.01324 (cross-list from hep-ex) [pdf, ps, other]
Title: Panda: Self-distillation of Reusable Sensor-level Representations for High Energy Physics
Comments: 23 pages, 15 figures, preprint. Project page at this https URL
Subjects: High Energy Physics - Experiment (hep-ex); Computer Vision and Pattern Recognition (cs.CV)
[749]  arXiv:2512.01252 (cross-list from cs.LG) [pdf, ps, other]
Title: Efficient Training of Diffusion Mixture-of-Experts Models: A Practical Recipe
Comments: 9 pages, 7 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[750]  arXiv:2512.01181 (cross-list from cs.LG) [pdf, ps, other]
Title: First On-Orbit Demonstration of a Geospatial Foundation Model
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[751]  arXiv:2512.01152 (cross-list from cs.LG) [pdf, ps, other]
Title: Open-Set Domain Adaptation Under Background Distribution Shift: Challenges and A Provably Efficient Solution
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[752]  arXiv:2512.01104 (cross-list from cs.RO) [pdf, ps, other]
Title: Estimation of Kinematic Motion from Dashcam Footage
Comments: 8 pages, 10 figures
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[753]  arXiv:2512.01061 (cross-list from cs.RO) [pdf, ps, other]
Title: Opening the Sim-to-Real Door for Humanoid Pixel-to-Action Policy Transfer
Comments: this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[754]  arXiv:2512.01009 (cross-list from cs.RO) [pdf, ps, other]
Title: FOM-Nav: Frontier-Object Maps for Object Goal Navigation
Comments: Project page: this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[755]  arXiv:2512.00883 (cross-list from cs.MM) [pdf, ps, other]
Title: Audio-Visual World Models: Towards Multisensory Imagination in Sight and Sound
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD)
[756]  arXiv:2512.00818 (cross-list from cs.AI) [pdf, ps, other]
Title: Med-CMR: A Fine-Grained Benchmark Integrating Visual Evidence and Clinical Logic for Medical Complex Multimodal Reasoning
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[757]  arXiv:2512.00777 (cross-list from cs.RO) [pdf, ps, other]
Title: Sign Language Recognition using Bidirectional Reservoir Computing
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[758]  arXiv:2512.00736 (cross-list from cs.LG) [pdf, ps, other]
Title: REM: Evaluating LLM Embodied Spatial Reasoning through Multi-Frame Trajectories
Journal-ref: Proceedings of the Conference on Language Modeling (COLM 2025)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[759]  arXiv:2512.00659 (cross-list from cs.RO) [pdf, ps, other]
Title: Fast, Robust, Permutation-and-Sign Invariant SO(3) Pattern Alignment
Subjects: Robotics (cs.RO); Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV)
[760]  arXiv:2512.00403 (cross-list from cs.LG) [pdf, ps, other]
Title: SelfAI: Building a Self-Training AI System with LLM Agents
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[761]  arXiv:2512.00396 (cross-list from cs.LG) [pdf, ps, other]
Title: Time-Series at the Edge: Tiny Separable CNNs for Wearable Gait Detection and Optimal Sensor Placement
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[762]  arXiv:2512.00350 (cross-list from eess.IV) [pdf, ps, other]
Title: MedCondDiff: Lightweight, Robust, Semantically Guided Diffusion for Medical Image Segmentation
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[763]  arXiv:2512.00324 (cross-list from cs.RO) [pdf, ps, other]
Title: MILE: A Mechanically Isomorphic Exoskeleton Data Collection System with Fingertip Visuotactile Sensing for Dexterous Manipulation
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[764]  arXiv:2512.00287 (cross-list from cs.RO) [pdf, ps, other]
Title: RealAppliance: Let High-fidelity Appliance Assets Controllable and Workable as Aligned Real Manuals
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[765]  arXiv:2512.00229 (cross-list from cs.LG) [pdf, ps, other]
Title: TIE: A Training-Inversion-Exclusion Framework for Visually Interpretable and Uncertainty-Guided Out-of-Distribution Detection
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Machine Learning (stat.ML)
[766]  arXiv:2512.00138 (cross-list from cs.AR) [pdf, ps, other]
Title: Ternary-Input Binary-Weight CNN Accelerator Design for Miniature Object Classification System with Query-Driven Spatial DVS
Comments: 6 pages.12 figures & 2 table
Subjects: Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[767]  arXiv:2512.00120 (cross-list from cs.SD) [pdf, ps, other]
Title: Art2Music: Generating Music for Art Images with Multi-modal Feeling Alignment
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[768]  arXiv:2512.00115 (cross-list from cs.SD) [pdf, ps, other]
Title: MoLT: Mixture of Layer-Wise Tokens for Efficient Audio-Visual Learning
Comments: 10 pages, 5 figures
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[769]  arXiv:2512.00094 (cross-list from cs.CR) [pdf, ps, other]
Title: HMARK: Radioactive Multi-Bit Semantic-Latent Watermarking for Diffusion Models
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[770]  arXiv:2512.00076 (cross-list from cs.RO) [pdf, ps, other]
Title: Arcadia: Toward a Full-Lifecycle Framework for Embodied Lifelong Learning
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[771]  arXiv:2512.00074 (cross-list from cs.RO) [pdf, ps, other]
Title: Bootstrap Dynamic-Aware 3D Visual Representation for Scalable Robot Learning
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[772]  arXiv:2512.00052 (cross-list from physics.geo-ph) [pdf, ps, other]
Title: Coarse-to-Fine Non-Rigid Registration for Side-Scan Sonar Mosaicking
Subjects: Geophysics (physics.geo-ph); Computer Vision and Pattern Recognition (cs.CV)
[773]  arXiv:2512.00041 (cross-list from cs.RO) [pdf, ps, other]
Title: VISTAv2: World Imagination for Indoor Vision-and-Language Navigation
Comments: 11 pages, 5 figures
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[774]  arXiv:2512.00037 (cross-list from cs.RO) [pdf, ps, other]
Title: ICD-Net: Inertial Covariance Displacement Network for Drone Visual-Inertial SLAM
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[775]  arXiv:2512.00027 (cross-list from cs.RO) [pdf, ps, other]
Title: A Survey on Improving Human Robot Collaboration through Vision-and-Language Navigation
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[776]  arXiv:2512.00024 (cross-list from cs.RO) [pdf, ps, other]
Title: Learning from Watching: Scalable Extraction of Manipulation Trajectories from Human Videos
Authors: X. Hu, G. Ye
Comments: Accepted to RSS 2025 Workshop
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[777]  arXiv:2512.00021 (cross-list from cs.RO) [pdf, ps, other]
Title: Foundation Models for Trajectory Planning in Autonomous Driving: A Review of Progress and Open Challenges
Comments: Under review
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[778]  arXiv:2512.00019 (cross-list from cs.RO) [pdf, ps, other]
Title: A Comprehensive Survey on Surgical Digital Twin
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[ total of 778 entries: 1-50 | ... | 579-628 | 629-678 | 679-728 | 729-778 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)