We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 120

[ total of 747 entries: 1-50 | 21-70 | 71-120 | 121-170 | 171-220 | 221-270 | 271-320 | ... | 721-747 ]
[ showing 50 entries per page: fewer | more | all ]

Fri, 12 Dec 2025 (continued, showing 50 of 118 entries)

[121]  arXiv:2512.10935 [pdf, ps, other]
Title: Any4D: Unified Feed-Forward Metric 4D Reconstruction
Comments: Project Website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[122]  arXiv:2512.10932 [pdf, ps, other]
Title: BabyVLM-V2: Toward Developmentally Grounded Pretraining and Benchmarking of Vision Foundation Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[123]  arXiv:2512.10927 [pdf, ps, other]
Title: FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos
Comments: Code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[124]  arXiv:2512.10894 [pdf, ps, other]
Title: DuetSVG: Unified Multimodal SVG Generation with Internal Visual Guidance
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[125]  arXiv:2512.10888 [pdf, ps, other]
Title: PubTables-v2: A new large-scale dataset for full-page and multi-page table extraction
Comments: 15 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[126]  arXiv:2512.10881 [pdf, ps, other]
Title: MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[127]  arXiv:2512.10867 [pdf, ps, other]
Title: From Macro to Micro: Benchmarking Microscopic Spatial Intelligence on Molecules via Vision-Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[128]  arXiv:2512.10863 [pdf, ps, other]
Title: MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[129]  arXiv:2512.10860 [pdf, ps, other]
Title: SWiT-4D: Sliding-Window Transformer for Lossless and Parameter-Free Temporal 4D Generation
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[130]  arXiv:2512.10840 [pdf, ps, other]
Title: PoseGAM: Robust Unseen Object Pose Estimation via Geometry-Aware Multi-View Reasoning
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[131]  arXiv:2512.10818 [pdf, ps, other]
Title: Self-Ensemble Post Learning for Noisy Domain Generalization
Authors: Wang Lu, Jindong Wang
Comments: 18 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[132]  arXiv:2512.10808 [pdf, ps, other]
Title: Graph Laplacian Transformer with Progressive Sampling for Prostate Cancer Grading
Comments: International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI) 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[133]  arXiv:2512.10794 [pdf, ps, other]
Title: What matters for Representation Alignment: Global Information or Spatial Structure?
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Machine Learning (stat.ML)
[134]  arXiv:2512.10765 [pdf, ps, other]
Title: Blood Pressure Prediction for Coronary Artery Disease Diagnosis using Coronary Computed Tomography Angiography
Comments: 19 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[135]  arXiv:2512.10750 [pdf, ps, other]
Title: LDP: Parameter-Efficient Fine-Tuning of Multimodal LLM for Medical Report Generation
Comments: Work in progress
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[136]  arXiv:2512.10730 [pdf, ps, other]
Title: IRG-MotionLLM: Interleaving Motion Generation, Assessment and Refinement for Text-to-Motion Generation
Comments: 25 pages, 16 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[137]  arXiv:2512.10725 [pdf, ps, other]
Title: Video Depth Propagation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[138]  arXiv:2512.10719 [pdf, ps, other]
Title: SpaceDrive: Infusing Spatial Awareness into VLM-based Autonomous Driving
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[139]  arXiv:2512.10715 [pdf, ps, other]
Title: CheXmask-U: Quantifying uncertainty in landmark-based anatomical segmentation for X-ray images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[140]  arXiv:2512.10685 [pdf, ps, other]
Title: Sharp Monocular View Synthesis in Less Than a Second
Comments: Code and weights available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[141]  arXiv:2512.10683 [pdf, other]
Title: Optimal transport unlocks end-to-end learning for single-molecule localization
Authors: Romain Seailles (DI-ENS), Jean-Baptiste Masson (IP, CNRS, UPCité), Jean Ponce (DI-ENS, CDS), Julien Mairal (LJK)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[142]  arXiv:2512.10674 [pdf, ps, other]
Title: Geo6DPose: Fast Zero-Shot 6D Object Pose Estimation via Geometry-Filtered Feature Matching
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[143]  arXiv:2512.10668 [pdf, ps, other]
Title: XDen-1K: A Density Field Dataset of Real-World Objects
Comments: 10 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[144]  arXiv:2512.10660 [pdf, ps, other]
Title: NaviHydra: Controllable Navigation-guided End-to-end Autonomous Driving with Hydra-distillation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[145]  arXiv:2512.10652 [pdf, ps, other]
Title: TriDF: Evaluating Perception, Detection, and Hallucination for Interpretable DeepFake Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[146]  arXiv:2512.10628 [pdf, ps, other]
Title: K-Track: Kalman-Enhanced Tracking for Accelerating Deep Point Trackers on Edge Devices
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[147]  arXiv:2512.10619 [pdf, ps, other]
Title: DOCR-Inspector: Fine-Grained and Automated Evaluation of Document Parsing with VLM
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[148]  arXiv:2512.10617 [pdf, ps, other]
Title: Lang2Motion: Bridging Language and Motion through Joint Embedding Spaces
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[149]  arXiv:2512.10608 [pdf, ps, other]
Title: Robust Multi-Disease Retinal Classification via Xception-Based Transfer Learning and W-Net Vessel Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[150]  arXiv:2512.10607 [pdf, ps, other]
Title: Track and Caption Any Motion: Query-Free Motion Discovery and Description in Videos
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[151]  arXiv:2512.10596 [pdf, ps, other]
Title: Beyond Pixels: A Training-Free, Text-to-Text Framework for Remote Sensing Image Retrieval
Comments: 6 pages, 1 figure
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[152]  arXiv:2512.10592 [pdf, ps, other]
Title: Salient Object Detection in Complex Weather Conditions via Noise Indicators
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[153]  arXiv:2512.10581 [pdf, ps, other]
Title: Unleashing Degradation-Carrying Features in Symmetric U-Net: Simpler and Stronger Baselines for All-in-One Image Restoration
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[154]  arXiv:2512.10571 [pdf, ps, other]
Title: Audio-sync Video Instance Editing with Granularity-Aware Mask Refiner
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[155]  arXiv:2512.10562 [pdf, ps, other]
Title: Data-Efficient American Sign Language Recognition via Few-Shot Prototypical Networks
Authors: Meher Md Saad
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[156]  arXiv:2512.10554 [pdf, ps, other]
Title: Grounding Everything in Tokens for Multimodal Large Language Models
Comments: 19 pages, 16 figures, 12 Tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[157]  arXiv:2512.10548 [pdf, ps, other]
Title: Blink: Dynamic Visual Token Resolution for Enhanced Multimodal Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[158]  arXiv:2512.10521 [pdf, ps, other]
Title: Take a Peek: Efficient Encoder Adaptation for Few-Shot Semantic Segmentation via LoRA
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[159]  arXiv:2512.10517 [pdf, ps, other]
Title: 3D Blood Pulsation Maps
Comments: 9 pages (without references), supplementals attached, waiting for publication. In order to access the dataset,see this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[160]  arXiv:2512.10498 [pdf, ps, other]
Title: Robust Shape from Focus via Multiscale Directional Dilated Laplacian and Recurrent Network
Comments: Accepted to IJCV
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[161]  arXiv:2512.10450 [pdf, ps, other]
Title: Error-Propagation-Free Learned Video Compression With Dual-Domain Progressive Temporal Alignment
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[162]  arXiv:2512.10437 [pdf, ps, other]
Title: An M-Health Algorithmic Approach to Identify and Assess Physiotherapy Exercises in Real Time
Comments: 11 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[163]  arXiv:2512.10421 [pdf, ps, other]
Title: Neural Collapse in Test-Time Adaptation
Comments: 10 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[164]  arXiv:2512.10419 [pdf, ps, other]
Title: TransLocNet: Cross-Modal Attention for Aerial-Ground Vehicle Localization with Contrastive Learning
Comments: 8 pages, 4 figures, 4 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[165]  arXiv:2512.10416 [pdf, ps, other]
Title: Beyond Endpoints: Path-Centric Reasoning for Vectorized Off-Road Network Extraction
Comments: v2: Corrected the abstract to accurately reflect the paper content. Updated the project link to the correct repository<a href="this https URL" target="_blank" rel="noopener noreferrer nofollow"></a>. No changes to the main text
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[166]  arXiv:2512.10408 [pdf, ps, other]
Title: MultiHateLoc: Towards Temporal Localisation of Multimodal Hate Content in Online Videos
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[167]  arXiv:2512.10386 [pdf, ps, other]
Title: Adaptive Dual-Weighted Gravitational Point Cloud Denoising Method
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[168]  arXiv:2512.10384 [pdf, ps, other]
Title: Towards Fine-Grained Recognition with Large Visual Language Models: Benchmark and Optimization Strategies
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[169]  arXiv:2512.10379 [pdf, ps, other]
Title: Self-Supervised Contrastive Embedding Adaptation for Endoscopic Image Matching
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[170]  arXiv:2512.10376 [pdf, ps, other]
Title: RaLiFlow: Scene Flow Estimation with 4D Radar and LiDAR Point Clouds
Comments: Accepted by AAAI
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 747 entries: 1-50 | 21-70 | 71-120 | 121-170 | 171-220 | 221-270 | 271-320 | ... | 721-747 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)