We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 28

[ total of 737 entries: 1-50 | 29-78 | 79-128 | 129-178 | 179-228 | ... | 729-737 ]
[ showing 50 entries per page: fewer | more | all ]

Fri, 12 Dec 2025 (continued, showing 50 of 118 entries)

[29]  arXiv:2512.10794 [pdf, ps, other]
Title: What matters for Representation Alignment: Global Information or Spatial Structure?
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Machine Learning (stat.ML)
[30]  arXiv:2512.10765 [pdf, ps, other]
Title: Blood Pressure Prediction for Coronary Artery Disease Diagnosis using Coronary Computed Tomography Angiography
Comments: 19 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[31]  arXiv:2512.10750 [pdf, ps, other]
Title: LDP: Parameter-Efficient Fine-Tuning of Multimodal LLM for Medical Report Generation
Comments: Work in progress
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[32]  arXiv:2512.10730 [pdf, ps, other]
Title: IRG-MotionLLM: Interleaving Motion Generation, Assessment and Refinement for Text-to-Motion Generation
Comments: 25 pages, 16 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[33]  arXiv:2512.10725 [pdf, ps, other]
Title: Video Depth Propagation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[34]  arXiv:2512.10719 [pdf, ps, other]
Title: SpaceDrive: Infusing Spatial Awareness into VLM-based Autonomous Driving
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[35]  arXiv:2512.10715 [pdf, ps, other]
Title: CheXmask-U: Quantifying uncertainty in landmark-based anatomical segmentation for X-ray images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[36]  arXiv:2512.10685 [pdf, ps, other]
Title: Sharp Monocular View Synthesis in Less Than a Second
Comments: Code and weights available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[37]  arXiv:2512.10683 [pdf, other]
Title: Optimal transport unlocks end-to-end learning for single-molecule localization
Authors: Romain Seailles (DI-ENS), Jean-Baptiste Masson (IP, CNRS, UPCité), Jean Ponce (DI-ENS, CDS), Julien Mairal (LJK)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[38]  arXiv:2512.10674 [pdf, ps, other]
Title: Geo6DPose: Fast Zero-Shot 6D Object Pose Estimation via Geometry-Filtered Feature Matching
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[39]  arXiv:2512.10668 [pdf, ps, other]
Title: XDen-1K: A Density Field Dataset of Real-World Objects
Comments: 10 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[40]  arXiv:2512.10660 [pdf, ps, other]
Title: NaviHydra: Controllable Navigation-guided End-to-end Autonomous Driving with Hydra-distillation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[41]  arXiv:2512.10652 [pdf, ps, other]
Title: TriDF: Evaluating Perception, Detection, and Hallucination for Interpretable DeepFake Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[42]  arXiv:2512.10628 [pdf, ps, other]
Title: K-Track: Kalman-Enhanced Tracking for Accelerating Deep Point Trackers on Edge Devices
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[43]  arXiv:2512.10619 [pdf, ps, other]
Title: DOCR-Inspector: Fine-Grained and Automated Evaluation of Document Parsing with VLM
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[44]  arXiv:2512.10617 [pdf, ps, other]
Title: Lang2Motion: Bridging Language and Motion through Joint Embedding Spaces
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[45]  arXiv:2512.10608 [pdf, ps, other]
Title: Robust Multi-Disease Retinal Classification via Xception-Based Transfer Learning and W-Net Vessel Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[46]  arXiv:2512.10607 [pdf, ps, other]
Title: Track and Caption Any Motion: Query-Free Motion Discovery and Description in Videos
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[47]  arXiv:2512.10596 [pdf, ps, other]
Title: Beyond Pixels: A Training-Free, Text-to-Text Framework for Remote Sensing Image Retrieval
Comments: 6 pages, 1 figure
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[48]  arXiv:2512.10592 [pdf, ps, other]
Title: Salient Object Detection in Complex Weather Conditions via Noise Indicators
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[49]  arXiv:2512.10581 [pdf, ps, other]
Title: Unleashing Degradation-Carrying Features in Symmetric U-Net: Simpler and Stronger Baselines for All-in-One Image Restoration
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[50]  arXiv:2512.10571 [pdf, ps, other]
Title: Audio-sync Video Instance Editing with Granularity-Aware Mask Refiner
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[51]  arXiv:2512.10562 [pdf, ps, other]
Title: Data-Efficient American Sign Language Recognition via Few-Shot Prototypical Networks
Authors: Meher Md Saad
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[52]  arXiv:2512.10554 [pdf, ps, other]
Title: Grounding Everything in Tokens for Multimodal Large Language Models
Comments: 19 pages, 16 figures, 12 Tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[53]  arXiv:2512.10548 [pdf, ps, other]
Title: Blink: Dynamic Visual Token Resolution for Enhanced Multimodal Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[54]  arXiv:2512.10521 [pdf, ps, other]
Title: Take a Peek: Efficient Encoder Adaptation for Few-Shot Semantic Segmentation via LoRA
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[55]  arXiv:2512.10517 [pdf, ps, other]
Title: 3D Blood Pulsation Maps
Comments: 9 pages (without references), supplementals attached, waiting for publication. In order to access the dataset,see this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[56]  arXiv:2512.10498 [pdf, ps, other]
Title: Robust Shape from Focus via Multiscale Directional Dilated Laplacian and Recurrent Network
Comments: Accepted to IJCV
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[57]  arXiv:2512.10450 [pdf, ps, other]
Title: Error-Propagation-Free Learned Video Compression With Dual-Domain Progressive Temporal Alignment
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[58]  arXiv:2512.10437 [pdf, ps, other]
Title: An M-Health Algorithmic Approach to Identify and Assess Physiotherapy Exercises in Real Time
Comments: 11 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[59]  arXiv:2512.10421 [pdf, ps, other]
Title: Neural Collapse in Test-Time Adaptation
Comments: 10 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[60]  arXiv:2512.10419 [pdf, ps, other]
Title: TransLocNet: Cross-Modal Attention for Aerial-Ground Vehicle Localization with Contrastive Learning
Comments: 8 pages, 4 figures, 4 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[61]  arXiv:2512.10416 [pdf, ps, other]
Title: Beyond Endpoints: Path-Centric Reasoning for Vectorized Off-Road Network Extraction
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[62]  arXiv:2512.10408 [pdf, ps, other]
Title: MultiHateLoc: Towards Temporal Localisation of Multimodal Hate Content in Online Videos
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[63]  arXiv:2512.10386 [pdf, ps, other]
Title: Adaptive Dual-Weighted Gravitational Point Cloud Denoising Method
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[64]  arXiv:2512.10384 [pdf, ps, other]
Title: Towards Fine-Grained Recognition with Large Visual Language Models: Benchmark and Optimization Strategies
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[65]  arXiv:2512.10379 [pdf, ps, other]
Title: Self-Supervised Contrastive Embedding Adaptation for Endoscopic Image Matching
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[66]  arXiv:2512.10376 [pdf, ps, other]
Title: RaLiFlow: Scene Flow Estimation with 4D Radar and LiDAR Point Clouds
Comments: Accepted by AAAI
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[67]  arXiv:2512.10369 [pdf, ps, other]
Title: Breaking the Vicious Cycle: Coherent 3D Gaussian Splatting from Sparse and Motion-Blurred Views
Comments: 20 pages, 14 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[68]  arXiv:2512.10363 [pdf, ps, other]
Title: Point to Span: Zero-Shot Moment Retrieval for Navigating Unseen Hour-Long Videos
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[69]  arXiv:2512.10362 [pdf, ps, other]
Title: Visual Funnel: Resolving Contextual Blindness in Multimodal Large Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[70]  arXiv:2512.10359 [pdf, ps, other]
Title: Tool-Augmented Spatiotemporal Reasoning for Streamlining Video Question Answering Task
Comments: Accepted by NeurIPS 2025 main track
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[71]  arXiv:2512.10357 [pdf, ps, other]
Title: mmCounter: Static People Counting in Dense Indoor Scenarios Using mmWave Radar
Comments: Accepted at the 22nd International Conference on Embedded Wireless Systems and Networks (EWSN 2025)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[72]  arXiv:2512.10353 [pdf, ps, other]
Title: Hybrid Transformer-Mamba Architecture for Weakly Supervised Volumetric Medical Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[73]  arXiv:2512.10352 [pdf, ps, other]
Title: Topology-Agnostic Animal Motion Generation from Text Prompt
Comments: 10 pages, 7 figures.Conference submission
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[74]  arXiv:2512.10342 [pdf, ps, other]
Title: CoSPlan: Corrective Sequential Planning via Scene Graph Incremental Updates
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[75]  arXiv:2512.10340 [pdf, ps, other]
Title: Zero-shot Adaptation of Stable Diffusion via Plug-in Hierarchical Degradation Representation for Real-World Super-Resolution
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[76]  arXiv:2512.10334 [pdf, ps, other]
Title: A Conditional Generative Framework for Synthetic Data Augmentation in Segmenting Thin and Elongated Structures in Biological Images
Authors: Yi Liu, Yichi Zhang
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[77]  arXiv:2512.10327 [pdf, ps, other]
Title: Simple Yet Effective Selective Imputation for Incomplete Multi-view Clustering
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[78]  arXiv:2512.10326 [pdf, ps, other]
Title: StainNet: A Special Staining Self-Supervised Vision Transformer for Computational Pathology
Comments: 15 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 737 entries: 1-50 | 29-78 | 79-128 | 129-178 | 179-228 | ... | 729-737 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)