Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 120

[ total of 747 entries: 1-50 | 21-70 | 71-120 | 121-170 | 171-220 | 221-270 | 271-320 | ... | 721-747 ]
[ showing 50 entries per page: fewer | more | all ]

Fri, 12 Dec 2025 (continued, showing 50 of 118 entries)

[121] arXiv:2512.10935 [pdf, ps, other]: Title: Any4D: Unified Feed-Forward Metric 4D Reconstruction

Authors: Jay Karhade, Nikhil Keetha, Yuchen Zhang, Tanisha Gupta, Akash Sharma, Sebastian Scherer, Deva Ramanan

Comments: Project Website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[122] arXiv:2512.10932 [pdf, ps, other]: Title: BabyVLM-V2: Toward Developmentally Grounded Pretraining and Benchmarking of Vision Foundation Models

Authors: Shengao Wang, Wenqi Wang, Zecheng Wang, Max Whitton, Michael Wakeham, Arjun Chandra, Joey Huang, Pengyue Zhu, Helen Chen, David Li, Jeffrey Li, Shawn Li, Andrew Zagula, Amy Zhao, Andrew Zhu, Sayaka Nakamura, Yuki Yamamoto, Jerry Jun Yokono, Aaron Mueller, Bryan A. Plummer, Kate Saenko, Venkatesh Saligrama, Boqing Gong

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[123] arXiv:2512.10927 [pdf, ps, other]: Title: FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos

Authors: Yulu Gan, Ligeng Zhu, Dandan Shan, Baifeng Shi, Hongxu Yin, Boris Ivanovic, Song Han, Trevor Darrell, Jitendra Malik, Marco Pavone, Boyi Li

Comments: Code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[124] arXiv:2512.10894 [pdf, ps, other]: Title: DuetSVG: Unified Multimodal SVG Generation with Internal Visual Guidance

Authors: Peiying Zhang, Nanxuan Zhao, Matthew Fisher, Yiran Xu, Jing Liao, Difan Liu

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[125] arXiv:2512.10888 [pdf, ps, other]: Title: PubTables-v2: A new large-scale dataset for full-page and multi-page table extraction

Authors: Brandon Smock, Valerie Faucon-Morin, Max Sokolov, Libin Liang, Tayyibah Khanam, Maury Courtland

Comments: 15 pages, 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[126] arXiv:2512.10881 [pdf, ps, other]: Title: MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos

Authors: Kehong Gong, Zhengyu Wen, Weixia He, Mingxi Xu, Qi Wang, Ning Zhang, Zhengyu Li, Dongze Lian, Wei Zhao, Xiaoyu He, Mingyuan Zhang

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[127] arXiv:2512.10867 [pdf, ps, other]: Title: From Macro to Micro: Benchmarking Microscopic Spatial Intelligence on Molecules via Vision-Language Models

Authors: Zongzhao Li, Xiangzhe Kong, Jiahui Su, Zongyang Ma, Mingze Li, Songyou Li, Yuelin Zhang, Yu Rong, Tingyang Xu, Deli Zhao, Wenbing Huang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[128] arXiv:2512.10863 [pdf, ps, other]: Title: MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence

Authors: Jingli Lin, Runsen Xu, Shaohao Zhu, Sihan Yang, Peizhou Cao, Yunlong Ran, Miao Hu, Chenming Zhu, Yiman Xie, Yilin Long, Wenbo Hu, Dahua Lin, Tai Wang, Jiangmiao Pang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[129] arXiv:2512.10860 [pdf, ps, other]: Title: SWiT-4D: Sliding-Window Transformer for Lossless and Parameter-Free Temporal 4D Generation

Authors: Kehong Gong, Zhengyu Wen, Mingxi Xu, Weixia He, Qi Wang, Ning Zhang, Zhengyu Li, Chenbin Li, Dongze Lian, Wei Zhao, Xiaoyu He, Mingyuan Zhang

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[130] arXiv:2512.10840 [pdf, ps, other]: Title: PoseGAM: Robust Unseen Object Pose Estimation via Geometry-Aware Multi-View Reasoning

Authors: Jianqi Chen, Biao Zhang, Xiangjun Tang, Peter Wonka

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[131] arXiv:2512.10818 [pdf, ps, other]: Title: Self-Ensemble Post Learning for Noisy Domain Generalization

Authors: Wang Lu, Jindong Wang

Comments: 18 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[132] arXiv:2512.10808 [pdf, ps, other]: Title: Graph Laplacian Transformer with Progressive Sampling for Prostate Cancer Grading

Authors: Masum Shah Junayed, John Derek Van Vessem, Qian Wan, Gahie Nam, Sheida Nabavi

Comments: International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI) 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[133] arXiv:2512.10794 [pdf, ps, other]: Title: What matters for Representation Alignment: Global Information or Spatial Structure?

Authors: Jaskirat Singh, Xingjian Leng, Zongze Wu, Liang Zheng, Richard Zhang, Eli Shechtman, Saining Xie

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Machine Learning (stat.ML)
[134] arXiv:2512.10765 [pdf, ps, other]: Title: Blood Pressure Prediction for Coronary Artery Disease Diagnosis using Coronary Computed Tomography Angiography

Authors: Rene Lisasi, Michele Esposito, Chen Zhao

Comments: 19 pages, 9 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[135] arXiv:2512.10750 [pdf, ps, other]: Title: LDP: Parameter-Efficient Fine-Tuning of Multimodal LLM for Medical Report Generation

Authors: Tianyu Zhou, Junyi Tang, Zehui Li, Dahong Qian, Suncheng Xiang

Comments: Work in progress

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[136] arXiv:2512.10730 [pdf, ps, other]: Title: IRG-MotionLLM: Interleaving Motion Generation, Assessment and Refinement for Text-to-Motion Generation

Authors: Yuan-Ming Li, Qize Yang, Nan Lei, Shenghao Fu, Ling-An Zeng, Jian-Fang Hu, Xihan Wei, Wei-Shi Zheng

Comments: 25 pages, 16 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[137] arXiv:2512.10725 [pdf, ps, other]: Title: Video Depth Propagation

Authors: Luigi Piccinelli, Thiemo Wandel, Christos Sakaridis, Wim Abbeloos, Luc Van Gool

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[138] arXiv:2512.10719 [pdf, ps, other]: Title: SpaceDrive: Infusing Spatial Awareness into VLM-based Autonomous Driving

Authors: Peizheng Li, Zhenghao Zhang, David Holtz, Hang Yu, Yutong Yang, Yuzhi Lai, Rui Song, Andreas Geiger, Andreas Zell

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[139] arXiv:2512.10715 [pdf, ps, other]: Title: CheXmask-U: Quantifying uncertainty in landmark-based anatomical segmentation for X-ray images

Authors: Matias Cosarinsky, Nicolas Gaggion, Rodrigo Echeveste, Enzo Ferrante

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[140] arXiv:2512.10685 [pdf, ps, other]: Title: Sharp Monocular View Synthesis in Less Than a Second

Authors: Lars Mescheder, Wei Dong, Shiwei Li, Xuyang Bai, Marcel Santos, Peiyun Hu, Bruno Lecouat, Mingmin Zhen, Amaël Delaunoy, Tian Fang, Yanghai Tsin, Stephan R. Richter, Vladlen Koltun

Comments: Code and weights available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[141] arXiv:2512.10683 [pdf, other]: Title: Optimal transport unlocks end-to-end learning for single-molecule localization

Authors: Romain Seailles (DI-ENS), Jean-Baptiste Masson (IP, CNRS, UPCité), Jean Ponce (DI-ENS, CDS), Julien Mairal (LJK)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[142] arXiv:2512.10674 [pdf, ps, other]: Title: Geo6DPose: Fast Zero-Shot 6D Object Pose Estimation via Geometry-Filtered Feature Matching

Authors: Javier Villena Toro, Mehdi Tarkian

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[143] arXiv:2512.10668 [pdf, ps, other]: Title: XDen-1K: A Density Field Dataset of Real-World Objects

Authors: Jingxuan Zhang, Tianqi Yu, Yatu Zhang, Jinze Wu, Kaixin Yao, Jingyang Liu, Yuyao Zhang, Jiayuan Gu, Jingyi Yu

Comments: 10 pages, 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[144] arXiv:2512.10660 [pdf, ps, other]: Title: NaviHydra: Controllable Navigation-guided End-to-end Autonomous Driving with Hydra-distillation

Authors: Hanfeng Wu, Marlon Steiner, Michael Schmidt, Alvaro Marcos-Ramiro, Christoph Stiller

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[145] arXiv:2512.10652 [pdf, ps, other]: Title: TriDF: Evaluating Perception, Detection, and Hallucination for Interpretable DeepFake Detection

Authors: Jian-Yu Jiang-Lin, Kang-Yang Huang, Ling Zou, Ling Lo, Sheng-Ping Yang, Yu-Wen Tseng, Kun-Hsiang Lin, Chia-Ling Chen, Yu-Ting Ta, Yan-Tsung Wang, Po-Ching Chen, Hongxia Xie, Hong-Han Shuai, Wen-Huang Cheng

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[146] arXiv:2512.10628 [pdf, ps, other]: Title: K-Track: Kalman-Enhanced Tracking for Accelerating Deep Point Trackers on Edge Devices

Authors: Bishoy Galoaa, Pau Closas, Sarah Ostadabbas

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[147] arXiv:2512.10619 [pdf, ps, other]: Title: DOCR-Inspector: Fine-Grained and Automated Evaluation of Document Parsing with VLM

Authors: Qintong Zhang, Junyuan Zhang, Zhifei Ren, Linke Ouyang, Zichen Wen, Junbo Niu, Yuan Qu, Bin Wang, Ka-Ho Chow, Conghui He, Wentao Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[148] arXiv:2512.10617 [pdf, ps, other]: Title: Lang2Motion: Bridging Language and Motion through Joint Embedding Spaces

Authors: Bishoy Galoaa, Xiangyu Bai, Sarah Ostadabbas

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[149] arXiv:2512.10608 [pdf, ps, other]: Title: Robust Multi-Disease Retinal Classification via Xception-Based Transfer Learning and W-Net Vessel Segmentation

Authors: Mohammad Sadegh Gholizadeh, Amir Arsalan Rezapour

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[150] arXiv:2512.10607 [pdf, ps, other]: Title: Track and Caption Any Motion: Query-Free Motion Discovery and Description in Videos

Authors: Bishoy Galoaa, Sarah Ostadabbas

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[151] arXiv:2512.10596 [pdf, ps, other]: Title: Beyond Pixels: A Training-Free, Text-to-Text Framework for Remote Sensing Image Retrieval

Authors: J. Xiao, Y. Guo, X. Zi, K. Thiyagarajan, C. Moreira, M. Prasad

Comments: 6 pages, 1 figure

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[152] arXiv:2512.10592 [pdf, ps, other]: Title: Salient Object Detection in Complex Weather Conditions via Noise Indicators

Authors: Quan Chen, Xiaokai Yang, Tingyu Wang, Rongfeng Lu, Xichun Sheng, Yaoqi Sun, Chenggang Yan

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[153] arXiv:2512.10581 [pdf, ps, other]: Title: Unleashing Degradation-Carrying Features in Symmetric U-Net: Simpler and Stronger Baselines for All-in-One Image Restoration

Authors: Wenlong Jiao, Heyang Lee, Ping Wang, Pengfei Zhu, Qinghua Hu, Dongwei Ren

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[154] arXiv:2512.10571 [pdf, ps, other]: Title: Audio-sync Video Instance Editing with Granularity-Aware Mask Refiner

Authors: Haojie Zheng, Shuchen Weng, Jingqi Liu, Siqi Yang, Boxin Shi, Xinlong Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[155] arXiv:2512.10562 [pdf, ps, other]: Title: Data-Efficient American Sign Language Recognition via Few-Shot Prototypical Networks

Authors: Meher Md Saad

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[156] arXiv:2512.10554 [pdf, ps, other]: Title: Grounding Everything in Tokens for Multimodal Large Language Models

Authors: Xiangxuan Ren, Zhongdao Wang, Liping Hou, Pin Tang, Guoqing Wang, Chao Ma

Comments: 19 pages, 16 figures, 12 Tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[157] arXiv:2512.10548 [pdf, ps, other]: Title: Blink: Dynamic Visual Token Resolution for Enhanced Multimodal Understanding

Authors: Yuchen Feng, Zhenyu Zhang, Naibin Gu, Yilong Chen, Peng Fu, Zheng Lin, Shuohuan Wang, Yu Sun, Hua Wu, Weiping Wang, Haifeng Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[158] arXiv:2512.10521 [pdf, ps, other]: Title: Take a Peek: Efficient Encoder Adaptation for Few-Shot Semantic Segmentation via LoRA

Authors: Pasquale De Marinis, Gennaro Vessio, Giovanna Castellano

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[159] arXiv:2512.10517 [pdf, ps, other]: Title: 3D Blood Pulsation Maps

Authors: Maurice Rohr, Tobias Reinhardt, Tizian Dege, Justus Thies, Christoph Hoog Antink

Comments: 9 pages (without references), supplementals attached, waiting for publication. In order to access the dataset,see this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[160] arXiv:2512.10498 [pdf, ps, other]: Title: Robust Shape from Focus via Multiscale Directional Dilated Laplacian and Recurrent Network

Authors: Khurram Ashfaq, Muhammad Tariq Mahmood

Comments: Accepted to IJCV

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[161] arXiv:2512.10450 [pdf, ps, other]: Title: Error-Propagation-Free Learned Video Compression With Dual-Domain Progressive Temporal Alignment

Authors: Han Li, Shaohui Li, Wenrui Dai, Chenglin Li, Xinlong Pan, Haipeng Wang, Junni Zou, Hongkai Xiong

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[162] arXiv:2512.10437 [pdf, ps, other]: Title: An M-Health Algorithmic Approach to Identify and Assess Physiotherapy Exercises in Real Time

Authors: Stylianos Kandylakis, Christos Orfanopoulos, Georgios Siolas, Panayiotis Tsanakas

Comments: 11 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[163] arXiv:2512.10421 [pdf, ps, other]: Title: Neural Collapse in Test-Time Adaptation

Authors: Xiao Chen, Zhongjing Du, Jiazhen Huang, Xu Jiang, Li Lu, Jingyan Jiang, Zhi Wang

Comments: 10 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[164] arXiv:2512.10419 [pdf, ps, other]: Title: TransLocNet: Cross-Modal Attention for Aerial-Ground Vehicle Localization with Contrastive Learning

Authors: Phu Pham, Damon Conover, Aniket Bera

Comments: 8 pages, 4 figures, 4 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[165] arXiv:2512.10416 [pdf, ps, other]: Title: Beyond Endpoints: Path-Centric Reasoning for Vectorized Off-Road Network Extraction

Authors: Wenfei Guan, Jilin Mei, Tong Shen, Xumin Wu, Shuo Wang, Cheng Min, Yu Hu

Comments: v2: Corrected the abstract to accurately reflect the paper content. Updated the project link to the correct repository<a href="this https URL" target="_blank" rel="noopener noreferrer nofollow"></a>. No changes to the main text

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[166] arXiv:2512.10408 [pdf, ps, other]: Title: MultiHateLoc: Towards Temporal Localisation of Multimodal Hate Content in Online Videos

Authors: Qiyue Sun, Tailin Chen, Yinghui Zhang, Yuchen Zhang, Jiangbei Yue, Jianbo Jiao, Zeyu Fu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[167] arXiv:2512.10386 [pdf, ps, other]: Title: Adaptive Dual-Weighted Gravitational Point Cloud Denoising Method

Authors: Ge Zhang, Chunyang Wang, Bo Xiao, Xuelian Liu, Bin Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[168] arXiv:2512.10384 [pdf, ps, other]: Title: Towards Fine-Grained Recognition with Large Visual Language Models: Benchmark and Optimization Strategies

Authors: Cong Pang, Hongtao Yu, Zixuan Chen, Lewei Lu, Xin Lou

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[169] arXiv:2512.10379 [pdf, ps, other]: Title: Self-Supervised Contrastive Embedding Adaptation for Endoscopic Image Matching

Authors: Alberto Rota, Elena De Momi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[170] arXiv:2512.10376 [pdf, ps, other]: Title: RaLiFlow: Scene Flow Estimation with 4D Radar and LiDAR Point Clouds

Authors: Jingyun Fu, Zhiyu Xiang, Na Zhao

Comments: Accepted by AAAI

Subjects: Computer Vision and Pattern Recognition (cs.CV)

[ total of 747 entries: 1-50 | 21-70 | 71-120 | 121-170 | 171-220 | 221-270 | 271-320 | ... | 721-747 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help (Access key information)

> cs > cs.CV

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 120

Fri, 12 Dec 2025 (continued, showing 50 of 118 entries)