Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 500

[ total of 778 entries: 1-50 | ... | 351-400 | 401-450 | 451-500 | 501-550 | 551-600 | 601-650 | 651-700 | ... | 751-778 ]
[ showing 50 entries per page: fewer | more | all ]

Tue, 2 Dec 2025 (showing first 50 of 278 entries)

[501] arXiv:2512.02018 [pdf, ps, other]: Title: Data-Centric Visual Development for Self-Driving Labs

Authors: Anbang Liu, Guanzhong Hu, Jiayi Wang, Ping Guo, Han Liu

Comments: 11 pages, 4 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[502] arXiv:2512.02017 [pdf, ps, other]: Title: Visual Sync: Multi-Camera Synchronization via Cross-View Object Motion

Authors: Shaowei Liu, David Yifan Yao, Saurabh Gupta, Shenlong Wang

Comments: Accepted to NeurIPS 2025. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[503] arXiv:2512.02016 [pdf, ps, other]: Title: Objects in Generated Videos Are Slower Than They Appear: Models Suffer Sub-Earth Gravity and Don't Know Galileo's Principle...for now

Authors: Varun Varma Thozhiyoor, Shivam Tripathi, Venkatesh Babu Radhakrishnan, Anand Bhattad

Comments: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[504] arXiv:2512.02015 [pdf, ps, other]: Title: Generative Video Motion Editing with 3D Point Tracks

Authors: Yao-Chih Lee, Zhoutong Zhang, Jiahui Huang, Jui-Hsien Wang, Joon-Young Lee, Jia-Bin Huang, Eli Shechtman, Zhengqi Li

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[505] arXiv:2512.02014 [pdf, ps, other]: Title: TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

Authors: Zhiheng Liu, Weiming Ren, Haozhe Liu, Zijian Zhou, Shoufa Chen, Haonan Qiu, Xiaoke Huang, Zhaochong An, Fanny Yang, Aditya Patel, Viktar Atliha, Tony Ng, Xiao Han, Chuyan Zhu, Chenyang Zhang, Ding Liu, Juan-Manuel Perez-Rua, Sen He, Jürgen Schmidhuber, Wenhu Chen, Ping Luo, Wei Liu, Tao Xiang, Jonas Schult, Yuren Cong

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[506] arXiv:2512.02012 [pdf, ps, other]: Title: Improved Mean Flows: On the Challenges of Fastforward Generative Models

Authors: Zhengyang Geng, Yiyang Lu, Zongze Wu, Eli Shechtman, J. Zico Kolter, Kaiming He

Comments: Technical report

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[507] arXiv:2512.02009 [pdf, ps, other]: Title: AirSim360: A Panoramic Simulation Platform within Drone View

Authors: Xian Ge, Yuling Pan, Yuhang Zhang, Xiang Li, Weijun Zhang, Dizhe Zhang, Zhaoliang Wan, Xin Lin, Xiangkai Zhang, Juntao Liang, Jason Li, Wenjie Jiang, Bo Du, Ming-Hsuan Yang, Lu Qi

Comments: Project Website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[508] arXiv:2512.02006 [pdf, ps, other]: Title: MV-TAP: Tracking Any Point in Multi-View Videos

Authors: Jahyeok Koo, Inès Hyeonsu Kim, Mungyeom Kim, Junghyun Park, Seohyun Park, Jaeyeong Kim, Jung Yi, Seokju Cho, Seungryong Kim

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[509] arXiv:2512.02005 [pdf, ps, other]: Title: Learning Visual Affordance from Audio

Authors: Lidong Lu, Guo Chen, Zhu Wei, Yicheng Liu, Tong Lu

Comments: 15 pages, 10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[510] arXiv:2512.01989 [pdf, ps, other]: Title: PAI-Bench: A Comprehensive Benchmark For Physical AI

Authors: Fengzhe Zhou, Jiannan Huang, Jialuo Li, Deva Ramanan, Humphrey Shi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[511] arXiv:2512.01988 [pdf, ps, other]: Title: Artemis: Structured Visual Reasoning for Perception Policy Learning

Authors: Wei Tang, Yanpeng Sun, Shan Zhang, Xiaofan Li, Piotr Koniusz, Wei Li, Na Zhao, Zechao Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[512] arXiv:2512.01975 [pdf, ps, other]: Title: SGDiff: Scene Graph Guided Diffusion Model for Image Collaborative SegCaptioning

Authors: Xu Zhang, Jin Yuan, Hanwang Zhang, Guojin Zhong, Yongsheng Zang, Jiacheng Lin, Zhiyong Li

Comments: Accept by AAAI-2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[513] arXiv:2512.01960 [pdf, ps, other]: Title: SpriteHand: Real-Time Versatile Hand-Object Interaction with Autoregressive Video Generation

Authors: Zisu Li, Hengye Lyu, Jiaxin Shi, Yufeng Zeng, Mingming Fan, Hanwang Zhang, Chen Liang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[514] arXiv:2512.01952 [pdf, ps, other]: Title: GrndCtrl: Grounding World Models via Self-Supervised Reward Alignment

Authors: Haoyang He, Jay Patrikar, Dong-Ki Kim, Max Smith, Daniel McGann, Ali-akbar Agha-mohammadi, Shayegan Omidshafiei, Sebastian Scherer

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[515] arXiv:2512.01949 [pdf, ps, other]: Title: Script: Graph-Structured and Query-Conditioned Semantic Token Pruning for Multimodal Large Language Models

Authors: Zhongyu Yang, Dannong Xu, Wei Pang, Yingfang Yuan

Comments: Published in Transactions on Machine Learning Research, Project in this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[516] arXiv:2512.01934 [pdf, ps, other]: Title: Physical ID-Transfer Attacks against Multi-Object Tracking via Adversarial Trajectory

Authors: Chenyi Wang, Yanmao Man, Raymond Muller, Ming Li, Z. Berkay Celik, Ryan Gerdes, Jonathan Petit

Comments: Accepted to Annual Computer Security Applications Conference (ACSAC) 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[517] arXiv:2512.01922 [pdf, ps, other]: Title: Med-VCD: Mitigating Hallucination for Medical Large Vision Language Models through Visual Contrastive Decoding

Authors: Zahra Mahdavi, Zahra Khodakaramimaghsoud, Hooman Khaloo, Sina Bakhshandeh Taleshani, Erfan Hashemi, Javad Mirzapour Kaleybar, Omid Nejati Manzari

Journal-ref: Computers in Biology and Medicine (2026)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[518] arXiv:2512.01908 [pdf, ps, other]: Title: SARL: Spatially-Aware Self-Supervised Representation Learning for Visuo-Tactile Perception

Authors: Gurmeher Khurana, Lan Wei, Dandan Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[519] arXiv:2512.01895 [pdf, ps, other]: Title: StyleYourSmile: Cross-Domain Face Retargeting Without Paired Multi-Style Data

Authors: Avirup Dey, Vinay Namboodiri

Comments: 15 pages, 14 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[520] arXiv:2512.01889 [pdf, ps, other]: Title: KM-ViPE: Online Tightly Coupled Vision-Language-Geometry Fusion for Open-Vocabulary Semantic SLAM

Authors: Zaid Nasser, Mikhail Iumanov, Tianhao Li, Maxim Popov, Jaafar Mahmoud, Malik Mohrat, Ilya Obrubov, Ekaterina Derevyanka, Ivan Sosin, Sergey Kolyubin

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[521] arXiv:2512.01885 [pdf, ps, other]: Title: TransientTrack: Advanced Multi-Object Tracking and Classification of Cancer Cells with Transient Fluorescent Signals

Authors: Florian Bürger, Martim Dias Gomes, Nica Gutu, Adrián E. Granada, Noémie Moreau, Katarzyna Bozek

Comments: 13 pages, 7 figures, 2 tables. This work has been submitted to IEEE Transactions on Medical Imaging

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cell Behavior (q-bio.CB); Quantitative Methods (q-bio.QM)
[522] arXiv:2512.01853 [pdf, ps, other]: Title: COACH: Collaborative Agents for Contextual Highlighting -- A Multi-Agent Framework for Sports Video Analysis

Authors: Tsz-To Wong, Ching-Chun Huang, Hong-Han Shuai

Comments: Accepted by AAAI 2026 Workshop LaMAS

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[523] arXiv:2512.01850 [pdf, ps, other]: Title: Register Any Point: Scaling 3D Point Cloud Registration by Flow Matching

Authors: Yue Pan, Tao Sun, Liyuan Zhu, Lucas Nunes, Iro Armeni, Jens Behley, Cyrill Stachniss

Comments: 22 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[524] arXiv:2512.01843 [pdf, ps, other]: Title: PhyDetEx: Detecting and Explaining the Physical Plausibility of T2V Models

Authors: Zeqing Wang, Keze Wang, Lei Zhang

Comments: 17 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[525] arXiv:2512.01830 [pdf, ps, other]: Title: OpenREAD: Reinforced Open-Ended Reasoning for End-to-End Autonomous Driving with LLM-as-Critic

Authors: Songyan Zhang, Wenhui Huang, Zhan Chen, Chua Jiahao Collister, Qihang Huang, Chen Lv

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[526] arXiv:2512.01827 [pdf, ps, other]: Title: CauSight: Learning to Supersense for Visual Causal Discovery

Authors: Yize Zhang, Meiqi Chen, Sirui Chen, Bo Peng, Yanxi Zhang, Tianyu Li, Chaochao Lu

Comments: project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[527] arXiv:2512.01821 [pdf, ps, other]: Title: Seeing through Imagination: Learning Scene Geometry via Implicit Spatial World Modeling

Authors: Meng Cao, Haokun Lin, Haoyuan Li, Haoran Tang, Rongtao Xu, Dong An, Xue Liu, Ian Reid, Xiaodan Liang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[528] arXiv:2512.01816 [pdf, ps, other]: Title: Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights

Authors: Juanxi Tian, Siyuan Li, Conghui He, Lijun Wu, Cheng Tan

Comments: 35 pages, 12 figures, 10 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[529] arXiv:2512.01803 [pdf, ps, other]: Title: Generative Action Tell-Tales: Assessing Human Motion in Synthesized Videos

Authors: Xavier Thomas, Youngsun Lim, Ananya Srinivasan, Audrey Zheng, Deepti Ghadiyaram

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[530] arXiv:2512.01789 [pdf, ps, other]: Title: SAM3-UNet: Simplified Adaptation of Segment Anything Model 3

Authors: Xinyu Xiong, Zihuang Wu, Lei Lu, Yufa Xia

Comments: Technical Report

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[531] arXiv:2512.01788 [pdf, ps, other]: Title: Learned Image Compression for Earth Observation: Implications for Downstream Segmentation Tasks

Authors: Christian Mollière, Iker Cumplido, Marco Zeulner, Lukas Liesenhoff, Matthias Schubert, Julia Gottfriedsen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[532] arXiv:2512.01774 [pdf, ps, other]: Title: Evaluating SAM2 for Video Semantic Segmentation

Authors: Syed Hesham Syed Ariff, Yun Liu, Guolei Sun, Jing Yang, Henghui Ding, Xue Geng, Xudong Jiang

Comments: 17 pages, 3 figures and 7 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[533] arXiv:2512.01771 [pdf, ps, other]: Title: Robust Rigid and Non-Rigid Medical Image Registration Using Learnable Edge Kernels

Authors: Ahsan Raza Siyal, Markus Haltmeier, Ruth Steiger, Malik Galijasevic, Elke Ruth Gizewski, Astrid Ellen Grams

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[534] arXiv:2512.01769 [pdf, ps, other]: Title: VideoScoop: A Non-Traditional Domain-Independent Framework For Video Analysis

Authors: Hafsa Billah

Comments: This is a report submitted as part of PhD proposal defense of Hafsa Billah

Subjects: Computer Vision and Pattern Recognition (cs.CV); Databases (cs.DB)
[535] arXiv:2512.01763 [pdf, ps, other]: Title: HiconAgent: History Context-aware Policy Optimization for GUI Agents

Authors: Xurui Zhou, Gongwei Chen, Yuquan Xie, Zaijing Li, Kaiwen Zhou, Shuai Wang, Shuo Yang, Zhuotao Tian, Rui Shao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[536] arXiv:2512.01755 [pdf, ps, other]: Title: FreqEdit: Preserving High-Frequency Features for Robust Multi-Turn Image Editing

Authors: Yucheng Liao, Jiajun Liang, Kaiqian Cui, Baoquan Zhao, Haoran Xie, Wei Liu, Qing Li, Xudong Mao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[537] arXiv:2512.01707 [pdf, ps, other]: Title: StreamGaze: Gaze-Guided Temporal Reasoning and Proactive Understanding in Streaming Videos

Authors: Daeun Lee, Subhojyoti Mukherjee, Branislav Kveton, Ryan A. Rossi, Viet Dac Lai, Seunghyun Yoon, Trung Bui, Franck Dernoncourt, Mohit Bansal

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[538] arXiv:2512.01701 [pdf, ps, other]: Title: SSR: Semantic and Spatial Rectification for CLIP-based Weakly Supervised Segmentation

Authors: Xiuli Bi, Die Xiao, Junchao Fan, Bin Xiao

Comments: Accepted in AAAI 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[539] arXiv:2512.01686 [pdf, ps, other]: Title: DreamingComics: A Story Visualization Pipeline via Subject and Layout Customized Generation using Video Models

Authors: Patrick Kwon, Chen Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[540] arXiv:2512.01681 [pdf, ps, other]: Title: Cross-Domain Validation of a Resection-Trained Self-Supervised Model on Multicentre Mesothelioma Biopsies

Authors: Farzaneh Seyedshahi, Francesca Damiola, Sylvie Lantuejoul, Ke Yuan, John Le Quesne

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[541] arXiv:2512.01677 [pdf, ps, other]: Title: Open-world Hand-Object Interaction Video Generation Based on Structure and Contact-aware Representation

Authors: Haodong Yan, Hang Yu, Zhide Zhong, Weilin Yuan, Xin Gong, Zehang Luo, Chengxi Heyu, Junfeng Li, Wenxuan Song, Shunbo Zhou, Haoang Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[542] arXiv:2512.01675 [pdf, ps, other]: Title: GRASP: Guided Residual Adapters with Sample-wise Partitioning

Authors: Felix Nützel, Mischa Dombrowski, Bernhard Kainz

Comments: 10 pages, 4 figures, 6 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[543] arXiv:2512.01665 [pdf, ps, other]: Title: Bridging the Scale Gap: Balanced Tiny and General Object Detection in Remote Sensing Imagery

Authors: Zhicheng Zhao, Yin Huang, Lingma Sun, Chenglong Li, Jin Tang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[544] arXiv:2512.01657 [pdf, ps, other]: Title: DB-KAUNet: An Adaptive Dual Branch Kolmogorov-Arnold UNet for Retinal Vessel Segmentation

Authors: Hongyu Xu, Panpan Meng, Meng Wang, Dayu Hu, Liming Liang, Xiaoqi Sheng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[545] arXiv:2512.01643 [pdf, ps, other]: Title: ViT$^3$: Unlocking Test-Time Training in Vision

Authors: Dongchen Han, Yining Li, Tianyu Li, Zixuan Cao, Ziming Wang, Jun Song, Yu Cheng, Bo Zheng, Gao Huang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[546] arXiv:2512.01636 [pdf, ps, other]: Title: Generative Editing in the Joint Vision-Language Space for Zero-Shot Composed Image Retrieval

Authors: Xin Wang, Haipeng Zhang, Mang Li, Zhaohui Xia, Yueguo Chen, Yu Zhang, Chunyu Wei

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[547] arXiv:2512.01629 [pdf, ps, other]: Title: SPARK: Sim-ready Part-level Articulated Reconstruction with VLM Knowledge

Authors: Yumeng He, Ying Jiang, Jiayin Lu, Yin Yang, Chenfanfu Jiang

Comments: Project page: this https URL 17 pages, 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[548] arXiv:2512.01611 [pdf, ps, other]: Title: Depth Matching Method Based on ShapeDTW for Oil-Based Mud Imager

Authors: Fengfeng Li, Zhou Feng, Hongliang Wu, Hao Zhang, Han Tian, Peng Liu, Lixin Yuan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Geophysics (physics.geo-ph)
[549] arXiv:2512.01589 [pdf, ps, other]: Title: Toward Content-based Indexing and Retrieval of Head and Neck CT with Abscess Segmentation

Authors: Thao Thi Phuong Dao, Tan-Cong Nguyen, Trong-Le Do, Truong Hoang Viet, Nguyen Chi Thanh, Huynh Nguyen Thuan, Do Vo Cong Nguyen, Minh-Khoi Pham, Mai-Khiem Tran, Viet-Tham Huynh, Trong-Thuan Nguyen, Trung-Nghia Le, Vo Thanh Toan, Tam V. Nguyen, Minh-Triet Tran, Thanh Dinh Le

Comments: The 2025 IEEE International Conference on Content-Based Multimedia Indexing (IEEE CBMI)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[550] arXiv:2512.01582 [pdf, ps, other]: Title: RoleMotion: A Large-Scale Dataset towards Robust Scene-Specific Role-Playing Motion Synthesis with Fine-grained Descriptions

Authors: Junran Peng, Yiheng Huang, Silei Shen, Zeji Wei, Jingwei Yang, Baojie Wang, Yonghao He, Chuanchen Luo, Man Zhang, Xucheng Yin, Wei Sui

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)

[ total of 778 entries: 1-50 | ... | 351-400 | 401-450 | 451-500 | 501-550 | 551-600 | 601-650 | 651-700 | ... | 751-778 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help (Access key information)

> cs > cs.CV

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 500

Tue, 2 Dec 2025 (showing first 50 of 278 entries)