Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 4

[ total of 737 entries: 1-25 | 5-29 | 30-54 | 55-79 | 80-104 | ... | 730-737 ]
[ showing 25 entries per page: fewer | more | all ]

Fri, 12 Dec 2025 (continued, showing 25 of 118 entries)

[5] arXiv:2512.10955 [pdf, ps, other]: Title: Omni-Attribute: Open-vocabulary Attribute Encoder for Visual Concept Personalization

Authors: Tsai-Shien Chen, Aliaksandr Siarohin, Guocheng Gordon Qian, Kuan-Chieh Jackson Wang, Egor Nemchinov, Moayed Haji-Ali, Riza Alp Guler, Willi Menapace, Ivan Skorokhodov, Anil Kag, Jun-Yan Zhu, Sergey Tulyakov

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[6] arXiv:2512.10954 [pdf, ps, other]: Title: Group Diffusion: Enhancing Image Generation by Unlocking Cross-Sample Collaboration

Authors: Sicheng Mo, Thao Nguyen, Richard Zhang, Nick Kolkin, Siddharth Srinivasan Iyer, Eli Shechtman, Krishna Kumar Singh, Yong Jae Lee, Bolei Zhou, Yuheng Li

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[7] arXiv:2512.10950 [pdf, ps, other]: Title: E-RayZer: Self-supervised 3D Reconstruction as Spatial Visual Pre-training

Authors: Qitao Zhao, Hao Tan, Qianqian Wang, Sai Bi, Kai Zhang, Kalyan Sunkavalli, Shubham Tulsiani, Hanwen Jiang

Comments: Project website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[8] arXiv:2512.10949 [pdf, ps, other]: Title: Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation

Authors: Yiwen Tang, Zoey Guo, Kaixin Zhu, Ray Zhang, Qizhi Chen, Dongzhi Jiang, Junli Liu, Bohan Zeng, Haoming Song, Delin Qu, Tianyi Bai, Dan Xu, Wentao Zhang, Bin Zhao

Comments: Code is released at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[9] arXiv:2512.10948 [pdf, ps, other]: Title: ClusIR: Towards Cluster-Guided All-in-One Image Restoration

Authors: Shengkai Hu, Jiaqi Ma, Jun Wan, Wenwen Min, Yongcheng Jing, Lefei Zhang, Dacheng Tao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[10] arXiv:2512.10947 [pdf, ps, other]: Title: Towards Efficient and Effective Multi-Camera Encoding for End-to-End Driving

Authors: Jiawei Yang, Ziyu Chen, Yurong You, Yan Wang, Yiming Li, Yuxiao Chen, Boyi Li, Boris Ivanovic, Marco Pavone, Yue Wang

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[11] arXiv:2512.10945 [pdf, ps, other]: Title: MeViS: A Multi-Modal Dataset for Referring Motion Expression Video Segmentation

Authors: Henghui Ding, Chang Liu, Shuting He, Kaining Ying, Xudong Jiang, Chen Change Loy, Yu-Gang Jiang

Comments: IEEE TPAMI, Project Page: this https URL

Journal-ref: in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 47, no. 12, pp. 11400-11416, 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[12] arXiv:2512.10943 [pdf, ps, other]: Title: AlcheMinT: Fine-grained Temporal Control for Multi-Reference Consistent Video Generation

Authors: Sharath Girish, Viacheslav Ivanov, Tsai-Shien Chen, Hao Chen, Aliaksandr Siarohin, Sergey Tulyakov

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[13] arXiv:2512.10942 [pdf, ps, other]: Title: VL-JEPA: Joint Embedding Predictive Architecture for Vision-language

Authors: Delong Chen, Mustafa Shukor, Theo Moutakanni, Willy Chung, Jade Yu, Tejaswi Kasarla, Allen Bolourchi, Yann LeCun, Pascale Fung

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[14] arXiv:2512.10941 [pdf, ps, other]: Title: Mull-Tokens: Modality-Agnostic Latent Thinking

Authors: Arijit Ray, Ahmed Abdelkader, Chengzhi Mao, Bryan A. Plummer, Kate Saenko, Ranjay Krishna, Leonidas Guibas, Wen-Sheng Chu

Comments: Project webpage: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[15] arXiv:2512.10940 [pdf, ps, other]: Title: OmniView: An All-Seeing Diffusion Model for 3D and 4D View Synthesis

Authors: Xiang Fan, Sharath Girish, Vivek Ramanujan, Chaoyang Wang, Ashkan Mirzaei, Petr Sushko, Aliaksandr Siarohin, Sergey Tulyakov, Ranjay Krishna

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[16] arXiv:2512.10939 [pdf, ps, other]: Title: GaussianHeadTalk: Wobble-Free 3D Talking Heads with Audio Driven Gaussian Splatting

Authors: Madhav Agarwal, Mingtian Zhang, Laura Sevilla-Lara, Steven McDonagh

Comments: IEEE/CVF Winter Conference on Applications of Computer Vision 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[17] arXiv:2512.10935 [pdf, ps, other]: Title: Any4D: Unified Feed-Forward Metric 4D Reconstruction

Authors: Jay Karhade, Nikhil Keetha, Yuchen Zhang, Tanisha Gupta, Akash Sharma, Sebastian Scherer, Deva Ramanan

Comments: Project Website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[18] arXiv:2512.10932 [pdf, ps, other]: Title: BabyVLM-V2: Toward Developmentally Grounded Pretraining and Benchmarking of Vision Foundation Models

Authors: Shengao Wang, Wenqi Wang, Zecheng Wang, Max Whitton, Michael Wakeham, Arjun Chandra, Joey Huang, Pengyue Zhu, Helen Chen, David Li, Jeffrey Li, Shawn Li, Andrew Zagula, Amy Zhao, Andrew Zhu, Sayaka Nakamura, Yuki Yamamoto, Jerry Jun Yokono, Aaron Mueller, Bryan A. Plummer, Kate Saenko, Venkatesh Saligrama, Boqing Gong

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[19] arXiv:2512.10927 [pdf, ps, other]: Title: FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos

Authors: Yulu Gan, Ligeng Zhu, Dandan Shan, Baifeng Shi, Hongxu Yin, Boris Ivanovic, Song Han, Trevor Darrell, Jitendra Malik, Marco Pavone, Boyi Li

Comments: Code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[20] arXiv:2512.10894 [pdf, ps, other]: Title: DuetSVG: Unified Multimodal SVG Generation with Internal Visual Guidance

Authors: Peiying Zhang, Nanxuan Zhao, Matthew Fisher, Yiran Xu, Jing Liao, Difan Liu

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[21] arXiv:2512.10888 [pdf, ps, other]: Title: PubTables-v2: A new large-scale dataset for full-page and multi-page table extraction

Authors: Brandon Smock, Valerie Faucon-Morin, Max Sokolov, Libin Liang, Tayyibah Khanam, Maury Courtland

Comments: 15 pages, 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[22] arXiv:2512.10881 [pdf, ps, other]: Title: MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos

Authors: Kehong Gong, Zhengyu Wen, Weixia He, Mingxi Xu, Qi Wang, Ning Zhang, Zhengyu Li, Dongze Lian, Wei Zhao, Xiaoyu He, Mingyuan Zhang

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[23] arXiv:2512.10867 [pdf, ps, other]: Title: From Macro to Micro: Benchmarking Microscopic Spatial Intelligence on Molecules via Vision-Language Models

Authors: Zongzhao Li, Xiangzhe Kong, Jiahui Su, Zongyang Ma, Mingze Li, Songyou Li, Yuelin Zhang, Yu Rong, Tingyang Xu, Deli Zhao, Wenbing Huang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[24] arXiv:2512.10863 [pdf, ps, other]: Title: MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence

Authors: Jingli Lin, Runsen Xu, Shaohao Zhu, Sihan Yang, Peizhou Cao, Yunlong Ran, Miao Hu, Chenming Zhu, Yiman Xie, Yilin Long, Wenbo Hu, Dahua Lin, Tai Wang, Jiangmiao Pang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[25] arXiv:2512.10860 [pdf, ps, other]: Title: SWiT-4D: Sliding-Window Transformer for Lossless and Parameter-Free Temporal 4D Generation

Authors: Kehong Gong, Zhengyu Wen, Mingxi Xu, Weixia He, Qi Wang, Ning Zhang, Zhengyu Li, Chenbin Li, Dongze Lian, Wei Zhao, Xiaoyu He, Mingyuan Zhang

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[26] arXiv:2512.10840 [pdf, ps, other]: Title: PoseGAM: Robust Unseen Object Pose Estimation via Geometry-Aware Multi-View Reasoning

Authors: Jianqi Chen, Biao Zhang, Xiangjun Tang, Peter Wonka

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[27] arXiv:2512.10818 [pdf, ps, other]: Title: Self-Ensemble Post Learning for Noisy Domain Generalization

Authors: Wang Lu, Jindong Wang

Comments: 18 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[28] arXiv:2512.10808 [pdf, ps, other]: Title: Graph Laplacian Transformer with Progressive Sampling for Prostate Cancer Grading

Authors: Masum Shah Junayed, John Derek Van Vessem, Qian Wan, Gahie Nam, Sheida Nabavi

Comments: International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI) 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[29] arXiv:2512.10794 [pdf, ps, other]: Title: What matters for Representation Alignment: Global Information or Spatial Structure?

Authors: Jaskirat Singh, Xingjian Leng, Zongze Wu, Liang Zheng, Richard Zhang, Eli Shechtman, Saining Xie

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Machine Learning (stat.ML)

[ total of 737 entries: 1-25 | 5-29 | 30-54 | 55-79 | 80-104 | ... | 730-737 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help (Access key information)

> cs > cs.CV

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 4

Fri, 12 Dec 2025 (continued, showing 25 of 118 entries)