Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 70

[ total of 747 entries: 1-50 | 21-70 | 71-120 | 121-170 | 171-220 | 221-270 | ... | 721-747 ]
[ showing 50 entries per page: fewer | more | all ]

Mon, 15 Dec 2025 (continued, showing last 34 of 104 entries)

[71] arXiv:2512.11226 [pdf, ps, other]: Title: FutureX: Enhance End-to-End Autonomous Driving via Latent Chain-of-Thought World Model

Authors: Hongbin Lin, Yiming Yang, Yifan Zhang, Chaoda Zheng, Jie Feng, Sheng Wang, Zhennan Wang, Shijia Chen, Boyang Wang, Yu Zhang, Xianming Liu, Shuguang Cui, Zhen Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[72] arXiv:2512.11225 [pdf, ps, other]: Title: VFMF: World Modeling by Forecasting Vision Foundation Model Features

Authors: Gabrijel Boduljak, Yushi Lan, Christian Rupprecht, Andrea Vedaldi

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[73] arXiv:2512.11215 [pdf, ps, other]: Title: SmokeBench: Evaluating Multimodal Large Language Models for Wildfire Smoke Detection

Authors: Tianye Qi, Weihao Li, Nick Barnes

Comments: Accepted to WACV 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[74] arXiv:2512.11203 [pdf, ps, other]: Title: AutoRefiner: Improving Autoregressive Video Diffusion Models via Reflective Refinement Over the Stochastic Sampling Path

Authors: Zhengyang Yu, Akio Hayakawa, Masato Ishii, Qingtao Yu, Takashi Shibuya, Jing Zhang, Yuki Mitsufuji

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[75] arXiv:2512.11199 [pdf, ps, other]: Title: CADKnitter: Compositional CAD Generation from Text and Geometry Guidance

Authors: Tri Le, Khang Nguyen, Baoru Huang, Tung D. Ta, Anh Nguyen

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[76] arXiv:2512.11189 [pdf, ps, other]: Title: Multi-task Learning with Extended Temporal Shift Module for Temporal Action Localization

Authors: Anh-Kiet Duong, Petra Gomez-Krämer

Comments: BinEgo360@ICCV25

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[77] arXiv:2512.11186 [pdf, ps, other]: Title: Lightweight 3D Gaussian Splatting Compression via Video Codec

Authors: Qi Yang, Geert Van Der Auwera, Zhu Li

Comments: Accepted by DCC2026 Oral

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[78] arXiv:2512.11167 [pdf, ps, other]: Title: Image Tiling for High-Resolution Reasoning: Balancing Local Detail with Global Context

Authors: Anatole Jacquin de Margerie, Alexis Roger, Irina Rish

Comments: Accepted in AAAI 2025 Workshop on Reproducible AI

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[79] arXiv:2512.11141 [pdf, ps, other]: Title: Learning complete and explainable visual representations from itemized text supervision

Authors: Yiwei Lyu, Chenhui Zhao, Soumyanil Banerjee, Shixuan Liu, Akshay Rao, Akhil Kondepudi, Honglak Lee, Todd C. Hollon

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[80] arXiv:2512.11130 [pdf, ps, other]: Title: Fast-FoundationStereo: Real-Time Zero-Shot Stereo Matching

Authors: Bowen Wen, Shaurya Dewan, Stan Birchfield

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[81] arXiv:2512.11121 [pdf, ps, other]: Title: Learning from a Generative Oracle: Domain Adaptation for Restoration

Authors: Yuyang Hu, Mojtaba Sahraee-Ardakan, Arpit Bansal, Kangfu Mei, Christian Qi, Peyman Milanfar, Mauricio Delbracio

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[82] arXiv:2512.11104 [pdf, ps, other]: Title: Information-driven Fusion of Pathology Foundation Models for Enhanced Disease Characterization

Authors: Brennan Flannery, Thomas DeSilvio, Jane Nguyen, Satish E. Viswanath

Comments: 29 Pages, 10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[83] arXiv:2512.11099 [pdf, ps, other]: Title: VGent: Visual Grounding via Modular Design for Disentangling Reasoning and Prediction

Authors: Weitai Kang, Jason Kuen, Mengwei Ren, Zijun Wei, Yan Yan, Kangning Liu

Comments: 8 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[84] arXiv:2512.11098 [pdf, ps, other]: Title: Vision-Language Models for Infrared Industrial Sensing in Additive Manufacturing Scene Description

Authors: Nazanin Mahjourian, Vinh Nguyen

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[85] arXiv:2512.11076 [pdf, ps, other]: Title: E-CHUM: Event-based Cameras for Human Detection and Urban Monitoring

Authors: Jack Brady, Andrew Dailey, Kristen Schang, Zo Vic Shong

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[86] arXiv:2512.11061 [pdf, ps, other]: Title: VDAWorld: World Modelling via VLM-Directed Abstraction and Simulation

Authors: Felix O'Mahony, Roberto Cipolla, Ayush Tewari

Comments: Website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[87] arXiv:2512.11060 [pdf, ps, other]: Title: Synthetic Vasculature and Pathology Enhance Vision-Language Model Reasoning

Authors: Chenjun Li, Cheng Wan, Laurin Lux, Alexander Berger, Richard B. Rosen, Martin J. Menten, Johannes C. Paetzold

Comments: 23 pages, 8 figures, 6 tables. Full paper under review for MIDL 2026 (Medical Imaging with Deep Learning)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[88] arXiv:2512.11057 [pdf, ps, other]: Title: Weakly Supervised Tuberculosis Localization in Chest X-rays through Knowledge Distillation

Authors: Marshal Ashif Shawkat, Moidul Hasan, Taufiq Hasan

Comments: 18 pages, 9 figures, 4 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[89] arXiv:2512.11016 [pdf, ps, other]: Title: SoccerMaster: A Vision Foundation Model for Soccer Understanding

Authors: Haolin Yang, Jiayuan Rao, Haoning Wu, Weidi Xie

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[90] arXiv:2512.11015 [pdf, ps, other]: Title: Leveraging Text Guidance for Enhancing Demographic Fairness in Gender Classification

Authors: Anoop Krishnan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[91] arXiv:2512.11797 (cross-list from cs.RO) [pdf, ps, other]: Title: AnchorDream: Repurposing Video Diffusion for Embodiment-Aware Robot Data Synthesis

Authors: Junjie Ye, Rong Xue, Basile Van Hoorick, Pavel Tokmakov, Muhammad Zubair Irshad, Yue Wang, Vitor Guizilini

Comments: Project page: this https URL

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[92] arXiv:2512.11745 (cross-list from eess.IV) [pdf, ps, other]: Title: mViSE: A Visual Search Engine for Analyzing Multiplex IHC Brain Tissue Images

Authors: Liqiang Huang, Rachel W. Mills, Saikiran Mandula, Lin Bai, Mahtab Jeyhani, John Redell, Hien Van Nguyen, Saurabh Prasad, Dragan Maric, Badrinath Roysam

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[93] arXiv:2512.11695 (cross-list from physics.flu-dyn) [pdf, ps, other]: Title: Particle Image Velocimetry Refinement via Consensus ADMM

Authors: Alan Bonomi, Francesco Banelli, Antonio Terpin

Comments: Code: this https URL

Subjects: Fluid Dynamics (physics.flu-dyn); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Optimization and Control (math.OC)
[94] arXiv:2512.11676 (cross-list from math.PR) [pdf, ps, other]: Title: Stochastics of shapes and Kunita flows

Authors: Stefan Sommer, Gefan Yang, Elizabeth Louise Baker

Subjects: Probability (math.PR); Computer Vision and Pattern Recognition (cs.CV)
[95] arXiv:2512.11582 (cross-list from cs.LG) [pdf, ps, other]: Title: Brain-Semantoks: Learning Semantic Tokens of Brain Dynamics with a Self-Distilled Foundation Model

Authors: Sam Gijsen, Marc-Andre Schulz, Kerstin Ritter

Comments: Code and pretrained models available at this https URL

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neurons and Cognition (q-bio.NC)
[96] arXiv:2512.11532 (cross-list from cs.DC) [pdf, ps, other]: Title: Parallax: Runtime Parallelization for Operator Fallbacks in Heterogeneous Edge Systems

Authors: Chong Tang, Hao Dai, Jagmohan Chauhan

Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[97] arXiv:2512.11433 (cross-list from cs.AI) [pdf, other]: Title: Back to the Baseline: Examining Baseline Effects on Explainability Metrics

Authors: Agustin Martin Picard (ANITI), Thibaut Boissin (ANITI), Varshini Subhash, Rémi Cadène (SU), Thomas Fel (ANITI)

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[98] arXiv:2512.11399 (cross-list from cs.CL) [pdf, ps, other]: Title: Minimal Clips, Maximum Salience: Long Video Summarization via Key Moment Extraction

Authors: Galann Pennec, Zhengyuan Liu, Nicholas Asher, Philippe Muller, Nancy F. Chen

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[99] arXiv:2512.11243 (cross-list from cs.LG) [pdf, ps, other]: Title: Task-Aware Multi-Expert Architecture For Lifelong Deep Learning

Authors: Jianyu Wang, Jacob Nean-Hua Sheikh, Cat P. Le, Hoda Bidkhori

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[100] arXiv:2512.11218 (cross-list from cs.RO) [pdf, ps, other]: Title: Seeing to Act, Prompting to Specify: A Bayesian Factorization of Vision Language Action Policy

Authors: Kechun Xu, Zhenjie Zhu, Anzhe Chen, Shuqi Zhao, Qing Huang, Yifei Yang, Haojian Lu, Rong Xiong, Masayoshi Tomizuka, Yue Wang

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[101] arXiv:2512.11194 (cross-list from cs.LG) [pdf, ps, other]: Title: Beyond Memorization: Gradient Projection Enables Selective Learning in Diffusion Models

Authors: Divya Kothandaraman, Jaclyn Pytlarz

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[102] arXiv:2512.11145 (cross-list from cs.LG) [pdf, ps, other]: Title: Autoencoder-based Semi-Supervised Dimensionality Reduction and Clustering for Scientific Ensembles

Authors: Lennard Manuel, Hamid Gadirov, Steffen Frey

Comments: Research Internship Project

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[103] arXiv:2512.11047 (cross-list from cs.RO) [pdf, ps, other]: Title: WholeBodyVLA: Towards Unified Latent VLA for Whole-Body Loco-Manipulation Control

Authors: Haoran Jiang, Jin Chen, Qingwen Bu, Li Chen, Modi Shi, Yanjie Zhang, Delong Li, Chuanzhe Suo, Chuang Wang, Zhihui Peng, Hongyang Li

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[104] arXiv:2512.10966 (cross-list from cs.LG) [pdf, ps, other]: Title: Multimodal Fusion of Regional Brain Experts for Interpretable Alzheimer's Disease Diagnosis

Authors: Farica Zhuang, Dinara Aliyeva, Shu Yang, Zixuan Wen, Duy Duong-Tran, Christos Davatzikos, Tianlong Chen, Song Wang, Li Shen

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)

Fri, 12 Dec 2025 (showing first 16 of 118 entries)

[105] arXiv:2512.10959 [pdf, ps, other]: Title: StereoSpace: Depth-Free Synthesis of Stereo Geometry via End-to-End Diffusion in a Canonical Space

Authors: Tjark Behrens, Anton Obukhov, Bingxin Ke, Fabio Tosi, Matteo Poggi, Konrad Schindler

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[106] arXiv:2512.10958 [pdf, ps, other]: Title: WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World

Authors: Ao Liang, Lingdong Kong, Tianyi Yan, Hongsi Liu, Wesley Yang, Ziqi Huang, Wei Yin, Jialong Zuo, Yixuan Hu, Dekai Zhu, Dongyue Lu, Youquan Liu, Guangfeng Jiang, Linfeng Li, Xiangtai Li, Long Zhuo, Lai Xing Ng, Benoit R. Cottereau, Changxin Gao, Liang Pan, Wei Tsang Ooi, Ziwei Liu

Comments: Preprint; 80 pages, 37 figures, 29 tables; Project Page at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[107] arXiv:2512.10957 [pdf, ps, other]: Title: SceneMaker: Open-set 3D Scene Generation with Decoupled De-occlusion and Pose Estimation Model

Authors: Yukai Shi, Weiyu Li, Zihao Wang, Hongyang Li, Xingyu Chen, Ping Tan, Lei Zhang

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[108] arXiv:2512.10956 [pdf, ps, other]: Title: Empowering Dynamic Urban Navigation with Stereo and Mid-Level Vision

Authors: Wentao Zhou, Xuweiyi Chen, Vignesh Rajagopal, Jeffrey Chen, Rohan Chandra, Zezhou Cheng

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[109] arXiv:2512.10955 [pdf, ps, other]: Title: Omni-Attribute: Open-vocabulary Attribute Encoder for Visual Concept Personalization

Authors: Tsai-Shien Chen, Aliaksandr Siarohin, Guocheng Gordon Qian, Kuan-Chieh Jackson Wang, Egor Nemchinov, Moayed Haji-Ali, Riza Alp Guler, Willi Menapace, Ivan Skorokhodov, Anil Kag, Jun-Yan Zhu, Sergey Tulyakov

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[110] arXiv:2512.10954 [pdf, ps, other]: Title: Group Diffusion: Enhancing Image Generation by Unlocking Cross-Sample Collaboration

Authors: Sicheng Mo, Thao Nguyen, Richard Zhang, Nick Kolkin, Siddharth Srinivasan Iyer, Eli Shechtman, Krishna Kumar Singh, Yong Jae Lee, Bolei Zhou, Yuheng Li

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[111] arXiv:2512.10950 [pdf, ps, other]: Title: E-RayZer: Self-supervised 3D Reconstruction as Spatial Visual Pre-training

Authors: Qitao Zhao, Hao Tan, Qianqian Wang, Sai Bi, Kai Zhang, Kalyan Sunkavalli, Shubham Tulsiani, Hanwen Jiang

Comments: Project website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[112] arXiv:2512.10949 [pdf, ps, other]: Title: Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation

Authors: Yiwen Tang, Zoey Guo, Kaixin Zhu, Ray Zhang, Qizhi Chen, Dongzhi Jiang, Junli Liu, Bohan Zeng, Haoming Song, Delin Qu, Tianyi Bai, Dan Xu, Wentao Zhang, Bin Zhao

Comments: Code is released at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[113] arXiv:2512.10948 [pdf, ps, other]: Title: ClusIR: Towards Cluster-Guided All-in-One Image Restoration

Authors: Shengkai Hu, Jiaqi Ma, Jun Wan, Wenwen Min, Yongcheng Jing, Lefei Zhang, Dacheng Tao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[114] arXiv:2512.10947 [pdf, ps, other]: Title: Towards Efficient and Effective Multi-Camera Encoding for End-to-End Driving

Authors: Jiawei Yang, Ziyu Chen, Yurong You, Yan Wang, Yiming Li, Yuxiao Chen, Boyi Li, Boris Ivanovic, Marco Pavone, Yue Wang

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[115] arXiv:2512.10945 [pdf, ps, other]: Title: MeViS: A Multi-Modal Dataset for Referring Motion Expression Video Segmentation

Authors: Henghui Ding, Chang Liu, Shuting He, Kaining Ying, Xudong Jiang, Chen Change Loy, Yu-Gang Jiang

Comments: IEEE TPAMI, Project Page: this https URL

Journal-ref: in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 47, no. 12, pp. 11400-11416, 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[116] arXiv:2512.10943 [pdf, ps, other]: Title: AlcheMinT: Fine-grained Temporal Control for Multi-Reference Consistent Video Generation

Authors: Sharath Girish, Viacheslav Ivanov, Tsai-Shien Chen, Hao Chen, Aliaksandr Siarohin, Sergey Tulyakov

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[117] arXiv:2512.10942 [pdf, ps, other]: Title: VL-JEPA: Joint Embedding Predictive Architecture for Vision-language

Authors: Delong Chen, Mustafa Shukor, Theo Moutakanni, Willy Chung, Jade Yu, Tejaswi Kasarla, Allen Bolourchi, Yann LeCun, Pascale Fung

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[118] arXiv:2512.10941 [pdf, ps, other]: Title: Mull-Tokens: Modality-Agnostic Latent Thinking

Authors: Arijit Ray, Ahmed Abdelkader, Chengzhi Mao, Bryan A. Plummer, Kate Saenko, Ranjay Krishna, Leonidas Guibas, Wen-Sheng Chu

Comments: Project webpage: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[119] arXiv:2512.10940 [pdf, ps, other]: Title: OmniView: An All-Seeing Diffusion Model for 3D and 4D View Synthesis

Authors: Xiang Fan, Sharath Girish, Vivek Ramanujan, Chaoyang Wang, Ashkan Mirzaei, Petr Sushko, Aliaksandr Siarohin, Sergey Tulyakov, Ranjay Krishna

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[120] arXiv:2512.10939 [pdf, ps, other]: Title: GaussianHeadTalk: Wobble-Free 3D Talking Heads with Audio Driven Gaussian Splatting

Authors: Madhav Agarwal, Mingtian Zhang, Laura Sevilla-Lara, Steven McDonagh

Comments: IEEE/CVF Winter Conference on Applications of Computer Vision 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)

[ total of 747 entries: 1-50 | 21-70 | 71-120 | 121-170 | 171-220 | 221-270 | ... | 721-747 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help (Access key information)

> cs > cs.CV

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 70

Mon, 15 Dec 2025 (continued, showing last 34 of 104 entries)

Fri, 12 Dec 2025 (showing first 16 of 118 entries)