Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 69

[ total of 778 entries: 1-25 | 20-44 | 45-69 | 70-94 | 95-119 | 120-144 | 145-169 | ... | 770-778 ]
[ showing 25 entries per page: fewer | more | all ]

Mon, 8 Dec 2025 (continued, showing last 25 of 94 entries)

[70] arXiv:2512.05268 [pdf, ps, other]: Title: CARD: Correlation Aware Restoration with Diffusion

Authors: Niki Nezakati, Arnab Ghosh, Amit Roy-Chowdhury, Vishwanath Saragadam

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[71] arXiv:2512.05259 [pdf, ps, other]: Title: Age-Inclusive 3D Human Mesh Recovery for Action-Preserving Data Anonymization

Authors: Georgios Chatzichristodoulou, Niki Efthymiou, Panagiotis Filntisis, Georgios Pavlakos, Petros Maragos

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[72] arXiv:2512.05240 [pdf, ps, other]: Title: IE2Video: Adapting Pretrained Diffusion Models for Event-Based Video Reconstruction

Authors: Dmitrii Torbunov, Onur Okuducu, Yi Huang, Odera Dim, Rebecca Coles, Yonggang Cui, Yihui Ren

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[73] arXiv:2512.05209 [pdf, ps, other]: Title: DEAR: Dataset for Evaluating the Aesthetics of RenderingDEAR: Dataset for Evaluating the Aesthetics of Rendering

Authors: Vsevolod Plohotnuk, Artyom Panshin, Nikola Banić, Simone Bianco, Michael Freeman, Egor Ershov

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[74] arXiv:2512.05198 [pdf, ps, other]: Title: Your Latent Mask is Wrong: Pixel-Equivalent Latent Compositing for Diffusion Models

Authors: Rowan Bradbury, Dazhi Zhong

Comments: 16 pages, 10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[75] arXiv:2512.05172 [pdf, ps, other]: Title: Semore: VLM-guided Enhanced Semantic Motion Representations for Visual Reinforcement Learning

Authors: Wentao Wang, Chunyang Liu, Kehua Sheng, Bo Zhang, Yan Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[76] arXiv:2512.05152 [pdf, ps, other]: Title: EFDiT: Efficient Fine-grained Image Generation Using Diffusion Transformer Models

Authors: Kun Wang, Donglin Di, Tonghua Su, Lei Fan

Comments: 6pages, 5figures, published to 2025 IEEE International Conference on Multimedia and Expo (ICME), Nantes, France, 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[77] arXiv:2512.05150 [pdf, ps, other]: Title: TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows

Authors: Zhenglin Cheng, Peng Sun, Jianguo Li, Tao Lin

Comments: arxiv v0

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[78] arXiv:2512.05145 [pdf, ps, other]: Title: Self-Improving VLM Judges Without Human Annotations

Authors: Inna Wanyin Lin, Yushi Hu, Shuyue Stella Li, Scott Geng, Pang Wei Koh, Luke Zettlemoyer, Tim Althoff, Marjan Ghazvininejad

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[79] arXiv:2512.05140 [pdf, other]: Title: FlowEO: Generative Unsupervised Domain Adaptation for Earth Observation

Authors: Georges Le Bellier (CEDRIC - VERTIGO, Cnam), Nicolas Audebert (LaSTIG, IGN, CEDRIC - VERTIGO)

Comments: 2026 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Mar 2026, Tucson (AZ), United States

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[80] arXiv:2512.05139 [pdf, ps, other]: Title: Spatiotemporal Satellite Image Downscaling with Transfer Encoders and Autoregressive Generative Models

Authors: Yang Xiang, Jingwen Zhong, Yige Yan, Petros Koutrakis, Eric Garshick, Meredith Franklin

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[81] arXiv:2512.05137 [pdf, ps, other]: Title: ChromouVQA: Benchmarking Vision-Language Models under Chromatic Camouflaged Images

Authors: Yunfei Zhang, Yizhuo He, Yuanxun Shao, Zhengtao Yao, Haoyan Xu, Junhao Dong, Zhen Yao, Zhikang Dong

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[82] arXiv:2512.05136 [pdf, ps, other]: Title: Fine-tuning an ECG Foundation Model to Predict Coronary CT Angiography Outcomes

Authors: Yujie Xiao, Gongzhen Tang, Deyun Zhang, Jun Li, Guangkun Nie, Haoyu Wang, Shun Huang, Tong Liu, Qinghao Zhao, Kangyin Chen, Shenda Hong

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[83] arXiv:2512.05134 [pdf, ps, other]: Title: InvarDiff: Cross-Scale Invariance Caching for Accelerated Diffusion Models

Authors: Zihao Wu

Comments: 8 pages main, 8 pages appendix, 16 figures, 5 tables. Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[84] arXiv:2512.05132 [pdf, ps, other]: Title: Breaking Scale Anchoring: Frequency Representation Learning for Accurate High-Resolution Inference from Low-Resolution Training

Authors: Wenshuo Wang, Fan Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[85] arXiv:2512.05131 [pdf, ps, other]: Title: AREA3D: Active Reconstruction Agent with Unified Feed-Forward 3D Perception and Vision-Language Guidance

Authors: Tianling Xu, Shengzhe Gan, Leslie Gu, Yuelei Li, Fangneng Zhan, Hanspeter Pfister

Comments: Under review

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[86] arXiv:2512.05959 (cross-list from cs.CL) [pdf, ps, other]: Title: M4-RAG: A Massive-Scale Multilingual Multi-Cultural Multimodal RAG

Authors: David Anugraha, Patrick Amadeus Irawan, Anshul Singh, En-Shiun Annie Lee, Genta Indra Winata

Comments: Preprint

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[87] arXiv:2512.05955 (cross-list from cs.RO) [pdf, ps, other]: Title: SIMPACT: Simulation-Enabled Action Planning using Vision-Language Models

Authors: Haowen Liu, Shaoxiong Yao, Haonan Chen, Jiawei Gao, Jiayuan Mao, Jia-Bin Huang, Yilun Du

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[88] arXiv:2512.05932 (cross-list from cs.RO) [pdf, ps, other]: Title: Physically-Based Simulation of Automotive LiDAR

Authors: L. Dudzik, M. Roschani, A. Sielemann, K. Trampert, J. Ziehn, J. Beyerer, C. Neumann

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[89] arXiv:2512.05824 (cross-list from cs.AI) [pdf, ps, other]: Title: Multimodal Oncology Agent for IDH1 Mutation Prediction in Low-Grade Glioma

Authors: Hafsa Akebli (1), Adam Shephard (2), Vincenzo Della Mea (1), Nasir Rajpoot (2 and 3) ((1) University of Udine, Udine, Italy, (2) University of Warwick, Coventry, UK, (3) Histofy Ltd, Coventry, UK)

Comments: 4 pages, 2 figures

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[90] arXiv:2512.05812 (cross-list from cs.RO) [pdf, ps, other]: Title: Toward Efficient and Robust Behavior Models for Multi-Agent Driving Simulation

Authors: Fabian Konstantinidis, Moritz Sackmann, Ulrich Hofmann, Christoph Stiller

Comments: This work has been submitted to the IEEE for possible publication

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[91] arXiv:2512.05665 (cross-list from cs.CL) [pdf, ps, other]: Title: Interleaved Latent Visual Reasoning with Selective Perceptual Modeling

Authors: Shuai Dong, Siyuan Wang, Xingyu Liu, Zhongyu Wei

Comments: 11 pages, 6 figures. Code available at this https URL

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[92] arXiv:2512.05438 (cross-list from cs.HC) [pdf, ps, other]: Title: EXR: An Interactive Immersive EHR Visualization in Extended Reality

Authors: Benoit Marteau, Shaun Q. Y. Tan, Jieru Li, Andrew Hornback, Yishan Zhong, Shaunna Wang, Christian Lowson, Jason Woloff, Joshua M. Pahys, Steven W. Hwang, Coleman Hilton, May D. Wang

Comments: 11 pages, 6 figures. Preprint version. This paper has been accepted to IEEE ICIR 2025. This is the author-prepared version and not the final published version. The final version will appear in IEEE Xplo

Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[93] arXiv:2512.05299 (cross-list from eess.SY) [pdf, ps, other]: Title: ARCAS: An Augmented Reality Collision Avoidance System with SLAM-Based Tracking for Enhancing VRU Safety

Authors: Ahmad Yehia, Jiseop Byeon, Tianyi Wang, Huihai Wang, Yiming Xu, Junfeng Jiao, Christian Claudel

Comments: 8 pages, 3 figures, 1 table

Subjects: Systems and Control (eess.SY); Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Emerging Technologies (cs.ET); Robotics (cs.RO); Image and Video Processing (eess.IV)
[94] arXiv:2512.05126 (cross-list from eess.AS) [pdf, ps, other]: Title: SyncVoice: Towards Video Dubbing with Vision-Augmented Pretrained TTS Model

Authors: Kaidi Wang, Yi He, Wenhao Guan, Weijie Wu, Hongwu Ding, Xiong Zhang, Di Wu, Meng Meng, Jian Luan, Lin Li, Qingyang Hong

Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD)

[ total of 778 entries: 1-25 | 20-44 | 45-69 | 70-94 | 95-119 | 120-144 | 145-169 | ... | 770-778 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help (Access key information)

> cs > cs.CV

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 69

Mon, 8 Dec 2025 (continued, showing last 25 of 94 entries)