Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 136

[ total of 754 entries: 1-25 | ... | 62-86 | 87-111 | 112-136 | 137-161 | 162-186 | 187-211 | 212-236 | ... | 737-754 ]
[ showing 25 entries per page: fewer | more | all ]

Wed, 10 Dec 2025 (continued, showing 25 of 131 entries)

[137] arXiv:2512.08930 [pdf, ps, other]: Title: Selfi: Self Improving Reconstruction Engine via 3D Geometric Feature Alignment

Authors: Youming Deng, Songyou Peng, Junyi Zhang, Kathryn Heal, Tiancheng Sun, John Flynn, Steve Marschner, Lucy Chai

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[138] arXiv:2512.08924 [pdf, ps, other]: Title: Efficiently Reconstructing Dynamic Scenes One D4RT at a Time

Authors: Chuhan Zhang, Guillaume Le Moing, Skanda Koppula, Ignacio Rocco, Liliane Momeni, Junyu Xie, Shuyang Sun, Rahul Sukthankar, Joëlle K. Barral, Raia Hadsell, Zoubin Ghahramani, Andrew Zisserman, Junlin Zhang, Mehdi S. M. Sajjadi

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[139] arXiv:2512.08922 [pdf, ps, other]: Title: Unified Diffusion Transformer for High-fidelity Text-Aware Image Restoration

Authors: Jin Hyeon Kim, Paul Hyunbin Cho, Claire Kim, Jaewon Min, Jaeeun Lee, Jihye Park, Yeji Choi, Seungryong Kim

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[140] arXiv:2512.08912 [pdf, ps, other]: Title: LiDAS: Lighting-driven Dynamic Active Sensing for Nighttime Perception

Authors: Simon de Moreau, Andrei Bursuc, Hafid El-Idrissi, Fabien Moutarde

Comments: Preprint. 12 pages, 9 figures. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[141] arXiv:2512.08905 [pdf, ps, other]: Title: Self-Evolving 3D Scene Generation from a Single Image

Authors: Kaizhi Zheng, Yue Fan, Jing Gu, Zishuo Xu, Xuehai He, Xin Eric Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[142] arXiv:2512.08897 [pdf, ps, other]: Title: UniLayDiff: A Unified Diffusion Transformer for Content-Aware Layout Generation

Authors: Zeyang Liu, Le Wang, Sanping Zhou, Yuxuan Wu, Xiaolong Sun, Gang Hua, Haoxiang Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[143] arXiv:2512.08889 [pdf, ps, other]: Title: No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers

Authors: Damiano Marsili, Georgia Gkioxari

Comments: Project webpage: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[144] arXiv:2512.08888 [pdf, ps, other]: Title: Accelerated Rotation-Invariant Convolution for UAV Image Segmentation

Authors: Manduhu Manduhu, Alexander Dow, Gerard Dooly, James Riordan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[145] arXiv:2512.08881 [pdf, ps, other]: Title: SATGround: A Spatially-Aware Approach for Visual Grounding in Remote Sensing

Authors: Aysim Toker, Andreea-Maria Oncescu, Roy Miles, Ismail Elezi, Jiankang Deng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[146] arXiv:2512.08873 [pdf, ps, other]: Title: Siamese-Driven Optimization for Low-Resolution Image Latent Embedding in Image Captioning

Authors: Jing Jie Tan, Anissa Mokraoui, Ban-Hoe Kwan, Danny Wee-Kiat Ng, Yan-Chai Hum

Comments: 6 pages

Journal-ref: 2024 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[147] arXiv:2512.08860 [pdf, ps, other]: Title: Tri-Bench: Stress-Testing VLM Reliability on Spatial Reasoning under Camera Tilt and Object Interference

Authors: Amit Bendkhale

Comments: 6 pages, 3 figures. Code and data: this https URL Accepted to the AAAI 2026 Workshop on Trust and Control in Agentic AI (TrustAgent)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[148] arXiv:2512.08854 [pdf, ps, other]: Title: Generation is Required for Data-Efficient Perception

Authors: Jack Brady, Bernhard Schölkopf, Thomas Kipf, Simon Buchholz, Wieland Brendel

Comments: Preprint

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[149] arXiv:2512.08829 [pdf, ps, other]: Title: InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models

Authors: Hongyuan Tao, Bencheng Liao, Shaoyu Chen, Haoran Yin, Qian Zhang, Wenyu Liu, Xinggang Wang

Comments: 16 pages, 8 figures, conference or other essential info

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[150] arXiv:2512.08820 [pdf, ps, other]: Title: Training-Free Dual Hyperbolic Adapters for Better Cross-Modal Reasoning

Authors: Yi Zhang, Chun-Wun Cheng, Junyi He, Ke Yu, Yushun Tang, Carola-Bibiane Schönlieb, Zhihai He, Angelica I. Aviles-Rivero

Comments: Accepted in IEEE Transactions on Multimedia (TMM)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[151] arXiv:2512.08789 [pdf, ps, other]: Title: MatteViT: High-Frequency-Aware Document Shadow Removal with Shadow Matte Guidance

Authors: Chaewon Kim, Seoyeon Lee, Jonghyuk Park

Comments: 10 pages, 7 figures, 5 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[152] arXiv:2512.08785 [pdf, ps, other]: Title: LoFA: Learning to Predict Personalized Priors for Fast Adaptation of Visual Generative Models

Authors: Yiming Hao, Mutian Xu, Chongjie Ye, Jie Qin, Shunlin Lu, Yipeng Qin, Xiaoguang Han

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[153] arXiv:2512.08774 [pdf, ps, other]: Title: Refining Visual Artifacts in Diffusion Models via Explainable AI-based Flaw Activation Maps

Authors: Seoyeon Lee, Gwangyeol Yu, Chaewon Kim, Jonghyuk Park

Comments: 10 pages, 9 figures, 7 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[154] arXiv:2512.08765 [pdf, ps, other]: Title: Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Authors: Ruihang Chu, Yefei He, Zhekai Chen, Shiwei Zhang, Xiaogang Xu, Bin Xia, Dingdong Wang, Hongwei Yi, Xihui Liu, Hengshuang Zhao, Yu Liu, Yingya Zhang, Yujiu Yang

Comments: NeurlPS 2025. Code and data available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[155] arXiv:2512.08751 [pdf, ps, other]: Title: Skewness-Guided Pruning of Multimodal Swin Transformers for Federated Skin Lesion Classification on Edge Devices

Authors: Kuniko Paxton, Koorosh Aslansefat, Dhavalkumar Thakker, Yiannis Papadopoulos

Subjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
[156] arXiv:2512.08747 [pdf, ps, other]: Title: A Scalable Pipeline Combining Procedural 3D Graphics and Guided Diffusion for Photorealistic Synthetic Training Data Generation in White Button Mushroom Segmentation

Authors: Artúr I. Károly, Péter Galambos

Comments: 20 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[157] arXiv:2512.08738 [pdf, ps, other]: Title: Pose-Based Sign Language Spotting via an End-to-End Encoder Architecture

Authors: Samuel Ebimobowei Johnny, Blessed Guda, Emmanuel Enejo Aaron, Assane Gueye

Comments: To appear at AACL-IJCNLP 2025 Workshop WSLP

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[158] arXiv:2512.08733 [pdf, ps, other]: Title: Mitigating Individual Skin Tone Bias in Skin Lesion Classification through Distribution-Aware Reweighting

Authors: Kuniko Paxton, Zeinab Dehghani, Koorosh Aslansefat, Dhavalkumar Thakker, Yiannis Papadopoulos

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[159] arXiv:2512.08730 [pdf, ps, other]: Title: SegEarth-OV3: Exploring SAM 3 for Open-Vocabulary Semantic Segmentation in Remote Sensing Images

Authors: Kaiyu Li, Shengqi Zhang, Yupeng Deng, Zhi Wang, Deyu Meng, Xiangyong Cao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[160] arXiv:2512.08700 [pdf, ps, other]: Title: Scale-invariant and View-relational Representation Learning for Full Surround Monocular Depth

Authors: Kyumin Hwang, Wonhyeok Choi, Kiljoon Han, Wonjoon Choi, Minwoo Choi, Yongcheon Na, Minwoo Park, Sunghoon Im

Comments: Accepted at IEEE Robotics and Automation Letters (RA-L) 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[161] arXiv:2512.08697 [pdf, ps, other]: Title: What really matters for person re-identification? A Mixture-of-Experts Framework for Semantic Attribute Importance

Authors: Athena Psalta, Vasileios Tsironis, Konstantinos Karantzalos

Subjects: Computer Vision and Pattern Recognition (cs.CV)

[ total of 754 entries: 1-25 | ... | 62-86 | 87-111 | 112-136 | 137-161 | 162-186 | 187-211 | 212-236 | ... | 737-754 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help (Access key information)

> cs > cs.CV

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 136

Wed, 10 Dec 2025 (continued, showing 25 of 131 entries)