Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 88

[ total of 737 entries: 1-100 | 89-188 | 189-288 | 289-388 | 389-488 | ... | 689-737 ]
[ showing 100 entries per page: fewer | more | all ]

Fri, 12 Dec 2025 (continued, showing last 30 of 118 entries)

[89] arXiv:2512.10262 [pdf, ps, other]: Title: VLM-NCD:Novel Class Discovery with Vision-Based Large Language Models

Authors: Yuetong Su, Baoguo Wei, Xinyu Wang, Xu Li, Lixin Li

Comments: 8 pages, 5 figures, conference

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[90] arXiv:2512.10252 [pdf, ps, other]: Title: GDKVM: Echocardiography Video Segmentation via Spatiotemporal Key-Value Memory with Gated Delta Rule

Authors: Rui Wang, Yimu Sun, Jingxing Guo, Huisi Wu, Jing Qin

Comments: Accepted to ICCV 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[91] arXiv:2512.10251 [pdf, ps, other]: Title: THE-Pose: Topological Prior with Hybrid Graph Fusion for Estimating Category-Level 6D Object Pose

Authors: Eunho Lee, Chaehyeon Song, Seunghoon Jeong, Ayoung Kim

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[92] arXiv:2512.10248 [pdf, ps, other]: Title: RobustSora: De-Watermarked Benchmark for Robust AI-Generated Video Detection

Authors: Zhuo Wang, Xiliang Liu, Ligang Sun

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[93] arXiv:2512.10244 [pdf, ps, other]: Title: Solving Semi-Supervised Few-Shot Learning from an Auto-Annotation Perspective

Authors: Tian Liu, Anwesha Basu, James Caverlee, Shu Kong

Comments: website and code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[94] arXiv:2512.10237 [pdf, ps, other]: Title: Multi-dimensional Preference Alignment by Conditioning Reward Itself

Authors: Jiho Jang, Jinyoung Kim, Kyungjune Baek, Nojun Kwak

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[95] arXiv:2512.10230 [pdf, ps, other]: Title: Emerging Standards for Machine-to-Machine Video Coding

Authors: Md Eimran Hossain Eimon, Velibor Adzic, Hari Kalva, Borko Furht

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[96] arXiv:2512.10226 [pdf, ps, other]: Title: Latent Chain-of-Thought World Modeling for End-to-End Driving

Authors: Shuhan Tan, Kashyap Chitta, Yuxiao Chen, Ran Tian, Yurong You, Yan Wang, Wenjie Luo, Yulong Cao, Philipp Krahenbuhl, Marco Pavone, Boris Ivanovic

Comments: Technical Report

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[97] arXiv:2512.10209 [pdf, ps, other]: Title: Feature Coding for Scalable Machine Vision

Authors: Md Eimran Hossain Eimon, Juan Merlos, Ashan Perera, Hari Kalva, Velibor Adzic, Borko Furht

Comments: This article has been accepted for publication in IEEE Consumer Electronics Magazine

Journal-ref: 2025 IEEE Consumer Electronics Magazine

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[98] arXiv:2512.10151 [pdf, ps, other]: Title: Topological Conditioning for Mammography Models via a Stable Wavelet-Persistence Vectorization

Authors: Charles Fanning, Mehmet Emin Aktas

Comments: 8 Pages, 2 Figures, submitted to IEEE Transactions on Medical Imaging

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[99] arXiv:2512.10102 [pdf, ps, other]: Title: Hierarchical Instance Tracking to Balance Privacy Preservation with Accessible Information

Authors: Neelima Prasad, Jarek Reynolds, Neel Karsanbhai, Tanusree Sharma, Lotus Zhang, Abigale Stangl, Yang Wang, Leah Findlater, Danna Gurari

Comments: Accepted at WACV 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[100] arXiv:2512.10095 [pdf, ps, other]: Title: TraceFlow: Dynamic 3D Reconstruction of Specular Scenes Driven by Ray Tracing

Authors: Jiachen Tao, Junyi Wu, Haoxuan Wang, Zongxin Yang, Dawen Cai, Yan Yan

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[101] arXiv:2512.10067 [pdf, ps, other]: Title: Independent Density Estimation

Authors: Jiahao Liu

Comments: 10 pages, 1 table, 4 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[102] arXiv:2512.10041 [pdf, ps, other]: Title: MetaVoxel: Joint Diffusion Modeling of Imaging and Clinical Metadata

Authors: Yihao Liu, Chenyu Gao, Lianrui Zuo, Michael E. Kim, Brian D. Boyd, Lisa L. Barnes, Walter A. Kukull, Lori L. Beason-Held, Susan M. Resnick, Timothy J. Hohman, Warren D. Taylor, Bennett A. Landman

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[103] arXiv:2512.10038 [pdf, ps, other]: Title: Diffusion Is Your Friend in Show, Suggest and Tell

Authors: Jia Cheng Hu, Roberto Cavicchioli, Alessandro Capotondi

Journal-ref: 2025 IEEE International Conference on Big Data

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[104] arXiv:2512.10031 [pdf, ps, other]: Title: ABBSPO: Adaptive Bounding Box Scaling and Symmetric Prior based Orientation Prediction for Detecting Aerial Image Objects

Authors: Woojin Lee, Hyugjae Chang, Jaeho Moon, Jaehyup Lee, Munchurl Kim

Comments: 17 pages, 11 figures, 8 tables, supplementary included. Accepted to CVPR 2025. Please visit our project page at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[105] arXiv:2512.09969 [pdf, ps, other]: Title: Neuromorphic Eye Tracking for Low-Latency Pupil Detection

Authors: Paul Hueber, Luca Peres, Florian Pitters, Alejandro Gloriani, Oliver Rhodes

Comments: 8 pages, 2 figures, conference

Subjects: Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[106] arXiv:2512.10953 (cross-list from cs.LG) [pdf, ps, other]: Title: Bidirectional Normalizing Flow: From Data to Noise and Back

Authors: Yiyang Lu, Qiao Sun, Xianbang Wang, Zhicheng Jiang, Hanhong Zhao, Kaiming He

Comments: Tech report

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[107] arXiv:2512.10938 (cross-list from cs.LG) [pdf, ps, other]: Title: Stronger Normalization-Free Transformers

Authors: Mingzhi Chen, Taiming Lu, Jiachen Zhu, Mingjie Sun, Zhuang Liu

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[108] arXiv:2512.10821 (cross-list from cs.AI) [pdf, ps, other]: Title: Agile Deliberation: Concept Deliberation for Subjective Visual Classification

Authors: Leijie Wang, Otilia Stretcu, Wei Qiao, Thomas Denby, Krishnamurthy Viswanathan, Enming Luo, Chun-Ta Lu, Tushar Dogra, Ranjay Krishna, Ariel Fuxman

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[109] arXiv:2512.10817 (cross-list from cs.LG) [pdf, ps, other]: Title: Extrapolation of Periodic Functions Using Binary Encoding of Continuous Numerical Values

Authors: Brian P. Powell, Jordan A. Caraballo-Vega, Mark L. Carroll, Thomas Maxwell, Andrew Ptak, Greg Olmschenk, Jorge Martinez-Palomera

Comments: Submitted to JMLR, under review

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[110] arXiv:2512.10805 (cross-list from cs.LG) [pdf, ps, other]: Title: Interpretable and Steerable Concept Bottleneck Sparse Autoencoders

Authors: Akshay Kulkarni, Tsui-Wei Weng, Vivek Narayanaswamy, Shusen Liu, Wesam A. Sakla, Kowshik Thopalli

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[111] arXiv:2512.10766 (cross-list from cs.CR) [pdf, ps, other]: Title: Metaphor-based Jailbreaking Attacks on Text-to-Image Models

Authors: Chenyu Zhang, Yiwen Ma, Lanjun Wang, Wenhui Li, Yi Tu, An-An Liu

Comments: This paper includes model-generated content that may contain offensive or distressing material

Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[112] arXiv:2512.10691 (cross-list from cs.AI) [pdf, ps, other]: Title: Enhancing Radiology Report Generation and Visual Grounding using Reinforcement Learning

Authors: Benjamin Gundersen, Nicolas Deperrois, Samuel Ruiperez-Campillo, Thomas M. Sutter, Julia E. Vogt, Michael Moor, Farhad Nooralahzadeh, Michael Krauthammer

Comments: 10 pages main text (3 figures, 3 tables), 31 pages in total

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[113] arXiv:2512.10675 (cross-list from cs.RO) [pdf, ps, other]: Title: Evaluating Gemini Robotics Policies in a Veo World Simulator

Authors: Gemini Robotics Team, Coline Devin, Yilun Du, Debidatta Dwibedi, Ruiqi Gao, Abhishek Jindal, Thomas Kipf, Sean Kirmani, Fangchen Liu, Anirudha Majumdar, Andrew Marmon, Carolina Parada, Yulia Rubanova, Dhruv Shah, Vikas Sindhwani, Jie Tan, Fei Xia, Ted Xiao, Sherry Yang, Wenhao Yu, Allan Zhou

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[114] arXiv:2512.10524 (cross-list from cs.LG) [pdf, ps, other]: Title: Mode-Seeking for Inverse Problems with Diffusion Models

Authors: Sai Bharath Chandra Gutha, Ricardo Vinuesa, Hossein Azizpour

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[115] arXiv:2512.10319 (cross-list from cs.RO) [pdf, ps, other]: Title: Design of a six wheel suspension and a three-axis linear actuation mechanism for a laser weeding robot

Authors: Muhammad Usama, Muhammad Ibrahim Khan, Ahmad Hasan, Muhammad Shaaf Nadeem, Khawaja Fahad Iqbal, Jawad Aslam, Mian Ashfaq Ali, Asad Nisar Awan

Comments: 15 Pages, 10 figures

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Systems and Control (eess.SY)
[116] arXiv:2512.10224 (cross-list from cs.LG) [pdf, ps, other]: Title: Federated Domain Generalization with Latent Space Inversion

Authors: Ragja Palakkadavath, Hung Le, Thanh Nguyen-Tang, Svetha Venkatesh, Sunil Gupta

Comments: Accepted at ICDM 2025

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[117] arXiv:2512.09944 (cross-list from cs.AI) [pdf, ps, other]: Title: Echo-CoPilot: A Multi-View, Multi-Task Agent for Echocardiography Interpretation and Reporting

Authors: Moein Heidari, Mohammad Amin Roohi, Armin Khosravi, Ilker Hacihaliloglu

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[118] arXiv:2510.20875 (cross-list from cs.LG) [pdf, ps, other]: Title: CC-GRMAS: A Multi-Agent Graph Neural System for Spatiotemporal Landslide Risk Assessment in High Mountain Asia

Authors: Mihir Panchal, Ying-Jung Chen, Surya Parkash

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)

Thu, 11 Dec 2025 (showing first 70 of 135 entries)

[119] arXiv:2512.09925 [pdf, ps, other]: Title: GAINS: Gaussian-based Inverse Rendering from Sparse Multi-View Captures

Authors: Patrick Noras, Jun Myeong Choi, Didier Stricker, Pieter Peers, Roni Sengupta

Comments: 23 pages, 18 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[120] arXiv:2512.09924 [pdf, ps, other]: Title: ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning

Authors: Xinyu Liu, Hangjie Yuan, Yujie Wei, Jiazheng Xing, Yujin Han, Jiahao Pan, Yanbiao Ma, Chi-Min Chan, Kang Zhao, Shiwei Zhang, Wenhan Luo, Yike Guo

Comments: Project Page: [this https URL](this https URL)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[121] arXiv:2512.09923 [pdf, ps, other]: Title: Splatent: Splatting Diffusion Latents for Novel View Synthesis

Authors: Or Hirschorn, Omer Sela, Inbar Huberman-Spiegelglas, Netalee Efrat, Eli Alshan, Ianir Ideses, Frederic Devernay, Yochai Zvik, Lior Fritz

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[122] arXiv:2512.09913 [pdf, ps, other]: Title: NordFKB: a fine-grained benchmark dataset for geospatial AI in Norway

Authors: Sander Riisøen Jyhne, Aditya Gupta, Ben Worsley, Marianne Andersen, Ivar Oveland, Alexander Salveson Nossum

Comments: 8 pages, 2 figures, 2 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[123] arXiv:2512.09907 [pdf, ps, other]: Title: VisualActBench: Can VLMs See and Act like a Human?

Authors: Daoan Zhang, Pai Liu, Xiaofei Zhou, Yuan Ge, Guangchen Lan, Jing Bi, Christopher Brinton, Ehsan Hoque, Jiebo Luo

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[124] arXiv:2512.09874 [pdf, ps, other]: Title: Benchmarking Document Parsers on Mathematical Formula Extraction from PDFs

Authors: Pius Horn, Janis Keuper

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[125] arXiv:2512.09871 [pdf, ps, other]: Title: Diffusion Posterior Sampler for Hyperspectral Unmixing with Spectral Variability Modeling

Authors: Yimin Zhu, Lincoln Linlin Xu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[126] arXiv:2512.09867 [pdf, ps, other]: Title: MedForget: Hierarchy-Aware Multimodal Unlearning Testbed for Medical AI

Authors: Fengli Wu, Vaidehi Patil, Jaehong Yoon, Yue Zhang, Mohit Bansal

Comments: Dataset and Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[127] arXiv:2512.09864 [pdf, ps, other]: Title: UniUGP: Unifying Understanding, Generation, and Planing For End-to-end Autonomous Driving

Authors: Hao Lu, Ziyang Liu, Guangfeng Jiang, Yuanfei Luo, Sheng Chen, Yangang Zhang, Ying-Cong Chen

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[128] arXiv:2512.09847 [pdf, ps, other]: Title: From Detection to Anticipation: Online Understanding of Struggles across Various Tasks and Activities

Authors: Shijia Feng, Michael Wray, Walterio Mayol-Cuevas

Comments: Accepted by WACV 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[129] arXiv:2512.09824 [pdf, ps, other]: Title: Composing Concepts from Images and Videos via Concept-prompt Binding

Authors: Xianghao Kong, Zeyu Zhang, Yuwei Guo, Zhuoran Zhao, Songchun Zhang, Anyi Rao

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[130] arXiv:2512.09814 [pdf, ps, other]: Title: DynaIP: Dynamic Image Prompt Adapter for Scalable Zero-shot Personalized Text-to-Image Generation

Authors: Zhizhong Wang, Tianyi Chu, Zeyi Huang, Nanyang Wang, Kehan Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[131] arXiv:2512.09806 [pdf, ps, other]: Title: CHEM: Estimating and Understanding Hallucinations in Deep Learning for Image Processing

Authors: Jianfei Li, Ines Rosellon-Inclan, Gitta Kutyniok, Jean-Luc Starck

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[132] arXiv:2512.09801 [pdf, ps, other]: Title: Modality-Specific Enhancement and Complementary Fusion for Semi-Supervised Multi-Modal Brain Tumor Segmentation

Authors: Tien-Dat Chung, Ba-Thinh Lam, Thanh-Huy Nguyen, Thien Nguyen, Nguyen Lan Vi Vu, Hoang-Loc Cao, Phat Kim Huynh, Min Xu

Comments: 9 pages, 3 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[133] arXiv:2512.09792 [pdf, ps, other]: Title: FastPose-ViT: A Vision Transformer for Real-Time Spacecraft Pose Estimation

Authors: Pierre Ancey, Andrew Price, Saqib Javed, Mathieu Salzmann

Comments: Accepted to WACV 2026. Preprint version

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[134] arXiv:2512.09773 [pdf, other]: Title: Stylized Meta-Album: Group-bias injection with style transfer to study robustness against distribution shifts

Authors: Romain Mussard (UNIROUEN), Aurélien Gauffre (UGA), Ihsan Ullah, Thanh Gia Hieu Khuong (TAU, LISN), Massih-Reza Amini (UGA), Isabelle Guyon (TAU, LISN), Lisheng Sun-Hosoya (TAU, LISN)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[135] arXiv:2512.09700 [pdf, ps, other]: Title: LiM-YOLO: Less is More with Pyramid Level Shift and Normalized Auxiliary Branch for Ship Detection in Optical Remote Sensing Imagery

Authors: Seon-Hoon Kim, Hyeji Sim, Youeyun Jung, Ok-Chul Jung, Yerin Kim

Comments: 16 pages, 8 figures, 9 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[136] arXiv:2512.09687 [pdf, ps, other]: Title: Unconsciously Forget: Mitigating Memorization; Without Knowing What is being Memorized

Authors: Er Jin, Yang Zhang, Yongli Mou, Yanfei Dong, Stefan Decker, Kenji Kawaguchi, Johannes Stegmaier

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[137] arXiv:2512.09670 [pdf, ps, other]: Title: An Automated Tip-and-Cue Framework for Optimized Satellite Tasking and Visual Intelligence

Authors: Gil Weissman, Amir Ivry, Israel Cohen

Comments: Under review at IEEE Transactions on Geoscience and Remote Sensing (TGRS). 13 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Systems and Control (eess.SY)
[138] arXiv:2512.09665 [pdf, ps, other]: Title: OxEnsemble: Fair Ensembles for Low-Data Classification

Authors: Jonathan Rystrøm, Zihao Fu, Chris Russell

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG)
[139] arXiv:2512.09663 [pdf, ps, other]: Title: IF-Bench: Benchmarking and Enhancing MLLMs for Infrared Images with Generative Visual Prompting

Authors: Tao Zhang, Yuyang Hong, Yang Xia, Kun Ding, Zeyu Zhang, Ying Wang, Shiming Xiang, Chunhong Pan

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[140] arXiv:2512.09646 [pdf, ps, other]: Title: VHOI: Controllable Video Generation of Human-Object Interactions from Sparse Trajectories via Motion Densification

Authors: Wanyue Zhang, Lin Geng Foo, Thabo Beeler, Rishabh Dabral, Christian Theobalt

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[141] arXiv:2512.09644 [pdf, ps, other]: Title: Kaapana: A Comprehensive Open-Source Platform for Integrating AI in Medical Imaging Research Environments

Authors: Ünal Akünal, Markus Bujotzek, Stefan Denner, Benjamin Hamm, Klaus Kades, Philipp Schader, Jonas Scherer, Marco Nolden, Peter Neher, Ralf Floca, Klaus Maier-Hein

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[142] arXiv:2512.09633 [pdf, ps, other]: Title: Benchmarking SAM2-based Trackers on FMOX

Authors: Senem Aktas, Charles Markham, John McDonald, Rozenn Dahyot

Journal-ref: 33rd International Conference on Artificial Intelligence and Cognitive Science (AICS 2025), December, 2025, Dublin, Ireland

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[143] arXiv:2512.09626 [pdf, ps, other]: Title: Beyond Sequences: A Benchmark for Atomic Hand-Object Interaction Using a Static RNN Encoder

Authors: Yousef Azizi Movahed, Fatemeh Ziaeetabar

Comments: Code available at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[144] arXiv:2512.09617 [pdf, ps, other]: Title: FROMAT: Multiview Material Appearance Transfer via Few-Shot Self-Attention Adaptation

Authors: Hubert Kompanowski, Varun Jampani, Aaryaman Vasishta, Binh-Son Hua

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[145] arXiv:2512.09616 [pdf, ps, other]: Title: Rethinking Chain-of-Thought Reasoning for Videos

Authors: Yiwu Zhong, Zi-Yuan Hu, Yin Li, Liwei Wang

Comments: Technical report

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[146] arXiv:2512.09592 [pdf, ps, other]: Title: CS3D: An Efficient Facial Expression Recognition via Event Vision

Authors: Zhe Wang, Qijin Song, Yucen Peng, Weibang Bai

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[147] arXiv:2512.09583 [pdf, ps, other]: Title: UnReflectAnything: RGB-Only Highlight Removal by Rendering Synthetic Specular Supervision

Authors: Alberto Rota, Mert Kiray, Mert Asim Karaoglu, Patrick Ruhkamp, Elena De Momi, Nassir Navab, Benjamin Busam

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[148] arXiv:2512.09580 [pdf, ps, other]: Title: Content-Adaptive Image Retouching Guided by Attribute-Based Text Representation

Authors: Hancheng Zhu, Xinyu Liu, Rui Yao, Kunyang Sun, Leida Li, Abdulmotaleb El Saddik

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[149] arXiv:2512.09579 [pdf, ps, other]: Title: Hands-on Evaluation of Visual Transformers for Object Recognition and Detection

Authors: Dimitrios N. Vlachogiannis, Dimitrios A. Koutsomitropoulos

Journal-ref: 37th International Conference on Tools with Artificial Intelligence (ICTAI 2025)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[150] arXiv:2512.09576 [pdf, ps, other]: Title: Seeing Soil from Space: Towards Robust and Scalable Remote Soil Nutrient Analysis

Authors: David Seu (1), Nicolas Longepe (2), Gabriel Cioltea (1), Erik Maidik (1), Calin Andrei (1) ((1) CO2 Angels, Cluj-Napoca, Romania, (2) European Space Agency Phi-Lab, Frascati, Italy)

Comments: 23 pages, 13 figures, 13 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Geophysics (physics.geo-ph)
[151] arXiv:2512.09573 [pdf, ps, other]: Title: Investigate the Low-level Visual Perception in Vision-Language based Image Quality Assessment

Authors: Yuan Li, Zitang Sun, Yen-Ju Chen, Shin'ya Nishida

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[152] arXiv:2512.09565 [pdf, ps, other]: Title: From Graphs to Gates: DNS-HyXNet, A Lightweight and Deployable Sequential Model for Real-Time DNS Tunnel Detection

Authors: Faraz Ali, Muhammad Afaq, Mahmood Niazi, Muzammil Behzad

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[153] arXiv:2512.09555 [pdf, ps, other]: Title: Building Reasonable Inference for Vision-Language Models in Blind Image Quality Assessment

Authors: Yuan Li, Zitang Sun, Yen-ju Chen, Shin'ya Nishida

Comments: Accepted to the ICONIP (International Conference on Neural Information Processing), 2025

Journal-ref: Building Reasonable Inference for Vision-Language Models in Blind Image Quality Assessment. In: Taniguchi, T., et al. Neural Information Processing. ICONIP 2025. Lecture Notes in Computer Science, vol 16310. Springer, Singapore

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[154] arXiv:2512.09546 [pdf, ps, other]: Title: A Dual-Domain Convolutional Network for Hyperspectral Single-Image Super-Resolution

Authors: Murat Karayaka, Usman Muhammad, Jorma Laaksonen, Md Ziaul Hoque, Tapio Seppänen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[155] arXiv:2512.09525 [pdf, ps, other]: Title: Masked Registration and Autoencoding of CT Images for Predictive Tibia Reconstruction

Authors: Hongyou Zhou, Cederic Aßmann, Alaa Bejaoui, Heiko Tzschätzsch, Mark Heyland, Julian Zierke, Niklas Tuttle, Sebastian Hölzl, Timo Auer, David A. Back, Marc Toussaint

Comments: DGM4MICCAI

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[156] arXiv:2512.09497 [pdf, ps, other]: Title: Gradient-Guided Learning Network for Infrared Small Target Detection

Authors: Jinmiao Zhao, Chuang Yu, Zelin Shi, Yunpeng Liu, Yingdi Zhang

Comments: Accepted by GRSL 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[157] arXiv:2512.09492 [pdf, ps, other]: Title: StateSpace-SSL: Linear-Time Self-supervised Learning for Plant Disease Detection

Authors: Abdullah Al Mamun, Miaohua Zhang, David Ahmedt-Aristizabal, Zeeshan Hayder, Mohammad Awrangjeb

Comments: Accepted to AAAI workshop (AgriAI 2026)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[158] arXiv:2512.09489 [pdf, ps, other]: Title: MODA: The First Challenging Benchmark for Multispectral Object Detection in Aerial Images

Authors: Shuaihao Han, Tingfa Xu, Peifu Liu, Jianan Li

Comments: 8 pages, 9 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[159] arXiv:2512.09477 [pdf, ps, other]: Title: Color encoding in Latent Space of Stable Diffusion Models

Authors: Guillem Arias, Ariadna Solà, Martí Armengod, Maria Vanrell

Comments: 6 pages, 8 figures, Color Imaging Conference 33

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[160] arXiv:2512.09471 [pdf, ps, other]: Title: Temporal-Spatial Tubelet Embedding for Cloud-Robust MSI Reconstruction using MSI-SAR Fusion: A Multi-Head Self-Attention Video Vision Transformer Approach

Authors: Yiqun Wang, Lujun Li, Meiru Yue, Radu State

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[161] arXiv:2512.09463 [pdf, ps, other]: Title: Privacy-Preserving Computer Vision for Industry: Three Case Studies in Human-Centric Manufacturing

Authors: Sander De Coninck, Emilio Gamba, Bart Van Doninck, Abdellatif Bey-Temsamani, Sam Leroux, Pieter Simoens

Comments: Accepted to the AAAI26 HCM workshop

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[162] arXiv:2512.09461 [pdf, ps, other]: Title: Cytoplasmic Strings Analysis in Human Embryo Time-Lapse Videos using Deep Learning Framework

Authors: Anabia Sohail, Mohamad Alansari, Ahmed Abughali, Asmaa Chehab, Abdelfatah Ahmed, Divya Velayudhan, Sajid Javed, Hasan Al Marzouqi, Ameena Saad Al-Sumaiti, Junaid Kashir, Naoufel Werghi

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[163] arXiv:2512.09446 [pdf, ps, other]: Title: Defect-aware Hybrid Prompt Optimization via Progressive Tuning for Zero-Shot Multi-type Anomaly Detection and Segmentation

Authors: Nadeem Nazer, Hongkuan Zhou, Lavdim Halilaj, Ylli Sadikaj, Steffen Staab

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[164] arXiv:2512.09441 [pdf, ps, other]: Title: Representation Calibration and Uncertainty Guidance for Class-Incremental Learning based on Vision Language Model

Authors: Jiantao Tan, Peixian Ma, Tong Yu, Wentao Zhang, Ruixuan Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[165] arXiv:2512.09435 [pdf, ps, other]: Title: UniPart: Part-Level 3D Generation with Unified 3D Geom-Seg Latents

Authors: Xufan He, Yushuang Wu, Xiaoyang Guo, Chongjie Ye, Jiaqing Zhou, Tianlei Hu, Xiaoguang Han, Dong Du

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[166] arXiv:2512.09423 [pdf, ps, other]: Title: FunPhase: A Periodic Functional Autoencoder for Motion Generation via Phase Manifolds

Authors: Marco Pegoraro, Evan Atherton, Bruno Roy, Aliasghar Khani, Arianna Rampini

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[167] arXiv:2512.09422 [pdf, ps, other]: Title: InfoMotion: A Graph-Based Approach to Video Dataset Distillation for Echocardiography

Authors: Zhe Li, Hadrien Reynaud, Alberto Gomez, Bernhard Kainz

Comments: Accepted at MICAD 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[168] arXiv:2512.09418 [pdf, ps, other]: Title: Label-free Motion-Conditioned Diffusion Model for Cardiac Ultrasound Synthesis

Authors: Zhe Li, Hadrien Reynaud, Johanna P Müller, Bernhard Kainz

Comments: Accepted at MICAD 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[169] arXiv:2512.09417 [pdf, ps, other]: Title: DirectSwap: Mask-Free Cross-Identity Training and Benchmarking for Expression-Consistent Video Head Swapping

Authors: Yanan Wang, Shengcai Liao, Panwen Hu, Xin Li, Fan Yang, Xiaodan Liang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[170] arXiv:2512.09407 [pdf, ps, other]: Title: Generative Point Cloud Registration

Authors: Haobo Jiang, Jin Xie, Jian Yang, Liang Yu, Jianmin Zheng

Comments: 14 pages, 9 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[171] arXiv:2512.09402 [pdf, ps, other]: Title: Wasserstein-Aligned Hyperbolic Multi-View Clustering

Authors: Rui Wang, Yuting Jiang, Xiaoqing Luo, Xiao-Jun Wu, Nicu Sebe, Ziheng Chen

Comments: 14 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[172] arXiv:2512.09393 [pdf, ps, other]: Title: Detection and Localization of Subdural Hematoma Using Deep Learning on Computed Tomography

Authors: Vasiliki Stoumpou, Rohan Kumar, Bernard Burman, Diego Ojeda, Tapan Mehta, Dimitris Bertsimas

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[173] arXiv:2512.09383 [pdf, ps, other]: Title: Perception-Inspired Color Space Design for Photo White Balance Editing

Authors: Yang Cheng, Ziteng Cui, Shenghan Su, Lin Gu, Zenghui Zhang

Comments: Accepted to WACV 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[174] arXiv:2512.09375 [pdf, ps, other]: Title: Log NeRF: Comparing Spaces for Learning Radiance Fields

Authors: Sihe Chen (Northeastern University), Luv Verma (Northeastern University), Bruce A. Maxwell (Northeastern University)

Comments: The 36th British Machine Vision Conference

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[175] arXiv:2512.09373 [pdf, ps, other]: Title: FUSER: Feed-Forward MUltiview 3D Registration Transformer and SE(3)$^N$ Diffusion Refinement

Authors: Haobo Jiang, Jin Xie, Jian Yang, Liang Yu, Jianmin Zheng

Comments: 13 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[176] arXiv:2512.09364 [pdf, ps, other]: Title: ASSIST-3D: Adapted Scene Synthesis for Class-Agnostic 3D Instance Segmentation

Authors: Shengchao Zhou, Jiehong Lin, Jiahui Liu, Shizhen Zhao, Chirui Chang, Xiaojuan Qi

Comments: Accepted by AAAI 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[177] arXiv:2512.09363 [pdf, ps, other]: Title: StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation

Authors: Ke Xing, Xiaojie Jin, Longfei Li, Yuyang Yin, Hanwen Liang, Guixun Luo, Chen Fang, Jue Wang, Konstantinos N. Plataniotis, Yao Zhao, Yunchao Wei

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[178] arXiv:2512.09354 [pdf, ps, other]: Title: Video-QTR: Query-Driven Temporal Reasoning Framework for Lightweight Video Understanding

Authors: Xinkui Zhao, Zuxin Wang, Yifan Zhang, Guanjie Cheng, Yueshen Xu, Shuiguang Deng, Chang Liu, Naibo Wang, Jianwei Yin

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[179] arXiv:2512.09350 [pdf, ps, other]: Title: TextGuider: Training-Free Guidance for Text Rendering via Attention Alignment

Authors: Kanghyun Baek, Sangyub Lee, Jin Young Choi, Jaewoo Song, Daemin Park, Jooyoung Choi, Chaehun Shin, Bohyung Han, Sungroh Yoon

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[180] arXiv:2512.09335 [pdf, ps, other]: Title: Relightable and Dynamic Gaussian Avatar Reconstruction from Monocular Video

Authors: Seonghwa Choi, Moonkyeong Choi, Mingyu Jang, Jaekyung Kim, Jianfei Cai, Wen-Huang Cheng, Sanghoon Lee

Comments: 8 pages, 9 figures, published in ACM MM 2025

Journal-ref: In Proceedings of the 33rd ACM International Conference on Multimedia. 2025. p. 7405-7414

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[181] arXiv:2512.09327 [pdf, ps, other]: Title: UniLS: End-to-End Audio-Driven Avatars for Unified Listening and Speaking

Authors: Xuangeng Chu, Ruicong Liu, Yifei Huang, Yun Liu, Yichen Peng, Bo Zheng

Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD)
[182] arXiv:2512.09315 [pdf, ps, other]: Title: Benchmarking Real-World Medical Image Classification with Noisy Labels: Challenges, Practice, and Outlook

Authors: Yuan Ma, Junlin Hou, Chao Zhang, Yukun Zhou, Zongyuan Ge, Haoran Xie, Lie Ju

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[183] arXiv:2512.09311 [pdf, ps, other]: Title: Transformer-Driven Multimodal Fusion for Explainable Suspiciousness Estimation in Visual Surveillance

Authors: Kuldeep Singh Yadav, Lalan Kumar

Comments: 12 pages, 10 figures, IEEE Transaction on Image Processing

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[184] arXiv:2512.09307 [pdf, ps, other]: Title: From SAM to DINOv2: Towards Distilling Foundation Models to Lightweight Baselines for Generalized Polyp Segmentation

Authors: Shivanshu Agnihotri, Snehashis Majhi, Deepak Ranjan Nayak, Debesh Jha

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[185] arXiv:2512.09299 [pdf, ps, other]: Title: VABench: A Comprehensive Benchmark for Audio-Video Generation

Authors: Daili Hua, Xizhi Wang, Bohan Zeng, Xinyi Huang, Hao Liang, Junbo Niu, Xinlong Chen, Quanqing Xu, Wentao Zhang

Comments: 24 pages, 25 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD)
[186] arXiv:2512.09296 [pdf, ps, other]: Title: Traffic Scene Small Target Detection Method Based on YOLOv8n-SPTS Model for Autonomous Driving

Authors: Songhan Wu

Comments: 6 pages, 7 figures, 1 table. Accepted to The 2025 IEEE 3rd International Conference on Electrical, Automation and Computer Engineering (ICEACE), 2025. Code available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[187] arXiv:2512.09289 [pdf, ps, other]: Title: MelanomaNet: Explainable Deep Learning for Skin Lesion Classification

Authors: Sukhrobbek Ilyosbekov

Comments: 7 pages, 3 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[188] arXiv:2512.09282 [pdf, ps, other]: Title: FoundIR-v2: Optimizing Pre-Training Data Mixtures for Image Restoration Foundation Model

Authors: Xiang Chen, Jinshan Pan, Jiangxin Dong, Jian Yang, Jinhui Tang

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)

[ total of 737 entries: 1-100 | 89-188 | 189-288 | 289-388 | 389-488 | ... | 689-737 ]
[ showing 100 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help (Access key information)

> cs > cs.CV

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 88

Fri, 12 Dec 2025 (continued, showing last 30 of 118 entries)

Thu, 11 Dec 2025 (showing first 70 of 135 entries)