Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 390

[ total of 749 entries: 1-50 | ... | 241-290 | 291-340 | 341-390 | 391-440 | 441-490 | 491-540 | 541-590 | ... | 741-749 ]
[ showing 50 entries per page: fewer | more | all ]

Mon, 8 Dec 2025 (showing first 50 of 94 entries)

[391] arXiv:2512.05965 [pdf, ps, other]: Title: EditThinker: Unlocking Iterative Reasoning for Any Image Editor

Authors: Hongyu Li, Manyuan Zhang, Dian Zheng, Ziyu Guo, Yimeng Jia, Kaituo Feng, Hao Yu, Yexin Liu, Yan Feng, Peng Pei, Xunliang Cai, Linjiang Huang, Hongsheng Li, Si Liu

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[392] arXiv:2512.05960 [pdf, ps, other]: Title: AQUA-Net: Adaptive Frequency Fusion and Illumination Aware Network for Underwater Image Enhancement

Authors: Munsif Ali, Najmul Hassan, Lucia Ventura, Davide Di Bari, Simonepietro Canese

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[393] arXiv:2512.05941 [pdf, ps, other]: Title: Zoom in, Click out: Unlocking and Evaluating the Potential of Zooming for GUI Grounding

Authors: Zhiyuan Jiang, Shenghao Xie, Wenyi Li, Wenqiang Zu, Peihang Li, Jiahao Qiu, Siqi Pei, Lei Ma, Tiejun Huang, Mengdi Wang, Shilong Liu

Comments: Code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[394] arXiv:2512.05937 [pdf, ps, other]: Title: Measuring the Effect of Background on Classification and Feature Importance in Deep Learning for AV Perception

Authors: Anne Sielemann, Valentin Barner, Stefan Wolf, Masoud Roschani, Jens Ziehn, Juergen Beyerer

Comments: 8 pages, 2 figures, 7 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[395] arXiv:2512.05936 [pdf, ps, other]: Title: Synset Signset Germany: a Synthetic Dataset for German Traffic Sign Recognition

Authors: Anne Sielemann, Lena Loercher, Max-Lion Schumacher, Stefan Wolf, Masoud Roschani, Jens Ziehn

Comments: 8 pages, 8 figures, 3 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[396] arXiv:2512.05928 [pdf, ps, other]: Title: A Comparative Study on Synthetic Facial Data Generation Techniques for Face Recognition

Authors: Pedro Vidal, Bernardo Biesseck, Luiz E. L. Coelho, Roger Granada, David Menotti

Comments: 18 pages, 17 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[397] arXiv:2512.05927 [pdf, ps, other]: Title: World Models That Know When They Don't Know: Controllable Video Generation with Calibrated Uncertainty

Authors: Zhiting Mei, Tenny Yin, Micah Baker, Ola Shorinwa, Anirudha Majumdar

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[398] arXiv:2512.05922 [pdf, ps, other]: Title: LPD: Learnable Prototypes with Diversity Regularization for Weakly Supervised Histopathology Segmentation

Authors: Khang Le, Anh Mai Vu, Thi Kim Trang Vo, Ha Thach, Ngoc Bui Lam Quang, Thanh-Huy Nguyen, Minh H. N. Le, Zhu Han, Chandra Mohan, Hien Van Nguyen

Comments: Note: Khang Le and Anh Mai Vu contributed equally

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[399] arXiv:2512.05920 [pdf, ps, other]: Title: NICE: Neural Implicit Craniofacial Model for Orthognathic Surgery Prediction

Authors: Jiawen Yang, Yihui Cao, Xuanyu Tian, Yuyao Zhang, Hongjiang Wei

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[400] arXiv:2512.05905 [pdf, ps, other]: Title: SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations

Authors: Wenhao Yan, Sheng Ye, Zhuoyi Yang, Jiayan Teng, ZhenHui Dong, Kairui Wen, Xiaotao Gu, Yong-Jin Liu, Jie Tang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[401] arXiv:2512.05866 [pdf, ps, other]: Title: Underwater Image Reconstruction Using a Swin Transformer-Based Generator and PatchGAN Discriminator

Authors: Md. Mahbub Hasan Akash, Aria Tasnim Mridula, Sheekar Banerjee, Ishtiak Al Mamoon

Comments: This paper has been accepted for presentation at the IEEE 28th International Conference on Computer and Information Technology (ICCIT), December 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[402] arXiv:2512.05859 [pdf, ps, other]: Title: Edit-aware RAW Reconstruction

Authors: Abhijith Punnappurath, Luxi Zhao, Ke Zhao, Hue Nguyen, Radek Grzeszczuk, Michael S. Brown

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[403] arXiv:2512.05853 [pdf, ps, other]: Title: VRSA: Jailbreaking Multimodal Large Language Models through Visual Reasoning Sequential Attack

Authors: Shiji Zhao, Shukun Xiong, Yao Huang, Yan Jin, Zhenyu Wu, Jiyang Guan, Ranjie Duan, Jialing Tao, Hui Xue, Xingxing Wei

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[404] arXiv:2512.05830 [pdf, ps, other]: Title: Phase-OTDR Event Detection Using Image-Based Data Transformation and Deep Learning

Authors: Muhammet Cagri Yeke, Samil Sirin, Kivilcim Yuksel, Abdurrahman Gumus

Comments: 22 pages, 11 figures, 5 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[405] arXiv:2512.05814 [pdf, ps, other]: Title: UG-FedDA: Uncertainty-Guided Federated Domain Adaptation for Multi-Center Alzheimer's Disease Detection

Authors: Fubao Zhu, Zhanyuan Jia, Zhiguo Wang, Huan Huang, Danyang Sun, Chuang Han, Yanting Li, Jiaofen Nan, Chen Zhao, Weihua Zhou

Comments: The code is already available on GitHub: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[406] arXiv:2512.05809 [pdf, ps, other]: Title: Probing the effectiveness of World Models for Spatial Reasoning through Test-time Scaling

Authors: Saurav Jha, M. Jehanzeb Mirza, Wei Lin, Shiqi Yang, Sarath Chandar

Comments: Extended abstract at World Modeling Workshop 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[407] arXiv:2512.05802 [pdf, ps, other]: Title: Bring Your Dreams to Life: Continual Text-to-Video Customization

Authors: Jiahua Dong, Xudong Wang, Wenqi Liang, Zongyan Han, Meng Cao, Duzhen Zhang, Hanbin Zhao, Zhi Han, Salman Khan, Fahad Shahbaz Khan

Comments: Accepted to AAAI2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[408] arXiv:2512.05783 [pdf, ps, other]: Title: Curvature-Regularized Variational Autoencoder for 3D Scene Reconstruction from Sparse Depth

Authors: Maryam Yousefi, Soodeh Bakhshandeh

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[409] arXiv:2512.05774 [pdf, ps, other]: Title: Active Video Perception: Iterative Evidence Seeking for Agentic Long Video Understanding

Authors: Ziyang Wang, Honglu Zhou, Shijie Wang, Junnan Li, Caiming Xiong, Silvio Savarese, Mohit Bansal, Michael S. Ryoo, Juan Carlos Niebles

Comments: Website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[410] arXiv:2512.05762 [pdf, ps, other]: Title: FNOPT: Resolution-Agnostic, Self-Supervised Cloth Simulation using Meta-Optimization with Fourier Neural Operators

Authors: Ruochen Chen, Thuy Tran, Shaifali Parashar

Comments: Accepted for WACV

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[411] arXiv:2512.05759 [pdf, ps, other]: Title: Label-Efficient Point Cloud Segmentation with Active Learning

Authors: Johannes Meyer, Jasper Hoffmann, Felix Schulz, Dominik Merkle, Daniel Buescher, Alexander Reiterer, Joschka Boedecker, Wolfram Burgard

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[412] arXiv:2512.05754 [pdf, ps, other]: Title: USV: Unified Sparsification for Accelerating Video Diffusion Models

Authors: Xinjian Wu, Hongmei Wang, Yuan Zhou, Qinglin Lu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[413] arXiv:2512.05746 [pdf, ps, other]: Title: HQ-DM: Single Hadamard Transformation-Based Quantization-Aware Training for Low-Bit Diffusion Models

Authors: Shizhuo Mao, Hongtao Zou, Qihu Xie, Song Chen, Yi Kang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[414] arXiv:2512.05740 [pdf, ps, other]: Title: Distilling Expert Surgical Knowledge: How to train local surgical VLMs for anatomy explanation in Complete Mesocolic Excision

Authors: Lennart Maack, Julia-Kristin Graß, Lisa-Marie Toscha, Nathaniel Melling, Alexander Schlaefer

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[415] arXiv:2512.05710 [pdf, ps, other]: Title: Manifold-Aware Point Cloud Completion via Geodesic-Attentive Hierarchical Feature Learning

Authors: Jianan Sun, Dongzhihan Wang, Mingyu Fan

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[416] arXiv:2512.05698 [pdf, ps, other]: Title: OWL: Unsupervised 3D Object Detection by Occupancy Guided Warm-up and Large Model Priors Reasoning

Authors: Xusheng Guo, Wanfa Zhang, Shijia Zhao, Qiming Xia, Xiaolong Xie, Mingming Wang, Hai Wu, Chenglu Wen

Comments: The 40th Annual AAAI Conference on Artificial Intelligence

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[417] arXiv:2512.05683 [pdf, ps, other]: Title: Physics-Informed Graph Neural Network with Frequency-Aware Learning for Optical Aberration Correction

Authors: Yong En Kok, Bowen Deng, Alexander Bentley, Andrew J. Parkes, Michael G. Somekh, Amanda J. Wright, Michael P. Pound

Subjects: Computer Vision and Pattern Recognition (cs.CV); Optics (physics.optics)
[418] arXiv:2512.05674 [pdf, ps, other]: Title: Hyperspectral Unmixing with 3D Convolutional Sparse Coding and Projected Simplex Volume Maximization

Authors: Gargi Panda, Soumitra Kundu, Saumik Bhattacharya, Aurobinda Routray

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[419] arXiv:2512.05672 [pdf, ps, other]: Title: InverseCrafter: Efficient Video ReCapture as a Latent Domain Inverse Problem

Authors: Yeobin Hong, Suhyeon Lee, Hyungjin Chung, Jong Chul Ye

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[420] arXiv:2512.05669 [pdf, ps, other]: Title: Deep Learning-Based Real-Time Sequential Facial Expression Analysis Using Geometric Features

Authors: Talha Enes Koksal, Abdurrahman Gumus

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[421] arXiv:2512.05663 [pdf, ps, other]: Title: LeAD-M3D: Leveraging Asymmetric Distillation for Real-time Monocular 3D Detection

Authors: Johannes Meier, Jonathan Michel, Oussema Dhaouadi, Yung-Hsu Yang, Christoph Reich, Zuria Bauer, Stefan Roth, Marc Pollefeys, Jacques Kaiser, Daniel Cremers

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[422] arXiv:2512.05651 [pdf, ps, other]: Title: Self-Supervised AI-Generated Image Detection: A Camera Metadata Perspective

Authors: Nan Zhong, Mian Zou, Yiran Xu, Zhenxing Qian, Xinpeng Zhang, Baoyuan Wu, Kede Ma

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[423] arXiv:2512.05635 [pdf, ps, other]: Title: Experts-Guided Unbalanced Optimal Transport for ISP Learning from Unpaired and/or Paired Data

Authors: Georgy Perevozchikov, Nancy Mehta, Egor Ershov, Radu Timofte

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[424] arXiv:2512.05613 [pdf, ps, other]: Title: DistillFSS: Synthesizing Few-Shot Knowledge into a Lightweight Segmentation Model

Authors: Pasquale De Marinis, Pieter M. Blok, Uzay Kaymak, Rogier Brussee, Gennaro Vessio, Giovanna Castellano

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[425] arXiv:2512.05610 [pdf, ps, other]: Title: NormalView: sensor-agnostic tree species classification from backpack and aerial lidar data using geometric projections

Authors: Juho Korkeala, Jesse Muhojoki, Josef Taher, Klaara Salolahti, Matti Hyyppä, Antero Kukko, Juha Hyyppä

Comments: 19 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[426] arXiv:2512.05597 [pdf, ps, other]: Title: Fast SceneScript: Accurate and Efficient Structured Language Model via Multi-Token Prediction

Authors: Ruihong Yin, Xuepeng Shi, Oleksandr Bailo, Marco Manfredi, Theo Gevers

Comments: 10 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[427] arXiv:2512.05593 [pdf, ps, other]: Title: Learning High-Fidelity Cloth Animation via Skinning-Free Image Transfer

Authors: Rong Wang, Wei Mao, Changsheng Lu, Hongdong Li

Comments: Accepted to 3DV 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[428] arXiv:2512.05571 [pdf, ps, other]: Title: MedDIFT: Multi-Scale Diffusion-Based Correspondence in 3D Medical Imaging

Authors: Xingyu Zhang, Anna Reithmeir, Fryderyk Kögl, Rickmer Braren, Julia A. Schnabel, Daniel M. Lang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[429] arXiv:2512.05564 [pdf, ps, other]: Title: ProPhy: Progressive Physical Alignment for Dynamic World Simulation

Authors: Zijun Wang, Panwen Hu, Jing Wang, Terry Jingchen Zhang, Yuhao Cheng, Long Chen, Yiqiang Yan, Zutao Jiang, Hanhui Li, Xiaodan Liang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[430] arXiv:2512.05557 [pdf, ps, other]: Title: 2K-Characters-10K-Stories: A Quality-Gated Stylized Narrative Dataset with Disentangled Control and Sequence Consistency

Authors: Xingxi Yin, Yicheng Li, Gong Yan, Chenglin Li, Jian Zhao, Cong Huang, Yue Deng, Yin Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[431] arXiv:2512.05546 [pdf, ps, other]: Title: Conscious Gaze: Adaptive Attention Mechanisms for Hallucination Mitigation in Vision-Language Models

Authors: Weijue Bu, Guan Yuan, Guixian Zhang

Comments: 6 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[432] arXiv:2512.05539 [pdf, ps, other]: Title: Ideal Observer for Segmentation of Dead Leaves Images

Authors: Swantje Mahncke, Malte Ott

Comments: 41 pages, 16 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Statistics Theory (math.ST); Methodology (stat.ME)
[433] arXiv:2512.05529 [pdf, ps, other]: Title: See in Depth: Training-Free Surgical Scene Segmentation with Monocular Depth Priors

Authors: Kunyi Yang, Qingyu Wang, Cheng Yuan, Yutong Ban

Comments: The first two authors contributed equally

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[434] arXiv:2512.05524 [pdf, ps, other]: Title: VOST-SGG: VLM-Aided One-Stage Spatio-Temporal Scene Graph Generation

Authors: Chinthani Sugandhika, Chen Li, Deepu Rajan, Basura Fernando

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[435] arXiv:2512.05515 [pdf, ps, other]: Title: DashFusion: Dual-stream Alignment with Hierarchical Bottleneck Fusion for Multimodal Sentiment Analysis

Authors: Yuhua Wen, Qifei Li, Yingying Zhou, Yingming Gao, Zhengqi Wen, Jianhua Tao, Ya Li

Comments: Accepted to IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[436] arXiv:2512.05513 [pdf, ps, other]: Title: Know-Show: Benchmarking Video-Language Models on Spatio-Temporal Grounded Reasoning

Authors: Chinthani Sugandhika, Chen Li, Deepu Rajan, Basura Fernando

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[437] arXiv:2512.05511 [pdf, ps, other]: Title: Rethinking Infrared Small Target Detection: A Foundation-Driven Efficient Paradigm

Authors: Chuang Yu, Jinmiao Zhao, Yunpeng Liu, Yaokun Li, Xiujun Shu, Yuanhao Feng, Bo Wang, Yimian Dai, Xiangyu Yue

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[438] arXiv:2512.05494 [pdf, ps, other]: Title: Decoding with Structured Awareness: Integrating Directional, Frequency-Spatial, and Structural Attention for Medical Image Segmentation

Authors: Fan Zhang, Zhiwei Gu, Hua Wang

Comments: Accepted to AAAI 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[439] arXiv:2512.05492 [pdf, ps, other]: Title: WaterWave: Bridging Underwater Image Enhancement into Video Streams via Wavelet-based Temporal Consistency Field

Authors: Qi Zhu, Jingyi Zhang, Naishan Zheng, Wei Yu, Jinghao Zhang, Deyi Ji, Feng Zhao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[440] arXiv:2512.05482 [pdf, ps, other]: Title: Concept-based Explainable Data Mining with VLM for 3D Detection

Authors: Mai Tsujimoto

Comments: 28 pages including appendix. Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)

[ total of 749 entries: 1-50 | ... | 241-290 | 291-340 | 341-390 | 391-440 | 441-490 | 491-540 | 541-590 | ... | 741-749 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help (Access key information)

> cs > cs.CV

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 390

Mon, 8 Dec 2025 (showing first 50 of 94 entries)