Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 54

[ total of 759 entries: 1-50 | 5-54 | 55-104 | 105-154 | 155-204 | 205-254 | ... | 755-759 ]
[ showing 50 entries per page: fewer | more | all ]

Tue, 9 Dec 2025 (continued, showing 50 of 259 entries)

[55] arXiv:2512.07379 [pdf, ps, other]: Title: Enhancing Small Object Detection with YOLO: A Novel Framework for Improved Accuracy and Efficiency

Authors: Mahila Moghadami, Mohammad Ali Keyvanrad, Melika Sabaghian

Comments: 22 pages, 16 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[56] arXiv:2512.07360 [pdf, ps, other]: Title: Structure-Aware Feature Rectification with Region Adjacency Graphs for Training-Free Open-Vocabulary Semantic Segmentation

Authors: Qiming Huang, Hao Ai, Jianbo Jiao

Comments: Accepted to WACV2026

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[57] arXiv:2512.07351 [pdf, ps, other]: Title: DeepAgent: A Dual Stream Multi Agent Fusion for Robust Multimodal Deepfake Detection

Authors: Sayeem Been Zaman, Wasimul Karim, Arefin Ittesafun Abian, Reem E. Mohamed, Md Rafiqul Islam, Asif Karim, Sami Azam

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Sound (cs.SD)
[58] arXiv:2512.07348 [pdf, ps, other]: Title: MICo-150K: A Comprehensive Dataset Advancing Multi-Image Composition

Authors: Xinyu Wei, Kangrui Cen, Hongyang Wei, Zhen Guo, Bairui Li, Zeqing Wang, Jinrui Zhang, Lei Zhang

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[59] arXiv:2512.07345 [pdf, ps, other]: Title: Debiasing Diffusion Priors via 3D Attention for Consistent Gaussian Splatting

Authors: Shilong Jin, Haoran Duan, Litao Hua, Wentao Huang, Yuan Zhou

Comments: 15 pages, 8 figures, 5 tables, 2 algorithms, Accepted by AAAI 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[60] arXiv:2512.07338 [pdf, ps, other]: Title: Generalized Referring Expression Segmentation on Aerial Photos

Authors: Luís Marnoto, Alexandre Bernardino, Bruno Martins

Comments: Submitted to IEEE J-STARS

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[61] arXiv:2512.07331 [pdf, ps, other]: Title: The Inductive Bottleneck: Data-Driven Emergence of Representational Sparsity in Vision Transformers

Authors: Kanishk Awadhiya

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[62] arXiv:2512.07328 [pdf, ps, other]: Title: ContextAnyone: Context-Aware Diffusion for Character-Consistent Text-to-Video Generation

Authors: Ziyang Mai, Yu-Wing Tai

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[63] arXiv:2512.07305 [pdf, ps, other]: Title: Reevaluating Automated Wildlife Species Detection: A Reproducibility Study on a Custom Image Dataset

Authors: Tobias Abraham Haider

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[64] arXiv:2512.07302 [pdf, ps, other]: Title: Towards Accurate UAV Image Perception: Guiding Vision-Language Models with Stronger Task Prompts

Authors: Mingning Guo, Mengwei Wu, Shaoxian Li, Haifeng Li, Chao Tao

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[65] arXiv:2512.07276 [pdf, ps, other]: Title: Geo3DVQA: Evaluating Vision-Language Models for 3D Geospatial Reasoning from Aerial Imagery

Authors: Mai Tsujimoto, Junjue Wang, Weihao Xuan, Naoto Yokoya

Comments: Accepted to WACV 2026. Camera-ready-based version with minor edits for readability (no change in the contents)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[66] arXiv:2512.07275 [pdf, ps, other]: Title: Effective Attention-Guided Multi-Scale Medical Network for Skin Lesion Segmentation

Authors: Siyu Wang, Hua Wang, Huiyu Li, Fan Zhang

Comments: The paper has been accepted by BIBM 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[67] arXiv:2512.07273 [pdf, ps, other]: Title: RVLF: A Reinforcing Vision-Language Framework for Gloss-Free Sign Language Translation

Authors: Zhi Rao, Yucheng Zhou, Benjia Zhou, Yiqing Huang, Sergio Escalera, Jun Wan

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[68] arXiv:2512.07269 [pdf, ps, other]: Title: A graph generation pipeline for critical infrastructures based on heuristics, images and depth data

Authors: Mike Diessner, Yannick Tarant

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[69] arXiv:2512.07253 [pdf, ps, other]: Title: DGGAN: Degradation Guided Generative Adversarial Network for Real-time Endoscopic Video Enhancement

Authors: Handing Xu, Zhenguo Nie, Tairan Peng, Huimin Pan, Xin-Jun Liu

Comments: 18 pages, 8 figures, and 7 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[70] arXiv:2512.07251 [pdf, ps, other]: Title: See More, Change Less: Anatomy-Aware Diffusion for Contrast Enhancement

Authors: Junqi Liu, Zejun Wu, Pedro R. A. S. Bassi, Xinze Zhou, Wenxuan Li, Ibrahim E. Hamamci, Sezgin Er, Tianyu Lin, Yi Luo, Szymon Płotka, Bjoern Menze, Daguang Xu, Kai Ding, Kang Wang, Yang Yang, Yucheng Tang, Alan L. Yuille, Zongwei Zhou

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[71] arXiv:2512.07247 [pdf, ps, other]: Title: AdLift: Lifting Adversarial Perturbations to Safeguard 3D Gaussian Splatting Assets Against Instruction-Driven Editing

Authors: Ziming Hong, Tianyu Huang, Runnan Chen, Shanshan Ye, Mingming Gong, Bo Han, Tongliang Liu

Comments: 40 pages, 34 figures, 18 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[72] arXiv:2512.07245 [pdf, ps, other]: Title: Zero-Shot Textual Explanations via Translating Decision-Critical Features

Authors: Toshinori Yamauchi, Hiroshi Kera, Kazuhiko Kawamoto

Comments: 11+6 pages, 8 figures, 4 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[73] arXiv:2512.07241 [pdf, ps, other]: Title: Squeezed-Eff-Net: Edge-Computed Boost of Tomography Based Brain Tumor Classification leveraging Hybrid Neural Network Architecture

Authors: Md. Srabon Chowdhury, Syeda Fahmida Tanzim, Sheekar Banerjee, Ishtiak Al Mamoon, AKM Muzahidul Islam

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[74] arXiv:2512.07237 [pdf, ps, other]: Title: Unified Camera Positional Encoding for Controlled Video Generation

Authors: Cheng Zhang, Boying Li, Meng Wei, Yan-Pei Cao, Camilo Cruz Gambardella, Dinh Phung, Jianfei Cai

Comments: Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[75] arXiv:2512.07234 [pdf, ps, other]: Title: Dropout Prompt Learning: Towards Robust and Adaptive Vision-Language Models

Authors: Biao Chen, Lin Zuo, Mengmeng Jing, Kunbin He, Yuchen Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[76] arXiv:2512.07230 [pdf, ps, other]: Title: STRinGS: Selective Text Refinement in Gaussian Splatting

Authors: Abhinav Raundhal, Gaurav Behera, P J Narayanan, Ravi Kiran Sarvadevabhatla, Makarand Tapaswi

Comments: Accepted to WACV 2026. Project Page, see this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[77] arXiv:2512.07229 [pdf, ps, other]: Title: ReLKD: Inter-Class Relation Learning with Knowledge Distillation for Generalized Category Discovery

Authors: Fang Zhou, Zhiqiang Chen, Martin Pavlovski, Yizhong Zhang

Comments: Accepted to the Main Track of the 28th European Conference on Artificial Intelligence (ECAI 2025). To appear in the proceedings published by IOS Press (DOI: 10.3233/FAIA413)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[78] arXiv:2512.07228 [pdf, ps, other]: Title: Towards Robust Protective Perturbation against DeepFake Face Swapping

Authors: Hengyang Yao, Lin Li, Ke Sun, Jianing Qiu, Huiping Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[79] arXiv:2512.07215 [pdf, ps, other]: Title: VFM-VLM: Vision Foundation Model and Vision Language Model based Visual Comparison for 3D Pose Estimation

Authors: Md Selim Sarowar, Sungho Kim

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[80] arXiv:2512.07211 [pdf, ps, other]: Title: Object Pose Distribution Estimation for Determining Revolution and Reflection Uncertainty in Point Clouds

Authors: Frederik Hagelskjær, Dimitrios Arapis, Steffen Madsen, Thorbjørn Mosekjær Iversen

Comments: 8 pages, 8 figures, 5 tables, ICCR 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[81] arXiv:2512.07206 [pdf, ps, other]: Title: AutoLugano: A Deep Learning Framework for Fully Automated Lymphoma Segmentation and Lugano Staging on FDG-PET/CT

Authors: Boyang Pan, Zeyu Zhang, Hongyu Meng, Bin Cui, Yingying Zhang, Wenli Hou, Junhao Li, Langdi Zhong, Xiaoxiao Chen, Xiaoyu Xu, Changjin Zuo, Chao Cheng, Nan-Jie Gong

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[82] arXiv:2512.07203 [pdf, ps, other]: Title: MMRPT: MultiModal Reinforcement Pre-Training via Masked Vision-Dependent Reasoning

Authors: Xuhui Zheng, Kang An, Ziliang Wang, Yuhang Wang, Faqiang Qian, Yichao Wu

Comments: 7 pages, 1 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[83] arXiv:2512.07201 [pdf, ps, other]: Title: Understanding Diffusion Models via Code Execution

Authors: Cheng Yu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[84] arXiv:2512.07198 [pdf, ps, other]: Title: Generating Storytelling Images with Rich Chains-of-Reasoning

Authors: Xiujie Song, Qi Jia, Shota Watanabe, Xiaoyi Pang, Ruijie Chen, Mengyue Wu, Kenny Q. Zhu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[85] arXiv:2512.07197 [pdf, ps, other]: Title: SUCCESS-GS: Survey of Compactness and Compression for Efficient Static and Dynamic Gaussian Splatting

Authors: Seokhyun Youn, Soohyun Lee, Geonho Kim, Weeyoung Kwon, Sung-Ho Bae, Jihyong Oh

Comments: The first three authors contributed equally to this work. The last two authors are co-corresponding authors. Please visit our project page at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[86] arXiv:2512.07192 [pdf, ps, other]: Title: HVQ-CGIC: Enabling Hyperprior Entropy Modeling for VQ-Based Controllable Generative Image Compression

Authors: Niu Yi, Xu Tianyi, Ma Mingming, Wang Xinkun

Comments: 12 pages, 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[87] arXiv:2512.07191 [pdf, ps, other]: Title: RefLSM: Linearized Structural-Prior Reflectance Model for Medical Image Segmentation and Bias-Field Correction

Authors: Wenqi Zhao, Jiacheng Sang, Fenghua Cheng, Yonglu Shu, Dong Li, Xiaofeng Yang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[88] arXiv:2512.07190 [pdf, ps, other]: Title: Integrating Multi-scale and Multi-filtration Topological Features for Medical Image Classification

Authors: Pengfei Gu, Huimin Li, Haoteng Tang, Dongkuan (DK)Xu, Erik Enriquez, DongChul Kim, Bin Fu, Danny Z. Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[89] arXiv:2512.07186 [pdf, ps, other]: Title: START: Spatial and Textual Learning for Chart Understanding

Authors: Zhuoming Liu, Xiaofeng Gao, Feiyang Niu, Qiaozi Gao, Liu Liu, Robinson Piramuthu

Comments: WACV2026 Camera Ready

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[90] arXiv:2512.07171 [pdf, ps, other]: Title: TIDE: Two-Stage Inverse Degradation Estimation with Guided Prior Disentanglement for Underwater Image Restoration

Authors: Shravan Venkatraman, Rakesh Raj Madavan, Pavan Kumar S, Muthu Subash Kavitha

Comments: 21 pages, 11 figures, 5 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[91] arXiv:2512.07170 [pdf, ps, other]: Title: Towards Unified Semantic and Controllable Image Fusion: A Diffusion Transformer Approach

Authors: Jiayang Li, Chengjie Jiang, Junjun Jiang, Pengwei Liang, Jiayi Ma, Liqiang Nie

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[92] arXiv:2512.07166 [pdf, ps, other]: Title: When Privacy Meets Recovery: The Overlooked Half of Surrogate-Driven Privacy Preservation for MLLM Editing

Authors: Siyuan Xu, Yibing Liu, Peilin Chen, Yung-Hui Li, Shiqi Wang, Sam Kwong

Comments: 9 pages,7figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[93] arXiv:2512.07165 [pdf, ps, other]: Title: MuSASplat: Efficient Sparse-View 3D Gaussian Splats via Lightweight Multi-Scale Adaptation

Authors: Muyu Xu, Fangneng Zhan, Xiaoqin Zhang, Ling Shao, Shijian Lu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[94] arXiv:2512.07155 [pdf, ps, other]: Title: CHIMERA: Adaptive Cache Injection and Semantic Anchor Prompting for Zero-shot Image Morphing with Morphing-oriented Metrics

Authors: Dahyeon Kye, Jeahun Sung, MinKyu Jeon, Jihyong Oh

Comments: Please visit our project page at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[95] arXiv:2512.07141 [pdf, ps, other]: Title: Think-Reflect-Revise: A Policy-Guided Reflective Framework for Safety Alignment in Large Vision Language Models

Authors: Fenghua Weng, Chaochao Lu, Xia Hu, Wenqi Shao, Wenjie Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[96] arXiv:2512.07136 [pdf, ps, other]: Title: A Large-Scale Multimodal Dataset and Benchmarks for Human Activity Scene Understanding and Reasoning

Authors: Siyang Jiang, Mu Yuan, Xiang Ji, Bufang Yang, Zeyu Liu, Lilin Xu, Yang Li, Yuting He, Liran Dong, Wenrui Lu, Zhenyu Yan, Xiaofan Jiang, Wei Gao, Hongkai Chen, Guoliang Xing

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[97] arXiv:2512.07135 [pdf, ps, other]: Title: TrajMoE: Scene-Adaptive Trajectory Planning with Mixture of Experts and Reinforcement Learning

Authors: Zebin Xing, Pengxuan Yang, Linbo Wang, Yichen Zhang, Yiming Hu, Yupeng Zheng, Junli Wang, Yinfeng Gao, Guang Li, Kun Ma, Long Chen, Zhongpu Xia, Qichao Zhang, Hangjun Ye, Dongbin Zhao

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[98] arXiv:2512.07128 [pdf, ps, other]: Title: MulCLIP: A Multi-level Alignment Framework for Enhancing Fine-grained Long-context CLIP

Authors: Chau Truong, Hieu Ta Quang, Dung D. Le

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[99] arXiv:2512.07126 [pdf, ps, other]: Title: Training-free Clothing Region of Interest Self-correction for Virtual Try-On

Authors: Shengjie Lu, Zhibin Wan, Jiejie Liu, Quan Zhang, Mingjie Sun

Comments: 16 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[100] arXiv:2512.07110 [pdf, ps, other]: Title: MSN: Multi-directional Similarity Network for Hand-crafted and Deep-synthesized Copy-Move Forgery Detection

Authors: Liangwei Jiang, Jinluo Xie, Yecheng Huang, Hua Zhang, Hongyu Yang, Di Huang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[101] arXiv:2512.07107 [pdf, ps, other]: Title: COREA: Coarse-to-Fine 3D Representation Alignment Between Relightable 3D Gaussians and SDF via Bidirectional 3D-to-3D Supervision

Authors: Jaeyoon Lee, Hojoon Jung, Sungtae Hwang, Jihyong Oh, Jongwon Choi

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[102] arXiv:2512.07078 [pdf, ps, other]: Title: DFIR-DETR: Frequency Domain Enhancement and Dynamic Feature Aggregation for Cross-Scene Small Object Detection

Authors: Bo Gao, Jingcheng Tong, Xingsheng Chen, Han Yu, Zichen Li

Comments: 16 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[103] arXiv:2512.07076 [pdf, ps, other]: Title: Context-measure: Contextualizing Metric for Camouflage

Authors: Chen-Yang Wang, Gepeng Ji, Song Shao, Ming-Ming Cheng, Deng-Ping Fan

Comments: Technical Report

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[104] arXiv:2512.07065 [pdf, ps, other]: Title: Persistent Homology-Guided Frequency Filtering for Image Compression

Authors: Anil Chintapalli, Peter Tenholder, Henry Chen, Arjun Rao

Comments: 17 pages, 8 figures, code available at github.com/RMATH3/persistent-homology-compression

Subjects: Computer Vision and Pattern Recognition (cs.CV)

[ total of 759 entries: 1-50 | 5-54 | 55-104 | 105-154 | 155-204 | 205-254 | ... | 755-759 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help (Access key information)

> cs > cs.CV

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 54

Tue, 9 Dec 2025 (continued, showing 50 of 259 entries)