Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 346

[ total of 754 entries: 1-50 | ... | 197-246 | 247-296 | 297-346 | 347-396 | 397-446 | 447-496 | 497-546 | ... | 747-754 ]
[ showing 50 entries per page: fewer | more | all ]

Tue, 9 Dec 2025 (continued, showing 50 of 259 entries)

[347] arXiv:2512.07206 [pdf, ps, other]: Title: AutoLugano: A Deep Learning Framework for Fully Automated Lymphoma Segmentation and Lugano Staging on FDG-PET/CT

Authors: Boyang Pan, Zeyu Zhang, Hongyu Meng, Bin Cui, Yingying Zhang, Wenli Hou, Junhao Li, Langdi Zhong, Xiaoxiao Chen, Xiaoyu Xu, Changjin Zuo, Chao Cheng, Nan-Jie Gong

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[348] arXiv:2512.07203 [pdf, ps, other]: Title: MMRPT: MultiModal Reinforcement Pre-Training via Masked Vision-Dependent Reasoning

Authors: Xuhui Zheng, Kang An, Ziliang Wang, Yuhang Wang, Faqiang Qian, Yichao Wu

Comments: 7 pages, 1 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[349] arXiv:2512.07201 [pdf, ps, other]: Title: Understanding Diffusion Models via Code Execution

Authors: Cheng Yu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[350] arXiv:2512.07198 [pdf, ps, other]: Title: Generating Storytelling Images with Rich Chains-of-Reasoning

Authors: Xiujie Song, Qi Jia, Shota Watanabe, Xiaoyi Pang, Ruijie Chen, Mengyue Wu, Kenny Q. Zhu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[351] arXiv:2512.07197 [pdf, ps, other]: Title: SUCCESS-GS: Survey of Compactness and Compression for Efficient Static and Dynamic Gaussian Splatting

Authors: Seokhyun Youn, Soohyun Lee, Geonho Kim, Weeyoung Kwon, Sung-Ho Bae, Jihyong Oh

Comments: The first three authors contributed equally to this work. The last two authors are co-corresponding authors. Please visit our project page at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[352] arXiv:2512.07192 [pdf, ps, other]: Title: HVQ-CGIC: Enabling Hyperprior Entropy Modeling for VQ-Based Controllable Generative Image Compression

Authors: Niu Yi, Xu Tianyi, Ma Mingming, Wang Xinkun

Comments: 12 pages, 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[353] arXiv:2512.07191 [pdf, ps, other]: Title: RefLSM: Linearized Structural-Prior Reflectance Model for Medical Image Segmentation and Bias-Field Correction

Authors: Wenqi Zhao, Jiacheng Sang, Fenghua Cheng, Yonglu Shu, Dong Li, Xiaofeng Yang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[354] arXiv:2512.07190 [pdf, ps, other]: Title: Integrating Multi-scale and Multi-filtration Topological Features for Medical Image Classification

Authors: Pengfei Gu, Huimin Li, Haoteng Tang, Dongkuan (DK)Xu, Erik Enriquez, DongChul Kim, Bin Fu, Danny Z. Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[355] arXiv:2512.07186 [pdf, ps, other]: Title: START: Spatial and Textual Learning for Chart Understanding

Authors: Zhuoming Liu, Xiaofeng Gao, Feiyang Niu, Qiaozi Gao, Liu Liu, Robinson Piramuthu

Comments: WACV2026 Camera Ready

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[356] arXiv:2512.07171 [pdf, ps, other]: Title: TIDE: Two-Stage Inverse Degradation Estimation with Guided Prior Disentanglement for Underwater Image Restoration

Authors: Shravan Venkatraman, Rakesh Raj Madavan, Pavan Kumar S, Muthu Subash Kavitha

Comments: 21 pages, 11 figures, 5 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[357] arXiv:2512.07170 [pdf, ps, other]: Title: Towards Unified Semantic and Controllable Image Fusion: A Diffusion Transformer Approach

Authors: Jiayang Li, Chengjie Jiang, Junjun Jiang, Pengwei Liang, Jiayi Ma, Liqiang Nie

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[358] arXiv:2512.07166 [pdf, ps, other]: Title: When Privacy Meets Recovery: The Overlooked Half of Surrogate-Driven Privacy Preservation for MLLM Editing

Authors: Siyuan Xu, Yibing Liu, Peilin Chen, Yung-Hui Li, Shiqi Wang, Sam Kwong

Comments: 9 pages,7figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[359] arXiv:2512.07165 [pdf, ps, other]: Title: MuSASplat: Efficient Sparse-View 3D Gaussian Splats via Lightweight Multi-Scale Adaptation

Authors: Muyu Xu, Fangneng Zhan, Xiaoqin Zhang, Ling Shao, Shijian Lu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[360] arXiv:2512.07155 [pdf, ps, other]: Title: CHIMERA: Adaptive Cache Injection and Semantic Anchor Prompting for Zero-shot Image Morphing with Morphing-oriented Metrics

Authors: Dahyeon Kye, Jeahun Sung, Mingyu Jeon, Jihyong Oh

Comments: Please visit our project page at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[361] arXiv:2512.07141 [pdf, ps, other]: Title: Think-Reflect-Revise: A Policy-Guided Reflective Framework for Safety Alignment in Large Vision Language Models

Authors: Fenghua Weng, Chaochao Lu, Xia Hu, Wenqi Shao, Wenjie Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[362] arXiv:2512.07136 [pdf, ps, other]: Title: A Large-Scale Multimodal Dataset and Benchmarks for Human Activity Scene Understanding and Reasoning

Authors: Siyang Jiang, Mu Yuan, Xiang Ji, Bufang Yang, Zeyu Liu, Lilin Xu, Yang Li, Yuting He, Liran Dong, Wenrui Lu, Zhenyu Yan, Xiaofan Jiang, Wei Gao, Hongkai Chen, Guoliang Xing

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[363] arXiv:2512.07135 [pdf, ps, other]: Title: TrajMoE: Scene-Adaptive Trajectory Planning with Mixture of Experts and Reinforcement Learning

Authors: Zebin Xing, Pengxuan Yang, Linbo Wang, Yichen Zhang, Yiming Hu, Yupeng Zheng, Junli Wang, Yinfeng Gao, Guang Li, Kun Ma, Long Chen, Zhongpu Xia, Qichao Zhang, Hangjun Ye, Dongbin Zhao

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[364] arXiv:2512.07128 [pdf, ps, other]: Title: MulCLIP: A Multi-level Alignment Framework for Enhancing Fine-grained Long-context CLIP

Authors: Chau Truong, Hieu Ta Quang, Dung D. Le

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[365] arXiv:2512.07126 [pdf, ps, other]: Title: Training-free Clothing Region of Interest Self-correction for Virtual Try-On

Authors: Shengjie Lu, Zhibin Wan, Jiejie Liu, Quan Zhang, Mingjie Sun

Comments: 16 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[366] arXiv:2512.07110 [pdf, ps, other]: Title: MSN: Multi-directional Similarity Network for Hand-crafted and Deep-synthesized Copy-Move Forgery Detection

Authors: Liangwei Jiang, Jinluo Xie, Yecheng Huang, Hua Zhang, Hongyu Yang, Di Huang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[367] arXiv:2512.07107 [pdf, ps, other]: Title: COREA: Coarse-to-Fine 3D Representation Alignment Between Relightable 3D Gaussians and SDF via Bidirectional 3D-to-3D Supervision

Authors: Jaeyoon Lee, Hojoon Jung, Sungtae Hwang, Jihyong Oh, Jongwon Choi

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[368] arXiv:2512.07078 [pdf, ps, other]: Title: DFIR-DETR: Frequency Domain Enhancement and Dynamic Feature Aggregation for Cross-Scene Small Object Detection

Authors: Bo Gao, Jingcheng Tong, Xingsheng Chen, Han Yu, Zichen Li

Comments: 16 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[369] arXiv:2512.07076 [pdf, ps, other]: Title: Context-measure: Contextualizing Metric for Camouflage

Authors: Chen-Yang Wang, Gepeng Ji, Song Shao, Ming-Ming Cheng, Deng-Ping Fan

Comments: Technical Report

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[370] arXiv:2512.07065 [pdf, ps, other]: Title: Persistent Homology-Guided Frequency Filtering for Image Compression

Authors: Anil Chintapalli, Peter Tenholder, Henry Chen, Arjun Rao

Comments: 17 pages, 8 figures, code available at github.com/RMATH3/persistent-homology-compression

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[371] arXiv:2512.07062 [pdf, ps, other]: Title: $\mathrm{D}^{\mathrm{3}}$-Predictor: Noise-Free Deterministic Diffusion for Dense Prediction

Authors: Changliang Xia, Chengyou Jia, Minnan Luo, Zhuohang Dang, Xin Shen, Bowen Ping

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[372] arXiv:2512.07052 [pdf, ps, other]: Title: RAVE: Rate-Adaptive Visual Encoding for 3D Gaussian Splatting

Authors: Hoang-Nhat Tran, Francesco Di Sario, Gabriele Spadaro, Giuseppe Valenzise, Enzo Tartaglione

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[373] arXiv:2512.07051 [pdf, ps, other]: Title: DAUNet: A Lightweight UNet Variant with Deformable Convolutions and Parameter-Free Attention for Medical Image Segmentation

Authors: Adnan Munir, Shujaat Khan

Comments: 11 pages, 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[374] arXiv:2512.07037 [pdf, ps, other]: Title: Evaluating and Preserving High-level Fidelity in Super-Resolution

Authors: Josep M. Rocafort, Shaolin Su, Alexandra Gomez-Villa, Javier Vazquez-Corral

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[375] arXiv:2512.07034 [pdf, ps, other]: Title: Power of Boundary and Reflection: Semantic Transparent Object Segmentation using Pyramid Vision Transformer with Transparent Cues

Authors: Tuan-Anh Vu, Hai Nguyen-Truong, Ziqiang Zheng, Binh-Son Hua, Qing Guo, Ivor Tsang, Sai-Kit Yeung

Comments: Accepted to WACV 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[376] arXiv:2512.06981 [pdf, ps, other]: Title: Selective Masking based Self-Supervised Learning for Image Semantic Segmentation

Authors: Yuemin Wang, Ian Stavness

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[377] arXiv:2512.06949 [pdf, ps, other]: Title: Can We Go Beyond Visual Features? Neural Tissue Relation Modeling for Relational Graph Analysis in Non-Melanoma Skin Histology

Authors: Shravan Venkatraman, Muthu Subash Kavitha, Joe Dhanith P R, V Manikandarajan, Jia Wu

Comments: 19 pages, 5 figures, 2 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[378] arXiv:2512.06921 [pdf, ps, other]: Title: NeuroABench: A Multimodal Evaluation Benchmark for Neurosurgical Anatomy Identification

Authors: Ziyang Song, Zelin Zang, Xiaofan Ye, Boqiang Xu, Long Bai, Jinlin Wu, Hongliang Ren, Hongbin Liu, Jiebo Luo, Zhen Lei

Comments: Accepted by IEEE ICIA 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[379] arXiv:2512.06905 [pdf, ps, other]: Title: Scaling Zero-Shot Reference-to-Video Generation

Authors: Zijian Zhou, Shikun Liu, Haozhe Liu, Haonan Qiu, Zhaochong An, Weiming Ren, Zhiheng Liu, Xiaoke Huang, Kam Woh Ng, Tian Xie, Xiao Han, Yuren Cong, Hang Li, Chuyan Zhu, Aditya Patel, Tao Xiang, Sen He

Comments: Website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[380] arXiv:2512.06888 [pdf, ps, other]: Title: Overcoming Small Data Limitations in Video-Based Infant Respiration Estimation

Authors: Liyang Song, Hardik Bishnoi, Sai Kumar Reddy Manne, Sarah Ostadabbas, Briana J. Taylor, Michael Wan

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[381] arXiv:2512.06886 [pdf, ps, other]: Title: Balanced Learning for Domain Adaptive Semantic Segmentation

Authors: Wangkai Li, Rui Sun, Bohao Liao, Zhaoyang Li, Tianzhu Zhang

Comments: Accepted by International Conference on Machine Learning (ICML 2025)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[382] arXiv:2512.06885 [pdf, ps, other]: Title: JoPano: Unified Panorama Generation via Joint Modeling

Authors: Wancheng Feng, Chen An, Zhenliang He, Meina Kan, Shiguang Shan, Lukun Wang

Comments: Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[383] arXiv:2512.06882 [pdf, ps, other]: Title: Hierarchical Image-Guided 3D Point Cloud Segmentation in Industrial Scenes via Multi-View Bayesian Fusion

Authors: Yu Zhu, Naoya Chiba, Koichi Hashimoto

Comments: Accepted to BMVC 2025 (Sheffield, UK, Nov 24-27, 2025). Supplementary video and poster available upon request

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[384] arXiv:2512.06877 [pdf, ps, other]: Title: SceneMixer: Exploring Convolutional Mixing Networks for Remote Sensing Scene Classification

Authors: Mohammed Q. Alkhatib, Ali Jamali, Swalpa Kumar Roy

Comments: Accepted and presented in ICSPIS

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[385] arXiv:2512.06870 [pdf, ps, other]: Title: Towards Robust Pseudo-Label Learning in Semantic Segmentation: An Encoding Perspective

Authors: Wangkai Li, Rui Sun, Zhaoyang Li, Tianzhu Zhang

Comments: Accepted by Conference on Neural Information Processing Systems (NeurIPS 2025)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[386] arXiv:2512.06866 [pdf, ps, other]: Title: Less Is More, but Where? Dynamic Token Compression via LLM-Guided Keyframe Prior

Authors: Yulin Li, Haokun Gui, Ziyang Fan, Junjie Wang, Bin Kang, Bin Chen, Zhuotao Tian

Comments: Accepted by NeurIPS 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[387] arXiv:2512.06865 [pdf, ps, other]: Title: Spatial Retrieval Augmented Autonomous Driving

Authors: Xiaosong Jia, Chenhe Zhang, Yule Jiang, Songbur Wong, Zhiyuan Zhang, Chen Chen, Shaofeng Zhang, Xuanhe Zhou, Xue Yang, Junchi Yan, Yu-Gang Jiang

Comments: Demo Page: this https URL with open sourced code, dataset, and checkpoints

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[388] arXiv:2512.06864 [pdf, ps, other]: Title: Boosting Unsupervised Video Instance Segmentation with Automatic Quality-Guided Self-Training

Authors: Kaixuan Lu, Mehmet Onurcan Kaya, Dim P. Papadopoulos

Comments: Accepted to WACV 2026. arXiv admin note: substantial text overlap with arXiv:2508.19808

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[389] arXiv:2512.06862 [pdf, ps, other]: Title: Omni-Referring Image Segmentation

Authors: Qiancheng Zheng, Yunhang Shen, Gen Luo, Baiyang Song, Xing Sun, Xiaoshuai Sun, Yiyi Zhou, Rongrong Ji

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[390] arXiv:2512.06849 [pdf, ps, other]: Title: Hide-and-Seek Attribution: Weakly Supervised Segmentation of Vertebral Metastases in CT

Authors: Matan Atad, Alexander W. Marka, Lisa Steinhelfer, Anna Curto-Vilalta, Yannik Leonhardt, Sarah C. Foreman, Anna-Sophia Walburga Dietrich, Robert Graf, Alexandra S. Gersing, Bjoern Menze, Daniel Rueckert, Jan S. Kirschke, Hendrik Möller

Comments: In submission

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[391] arXiv:2512.06845 [pdf, ps, other]: Title: Pseudo Anomalies Are All You Need: Diffusion-Based Generation for Weakly-Supervised Video Anomaly Detection

Authors: Satoshi Hashimoto, Hitoshi Nishimura, Yanan Wang, Mori Kurokawa

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[392] arXiv:2512.06840 [pdf, ps, other]: Title: CADE: Continual Weakly-supervised Video Anomaly Detection with Ensembles

Authors: Satoshi Hashimoto, Tatsuya Konishi, Tomoya Kaichi, Kazunori Matsumoto, Mori Kurokawa

Comments: Accepted to WACV 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[393] arXiv:2512.06838 [pdf, ps, other]: Title: SparseCoop: Cooperative Perception with Kinematic-Grounded Queries

Authors: Jiahao Wang, Zhongwei Jiang, Wenchao Sun, Jiaru Zhong, Haibao Yu, Yuner Zhang, Chenyang Lu, Chuang Zhang, Lei He, Shaobing Xu, Jianqiang Wang

Comments: Accepted by AAAI 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[394] arXiv:2512.06818 [pdf, ps, other]: Title: MeshSplatting: Differentiable Rendering with Opaque Meshes

Authors: Jan Held, Sanghyun Son, Renaud Vandeghen, Daniel Rebain, Matheus Gadelha, Yi Zhou, Anthony Cioppa, Ming C. Lin, Marc Van Droogenbroeck, Andrea Tagliasacchi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[395] arXiv:2512.06811 [pdf, ps, other]: Title: RMAdapter: Reconstruction-based Multi-Modal Adapter for Vision-Language Models

Authors: Xiang Lin, Weixin Li, Shu Guo, Lihong Wang, Di Huang

Comments: Accepted by AAAI 2026(Oral)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[396] arXiv:2512.06810 [pdf, ps, other]: Title: MMDuet2: Enhancing Proactive Interaction of Video MLLMs with Multi-Turn Reinforcement Learning

Authors: Yueqian Wang, Songxiang Liu, Disong Wang, Nuo Xu, Guanglu Wan, Huishuai Zhang, Dongyan Zhao

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)

[ total of 754 entries: 1-50 | ... | 197-246 | 247-296 | 297-346 | 347-396 | 397-446 | 447-496 | 497-546 | ... | 747-754 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help (Access key information)

> cs > cs.CV

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 346

Tue, 9 Dec 2025 (continued, showing 50 of 259 entries)