Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 225

[ total of 749 entries: 1-50 | ... | 76-125 | 126-175 | 176-225 | 226-275 | 276-325 | 326-375 | 376-425 | ... | 726-749 ]
[ showing 50 entries per page: fewer | more | all ]

Tue, 9 Dec 2025 (continued, showing 50 of 259 entries)

[226] arXiv:2512.07141 [pdf, ps, other]: Title: Think-Reflect-Revise: A Policy-Guided Reflective Framework for Safety Alignment in Large Vision Language Models

Authors: Fenghua Weng, Chaochao Lu, Xia Hu, Wenqi Shao, Wenjie Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[227] arXiv:2512.07136 [pdf, ps, other]: Title: A Large-Scale Multimodal Dataset and Benchmarks for Human Activity Scene Understanding and Reasoning

Authors: Siyang Jiang, Mu Yuan, Xiang Ji, Bufang Yang, Zeyu Liu, Lilin Xu, Yang Li, Yuting He, Liran Dong, Wenrui Lu, Zhenyu Yan, Xiaofan Jiang, Wei Gao, Hongkai Chen, Guoliang Xing

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[228] arXiv:2512.07135 [pdf, ps, other]: Title: TrajMoE: Scene-Adaptive Trajectory Planning with Mixture of Experts and Reinforcement Learning

Authors: Zebin Xing, Pengxuan Yang, Linbo Wang, Yichen Zhang, Yiming Hu, Yupeng Zheng, Junli Wang, Yinfeng Gao, Guang Li, Kun Ma, Long Chen, Zhongpu Xia, Qichao Zhang, Hangjun Ye, Dongbin Zhao

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[229] arXiv:2512.07128 [pdf, ps, other]: Title: MulCLIP: A Multi-level Alignment Framework for Enhancing Fine-grained Long-context CLIP

Authors: Chau Truong, Hieu Ta Quang, Dung D. Le

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[230] arXiv:2512.07126 [pdf, ps, other]: Title: Training-free Clothing Region of Interest Self-correction for Virtual Try-On

Authors: Shengjie Lu, Zhibin Wan, Jiejie Liu, Quan Zhang, Mingjie Sun

Comments: 16 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[231] arXiv:2512.07110 [pdf, ps, other]: Title: MSN: Multi-directional Similarity Network for Hand-crafted and Deep-synthesized Copy-Move Forgery Detection

Authors: Liangwei Jiang, Jinluo Xie, Yecheng Huang, Hua Zhang, Hongyu Yang, Di Huang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[232] arXiv:2512.07107 [pdf, ps, other]: Title: COREA: Coarse-to-Fine 3D Representation Alignment Between Relightable 3D Gaussians and SDF via Bidirectional 3D-to-3D Supervision

Authors: Jaeyoon Lee, Hojoon Jung, Sungtae Hwang, Jihyong Oh, Jongwon Choi

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[233] arXiv:2512.07078 [pdf, ps, other]: Title: DFIR-DETR: Frequency Domain Enhancement and Dynamic Feature Aggregation for Cross-Scene Small Object Detection

Authors: Bo Gao, Jingcheng Tong, Xingsheng Chen, Han Yu, Zichen Li

Comments: 16 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[234] arXiv:2512.07076 [pdf, ps, other]: Title: Context-measure: Contextualizing Metric for Camouflage

Authors: Chen-Yang Wang, Gepeng Ji, Song Shao, Ming-Ming Cheng, Deng-Ping Fan

Comments: Technical Report

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[235] arXiv:2512.07065 [pdf, ps, other]: Title: Persistent Homology-Guided Frequency Filtering for Image Compression

Authors: Anil Chintapalli, Peter Tenholder, Henry Chen, Arjun Rao

Comments: 17 pages, 8 figures, code available at github.com/RMATH3/persistent-homology-compression

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[236] arXiv:2512.07062 [pdf, ps, other]: Title: $\mathrm{D}^{\mathrm{3}}$-Predictor: Noise-Free Deterministic Diffusion for Dense Prediction

Authors: Changliang Xia, Chengyou Jia, Minnan Luo, Zhuohang Dang, Xin Shen, Bowen Ping

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[237] arXiv:2512.07052 [pdf, ps, other]: Title: RAVE: Rate-Adaptive Visual Encoding for 3D Gaussian Splatting

Authors: Hoang-Nhat Tran, Francesco Di Sario, Gabriele Spadaro, Giuseppe Valenzise, Enzo Tartaglione

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[238] arXiv:2512.07051 [pdf, ps, other]: Title: DAUNet: A Lightweight UNet Variant with Deformable Convolutions and Parameter-Free Attention for Medical Image Segmentation

Authors: Adnan Munir, Shujaat Khan

Comments: 11 pages, 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[239] arXiv:2512.07037 [pdf, ps, other]: Title: Evaluating and Preserving High-level Fidelity in Super-Resolution

Authors: Josep M. Rocafort, Shaolin Su, Alexandra Gomez-Villa, Javier Vazquez-Corral

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[240] arXiv:2512.07034 [pdf, ps, other]: Title: Power of Boundary and Reflection: Semantic Transparent Object Segmentation using Pyramid Vision Transformer with Transparent Cues

Authors: Tuan-Anh Vu, Hai Nguyen-Truong, Ziqiang Zheng, Binh-Son Hua, Qing Guo, Ivor Tsang, Sai-Kit Yeung

Comments: Accepted to WACV 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[241] arXiv:2512.06981 [pdf, ps, other]: Title: Selective Masking based Self-Supervised Learning for Image Semantic Segmentation

Authors: Yuemin Wang, Ian Stavness

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[242] arXiv:2512.06949 [pdf, ps, other]: Title: Can We Go Beyond Visual Features? Neural Tissue Relation Modeling for Relational Graph Analysis in Non-Melanoma Skin Histology

Authors: Shravan Venkatraman, Muthu Subash Kavitha, Joe Dhanith P R, V Manikandarajan, Jia Wu

Comments: 19 pages, 5 figures, 2 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[243] arXiv:2512.06921 [pdf, ps, other]: Title: NeuroABench: A Multimodal Evaluation Benchmark for Neurosurgical Anatomy Identification

Authors: Ziyang Song, Zelin Zang, Xiaofan Ye, Boqiang Xu, Long Bai, Jinlin Wu, Hongliang Ren, Hongbin Liu, Jiebo Luo, Zhen Lei

Comments: Accepted by IEEE ICIA 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[244] arXiv:2512.06905 [pdf, ps, other]: Title: Scaling Zero-Shot Reference-to-Video Generation

Authors: Zijian Zhou, Shikun Liu, Haozhe Liu, Haonan Qiu, Zhaochong An, Weiming Ren, Zhiheng Liu, Xiaoke Huang, Kam Woh Ng, Tian Xie, Xiao Han, Yuren Cong, Hang Li, Chuyan Zhu, Aditya Patel, Tao Xiang, Sen He

Comments: Website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[245] arXiv:2512.06888 [pdf, ps, other]: Title: Overcoming Small Data Limitations in Video-Based Infant Respiration Estimation

Authors: Liyang Song, Hardik Bishnoi, Sai Kumar Reddy Manne, Sarah Ostadabbas, Briana J. Taylor, Michael Wan

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[246] arXiv:2512.06886 [pdf, ps, other]: Title: Balanced Learning for Domain Adaptive Semantic Segmentation

Authors: Wangkai Li, Rui Sun, Bohao Liao, Zhaoyang Li, Tianzhu Zhang

Comments: Accepted by International Conference on Machine Learning (ICML 2025)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[247] arXiv:2512.06885 [pdf, ps, other]: Title: JoPano: Unified Panorama Generation via Joint Modeling

Authors: Wancheng Feng, Chen An, Zhenliang He, Meina Kan, Shiguang Shan, Lukun Wang

Comments: Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[248] arXiv:2512.06882 [pdf, ps, other]: Title: Hierarchical Image-Guided 3D Point Cloud Segmentation in Industrial Scenes via Multi-View Bayesian Fusion

Authors: Yu Zhu, Naoya Chiba, Koichi Hashimoto

Comments: Accepted to BMVC 2025 (Sheffield, UK, Nov 24-27, 2025). Supplementary video and poster available upon request

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[249] arXiv:2512.06877 [pdf, ps, other]: Title: SceneMixer: Exploring Convolutional Mixing Networks for Remote Sensing Scene Classification

Authors: Mohammed Q. Alkhatib, Ali Jamali, Swalpa Kumar Roy

Comments: Accepted and presented in ICSPIS

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[250] arXiv:2512.06870 [pdf, ps, other]: Title: Towards Robust Pseudo-Label Learning in Semantic Segmentation: An Encoding Perspective

Authors: Wangkai Li, Rui Sun, Zhaoyang Li, Tianzhu Zhang

Comments: Accepted by Conference on Neural Information Processing Systems (NeurIPS 2025)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[251] arXiv:2512.06866 [pdf, ps, other]: Title: Less Is More, but Where? Dynamic Token Compression via LLM-Guided Keyframe Prior

Authors: Yulin Li, Haokun Gui, Ziyang Fan, Junjie Wang, Bin Kang, Bin Chen, Zhuotao Tian

Comments: Accepted by NeurIPS 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[252] arXiv:2512.06865 [pdf, ps, other]: Title: Spatial Retrieval Augmented Autonomous Driving

Authors: Xiaosong Jia, Chenhe Zhang, Yule Jiang, Songbur Wong, Zhiyuan Zhang, Chen Chen, Shaofeng Zhang, Xuanhe Zhou, Xue Yang, Junchi Yan, Yu-Gang Jiang

Comments: Demo Page: this https URL with open sourced code, dataset, and checkpoints

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[253] arXiv:2512.06864 [pdf, ps, other]: Title: Boosting Unsupervised Video Instance Segmentation with Automatic Quality-Guided Self-Training

Authors: Kaixuan Lu, Mehmet Onurcan Kaya, Dim P. Papadopoulos

Comments: Accepted to WACV 2026. arXiv admin note: substantial text overlap with arXiv:2508.19808

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[254] arXiv:2512.06862 [pdf, ps, other]: Title: Omni-Referring Image Segmentation

Authors: Qiancheng Zheng, Yunhang Shen, Gen Luo, Baiyang Song, Xing Sun, Xiaoshuai Sun, Yiyi Zhou, Rongrong Ji

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[255] arXiv:2512.06849 [pdf, ps, other]: Title: Hide-and-Seek Attribution: Weakly Supervised Segmentation of Vertebral Metastases in CT

Authors: Matan Atad, Alexander W. Marka, Lisa Steinhelfer, Anna Curto-Vilalta, Yannik Leonhardt, Sarah C. Foreman, Anna-Sophia Walburga Dietrich, Robert Graf, Alexandra S. Gersing, Bjoern Menze, Daniel Rueckert, Jan S. Kirschke, Hendrik Möller

Comments: In submission

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[256] arXiv:2512.06845 [pdf, ps, other]: Title: Pseudo Anomalies Are All You Need: Diffusion-Based Generation for Weakly-Supervised Video Anomaly Detection

Authors: Satoshi Hashimoto, Hitoshi Nishimura, Yanan Wang, Mori Kurokawa

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[257] arXiv:2512.06840 [pdf, ps, other]: Title: CADE: Continual Weakly-supervised Video Anomaly Detection with Ensembles

Authors: Satoshi Hashimoto, Tatsuya Konishi, Tomoya Kaichi, Kazunori Matsumoto, Mori Kurokawa

Comments: Accepted to WACV 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[258] arXiv:2512.06838 [pdf, ps, other]: Title: SparseCoop: Cooperative Perception with Kinematic-Grounded Queries

Authors: Jiahao Wang, Zhongwei Jiang, Wenchao Sun, Jiaru Zhong, Haibao Yu, Yuner Zhang, Chenyang Lu, Chuang Zhang, Lei He, Shaobing Xu, Jianqiang Wang

Comments: Accepted by AAAI 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[259] arXiv:2512.06818 [pdf, ps, other]: Title: MeshSplatting: Differentiable Rendering with Opaque Meshes

Authors: Jan Held, Sanghyun Son, Renaud Vandeghen, Daniel Rebain, Matheus Gadelha, Yi Zhou, Anthony Cioppa, Ming C. Lin, Marc Van Droogenbroeck, Andrea Tagliasacchi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[260] arXiv:2512.06811 [pdf, ps, other]: Title: RMAdapter: Reconstruction-based Multi-Modal Adapter for Vision-Language Models

Authors: Xiang Lin, Weixin Li, Shu Guo, Lihong Wang, Di Huang

Comments: Accepted by AAAI 2026(Oral)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[261] arXiv:2512.06810 [pdf, ps, other]: Title: MMDuet2: Enhancing Proactive Interaction of Video MLLMs with Multi-Turn Reinforcement Learning

Authors: Yueqian Wang, Songxiang Liu, Disong Wang, Nuo Xu, Guanglu Wan, Huishuai Zhang, Dongyan Zhao

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[262] arXiv:2512.06802 [pdf, ps, other]: Title: VDOT: Efficient Unified Video Creation via Optimal Transport Distillation

Authors: Yutong Wang, Haiyu Zhang, Tianfan Xue, Yu Qiao, Yaohui Wang, Chang Xu, Xinyuan Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[263] arXiv:2512.06793 [pdf, ps, other]: Title: Generalized Geometry Encoding Volume for Real-time Stereo Matching

Authors: Jiaxin Liu, Gangwei Xu, Xianqi Wang, Chengliang Zhang, Xin Yang

Comments: Accepted by AAAI 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[264] arXiv:2512.06783 [pdf, ps, other]: Title: Physics Informed Human Posture Estimation Based on 3D Landmarks from Monocular RGB-Videos

Authors: Tobias Leuthold, Michele Xiloyannis, Yves Zimmermann

Comments: 16 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[265] arXiv:2512.06774 [pdf, ps, other]: Title: RDSplat: Robust Watermarking Against Diffusion Editing for 3D Gaussian Splatting

Authors: Longjie Zhao, Ziming Hong, Zhenyang Ren, Runnan Chen, Mingming Gong, Tongliang Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[266] arXiv:2512.06769 [pdf, ps, other]: Title: Stitch and Tell: A Structured Multimodal Data Augmentation Method for Spatial Understanding

Authors: Hang Yin, Xiaomin He, PeiWen Yuan, Yiwei Li, Jiayi Shi, Wenxiao Fan, Shaoxiong Feng, Kan Li

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[267] arXiv:2512.06763 [pdf, ps, other]: Title: JOCA: Task-Driven Joint Optimisation of Camera Hardware and Adaptive Camera Control Algorithms

Authors: Chengyang Yan, Mitch Bryson, Donald G. Dansereau

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[268] arXiv:2512.06759 [pdf, ps, other]: Title: VisChainBench: A Benchmark for Multi-Turn, Multi-Image Visual Reasoning Beyond Language Priors

Authors: Wenbo Lyu, Yingjun Du, Jinglin Zhao, Xianton Zhen, Ling Shao

Comments: 12 pages,13figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[269] arXiv:2512.06750 [pdf, ps, other]: Title: UARE: A Unified Vision-Language Model for Image Quality Assessment, Restoration, and Enhancement

Authors: Weiqi Li, Xuanyu Zhang, Bin Chen, Jingfen Xie, Yan Wang, Kexin Zhang, Junlin Li, Li Zhang, Jian Zhang, Shijie Zhao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[270] arXiv:2512.06746 [pdf, ps, other]: Title: Task-Model Alignment: A Simple Path to Generalizable AI-Generated Image Detection

Authors: Ruoxin Chen, Jiahui Gao, Kaiqing Lin, Keyue Zhang, Yandan Zhao, Isabel Guan, Taiping Yao, Shouhong Ding

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[271] arXiv:2512.06738 [pdf, ps, other]: Title: FedSCAl: Leveraging Server and Client Alignment for Unsupervised Federated Source-Free Domain Adaptation

Authors: M Yashwanth, Sampath Koti, Arunabh Singh, Shyam Marjit, Anirban Chakraborty

Comments: Accepted to Winter Conference on Applications of Computer Vision (WACV) 2026, Round 1

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[272] arXiv:2512.06736 [pdf, ps, other]: Title: Graph Convolutional Long Short-Term Memory Attention Network for Post-Stroke Compensatory Movement Detection Based on Skeleton Data

Authors: Jiaxing Fan, Jiaojiao Liu, Wenkong Wang, Yang Zhang, Xin Ma, Jichen Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[273] arXiv:2512.06726 [pdf, ps, other]: Title: The Role of Entropy in Visual Grounding: Analysis and Optimization

Authors: Shuo Li, Jiajun Sun, Zhihao Zhang, Xiaoran Fan, Senjie Jin, Hui Li, Yuming Yang, Junjie Ye, Lixing Shen, Tao Ji, Tao Gui, Qi Zhang, Xuanjing Huang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[274] arXiv:2512.06689 [pdf, ps, other]: Title: Lightweight Wasserstein Audio-Visual Model for Unified Speech Enhancement and Separation

Authors: Jisoo Park, Seonghak Lee, Guisik Kim, Taewoo Kim, Junseok Kwon

Comments: Accepted to ASRU 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[275] arXiv:2512.06684 [pdf, ps, other]: Title: EMGauss: Continuous Slice-to-3D Reconstruction via Dynamic Gaussian Modeling in Volume Electron Microscopy

Authors: Yumeng He, Zanwei Zhou, Yekun Zheng, Chen Liang, Yunbo Wang, Xiaokang Yang

Subjects: Computer Vision and Pattern Recognition (cs.CV)

[ total of 749 entries: 1-50 | ... | 76-125 | 126-175 | 176-225 | 226-275 | 276-325 | 326-375 | 376-425 | ... | 726-749 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help (Access key information)

> cs > cs.CV

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 225

Tue, 9 Dec 2025 (continued, showing 50 of 259 entries)