Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 159

[ total of 749 entries: 1-50 | 10-59 | 60-109 | 110-159 | 160-209 | 210-259 | 260-309 | 310-359 | ... | 710-749 ]
[ showing 50 entries per page: fewer | more | all ]

Tue, 9 Dec 2025 (continued, showing 50 of 259 entries)

[160] arXiv:2512.07651 [pdf, ps, other]: Title: Liver Fibrosis Quantification and Analysis: The LiQA Dataset and Baseline Method

Authors: Yuanye Liu, Hanxiao Zhang, Nannan Shi, Yuxin Shi, Arif Mahmood, Murtaza Taj, Xiahai Zhuang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[161] arXiv:2512.07628 [pdf, ps, other]: Title: MoCA: Mixture-of-Components Attention for Scalable Compositional 3D Generation

Authors: Zhiqi Li, Wenhuan Li, Tengfei Wang, Zhenwei Wang, Junta Wu, Haoyuan Wang, Yunhan Yang, Zehuan Huang, Yang Li, Peidong Liu, Chunchao Guo

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[162] arXiv:2512.07606 [pdf, ps, other]: Title: Decomposition Sampling for Efficient Region Annotations in Active Learning

Authors: Jingna Qiu, Frauke Wilm, Mathias Öttl, Jonas Utz, Maja Schlereth, Moritz Schillinger, Marc Aubreville, Katharina Breininger

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[163] arXiv:2512.07599 [pdf, ps, other]: Title: Online Segment Any 3D Thing as Instance Tracking

Authors: Hanshi Wang, Zijian Cai, Jin Gao, Yiwei Zhang, Weiming Hu, Ke Wang, Zhipeng Zhang

Comments: NeurIPS 2025, Code is at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[164] arXiv:2512.07596 [pdf, ps, other]: Title: More than Segmentation: Benchmarking SAM 3 for Segmentation, 3D Perception, and Reconstruction in Robotic Surgery

Authors: Wenzhen Dong, Jieming Yu, Yiming Huang, Hongqiu Wang, Lei Zhu, Albert C. S. Chung, Hongliang Ren, Long Bai

Comments: Technical Report

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[165] arXiv:2512.07590 [pdf, ps, other]: Title: Robust Variational Model Based Tailored UNet: Leveraging Edge Detector and Mean Curvature for Improved Image Segmentation

Authors: Kaili Qi, Zhongyi Huang, Wenli Yang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[166] arXiv:2512.07584 [pdf, ps, other]: Title: LongCat-Image Technical Report

Authors: Meituan LongCat Team: Hanghang Ma, Haoxian Tan, Jiale Huang, Junqiang Wu, Jun-Yan He, Lishuai Gao, Songlin Xiao, Xiaoming Wei, Xiaoqi Ma, Xunliang Cai, Yayong Guan, Jie Hu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[167] arXiv:2512.07580 [pdf, ps, other]: Title: All You Need Are Random Visual Tokens? Demystifying Token Pruning in VLLMs

Authors: Yahong Wang, Juncheng Wu, Zhangkai Ni, Longzhen Yang, Yihang Liu, Chengmei Yang, Ying Wen, Xianfeng Tang, Hui Liu, Yuyin Zhou, Lianghua He

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[168] arXiv:2512.07568 [pdf, ps, other]: Title: Dual-Stream Cross-Modal Representation Learning via Residual Semantic Decorrelation

Authors: Xuecheng Li, Weikuan Jia, Alisher Kurbonaliev, Qurbonaliev Alisher, Khudzhamkulov Rustam, Ismoilov Shuhratjon, Eshmatov Javhariddin, Yuanjie Zheng

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[169] arXiv:2512.07564 [pdf, ps, other]: Title: Toward More Reliable Artificial Intelligence: Reducing Hallucinations in Vision-Language Models

Authors: Kassoum Sanogo, Renzo Ardiccioni

Comments: 24 pages, 3 figures, 2 tables. Training-free self-correction framework for vision-language models. Code and implementation details will be released at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[170] arXiv:2512.07527 [pdf, ps, other]: Title: From Orbit to Ground: Generative City Photogrammetry from Extreme Off-Nadir Satellite Images

Authors: Fei Yu, Yu Liu, Luyang Tang, Mingchao Sun, Zengye Ge, Rui Bu, Yuchao Jin, Haisen Zhao, He Sun, Yangyan Li, Mu Xu, Wenzheng Chen, Baoquan Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[171] arXiv:2512.07514 [pdf, ps, other]: Title: MeshRipple: Structured Autoregressive Generation of Artist-Meshes

Authors: Junkai Lin, Hang Long, Huipeng Guo, Jielei Zhang, JiaYi Yang, Tianle Guo, Yang Yang, Jianwen Li, Wenxiao Zhang, Matthias Nießner, Wei Yang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[172] arXiv:2512.07504 [pdf, ps, other]: Title: ControlVP: Interactive Geometric Refinement of AI-Generated Images with Consistent Vanishing Points

Authors: Ryota Okumura, Kaede Shiohara, Toshihiko Yamasaki

Comments: Accepted to WACV 2026, 8 pages, supplementary included. Dataset and code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[173] arXiv:2512.07503 [pdf, ps, other]: Title: SJD++: Improved Speculative Jacobi Decoding for Training-free Acceleration of Discrete Auto-regressive Text-to-Image Generation

Authors: Yao Teng, Zhihuan Jiang, Han Shi, Xian Liu, Xuefei Ning, Guohao Dai, Yu Wang, Zhenguo Li, Xihui Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[174] arXiv:2512.07500 [pdf, ps, other]: Title: MultiMotion: Multi Subject Video Motion Transfer via Video Diffusion Transformer

Authors: Penghui Liu, Jiangshan Wang, Yutong Shen, Shanhui Mo, Chenyang Qi, Yue Ma

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[175] arXiv:2512.07498 [pdf, ps, other]: Title: Towards Robust DeepFake Detection under Unstable Face Sequences: Adaptive Sparse Graph Embedding with Order-Free Representation and Explicit Laplacian Spectral Prior

Authors: Chih-Chung Hsu, Shao-Ning Chen, Chia-Ming Lee, Yi-Fang Wang, Yi-Shiuan Chou

Comments: 16 pages (including appendix)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[176] arXiv:2512.07480 [pdf, ps, other]: Title: Single-step Diffusion-based Video Coding with Semantic-Temporal Guidance

Authors: Naifu Xue, Zhaoyang Jia, Jiahao Li, Bin Li, Zihan Zheng, Yuan Zhang, Yan Lu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[177] arXiv:2512.07469 [pdf, ps, other]: Title: Unified Video Editing with Temporal Reasoner

Authors: Xiangpeng Yang, Ji Xie, Yiyuan Yang, Yan Huang, Min Xu, Qiang Wu

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[178] arXiv:2512.07426 [pdf, ps, other]: Title: When normalization hallucinates: unseen risks in AI-powered whole slide image processing

Authors: Karel Moens, Matthew B. Blaschko, Tinne Tuytelaars, Bart Diricx, Jonas De Vylder, Mustafa Yousif

Comments: 4 pages, accepted for oral presentation at SPIE Medical Imaging, 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[179] arXiv:2512.07415 [pdf, ps, other]: Title: Data-driven Exploration of Mobility Interaction Patterns

Authors: Gabriele Galatolo, Mirco Nanni

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[180] arXiv:2512.07410 [pdf, ps, other]: Title: InterAgent: Physics-based Multi-agent Command Execution via Diffusion on Interaction Graphs

Authors: Bin Li, Ruichi Zhang, Han Liang, Jingyan Zhang, Juze Zhang, Xin Chen, Lan Xu, Jingyi Yu, Jingya Wang

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[181] arXiv:2512.07394 [pdf, ps, other]: Title: Reconstructing Objects along Hand Interaction Timelines in Egocentric Video

Authors: Zhifan Zhu, Siddhant Bansal, Shashank Tripathi, Dima Damen

Comments: webpage: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[182] arXiv:2512.07391 [pdf, ps, other]: Title: GlimmerNet: A Lightweight Grouped Dilated Depthwise Convolutions for UAV-Based Emergency Monitoring

Authors: Đorđe Nedeljković

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[183] arXiv:2512.07385 [pdf, ps, other]: Title: How Far are Modern Trackers from UAV-Anti-UAV? A Million-Scale Benchmark and New Baseline

Authors: Chunhui Zhang, Li Liu, Zhipeng Zhang, Yong Wang, Hao Wen, Xi Zhou, Shiming Ge, Yanfeng Wang

Comments: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[184] arXiv:2512.07383 [pdf, ps, other]: Title: LogicCBMs: Logic-Enhanced Concept-Based Learning

Authors: Deepika SN Vemuri, Gautham Bellamkonda, Aditya Pola, Vineeth N Balasubramanian

Comments: 18 pages, 19 figures, WACV 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[185] arXiv:2512.07381 [pdf, ps, other]: Title: Tessellation GS: Neural Mesh Gaussians for Robust Monocular Reconstruction of Dynamic Objects

Authors: Shuohan Tao, Boyao Zhou, Hanzhang Tu, Yuwang Wang, Yebin Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[186] arXiv:2512.07379 [pdf, ps, other]: Title: Enhancing Small Object Detection with YOLO: A Novel Framework for Improved Accuracy and Efficiency

Authors: Mahila Moghadami, Mohammad Ali Keyvanrad, Melika Sabaghian

Comments: 22 pages, 16 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[187] arXiv:2512.07360 [pdf, ps, other]: Title: Structure-Aware Feature Rectification with Region Adjacency Graphs for Training-Free Open-Vocabulary Semantic Segmentation

Authors: Qiming Huang, Hao Ai, Jianbo Jiao

Comments: Accepted to WACV2026

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[188] arXiv:2512.07351 [pdf, ps, other]: Title: DeepAgent: A Dual Stream Multi Agent Fusion for Robust Multimodal Deepfake Detection

Authors: Sayeem Been Zaman, Wasimul Karim, Arefin Ittesafun Abian, Reem E. Mohamed, Md Rafiqul Islam, Asif Karim, Sami Azam

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Sound (cs.SD)
[189] arXiv:2512.07348 [pdf, ps, other]: Title: MICo-150K: A Comprehensive Dataset Advancing Multi-Image Composition

Authors: Xinyu Wei, Kangrui Cen, Hongyang Wei, Zhen Guo, Bairui Li, Zeqing Wang, Jinrui Zhang, Lei Zhang

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[190] arXiv:2512.07345 [pdf, ps, other]: Title: Debiasing Diffusion Priors via 3D Attention for Consistent Gaussian Splatting

Authors: Shilong Jin, Haoran Duan, Litao Hua, Wentao Huang, Yuan Zhou

Comments: 15 pages, 8 figures, 5 tables, 2 algorithms, Accepted by AAAI 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[191] arXiv:2512.07338 [pdf, ps, other]: Title: Generalized Referring Expression Segmentation on Aerial Photos

Authors: Luís Marnoto, Alexandre Bernardino, Bruno Martins

Comments: Submitted to IEEE J-STARS

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[192] arXiv:2512.07331 [pdf, ps, other]: Title: The Inductive Bottleneck: Data-Driven Emergence of Representational Sparsity in Vision Transformers

Authors: Kanishk Awadhiya

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[193] arXiv:2512.07328 [pdf, ps, other]: Title: ContextAnyone: Context-Aware Diffusion for Character-Consistent Text-to-Video Generation

Authors: Ziyang Mai, Yu-Wing Tai

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[194] arXiv:2512.07305 [pdf, ps, other]: Title: Reevaluating Automated Wildlife Species Detection: A Reproducibility Study on a Custom Image Dataset

Authors: Tobias Abraham Haider

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[195] arXiv:2512.07302 [pdf, ps, other]: Title: Towards Accurate UAV Image Perception: Guiding Vision-Language Models with Stronger Task Prompts

Authors: Mingning Guo, Mengwei Wu, Shaoxian Li, Haifeng Li, Chao Tao

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[196] arXiv:2512.07276 [pdf, ps, other]: Title: Geo3DVQA: Evaluating Vision-Language Models for 3D Geospatial Reasoning from Aerial Imagery

Authors: Mai Tsujimoto, Junjue Wang, Weihao Xuan, Naoto Yokoya

Comments: Accepted to WACV 2026. Camera-ready-based version with minor edits for readability (no change in the contents)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[197] arXiv:2512.07275 [pdf, ps, other]: Title: Effective Attention-Guided Multi-Scale Medical Network for Skin Lesion Segmentation

Authors: Siyu Wang, Hua Wang, Huiyu Li, Fan Zhang

Comments: The paper has been accepted by BIBM 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[198] arXiv:2512.07273 [pdf, ps, other]: Title: RVLF: A Reinforcing Vision-Language Framework for Gloss-Free Sign Language Translation

Authors: Zhi Rao, Yucheng Zhou, Benjia Zhou, Yiqing Huang, Sergio Escalera, Jun Wan

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[199] arXiv:2512.07269 [pdf, ps, other]: Title: A graph generation pipeline for critical infrastructures based on heuristics, images and depth data

Authors: Mike Diessner, Yannick Tarant

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[200] arXiv:2512.07253 [pdf, ps, other]: Title: DGGAN: Degradation Guided Generative Adversarial Network for Real-time Endoscopic Video Enhancement

Authors: Handing Xu, Zhenguo Nie, Tairan Peng, Huimin Pan, Xin-Jun Liu

Comments: 18 pages, 8 figures, and 7 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[201] arXiv:2512.07251 [pdf, ps, other]: Title: See More, Change Less: Anatomy-Aware Diffusion for Contrast Enhancement

Authors: Junqi Liu, Zejun Wu, Pedro R. A. S. Bassi, Xinze Zhou, Wenxuan Li, Ibrahim E. Hamamci, Sezgin Er, Tianyu Lin, Yi Luo, Szymon Płotka, Bjoern Menze, Daguang Xu, Kai Ding, Kang Wang, Yang Yang, Yucheng Tang, Alan L. Yuille, Zongwei Zhou

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[202] arXiv:2512.07247 [pdf, ps, other]: Title: AdLift: Lifting Adversarial Perturbations to Safeguard 3D Gaussian Splatting Assets Against Instruction-Driven Editing

Authors: Ziming Hong, Tianyu Huang, Runnan Chen, Shanshan Ye, Mingming Gong, Bo Han, Tongliang Liu

Comments: 40 pages, 34 figures, 18 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[203] arXiv:2512.07245 [pdf, ps, other]: Title: Zero-Shot Textual Explanations via Translating Decision-Critical Features

Authors: Toshinori Yamauchi, Hiroshi Kera, Kazuhiko Kawamoto

Comments: 11+6 pages, 8 figures, 4 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[204] arXiv:2512.07241 [pdf, ps, other]: Title: Squeezed-Eff-Net: Edge-Computed Boost of Tomography Based Brain Tumor Classification leveraging Hybrid Neural Network Architecture

Authors: Md. Srabon Chowdhury, Syeda Fahmida Tanzim, Sheekar Banerjee, Ishtiak Al Mamoon, AKM Muzahidul Islam

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[205] arXiv:2512.07237 [pdf, ps, other]: Title: Unified Camera Positional Encoding for Controlled Video Generation

Authors: Cheng Zhang, Boying Li, Meng Wei, Yan-Pei Cao, Camilo Cruz Gambardella, Dinh Phung, Jianfei Cai

Comments: Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[206] arXiv:2512.07234 [pdf, ps, other]: Title: Dropout Prompt Learning: Towards Robust and Adaptive Vision-Language Models

Authors: Biao Chen, Lin Zuo, Mengmeng Jing, Kunbin He, Yuchen Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[207] arXiv:2512.07230 [pdf, ps, other]: Title: STRinGS: Selective Text Refinement in Gaussian Splatting

Authors: Abhinav Raundhal, Gaurav Behera, P J Narayanan, Ravi Kiran Sarvadevabhatla, Makarand Tapaswi

Comments: Accepted to WACV 2026. Project Page, see this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[208] arXiv:2512.07229 [pdf, ps, other]: Title: ReLKD: Inter-Class Relation Learning with Knowledge Distillation for Generalized Category Discovery

Authors: Fang Zhou, Zhiqiang Chen, Martin Pavlovski, Yizhong Zhang

Comments: Accepted to the Main Track of the 28th European Conference on Artificial Intelligence (ECAI 2025). To appear in the proceedings published by IOS Press (DOI: 10.3233/FAIA413)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[209] arXiv:2512.07228 [pdf, ps, other]: Title: Towards Robust Protective Perturbation against DeepFake Face Swapping

Authors: Hengyang Yao, Lin Li, Ke Sun, Jianing Qiu, Huiping Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)

[ total of 749 entries: 1-50 | 10-59 | 60-109 | 110-159 | 160-209 | 210-259 | 260-309 | 310-359 | ... | 710-749 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help (Access key information)

> cs > cs.CV

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 159

Tue, 9 Dec 2025 (continued, showing 50 of 259 entries)