Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 550

[ total of 778 entries: 1-50 | ... | 401-450 | 451-500 | 501-550 | 551-600 | 601-650 | 651-700 | 701-750 | 751-778 ]
[ showing 50 entries per page: fewer | more | all ]

Tue, 2 Dec 2025 (continued, showing 50 of 278 entries)

[551] arXiv:2512.01563 [pdf, ps, other]: Title: MasHeNe: A Benchmark for Head and Neck CT Mass Segmentation using Window-Enhanced Mamba with Frequency-Domain Integration

Authors: Thao Thi Phuong Dao, Tan-Cong Nguyen, Nguyen Chi Thanh, Truong Hoang Viet, Trong-Le Do, Mai-Khiem Tran, Minh-Khoi Pham, Trung-Nghia Le, Minh-Triet Tran, Thanh Dinh Le

Comments: The 14th International Symposium on Information and Communication Technology Conference SoICT 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[552] arXiv:2512.01540 [pdf, ps, other]: Title: FlashVGGT: Efficient and Scalable Visual Geometry Transformers with Compressed Descriptor Attention

Authors: Zipeng Wang, Dan Xu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[553] arXiv:2512.01534 [pdf, ps, other]: Title: Deep Unsupervised Anomaly Detection in Brain Imaging: Large-Scale Benchmarking and Bias Analysis

Authors: Alexander Frotscher, Christian F. Baumgartner, Thomas Wolfers

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[554] arXiv:2512.01533 [pdf, ps, other]: Title: Diffusion Fuzzy System: Fuzzy Rule Guided Latent Multi-Path Diffusion Modeling

Authors: Hailong Yang, Te Zhang, Kup-sze Choi, Zhaohong Deng

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[555] arXiv:2512.01519 [pdf, ps, other]: Title: QuantumCanvas: A Multimodal Benchmark for Visual Learning of Atomic Interactions

Authors: Can Polat, Erchin Serpedin, Mustafa Kurban, Hasan Kurban

Subjects: Computer Vision and Pattern Recognition (cs.CV); Materials Science (cond-mat.mtrl-sci); Quantum Physics (quant-ph)
[556] arXiv:2512.01510 [pdf, ps, other]: Title: Semantic-aware Random Convolution and Source Matching for Domain Generalization in Medical Image Segmentation

Authors: Franz Thaler, Martin Urschler, Mateusz Kozinski, Matthias AF Gsell, Gernot Plank, Darko Stern

Comments: Preprint submitted to Computer Methods and Programs in Biomedicine (currently under revision)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[557] arXiv:2512.01495 [pdf, ps, other]: Title: ELVIS: Enhance Low-Light for Video Instance Segmentation in the Dark

Authors: Joanne Lin, Ruirui Lin, Yini Li, David Bull, Nantheera Anantrasirichai

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[558] arXiv:2512.01494 [pdf, other]: Title: A variational method for curve extraction with curvature-dependent energies

Authors: Majid Arthaud (ENPC, MOKAPLAN, UMich), Antonin Chambolle (CEREMADE, MOKAPLAN), Vincent Duval (MOKAPLAN)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[559] arXiv:2512.01481 [pdf, ps, other]: Title: ChronosObserver: Taming 4D World with Hyperspace Diffusion Sampling

Authors: Qisen Wang, Yifan Zhao, Peisen Shen, Jialu Li, Jia Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[560] arXiv:2512.01478 [pdf, ps, other]: Title: CourtMotion: Learning Event-Driven Motion Representations from Skeletal Data for Basketball

Authors: Omer Sela (1 and 2), Michael Chertok (1), Lior Wolf (2) ((1) Amazon, (2) Tel Aviv University)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multiagent Systems (cs.MA)
[561] arXiv:2512.01444 [pdf, ps, other]: Title: FastAnimate: Towards Learnable Template Construction and Pose Deformation for Fast 3D Human Avatar Animation

Authors: Jian Shu, Nanjie Yao, Gangjian Zhang, Junlong Ren, Yu Feng, Hao Wang

Comments: 9 pages,4 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[562] arXiv:2512.01427 [pdf, ps, other]: Title: Language-Guided Open-World Anomaly Segmentation

Authors: Klara Reichard, Nikolas Brasch, Nassir Navab, Federico Tombari

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[563] arXiv:2512.01426 [pdf, ps, other]: Title: ResDiT: Evoking the Intrinsic Resolution Scalability in Diffusion Transformers

Authors: Yiyang Ma, Feng Zhou, Xuedan Yin, Pu Cao, Yonghao Dang, Jianqin Yin

Comments: 8 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[564] arXiv:2512.01424 [pdf, ps, other]: Title: ViRectify: A Challenging Benchmark for Video Reasoning Correction with Multimodal Large Language Models

Authors: Xusen Hei, Jiali Chen, Jinyu Yang, Mengchen Zhao, Yi Cai

Comments: 22 pages, 11 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[565] arXiv:2512.01422 [pdf, ps, other]: Title: MDiff4STR: Mask Diffusion Model for Scene Text Recognition

Authors: Yongkun Du, Miaomiao Zhao, Songlin Fan, Zhineng Chen, Caiyan Jia, Yu-Gang Jiang

Comments: Accepted by AAAI 2026 (Oral)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[566] arXiv:2512.01419 [pdf, ps, other]: Title: Rice-VL: Evaluating Vision-Language Models for Cultural Understanding Across ASEAN Countries

Authors: Tushar Pranav, Eshan Pandey, Austria Lyka Diane Bala, Aman Chadha, Indriyati Atmosukarto, Donny Soh Cheng Lock

Comments: 14 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[567] arXiv:2512.01390 [pdf, ps, other]: Title: FRAMER: Frequency-Aligned Self-Distillation with Adaptive Modulation Leveraging Diffusion Priors for Real-World Image Super-Resolution

Authors: Seungho Choi, Jeahun Sung, Jihyong Oh

Comments: Comments: Please visit our project page at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[568] arXiv:2512.01383 [pdf, ps, other]: Title: PointNet4D: A Lightweight 4D Point Cloud Video Backbone for Online and Offline Perception in Robotic Applications

Authors: Yunze Liu, Zifan Wang, Peiran Wu, Jiayang Ao

Comments: Accepted by WACV2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[569] arXiv:2512.01382 [pdf, ps, other]: Title: Reversible Inversion for Training-Free Exemplar-guided Image Editing

Authors: Yuke Li, Lianli Gao, Ji Zhang, Pengpeng Zeng, Lichuan Xiang, Hongkai Wen, Heng Tao Shen, Jingkuan Song

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[570] arXiv:2512.01380 [pdf, ps, other]: Title: Textured Geometry Evaluation: Perceptual 3D Textured Shape Metric via 3D Latent-Geometry Network

Authors: Tianyu Luan, Xuelu Feng, Zixin Zhu, Phani Nuney, Sheng Liu, Xuan Gong, David Doermann, Chunming Qiao, Junsong Yuan

Comments: Accepted by AAAI26

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[571] arXiv:2512.01373 [pdf, ps, other]: Title: SRAM: Shape-Realism Alignment Metric for No Reference 3D Shape Evaluation

Authors: Sheng Liu, Tianyu Luan, Phani Nuney, Xuelu Feng, Junsong Yuan

Comments: Accepted by AAAI2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[572] arXiv:2512.01366 [pdf, ps, other]: Title: BlinkBud: Detecting Hazards from Behind via Sampled Monocular 3D Detection on a Single Earbud

Authors: Yunzhe Li, Jiajun Yan, Yuzhou Wei, Kechen Liu, Yize Zhao, Chong Zhang, Hongzi Zhu, Li Lu, Shan Chang, Minyi Guo

Comments: This is the author-accepted version of the paper published in Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT), Vol. 9, No. 4, Article 191, 2025. Final published version: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[573] arXiv:2512.01352 [pdf, ps, other]: Title: OpenBox: Annotate Any Bounding Boxes in 3D

Authors: In-Jae Lee, Mungyeom Kim, Kwonyoung Ryu, Pierre Musacchio, Jaesik Park

Comments: Accepted by NeurIPS 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[574] arXiv:2512.01348 [pdf, ps, other]: Title: Handwritten Text Recognition for Low Resource Languages

Authors: Sayantan Dey, Alireza Alaei, Partha Pratim Roy

Comments: 21 Pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[575] arXiv:2512.01342 [pdf, ps, other]: Title: InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision

Authors: Chenting Wang, Yuhan Zhu, Yicheng Xu, Jiange Yang, Ziang Yan, Yali Wang, Yi Wang, Limin Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[576] arXiv:2512.01340 [pdf, ps, other]: Title: EvalTalker: Learning to Evaluate Real-Portrait-Driven Multi-Subject Talking Humans

Authors: Yingjie Zhou, Xilei Zhu, Siyu Ren, Ziyi Zhao, Ziwen Wang, Farong Wen, Yu Zhou, Jiezhang Cao, Xiongkuo Min, Fengjiao Chen, Xiaoyu Li, Xuezhi Cao, Guangtao Zhai, Xiaohong Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[577] arXiv:2512.01334 [pdf, ps, other]: Title: AlignVid: Training-Free Attention Scaling for Semantic Fidelity in Text-Guided Image-to-Video Generation

Authors: Yexin Liu, Wen-Jie Shu, Zile Huang, Haoze Zheng, Yueze Wang, Manyuan Zhang, Ser-Nam Lim, Harry Yang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[578] arXiv:2512.01333 [pdf, ps, other]: Title: Optimizing Stroke Risk Prediction: A Machine Learning Pipeline Combining ROS-Balanced Ensembles and XAI

Authors: A S M Ahsanul Sarkar Akib, Raduana Khawla, Abdul Hasib

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[579] arXiv:2512.01319 [pdf, ps, other]: Title: Rethinking Intracranial Aneurysm Vessel Segmentation: A Perspective from Computational Fluid Dynamics Applications

Authors: Feiyang Xiao, Yichi Zhang, Xigui Li, Yuanye Zhou, Chen Jiang, Xin Guo, Limei Han, Yuxin Li, Fengping Zhu, Yuan Cheng

Comments: 18 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[580] arXiv:2512.01315 [pdf, ps, other]: Title: FOD-S2R: A FOD Dataset for Sim2Real Transfer Learning based Object Detection

Authors: Ashish Vashist, Qiranul Saadiyean, Suresh Sundaram, Chandra Sekhar Seelamantula

Comments: 8 pages, 11 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[581] arXiv:2512.01314 [pdf, ps, other]: Title: TokenPure: Watermark Removal through Tokenized Appearance and Structural Guidance

Authors: Pei Yang, Yepeng Liu, Kelly Peng, Yuan Gao, Yiren Song

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[582] arXiv:2512.01312 [pdf, ps, other]: Title: IVCR-200K: A Large-Scale Multi-turn Dialogue Benchmark for Interactive Video Corpus Retrieval

Authors: Ning Han, Yawen Zeng, Shaohua Long, Chengqing Li, Sijie Yang, Dun Tan, Jianfeng Dong, Jingjing Chen

Comments: Accepted by SIGIR2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[583] arXiv:2512.01310 [pdf, ps, other]: Title: Lost in Distortion: Uncovering the Domain Gap Between Computer Vision and Brain Imaging - A Study on Pretraining for Age Prediction

Authors: Yanteng Zhang, Songheng Li, Zeyu Shen, Qizhen Lan, Lipei Zhang, Yang Liu, Vince Calhoun

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[584] arXiv:2512.01306 [pdf, ps, other]: Title: Gaussian Swaying: Surface-Based Framework for Aerodynamic Simulation with 3D Gaussians

Authors: Hongru Yan, Xiang Zhang, Zeyuan Chen, Fangyin Wei, Zhuowen Tu

Comments: Accepted to WACV 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[585] arXiv:2512.01302 [pdf, ps, other]: Title: DCText: Scheduled Attention Masking for Visual Text Generation via Divide-and-Conquer Strategy

Authors: Jaewoo Song, Jooyoung Choi, Kanghyun Baek, Sangyub Lee, Daemin Park, Sungroh Yoon

Comments: Accepted to WACV 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[586] arXiv:2512.01298 [pdf, ps, other]: Title: TBT-Former: Learning Temporal Boundary Distributions for Action Localization

Authors: Thisara Rathnayaka, Uthayasanker Thayasivam

Comments: 8 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[587] arXiv:2512.01296 [pdf, ps, other]: Title: EGG-Fusion: Efficient 3D Reconstruction with Geometry-aware Gaussian Surfel on the Fly

Authors: Xiaokun Pan, Zhenzhe Li, Zhichao Ye, Hongjia Zhai, Guofeng Zhang

Comments: SIGGRAPH ASIA 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[588] arXiv:2512.01292 [pdf, ps, other]: Title: Diffusion Model in Latent Space for Medical Image Segmentation Task

Authors: Huynh Trinh Ngoc, Toan Nguyen Hai, Ba Luong Son, Long Tran Quoc

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[589] arXiv:2512.01291 [pdf, ps, other]: Title: Supervised Contrastive Machine Unlearning of Background Bias in Sonar Image Classification with Fine-Grained Explainable AI

Authors: Kamal Basha S, Athira Nambiar

Comments: Accepted to CVIP 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[590] arXiv:2512.01273 [pdf, ps, other]: Title: nnMobileNet++: Towards Efficient Hybrid Networks for Retinal Image Analysis

Authors: Xin Li, Wenhui Zhu, Xuanzhao Dong, Hao Wang, Yujian Xiong, Oana Dumitrascu, Yalin Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[591] arXiv:2512.01268 [pdf, ps, other]: Title: ViscNet: Vision-Based In-line Viscometry for Fluid Mixing Process

Authors: Jongwon Sohn, Juhyeon Moon, Hyunjoon Jung, Jaewook Nam

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[592] arXiv:2512.01248 [pdf, ps, other]: Title: TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition

Authors: Junyuan Zhang, Bin Wang, Qintong Zhang, Fan Wu, Zichen Wen, Jialin Lu, Junjie Shan, Ziqi Zhao, Shuya Yang, Ziling Wang, Ziyang Miao, Huaping Zhong, Yuhang Zang, Xiaoyi Dong, Ka-Ho Chow, Conghui He

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[593] arXiv:2512.01242 [pdf, ps, other]: Title: Generative Adversarial Gumbel MCTS for Abstract Visual Composition Generation

Authors: Zirui Zhao, Boye Niu, David Hsu, Wee Sun Lee

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[594] arXiv:2512.01236 [pdf, ps, other]: Title: PSR: Scaling Multi-Subject Personalized Image Generation with Pairwise Subject-Consistency Rewards

Authors: Shulei Wang, Longhui Wei, Xin He, Jianbo Ouyang, Hui Lu, Zhou Zhao, Qi Tian

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[595] arXiv:2512.01223 [pdf, ps, other]: Title: S$^2$-MLLM: Boosting Spatial Reasoning Capability of MLLMs for 3D Visual Grounding with Structural Guidance

Authors: Beining Xu, Siting Zhu, Zhao Jin, Junxian Li, Hesheng Wang

Comments: 18 pages, 9 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[596] arXiv:2512.01214 [pdf, ps, other]: Title: M4-BLIP: Advancing Multi-Modal Media Manipulation Detection through Face-Enhanced Local Analysis

Authors: Hang Wu, Ke Sun, Jiayi Ji, Xiaoshuai Sun, Rongrong Ji

Comments: 12 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[597] arXiv:2512.01213 [pdf, ps, other]: Title: Closing the Approximation Gap of Partial AUC Optimization: A Tale of Two Formulations

Authors: Yangbangyan Jiang, Qianqian Xu, Huiyang Shao, Zhiyong Yang, Shilong Bao, Xiaochun Cao, Qingming Huang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[598] arXiv:2512.01204 [pdf, ps, other]: Title: TabletopGen: Instance-Level Interactive 3D Tabletop Scene Generation from Text or Single Image

Authors: Ziqian Wang, Yonghao He, Licheng Yang, Wei Zou, Hongxuan Ma, Liu Liu, Wei Sui, Yuxin Guo, Hu Su

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[599] arXiv:2512.01178 [pdf, ps, other]: Title: VSRD++: Autolabeling for 3D Object Detection via Instance-Aware Volumetric Silhouette Rendering

Authors: Zihua Liu, Hiroki Sakuma, Masatoshi Okutomi

Comments: arXiv admin note: text overlap with arXiv:2404.00149

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[600] arXiv:2512.01165 [pdf, ps, other]: Title: Real-Time On-the-Go Annotation Framework Using YOLO for Automated Dataset Generation

Authors: Mohamed Abdallah Salem (1), Ahmed Harb Rabia (1) ((1) North Dakota State University)

Comments: Copyright 2025 IEEE. This is the author's version of the work that has been accepted for publication in Proceedings of the 5. Interdisciplinary Conference on Electrics and Computer (INTCEC 2025) 15-16 September 2025, Chicago-USA. The final version of record is available at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)

[ total of 778 entries: 1-50 | ... | 401-450 | 451-500 | 501-550 | 551-600 | 601-650 | 651-700 | 701-750 | 751-778 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help (Access key information)

> cs > cs.CV

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 550

Tue, 2 Dec 2025 (continued, showing 50 of 278 entries)