Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 44

[ total of 754 entries: 1-50 | 45-94 | 95-144 | 145-194 | 195-244 | ... | 745-754 ]
[ showing 50 entries per page: fewer | more | all ]

Thu, 11 Dec 2025 (continued, showing 50 of 135 entries)

[45] arXiv:2512.09446 [pdf, ps, other]: Title: Defect-aware Hybrid Prompt Optimization via Progressive Tuning for Zero-Shot Multi-type Anomaly Detection and Segmentation

Authors: Nadeem Nazer, Hongkuan Zhou, Lavdim Halilaj, Ylli Sadikaj, Steffen Staab

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[46] arXiv:2512.09441 [pdf, ps, other]: Title: Representation Calibration and Uncertainty Guidance for Class-Incremental Learning based on Vision Language Model

Authors: Jiantao Tan, Peixian Ma, Tong Yu, Wentao Zhang, Ruixuan Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[47] arXiv:2512.09435 [pdf, ps, other]: Title: UniPart: Part-Level 3D Generation with Unified 3D Geom-Seg Latents

Authors: Xufan He, Yushuang Wu, Xiaoyang Guo, Chongjie Ye, Jiaqing Zhou, Tianlei Hu, Xiaoguang Han, Dong Du

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[48] arXiv:2512.09423 [pdf, ps, other]: Title: FunPhase: A Periodic Functional Autoencoder for Motion Generation via Phase Manifolds

Authors: Marco Pegoraro, Evan Atherton, Bruno Roy, Aliasghar Khani, Arianna Rampini

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[49] arXiv:2512.09422 [pdf, ps, other]: Title: InfoMotion: A Graph-Based Approach to Video Dataset Distillation for Echocardiography

Authors: Zhe Li, Hadrien Reynaud, Alberto Gomez, Bernhard Kainz

Comments: Accepted at MICAD 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[50] arXiv:2512.09418 [pdf, ps, other]: Title: Label-free Motion-Conditioned Diffusion Model for Cardiac Ultrasound Synthesis

Authors: Zhe Li, Hadrien Reynaud, Johanna P Müller, Bernhard Kainz

Comments: Accepted at MICAD 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[51] arXiv:2512.09417 [pdf, ps, other]: Title: DirectSwap: Mask-Free Cross-Identity Training and Benchmarking for Expression-Consistent Video Head Swapping

Authors: Yanan Wang, Shengcai Liao, Panwen Hu, Xin Li, Fan Yang, Xiaodan Liang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[52] arXiv:2512.09407 [pdf, ps, other]: Title: Generative Point Cloud Registration

Authors: Haobo Jiang, Jin Xie, Jian Yang, Liang Yu, Jianmin Zheng

Comments: 14 pages, 9 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[53] arXiv:2512.09402 [pdf, ps, other]: Title: Wasserstein-Aligned Hyperbolic Multi-View Clustering

Authors: Rui Wang, Yuting Jiang, Xiaoqing Luo, Xiao-Jun Wu, Nicu Sebe, Ziheng Chen

Comments: 14 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[54] arXiv:2512.09393 [pdf, ps, other]: Title: Detection and Localization of Subdural Hematoma Using Deep Learning on Computed Tomography

Authors: Vasiliki Stoumpou, Rohan Kumar, Bernard Burman, Diego Ojeda, Tapan Mehta, Dimitris Bertsimas

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[55] arXiv:2512.09383 [pdf, ps, other]: Title: Perception-Inspired Color Space Design for Photo White Balance Editing

Authors: Yang Cheng, Ziteng Cui, Lin Gu, Shenghan Su, Zenghui Zhang

Comments: Accepted to WACV 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[56] arXiv:2512.09375 [pdf, ps, other]: Title: Log NeRF: Comparing Spaces for Learning Radiance Fields

Authors: Sihe Chen (Northeastern University), Luv Verma (Northeastern University), Bruce A. Maxwell (Northeastern University)

Comments: The 36th British Machine Vision Conference

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[57] arXiv:2512.09373 [pdf, ps, other]: Title: FUSER: Feed-Forward MUltiview 3D Registration Transformer and SE(3)$^N$ Diffusion Refinement

Authors: Haobo Jiang, Jin Xie, Jian Yang, Liang Yu, Jianmin Zheng

Comments: 13 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[58] arXiv:2512.09364 [pdf, ps, other]: Title: ASSIST-3D: Adapted Scene Synthesis for Class-Agnostic 3D Instance Segmentation

Authors: Shengchao Zhou, Jiehong Lin, Jiahui Liu, Shizhen Zhao, Chirui Chang, Xiaojuan Qi

Comments: Accepted by AAAI 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[59] arXiv:2512.09363 [pdf, ps, other]: Title: StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation

Authors: Ke Xing, Longfei Li, Yuyang Yin, Hanwen Liang, Guixun Luo, Chen Fang, Jue Wang, Konstantinos N. Plataniotis, Xiaojie Jin, Yao Zhao, Yunchao Wei

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[60] arXiv:2512.09354 [pdf, ps, other]: Title: Video-QTR: Query-Driven Temporal Reasoning Framework for Lightweight Video Understanding

Authors: Xinkui Zhao, Zuxin Wang, Yifan Zhang, Guanjie Cheng, Yueshen Xu, Shuiguang Deng, Chang Liu, Naibo Wang, Jianwei Yin

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[61] arXiv:2512.09350 [pdf, ps, other]: Title: TextGuider: Training-Free Guidance for Text Rendering via Attention Alignment

Authors: Kanghyun Baek, Sangyub Lee, Jin Young Choi, Jaewoo Song, Daemin Park, Jooyoung Choi, Chaehun Shin, Bohyung Han, Sungroh Yoon

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[62] arXiv:2512.09335 [pdf, ps, other]: Title: Relightable and Dynamic Gaussian Avatar Reconstruction from Monocular Video

Authors: Seonghwa Choi, Moonkyeong Choi, Mingyu Jang, Jaekyung Kim, Jianfei Cai, Wen-Huang Cheng, Sanghoon Lee

Comments: 8 pages, 9 figures, published in ACM MM 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[63] arXiv:2512.09327 [pdf, ps, other]: Title: UniLS: End-to-End Audio-Driven Avatars for Unified Listening and Speaking

Authors: Xuangeng Chu, Ruicong Liu, Yifei Huang, Yun Liu, Yichen Peng, Bo Zheng

Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD)
[64] arXiv:2512.09315 [pdf, ps, other]: Title: Benchmarking Real-World Medical Image Classification with Noisy Labels: Challenges, Practice, and Outlook

Authors: Yuan Ma, Junlin Hou, Chao Zhang, Yukun Zhou, Zongyuan Ge, Haoran Xie, Lie Ju

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[65] arXiv:2512.09311 [pdf, ps, other]: Title: Transformer-Driven Multimodal Fusion for Explainable Suspiciousness Estimation in Visual Surveillance

Authors: Kuldeep Singh Yadav, Lalan Kumar

Comments: 12 pages, 10 figures, IEEE Transaction on Image Processing

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[66] arXiv:2512.09307 [pdf, ps, other]: Title: From SAM to DINOv2: Towards Distilling Foundation Models to Lightweight Baselines for Generalized Polyp Segmentation

Authors: Shivanshu Agnihotri, Snehashis Majhi, Deepak Ranjan Nayak, Debesh Jha

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[67] arXiv:2512.09299 [pdf, ps, other]: Title: VABench: A Comprehensive Benchmark for Audio-Video Generation

Authors: Daili Hua, Xizhi Wang, Bohan Zeng, Xinyi Huang, Hao Liang, Junbo Niu, Xinlong Chen, Quanqing Xu, Wentao Zhang

Comments: 24 pages, 25 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD)
[68] arXiv:2512.09296 [pdf, ps, other]: Title: Traffic Scene Small Target Detection Method Based on YOLOv8n-SPTS Model for Autonomous Driving

Authors: Songhan Wu

Comments: 6 pages, 7 figures, 1 table. Accepted to The 2025 IEEE 3rd International Conference on Electrical, Automation and Computer Engineering (ICEACE), 2025. Code available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[69] arXiv:2512.09289 [pdf, ps, other]: Title: MelanomaNet: Explainable Deep Learning for Skin Lesion Classification

Authors: Sukhrobbek Ilyosbekov

Comments: 7 pages, 3 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[70] arXiv:2512.09282 [pdf, ps, other]: Title: FoundIR-v2: Optimizing Pre-Training Data Mixtures for Image Restoration Foundation Model

Authors: Xiang Chen, Jinshan Pan, Jiangxin Dong, Jian Yang, Jinhui Tang

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[71] arXiv:2512.09278 [pdf, ps, other]: Title: LoGoColor: Local-Global 3D Colorization for 360° Scenes

Authors: Yeonjin Chang, Juhwan Cho, Seunghyeon Seo, Wonsik Shin, Nojun Kwak

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[72] arXiv:2512.09276 [pdf, ps, other]: Title: Dynamic Facial Expressions Analysis Based Parkinson's Disease Auxiliary Diagnosis

Authors: Xiaochen Huang, Xiaochen Bi, Cuihua Lv, Xin Wang, Haoyan Zhang, Wenjing Jiang, Xin Ma, Yibin Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[73] arXiv:2512.09271 [pdf, ps, other]: Title: LongT2IBench: A Benchmark for Evaluating Long Text-to-Image Generation with Graph-structured Annotations

Authors: Zhichao Yang, Tianjiao Gu, Jianjie Wang, Feiyu Lin, Xiangfei Sheng, Pengfei Chen, Leida Li

Comments: The paper has been accepted by AAAI 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[74] arXiv:2512.09270 [pdf, ps, other]: Title: MoRel: Long-Range Flicker-Free 4D Motion Modeling via Anchor Relay-based Bidirectional Blending with Hierarchical Densification

Authors: Sangwoon Kwak, Weeyoung Kwon, Jun Young Jeong, Geonho Kim, Won-Sik Cheong, Jihyong Oh

Comments: Please visit our project page at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[75] arXiv:2512.09258 [pdf, ps, other]: Title: ROI-Packing: Efficient Region-Based Compression for Machine Vision

Authors: Md Eimran Hossain Eimon, Alena Krause, Ashan Perera, Juan Merlos, Hari Kalva, Velibor Adzic, Borko Furht

Journal-ref: International Conference on Multimedia Information Processing and Retrieval (MIPR), San Jose, CA, USA, 2025, pp. 233-238

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[76] arXiv:2512.09251 [pdf, ps, other]: Title: GLACIA: Instance-Aware Positional Reasoning for Glacial Lake Segmentation via Multimodal Large Language Model

Authors: Lalit Maurya, Saurabh Kaushik, Beth Tellman

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[77] arXiv:2512.09247 [pdf, ps, other]: Title: OmniPSD: Layered PSD Generation with Diffusion Transformer

Authors: Cheng Liu, Yiren Song, Haofan Wang, Mike Zheng Shou

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[78] arXiv:2512.09244 [pdf, ps, other]: Title: A Clinically Interpretable Deep CNN Framework for Early Chronic Kidney Disease Prediction Using Grad-CAM-Based Explainable AI

Authors: Anas Bin Ayub, Nilima Sultana Niha, Md. Zahurul Haque

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[79] arXiv:2512.09235 [pdf, ps, other]: Title: Efficient Feature Compression for Machines with Global Statistics Preservation

Authors: Md Eimran Hossain Eimon, Hyomin Choi, Fabien Racapé, Mateen Ulhaq, Velibor Adzic, Hari Kalva, Borko Furht

Journal-ref: 2025 IEEE International Symposium on Circuits and Systems (ISCAS), London, United Kingdom, 2025, pp. 1-5

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[80] arXiv:2512.09232 [pdf, ps, other]: Title: Enabling Next-Generation Consumer Experience with Feature Coding for Machines

Authors: Md Eimran Hossain Eimon, Juan Merlos, Ashan Perera, Hari Kalva, Velibor Adzic, Borko Furht

Journal-ref: 2025 IEEE International Conference on Consumer Electronics (ICCE), Las Vegas, NV, USA, 2025, pp. 1-4

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[81] arXiv:2512.09215 [pdf, ps, other]: Title: View-on-Graph: Zero-shot 3D Visual Grounding via Vision-Language Reasoning on Scene Graphs

Authors: Yuanyuan Liu, Haiyang Mei, Dongyang Zhan, Jiayue Zhao, Dongsheng Zhou, Bo Dong, Xin Yang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[82] arXiv:2512.09185 [pdf, ps, other]: Title: Learning Patient-Specific Disease Dynamics with Latent Flow Matching for Longitudinal Imaging Generation

Authors: Hao Chen, Rui Yin, Yifan Chen, Qi Chen, Chao Li

Comments: Under review

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[83] arXiv:2512.09172 [pdf, ps, other]: Title: Prompt-Based Continual Compositional Zero-Shot Learning

Authors: Sauda Maryam, Sara Nadeem, Faisal Qureshi, Mohsen Ali

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[84] arXiv:2512.09164 [pdf, ps, other]: Title: WonderZoom: Multi-Scale 3D World Generation

Authors: Jin Cao, Hong-Xing Yu, Jiajun Wu

Comments: Project website: this https URL The first two authors contributed equally

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[85] arXiv:2512.09162 [pdf, ps, other]: Title: GTAvatar: Bridging Gaussian Splatting and Texture Mapping for Relightable and Editable Gaussian Avatars

Authors: Kelian Baert, Mae Younes, Francois Bourel, Marc Christie, Adnane Boukhayma

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[86] arXiv:2512.09134 [pdf, ps, other]: Title: Integrated Pipeline for Coronary Angiography With Automated Lesion Profiling, Virtual Stenting, and 100-Vessel FFR Validation

Authors: Georgy Kopanitsa, Oleg Metsker, Alexey Yakovlev

Comments: 22 pages, 10 figures, 7 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[87] arXiv:2512.09115 [pdf, ps, other]: Title: SuperF: Neural Implicit Fields for Multi-Image Super-Resolution

Authors: Sander Riisøen Jyhne, Christian Igel, Morten Goodwin, Per-Arne Andersen, Serge Belongie, Nico Lang

Comments: 23 pages, 13 figures, 8 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[88] arXiv:2512.09112 [pdf, ps, other]: Title: GimbalDiffusion: Gravity-Aware Camera Control for Video Generation

Authors: Frédéric Fortier-Chouinard, Yannick Hold-Geoffroy, Valentin Deschaintre, Matheus Gadelha, Jean-François Lalonde

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[89] arXiv:2512.09095 [pdf, ps, other]: Title: Food Image Generation on Multi-Noun Categories

Authors: Xinyue Pan, Yuhao Chen, Jiangpeng He, Fengqing Zhu

Comments: Accepted by WACV 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[90] arXiv:2512.09092 [pdf, ps, other]: Title: Explaining the Unseen: Multimodal Vision-Language Reasoning for Situational Awareness in Underground Mining Disasters

Authors: Mizanur Rahman Jewel, Mohamed Elmahallawy, Sanjay Madria, Samuel Frimpong

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[91] arXiv:2512.09081 [pdf, ps, other]: Title: AgentComp: From Agentic Reasoning to Compositional Mastery in Text-to-Image Models

Authors: Arman Zarei, Jiacheng Pan, Matthew Gwilliam, Soheil Feizi, Zhenheng Yang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[92] arXiv:2512.09071 [pdf, ps, other]: Title: Adaptive Thresholding for Visual Place Recognition using Negative Gaussian Mixture Statistics

Authors: Nick Trinh, Damian Lyons

Comments: Accepted and presented at IEEE RoboticCC 2025. 4 pages short paper

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[93] arXiv:2512.09069 [pdf, ps, other]: Title: KD-OCT: Efficient Knowledge Distillation for Clinical-Grade Retinal OCT Classification

Authors: Erfan Nourbakhsh, Nasrin Sanjari, Ali Nourbakhsh

Comments: 7 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[94] arXiv:2512.09062 [pdf, ps, other]: Title: SIP: Site in Pieces- A Dataset of Disaggregated Construction-Phase 3D Scans for Semantic Segmentation and Scene Understanding

Authors: Seongyong Kim, Yong Kwon Cho

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)

[ total of 754 entries: 1-50 | 45-94 | 95-144 | 145-194 | 195-244 | ... | 745-754 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help (Access key information)

> cs > cs.CV

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 44

Thu, 11 Dec 2025 (continued, showing 50 of 135 entries)