Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 386

[ total of 759 entries: 1-50 | ... | 237-286 | 287-336 | 337-386 | 387-436 | 437-486 | 487-536 | 537-586 | ... | 737-759 ]
[ showing 50 entries per page: fewer | more | all ]

Fri, 5 Dec 2025 (continued, showing 50 of 135 entries)

[387] arXiv:2512.04888 [pdf, ps, other]: Title: You Only Train Once (YOTO): A Retraining-Free Object Detection Framework

Authors: Priyanto Hidayatullah, Nurjannah Syakrani, Yudi Widhiyasana, Muhammad Rizqi Sholahuddin, Refdinal Tubagus, Zahri Al Adzani Hidayat, Hanri Fajar Ramadhan, Dafa Alfarizki Pratama, Farhan Muhammad Yasin

Comments: This manuscript was first submitted to the Engineering (Elsevier Journal). The preprint version was posted to arXiv afterwards to facilitate open access and community feedback

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[388] arXiv:2512.04883 [pdf, ps, other]: Title: SDG-Track: A Heterogeneous Observer-Follower Framework for High-Resolution UAV Tracking on Embedded Platforms

Authors: Jiawen Wen, Yu Hu, Suixuan Qiu, Jinshan Huang, Xiaowen Chu

Comments: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[389] arXiv:2512.04875 [pdf, ps, other]: Title: SP-Det: Self-Prompted Dual-Text Fusion for Generalized Multi-Label Lesion Detection

Authors: Qing Xu, Yanqian Wang, Xiangjian Hea, Yue Li, Yixuan Zhang, Rong Qu, Wenting Duan, Zhen Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[390] arXiv:2512.04862 [pdf, ps, other]: Title: Contact-Aware Refinement of Human Pose Pseudo-Ground Truth via Bioimpedance Sensing

Authors: Maria-Paola Forte, Nikos Athanasiou, Giulia Ballardini, Jan Ulrich Bartels, Katherine J. Kuchenbecker, Michael J. Black

Comments: * Equal contribution. Minor figure corrections compared to the ICCV 2025 version

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[391] arXiv:2512.04857 [pdf, ps, other]: Title: Autoregressive Image Generation Needs Only a Few Lines of Cached Tokens

Authors: Ziran Qin, Youru Lv, Mingbao Lin, Zeren Zhang, Chanfan Gan, Tieyuan Chen, Weiyao Lin

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[392] arXiv:2512.04837 [pdf, ps, other]: Title: A Sanity Check for Multi-In-Domain Face Forgery Detection in the Real World

Authors: Jikang Cheng, Renye Yan, Zhiyuan Yan, Yaozhong Gan, Xueyi Zhang, Zhongyuan Wang, Wei Peng, Ling Liang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[393] arXiv:2512.04832 [pdf, ps, other]: Title: Tokenizing Buildings: A Transformer for Layout Synthesis

Authors: Manuel Ladron de Guevara, Jinmo Rhee, Ardavan Bidgoli, Vaidas Razgaitis, Michael Bergin

Comments: 8 pages, 1 page References, 4 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[394] arXiv:2512.04830 [pdf, ps, other]: Title: FreeGen: Feed-Forward Reconstruction-Generation Co-Training for Free-Viewpoint Driving Scene Synthesis

Authors: Shijie Chen, Peixi Peng

Comments: Novel View Synthesis, Driving Scene, Free Trajectory, Image Generation

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[395] arXiv:2512.04821 [pdf, ps, other]: Title: LatentFM: A Latent Flow Matching Approach for Generative Medical Image Segmentation

Authors: Huynh Trinh Ngoc, Hoang Anh Nguyen Kim, Toan Nguyen Hai, Long Tran Quoc

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[396] arXiv:2512.04815 [pdf, ps, other]: Title: RobustSplat++: Decoupling Densification, Dynamics, and Illumination for In-the-Wild 3DGS

Authors: Chuanyu Fu, Guanying Chen, Yuqi Zhang, Kunbin Yao, Yuan Xiong, Chuan Huang, Shuguang Cui, Yasuyuki Matsushita, Xiaochun Cao

Comments: arXiv admin note: substantial text overlap with arXiv:2506.02751

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[397] arXiv:2512.04810 [pdf, ps, other]: Title: EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture

Authors: Xin He, Longhui Wei, Jianbo Ouyang, Lingxi Xie, Qi Tian

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[398] arXiv:2512.04786 [pdf, ps, other]: Title: LaFiTe: A Generative Latent Field for 3D Native Texturing

Authors: Chia-Hao Chen, Zi-Xin Zou, Yan-Pei Cao, Ze Yuan, Guan Luo, Xiaojuan Qi, Ding Liang, Song-Hai Zhang, Yuan-Chen Guo

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[399] arXiv:2512.04784 [pdf, ps, other]: Title: PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling

Authors: Bowen Ping, Chengyou Jia, Minnan Luo, Changliang Xia, Xin Shen, Zhuohang Dang, Hangwei Qian

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[400] arXiv:2512.04761 [pdf, ps, other]: Title: Order Matters: 3D Shape Generation from Sequential VR Sketches

Authors: Yizi Chen, Sidi Wu, Tianyi Xiao, Nina Wiedemann, Loic Landrieu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[401] arXiv:2512.04734 [pdf, ps, other]: Title: MT-Depth: Multi-task Instance feature analysis for the Depth Completion

Authors: Abdul Haseeb Nizamani, Dandi Zhou, Xinhai Sun

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[402] arXiv:2512.04733 [pdf, ps, other]: Title: E3AD: An Emotion-Aware Vision-Language-Action Model for Human-Centric End-to-End Autonomous Driving

Authors: Yihong Tang, Haicheng Liao, Tong Nie, Junlin He, Ao Qu, Kehua Chen, Wei Ma, Zhenning Li, Lijun Sun, Chengzhong Xu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[403] arXiv:2512.04728 [pdf, ps, other]: Title: Measuring the Unspoken: A Disentanglement Model and Benchmark for Psychological Analysis in the Wild

Authors: Yigui Feng, Qinglin Wang, Haotian Mo, Yang Liu, Ke Liu, Gencheng Liu, Xinhai Chen, Siqi Shen, Songzhu Mei, Jie Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[404] arXiv:2512.04699 [pdf, ps, other]: Title: OmniScaleSR: Unleashing Scale-Controlled Diffusion Prior for Faithful and Realistic Arbitrary-Scale Image Super-Resolution

Authors: Xinning Chai, Zhengxue Cheng, Yuhong Zhang, Hengsheng Zhang, Yingsheng Qin, Yucai Yang, Rong Xie, Li Song

Comments: Accepted as TCSVT, 15 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[405] arXiv:2512.04686 [pdf, ps, other]: Title: Towards Cross-View Point Correspondence in Vision-Language Models

Authors: Yipu Wang, Yuheng Ji, Yuyang Liu, Enshen Zhou, Ziqiang Yang, Yuxuan Tian, Ziheng Qin, Yue Liu, Huajie Tan, Cheng Chi, Zhiyuan Ma, Daniel Dajun Zeng, Xiaolong Zheng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[406] arXiv:2512.04678 [pdf, ps, other]: Title: Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation

Authors: Yunhong Lu, Yanhong Zeng, Haobo Li, Hao Ouyang, Qiuyu Wang, Ka Leong Cheng, Jiapeng Zhu, Hengyuan Cao, Zhipeng Zhang, Xing Zhu, Yujun Shen, Min Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[407] arXiv:2512.04677 [pdf, ps, other]: Title: Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Authors: Yubo Huang, Hailong Guo, Fangtai Wu, Shifeng Zhang, Shijie Huang, Qijun Gan, Lin Liu, Sirui Zhao, Enhong Chen, Jiaming Liu, Steven Hoi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[408] arXiv:2512.04660 [pdf, ps, other]: Title: I2I-Bench: A Comprehensive Benchmark Suite for Image-to-Image Editing Models

Authors: Juntong Wang, Jiarui Wang, Huiyu Duan, Jiaxiang Kang, Guangtao Zhai, Xiongkuo Min

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[409] arXiv:2512.04643 [pdf, ps, other]: Title: SEASON: Mitigating Temporal Hallucination in Video Large Language Models via Self-Diagnostic Contrastive Decoding

Authors: Chang-Hsun Wu, Kai-Po Chang, Yu-Yang Sheng, Hung-Kai Chung, Kuei-Chun Wang, Yu-Chiang Frank Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[410] arXiv:2512.04619 [pdf, ps, other]: Title: Denoise to Track: Harnessing Video Diffusion Priors for Robust Correspondence

Authors: Tianyu Yuan, Yuanbo Yang, Lin-Zhuo Chen, Yao Yao, Zhuzhong Qian

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[411] arXiv:2512.04599 [pdf, ps, other]: Title: Malicious Image Analysis via Vision-Language Segmentation Fusion: Detection, Element, and Location in One-shot

Authors: Sheng Hang, Chaoxiang He, Hongsheng Hu, Hanqing Hu, Bin Benjamin Zhu, Shi-Feng Sun, Dawu Gu, Shuo Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[412] arXiv:2512.04597 [pdf, ps, other]: Title: When Robots Should Say "I Don't Know": Benchmarking Abstention in Embodied Question Answering

Authors: Tao Wu, Chuhao Zhou, Guangyu Zhao, Haozhi Cao, Yewen Pu, Jianfei Yang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[413] arXiv:2512.04585 [pdf, ps, other]: Title: SAM3-I: Segment Anything with Instructions

Authors: Jingjing Li, Yue Feng, Yuchen Guo, Jincai Huang, Yongri Piao, Qi Bi, Miao Zhang, Xiaoqi Zhao, Qiang Chen, Shihao Zou, Wei Ji, Huchuan Lu, Li Cheng

Comments: Preliminary results; work in progress

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[414] arXiv:2512.04581 [pdf, ps, other]: Title: Infrared UAV Target Tracking with Dynamic Feature Refinement and Global Contextual Attention Knowledge Distillation

Authors: Houzhang Fang, Chenxing Wu, Kun Bai, Tianqi Chen, Xiaolin Wang, Xiyang Liu, Yi Chang, Luxin Yan

Comments: Accepted by IEEE TMM

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[415] arXiv:2512.04576 [pdf, ps, other]: Title: TARDis: Time Attenuated Representation Disentanglement for Incomplete Multi-Modal Tumor Segmentation and Classification

Authors: Zishuo Wan, Qinqin Kang, Yi Huang, Yun Bian, Dawei Ding, Ke Yan

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[416] arXiv:2512.04568 [pdf, ps, other]: Title: Prompt2Craft: Generating Functional Craft Assemblies with LLMs

Authors: Vitor Hideyo Isume, Takuya Kiyokawa, Natsuki Yamanobe, Yukiyasu Domae, Weiwei Wan, Kensuke Harada

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[417] arXiv:2512.04564 [pdf, ps, other]: Title: Dataset creation for supervised deep learning-based analysis of microscopic images -- review of important considerations and recommendations

Authors: Christof A. Bertram, Viktoria Weiss, Jonas Ammeling, F. Maria Schabel, Taryn A. Donovan, Frauke Wilm, Christian Marzahl, Katharina Breininger, Marc Aubreville

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[418] arXiv:2512.04563 [pdf, ps, other]: Title: COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence

Authors: Zefeng Zhang, Xiangzhao Hao, Hengzhu Tang, Zhenyu Zhang, Jiawei Sheng, Xiaodong Li, Zhenyang Li, Li Gao, Daiting Shi, Dawei Yin, Tingwen Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[419] arXiv:2512.04554 [pdf, ps, other]: Title: Counterfeit Answers: Adversarial Forgery against OCR-Free Document Visual Question Answering

Authors: Marco Pintore, Maura Pintor, Dimosthenis Karatzas, Battista Biggio

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[420] arXiv:2512.04542 [pdf, ps, other]: Title: Gaussian Entropy Fields: Driving Adaptive Sparsity in 3D Gaussian Optimization

Authors: Hong Kuang, Jianchen Liu

Comments: 28 pages,11 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[421] arXiv:2512.04540 [pdf, ps, other]: Title: VideoMem: Enhancing Ultra-Long Video Understanding via Adaptive Memory Management

Authors: Hongbo Jin, Qingyuan Wang, Wenhao Zhang, Yang Liu, Sijie Cheng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[422] arXiv:2512.04537 [pdf, ps, other]: Title: X-Humanoid: Robotize Human Videos to Generate Humanoid Videos at Scale

Authors: Pei Yang, Hai Ci, Yiren Song, Mike Zheng Shou

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[423] arXiv:2512.04536 [pdf, ps, other]: Title: Detection of Intoxicated Individuals from Facial Video Sequences via a Recurrent Fusion Model

Authors: Bita Baroutian, Atefe Aghaei, Mohsen Ebrahimi Moghaddam

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[424] arXiv:2512.04534 [pdf, ps, other]: Title: Refaçade: Editing Object with Given Reference Texture

Authors: Youze Huang (1), Penghui Ruan (2), Bojia Zi (3), Xianbiao Qi (4), Jianan Wang (5), Rong Xiao (4) ((1) University of Electronic Science and Technology of China, (2) The Hong Kong Polytechnic University, (3) The Chinese University of Hong Kong, (4) IntelliFusion Inc., (5) Astribot Inc.)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[425] arXiv:2512.04532 [pdf, ps, other]: Title: PhyVLLM: Physics-Guided Video Language Model with Motion-Appearance Disentanglement

Authors: Yu-Wei Zhan, Xin Wang, Hong Chen, Tongtong Feng, Wei Feng, Ren Wang, Guangyao Li, Qing Li, Wenwu Zhu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[426] arXiv:2512.04528 [pdf, ps, other]: Title: Auto3R: Automated 3D Reconstruction and Scanning via Data-driven Uncertainty Quantification

Authors: Chentao Shen, Sizhe Zheng, Bingqian Wu, Yaohua Feng, Yuanchen Fei, Mingyu Mei, Hanwen Jiang, Xiangru Huang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[427] arXiv:2512.04522 [pdf, ps, other]: Title: Identity Clue Refinement and Enhancement for Visible-Infrared Person Re-Identification

Authors: Guoqing Zhang, Zhun Wang, Hairui Wang, Zhonglin Ye, Yuhui Zheng

Comments: 14 pages, 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[428] arXiv:2512.04521 [pdf, ps, other]: Title: WiFi-based Cross-Domain Gesture Recognition Using Attention Mechanism

Authors: Ruijing Liu, Cunhua Pan, Jiaming Zeng, Hong Ren, Kezhi Wang, Lei Kong, Jiangzhou Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[429] arXiv:2512.04520 [pdf, ps, other]: Title: Boundary-Aware Test-Time Adaptation for Zero-Shot Medical Image Segmentation

Authors: Chenlin Xu, Lei Zhang, Lituan Wang, Xinyu Pu, Pengfei Ma, Guangwu Qian, Zizhou Wang, Yan Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[430] arXiv:2512.04519 [pdf, ps, other]: Title: VideoSSM: Autoregressive Long Video Generation with Hybrid State-Space Memory

Authors: Yifei Yu, Xiaoshan Wu, Xinting Hu, Tao Hu, Yangtian Sun, Xiaoyang Lyu, Bo Wang, Lin Ma, Yuewen Ma, Zhongrui Wang, Xiaojuan Qi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[431] arXiv:2512.04515 [pdf, ps, other]: Title: EgoLCD: Egocentric Video Generation with Long Context Diffusion

Authors: Liuzhou Zhang, Jiarui Ye, Yuanlei Wang, Ming Zhong, Mingju Cao, Wanke Xia, Bowen Zeng, Zeyu Zhang, Hao Tang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[432] arXiv:2512.04511 [pdf, ps, other]: Title: DuGI-MAE: Improving Infrared Mask Autoencoders via Dual-Domain Guidance

Authors: Yinghui Xing, Xiaoting Su, Shizhou Zhang, Donghao Chu, Di Xu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[433] arXiv:2512.04504 [pdf, ps, other]: Title: UltraImage: Rethinking Resolution Extrapolation in Image Diffusion Transformers

Authors: Min Zhao, Bokai Yan, Xue Yang, Hongzhou Zhu, Jintao Zhang, Shilong Liu, Chongxuan Li, Jun Zhu

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[434] arXiv:2512.04499 [pdf, ps, other]: Title: Back to Basics: Motion Representation Matters for Human Motion Generation Using Diffusion Model

Authors: Yuduo Jin, Brandon Haworth

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[435] arXiv:2512.04496 [pdf, ps, other]: Title: Shift-Window Meets Dual Attention: A Multi-Model Architecture for Specular Highlight Removal

Authors: Tianci Huo, Lingfeng Qi, Yuhan Chen, Qihong Xue, Jinyuan Shao, Hai Yu, Jie Li, Zhanhua Zhang, Guofa Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[436] arXiv:2512.04487 [pdf, ps, other]: Title: Controllable Long-term Motion Generation with Extended Joint Targets

Authors: Eunjong Lee, Eunhee Kim, Sanghoon Hong, Eunho Jung, Jihoon Kim

Comments: WACV 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)

[ total of 759 entries: 1-50 | ... | 237-286 | 287-336 | 337-386 | 387-436 | 437-486 | 487-536 | 537-586 | ... | 737-759 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help (Access key information)

> cs > cs.CV

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 386

Fri, 5 Dec 2025 (continued, showing 50 of 135 entries)