Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 29

[ total of 603 entries: 1-100 | 30-129 | 130-229 | 230-329 | 330-429 | ... | 530-603 ]
[ showing 100 entries per page: fewer | more | all ]

Thu, 1 Jan 2026 (continued, showing last 95 of 124 entries)

[30] arXiv:2512.24622 [pdf, ps, other]: Title: FireRescue: A UAV-Based Dataset and Enhanced YOLO Model for Object Detection in Fire Rescue Scenes

Authors: Qingyu Xu, Runtong Zhang, Zihuan Qiu, Fanman Meng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[31] arXiv:2512.24620 [pdf, ps, other]: Title: LLHA-Net: A Hierarchical Attention Network for Two-View Correspondence Learning

Authors: Shuyuan Lin, Yu Guo, Xiao Chen, Yanjie Liang, Guobao Xiao, Feiran Huang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[32] arXiv:2512.24605 [pdf, ps, other]: Title: MoniRefer: A Real-world Large-scale Multi-modal Dataset based on Roadside Infrastructure for 3D Visual Grounding

Authors: Panquan Yang, Junfei Huang, Zongzhangbao Yin, Yingsong Hu, Anni Xu, Xinyi Luo, Xueqi Sun, Hai Wu, Sheng Ao, Zhaoxing Zhu, Chenglu Wen, Cheng Wang

Comments: 14 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[33] arXiv:2512.24603 [pdf, ps, other]: Title: Collaborative Low-Rank Adaptation for Pre-Trained Vision Transformers

Authors: Zheng Liu, Jinchao Zhu, Gao Huang

Comments: 13 tables, 3 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[34] arXiv:2512.24593 [pdf, ps, other]: Title: 3D Semantic Segmentation for Post-Disaster Assessment

Authors: Nhut Le, Maryam Rahnemoonfar

Comments: Accepted by the 2025 IEEE International Geoscience and Remote Sensing Symposium (IGARSS 2025)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[35] arXiv:2512.24592 [pdf, ps, other]: Title: SliceLens: Fine-Grained and Grounded Error Slice Discovery for Multi-Instance Vision Tasks

Authors: Wei Zhang, Chaoqun Wang, Zixuan Guan, Sam Kao, Pengfei Zhao, Peng Wu, Sifeng He

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[36] arXiv:2512.24591 [pdf, ps, other]: Title: Improving Few-Shot Change Detection Visual Question Answering via Decision-Ambiguity-guided Reinforcement Fine-Tuning

Authors: Fuyu Dong, Ke Li, Di Wang, Nan Luo, Yiming Zhang, Kaiyu Li, Jianfei Yang, Quan Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[37] arXiv:2512.24561 [pdf, ps, other]: Title: RGBT-Ground Benchmark: Visual Grounding Beyond RGB in Complex Real-World Scenarios

Authors: Tianyi Zhao, Jiawen Xi, Linhui Xiao, Junnan Li, Xue Yang, Maoxun Yuan, Xingxing Wei

Comments: 27pages, 9figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[38] arXiv:2512.24552 [pdf, ps, other]: Title: OCP-LS: An Efficient Algorithm for Visual Localization

Authors: Jindi Zhong, Hongxia Wang, Huanshui Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Optimization and Control (math.OC)
[39] arXiv:2512.24551 [pdf, ps, other]: Title: PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation

Authors: Yuanhao Cai, Kunpeng Li, Menglin Jia, Jialiang Wang, Junzhe Sun, Feng Liang, Weifeng Chen, Felix Juefei-Xu, Chu Wang, Ali Thabet, Xiaoliang Dai, Xuan Ju, Alan Yuille, Ji Hou

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[40] arXiv:2512.24547 [pdf, ps, other]: Title: Hierarchical Vector-Quantized Latents for Perceptual Low-Resolution Video Compression

Authors: Manikanta Kotthapalli, Banafsheh Rekabdar

Comments: 11 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[41] arXiv:2512.24518 [pdf, ps, other]: Title: Using Large Language Models To Translate Machine Results To Human Results

Authors: Trishna Niraula, Jonathan Stubblefield

Comments: 11 pages, 7 figures, 3 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[42] arXiv:2512.24473 [pdf, ps, other]: Title: F2IDiff: Real-world Image Super-resolution using Feature to Image Diffusion Foundation Model

Authors: Devendra K. Jangid, Ripon K. Saha, Dilshan Godaliyadda, Jing Li, Seok-Jun Lee, Hamid R. Sheikh

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[43] arXiv:2512.24463 [pdf, ps, other]: Title: Spectral and Spatial Graph Learning for Multispectral Solar Image Compression

Authors: Prasiddha Siwakoti, Atefeh Khoshkhahtinat, Piyush M. Mehta, Barbara J. Thompson, Michael S. F. Kirk, Daniel da Silva

Comments: 8 pages, 6 figures 1 table. Code available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[44] arXiv:2512.24438 [pdf, ps, other]: Title: Exploring Compositionality in Vision Transformers using Wavelet Representations

Authors: Akshad Shyam Purushottamdas, Pranav K Nayak, Divya Mehul Rajparia, Deekshith Patel, Yashmitha Gogineni, Konda Reddy Mopuri, Sumohana S. Channappayya

Comments: 9 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[45] arXiv:2512.24411 [pdf, ps, other]: Title: AI-Driven Evaluation of Surgical Skill via Action Recognition

Authors: Yan Meng, Daniel A. Donoho, Marcelle Altshuler, Omar Arnaout

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[46] arXiv:2512.24408 [pdf, ps, other]: Title: DyStream: Streaming Dyadic Talking Heads Generation via Flow Matching-based Autoregressive Model

Authors: Bohong Chen, Haiyang Liu

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[47] arXiv:2512.24386 [pdf, ps, other]: Title: RedunCut: Measurement-Driven Sampling and Accuracy Performance Modeling for Low-Cost Live Video Analytics

Authors: Gur-Eyal Sela, Kumar Krishna Agrawal, Bharathan Balaji, Joseph Gonzalez, Ion Stoica

Comments: 21 pages, 23 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
[48] arXiv:2512.24385 [pdf, ps, other]: Title: Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems

Authors: Song Wang, Lingdong Kong, Xiaolu Liu, Hao Shi, Wentong Li, Jianke Zhu, Steven C. H. Hoi

Comments: Preprint; 38 pages, 7 figures, 9 tables; GitHub at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[49] arXiv:2512.24340 [pdf, ps, other]: Title: DermaVQA-DAS: Dermatology Assessment Schema (DAS) & Datasets for Closed-Ended Question Answering & Segmentation in Patient-Generated Dermatology Images

Authors: Wen-wai Yim, Yujuan Fu, Asma Ben Abacha, Meliha Yetisgen, Noel Codella, Roberto Andres Novoa, Josep Malvehy

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[50] arXiv:2512.24338 [pdf, ps, other]: Title: The Mechanics of CNN Filtering with Rectification

Authors: Liam Frija-Altrac, Matthew Toews

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[51] arXiv:2512.24331 [pdf, ps, other]: Title: Spatial-aware Vision Language Model for Autonomous Driving

Authors: Weijie Wei, Zhipeng Luo, Ling Feng, Venice Erin Liong

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[52] arXiv:2512.24330 [pdf, ps, other]: Title: SenseNova-MARS: Empowering Multimodal Agentic Reasoning and Search via Reinforcement Learning

Authors: Yong Xien Chng, Tao Hu, Wenwen Tong, Xueheng Li, Jiandong Chen, Haojia Yu, Jiefan Lu, Hewei Guo, Hanming Deng, Chengjun Xie, Gao Huang, Dahua Lin, Lewei Lu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[53] arXiv:2512.24323 [pdf, ps, other]: Title: Robust Egocentric Referring Video Object Segmentation via Dual-Modal Causal Intervention

Authors: Haijing Liu, Zhiyuan Song, Hefeng Wu, Tao Pu, Keze Wang, Liang Lin

Comments: NeurIPS 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[54] arXiv:2512.24321 [pdf, ps, other]: Title: UniAct: Unified Motion Generation and Action Streaming for Humanoid Robots

Authors: Nan Jiang, Zimo He, Wanhe Yu, Lexi Pang, Yunhao Li, Hongjie Li, Jieming Cui, Yuhan Li, Yizhou Wang, Yixin Zhu, Siyuan Huang

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[55] arXiv:2512.24294 [pdf, ps, other]: Title: Virtual-Eyes: Quantitative Validation of a Lung CT Quality-Control Pipeline for Foundation-Model Cancer Risk Prediction

Authors: Md. Enamul Hoq, Linda Larson-Prior, Fred Prior

Comments: 23 pages, and Under Review-MIDL-2026

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[56] arXiv:2512.24278 [pdf, ps, other]: Title: One-shot synthesis of rare gastrointestinal lesions improves diagnostic accuracy and clinical training

Authors: Jia Yu, Yan Zhu, Peiyao Fu, Tianyi Chen, Zhihua Wang, Fei Wu, Quanlin Li, Pinghong Zhou, Shuo Wang, Xian Yang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[57] arXiv:2512.24276 [pdf, ps, other]: Title: LiftProj: Space Lifting and Projection-Based Panorama Stitching

Authors: Yuan Jia, Ruimin Wu, Rui Song, Jiaojiao Li, Bin Song

Comments: 16 pages, 10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[58] arXiv:2512.24271 [pdf, ps, other]: Title: Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation

Authors: Zhe Huang, Hao Wen, Aiming Hao, Bingze Song, Meiqi Wu, Jiahong Wu, Xiangxiang Chu, Sheng Lu, Haoqian Wang

Comments: 18 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[59] arXiv:2512.24260 [pdf, ps, other]: Title: Physically-Grounded Manifold Projection with Foundation Priors for Metal Artifact Reduction in Dental CBCT

Authors: Zhi Li, Yaqi Wang, Bingtao Ma, Yifan Zhang, Huiyu Zhou, Shuai Wang

Comments: This manuscript has been submitted to Medical Image Analysis for peer review

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[60] arXiv:2512.24243 [pdf, ps, other]: Title: MambaSeg: Harnessing Mamba for Accurate and Efficient Image-Event Semantic Segmentation

Authors: Fuqiang Gu, Yuanke Li, Xianlei Long, Kangping Ji, Chao Chen, Qingyi Gu, Zhenliang Ni

Comments: Accepted by AAAI 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[61] arXiv:2512.24231 [pdf, ps, other]: Title: MotivNet: Evolving Meta-Sapiens into an Emotionally Intelligent Foundation Model

Authors: Rahul Medicharla, Alper Yilmaz

Comments: 6 pages, 4 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[62] arXiv:2512.24227 [pdf, ps, other]: Title: Mirage: One-Step Video Diffusion for Photorealistic and Coherent Asset Editing in Driving Scenes

Authors: Shuyun Wang, Haiyang Sun, Bing Wang, Hangjun Ye, Xin Yu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[63] arXiv:2512.24224 [pdf, ps, other]: Title: ARM: A Learnable, Plug-and-Play Module for CLIP-based Open-vocabulary Semantic Segmentation

Authors: Ziquan Liu, Zhewei Zhu, Xuyang Shi

Comments: 10 pages, 4 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[64] arXiv:2512.24214 [pdf, ps, other]: Title: Medical Image Classification on Imbalanced Data Using ProGAN and SMA-Optimized ResNet: Application to COVID-19

Authors: Sina Jahromi, Farshid Hajati, Alireza Rezaee, Javaher Nourian

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[65] arXiv:2512.24195 [pdf, ps, other]: Title: CorGi: Contribution-Guided Block-Wise Interval Caching for Training-Free Acceleration of Diffusion Transformers

Authors: Yonglak Son, Suhyeok Kim, Seungryong Kim, Young Geun Kim

Comments: 16 pages, 20 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[66] arXiv:2512.24193 [pdf, ps, other]: Title: PointRAFT: 3D deep learning for high-throughput prediction of potato tuber weight from partial point clouds

Authors: Pieter M. Blok, Haozhou Wang, Hyun Kwon Suh, Peicheng Wang, James Burridge, Wei Guo

Comments: 14 pages, 7 figures, 3 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[67] arXiv:2512.24176 [pdf, ps, other]: Title: Guiding a Diffusion Transformer with the Internal Dynamics of Itself

Authors: Xingyu Zhou, Qifan Li, Xiaobin Hu, Hai Chen, Shuhang Gu

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[68] arXiv:2512.24172 [pdf, ps, other]: Title: Deep Global Clustering for Hyperspectral Image Segmentation: Concepts, Applications, and Open Challenges

Authors: Yu-Tang Chang, Pin-Wei Chen, Shih-Fang Chen

Comments: 10 pages, 4 figures. Technical report extending ACPA 2025 conference paper. Code and data available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[69] arXiv:2512.24165 [pdf, ps, other]: Title: DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models

Authors: Zefeng He, Xiaoye Qu, Yafu Li, Tong Zhu, Siyuan Huang, Yu Cheng

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[70] arXiv:2512.24162 [pdf, ps, other]: Title: Bayesian Self-Distillation for Image Classification

Authors: Anton Adelöw, Matteo Gamba, Atsuto Maki

Comments: 17 pages, 17 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[71] arXiv:2512.24160 [pdf, ps, other]: Title: Towards Open-Vocabulary Industrial Defect Understanding with a Large-Scale Multimodal Dataset

Authors: TsaiChing Ni, ZhenQi Chen, YuanFu Yang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[72] arXiv:2512.24146 [pdf, ps, other]: Title: Taming Preference Mode Collapse via Directional Decoupling Alignment in Diffusion Reinforcement Learning

Authors: Chubin Chen, Sujie Hu, Jiashu Zhu, Meiqi Wu, Jintao Chen, Yanxun Li, Nisha Huang, Chengyu Fang, Jiahong Wu, Xiangxiang Chu, Xiu Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[73] arXiv:2512.24120 [pdf, ps, other]: Title: Enhancing LLM-Based Neural Network Generation: Few-Shot Prompting and Efficient Validation for Automated Architecture Design

Authors: Chandini Vysyaraju, Raghuvir Duvvuri, Avi Goyal, Dmitry Ignatov, Radu Timofte

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[74] arXiv:2512.24119 [pdf, ps, other]: Title: GeoBench: Rethinking Multimodal Geometric Problem-Solving via Hierarchical Evaluation

Authors: Yuan Feng, Yue Yang, Xiaohan He, Jiatong Zhao, Jianlong Chen, Zijun Chen, Daocheng Fu, Qi Liu, Renqiu Xia, Bo Zhang, Junchi Yan

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[75] arXiv:2512.24111 [pdf, ps, other]: Title: Guided Diffusion-based Generation of Adversarial Objects for Real-World Monocular Depth Estimation Attacks

Authors: Yongtao Chen, Yanbo Wang, Wentao Zhao, Guole Shen, Tianchen Deng, Jingchuan Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[76] arXiv:2512.24100 [pdf, ps, other]: Title: Think Before You Move: Latent Motion Reasoning for Text-to-Motion Generation

Authors: Yijie Qian, Juncheng Wang, Yuxiang Feng, Chao Xu, Wang Lu, Yang Liu, Baigui Sun, Yiqiang Chen, Yong Liu, Shujun Wang

Comments: project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[77] arXiv:2512.24097 [pdf, ps, other]: Title: Factorized Learning for Temporally Grounded Video-Language Models

Authors: Wenzheng Zeng, Difei Gao, Mike Zheng Shou, Hwee Tou Ng

Comments: ICCV 2025 paper. This arXiv version updates Figure 1 to include the concurrent work Qwen2.5-VL to ensure consistency with Table 1

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multimedia (cs.MM)
[78] arXiv:2512.24086 [pdf, ps, other]: Title: RainFusion2.0: Temporal-Spatial Awareness and Hardware-Efficient Block-wise Sparse Attention

Authors: Aiyue Chen, Yaofu Liu, Junjian Huang, Guang Lian, Yiwu Yao, Wangli Lan, Jing Lin, Zhixin Ma, Tingting Zhou, Harry Yang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[79] arXiv:2512.24074 [pdf, ps, other]: Title: Balanced Hierarchical Contrastive Learning with Decoupled Queries for Fine-grained Object Detection in Remote Sensing Images

Authors: Jingzhou Chen, Dexin Chen, Fengchao Xiong, Yuntao Qian, Liang Xiao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[80] arXiv:2512.24066 [pdf, ps, other]: Title: Pathology Context Recalibration Network for Ocular Disease Recognition

Authors: Zunjie Xiao, Xiaoqing Zhang, Risa Higashita, Jiang Liu

Comments: The article has been accepted for publication at Machine Intelligence Research (MIR)

Journal-ref: Machine Intelligence Research 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[81] arXiv:2512.24064 [pdf, ps, other]: Title: Neighbor-aware Instance Refining with Noisy Labels for Cross-Modal Retrieval

Authors: Yizhi Liu, Ruitao Pu, Shilin Xu, Yingke Chen, Quan-Hui Liu, Yuan Sun

Comments: 9 pages, 4 figures, and AAAI-26 conference

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[82] arXiv:2512.24035 [pdf, ps, other]: Title: Reinforced Diffusion: Learning to Push the Limits of Anisotropic Diffusion for Image Denoising

Authors: Xinran Qin, Yuhui Quan, Ruotao Xu, Hui Ji

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[83] arXiv:2512.24026 [pdf, ps, other]: Title: PipeFlow: Pipelined Processing and Motion-Aware Frame Selection for Long-Form Video Editing

Authors: Mustafa Munir, Md Mostafijur Rahman, Kartikeya Bhardwaj, Paul Whatmough, Radu Marculescu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[84] arXiv:2512.24023 [pdf, ps, other]: Title: RSAgent: Learning to Reason and Act for Text-Guided Segmentation via Multi-Turn Tool Invocations

Authors: Xingqi He, Yujie Zhang, Shuyong Gao, Wenjie Li, Lingyi Hong, Mingxi Chen, Kaixun Jiang, Jiyuan Fu, Wenqiang Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[85] arXiv:2512.24022 [pdf, ps, other]: Title: FUSE-RSVLM: Feature Fusion Vision-Language Model for Remote Sensing

Authors: Yunkai Dang, Donghao Wang, Jiacheng Yang, Yifan Jiang, Meiyi Zhu, Yuekun Yang, Cong Wang, Qi Fan, Wenbin Li, Yang Gao

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[86] arXiv:2512.24018 [pdf, ps, other]: Title: Structure-Guided Allocation of 2D Gaussians for Image Representation and Compression

Authors: Huanxiong Liang, Yunuo Chen, Yicheng Pan, Sixian Wang, Jincheng Dai, Guo Lu, Wenjun Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[87] arXiv:2512.24016 [pdf, ps, other]: Title: FitControler: Toward Fit-Aware Virtual Try-On

Authors: Lu Yang, Yicheng Liu, Yanan Li, Xiang Bai, Hao Lu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[88] arXiv:2512.24015 [pdf, ps, other]: Title: On Exact Editing of Flow-Based Diffusion Models

Authors: Zixiang Li, Yue Song, Jianing Peng, Ting Liu, Jun Huang, Xiaochao Qu, Luoqi Liu, Wei Wang, Yao Zhao, Yunchao Wei

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[89] arXiv:2512.24013 [pdf, ps, other]: Title: Bridging the Perception-Cognition Gap:Re-engineering SAM2 with Hilbert-Mamba for Robust VLM-based Medical Diagnosis

Authors: Hao Wu, Hui Li, Yiyun Su

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[90] arXiv:2512.23998 [pdf, ps, other]: Title: Improved 3D Gaussian Splatting of Unknown Spacecraft Structure Using Space Environment Illumination Knowledge

Authors: Tae Ha Park, Simone D'Amico

Comments: Presented at 2025 IEEE International Conference on Space Robotics (iSpaRo)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[91] arXiv:2512.23997 [pdf, ps, other]: Title: Bridging Structure and Appearance: Topological Features for Robust Self-Supervised Segmentation

Authors: Haotang Li, Zhenyu Qi, Hao Qin, Huanrui Yang, Sen He, Kebin Peng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[92] arXiv:2512.23990 [pdf, ps, other]: Title: GCA-ResUNet: Medical Image Segmentation Using Grouped Coordinate Attention

Authors: Jun Ding, Shang Gao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[93] arXiv:2512.23986 [pdf, ps, other]: Title: Anomaly detection in satellite imagery through temporal inpainting

Authors: Bertrand Rouet-Leduc, Claudia Hulbert

Subjects: Computer Vision and Pattern Recognition (cs.CV); Geophysics (physics.geo-ph)
[94] arXiv:2512.23983 [pdf, ps, other]: Title: DriveExplorer: Images-Only Decoupled 4D Reconstruction with Progressive Restoration for Driving View Extrapolation

Authors: Yuang Jia, Jinlong Wang, Jiayi Zhao, Chunlam Li, Shunzhou Wang, Wei Gao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[95] arXiv:2512.23953 [pdf, ps, other]: Title: T2VAttack: Adversarial Attack on Text-to-Video Diffusion Models

Authors: Changzhen Li, Yuecong Min, Jie Zhang, Zheng Yuan, Shiguang Shan, Xilin Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[96] arXiv:2512.23950 [pdf, ps, other]: Title: U-Net-Like Spiking Neural Networks for Single Image Dehazing

Authors: Huibin Li, Haoran Liu, Mingzhe Liu, Yulong Xiao, Peng Li, Guibin Zan

Comments: 9 pages, 4 figures. Accepted at IJCNN 2025 (Rome, Italy). To appear in IEEE/IJCNN 2025 proceedings

Journal-ref: 2025 International Joint Conference on Neural Networks (IJCNN), Rome, Italy, pp. 1-9, 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[97] arXiv:2512.23942 [pdf, ps, other]: Title: Kinematic-Based Assessment of Surgical Actions in Microanastomosis

Authors: Yan Meng, Daniel Donoho, Marcelle Altshuler, Omar Arnaout

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[98] arXiv:2512.23938 [pdf, ps, other]: Title: Learnable Query Aggregation with KV Routing for Cross-view Geo-localisation

Authors: Hualin Ye, Bingxi Liu, Jixiang Du, Yu Qin, Ziyi Chen, Hong Zhang

Comments: 7 pages, 4 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[99] arXiv:2512.23936 [pdf, ps, other]: Title: MGML: A Plug-and-Play Meta-Guided Multi-Modal Learning Framework for Incomplete Multimodal Brain Tumor Segmentation

Authors: Yulong Zou, Bo Liu, Cun-Jing Zheng, Yuan-ming Geng, Siyue Li, Qiankun Zuo, Shuihua Wang, Yudong Zhang, Jin Hong

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[100] arXiv:2512.23920 [pdf, ps, other]: Title: Learning to learn skill assessment for fetal ultrasound scanning

Authors: Yipei Wang, Qianye Yang, Lior Drukker, Aris T. Papageorghiou, Yipeng Hu, J. Alison Noble

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[101] arXiv:2512.23903 [pdf, ps, other]: Title: Scaling Remote Sensing Foundation Models: Data Domain Tradeoffs at the Peta-Scale

Authors: Charith Wickrema, Eliza Mace, Hunter Brown, Heidys Cabrera, Nick Krall, Matthew O'Neill, Shivangi Sarkar, Lowell Weissman, Eric Hughes, Guido Zarrella

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[102] arXiv:2512.23894 [pdf, ps, other]: Title: MRI-to-CT Synthesis With Cranial Suture Segmentations Using A Variational Autoencoder Framework

Authors: Krithika Iyer, Austin Tapp, Athelia Paulli, Gabrielle Dickerson, Syed Muhammad Anwar, Natasha Lepore, Marius George Linguraru

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[103] arXiv:2512.23860 [pdf, ps, other]: Title: Lifelong Domain Adaptive 3D Human Pose Estimation

Authors: Qucheng Peng, Hongfei Xue, Pu Wang, Chen Chen

Comments: Accepted by AAAI 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[104] arXiv:2512.23851 [pdf, ps, other]: Title: Pretraining Frame Preservation in Autoregressive Video Memory Compression

Authors: Lvmin Zhang, Shengqu Cai, Muyang Li, Chong Zeng, Beijia Lu, Anyi Rao, Song Han, Gordon Wetzstein, Maneesh Agrawala

Comments: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[105] arXiv:2512.23819 [pdf, ps, other]: Title: Video-Based Performance Evaluation for ECR Drills in Synthetic Training Environments

Authors: Surya Rayala, Marcos Quinones-Grueiro, Naveeduddin Mohammed, Ashwin T S, Benjamin Goldberg, Randall Spain, Paige Lawton, Gautam Biswas

Comments: 14 pages, 9 figures, I/ITSEC-2025

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[106] arXiv:2512.23786 [pdf, ps, other]: Title: Leveraging Synthetic Priors for Monocular Depth Estimation in Specular Surgical Environments

Authors: Ankan Aich, Yangming Lee

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[107] arXiv:2512.25034 (cross-list from cs.LG) [pdf, ps, other]: Title: Generative Classifiers Avoid Shortcut Solutions

Authors: Alexander C. Li, Ananya Kumar, Deepak Pathak

Comments: ICLR 2025. Code: this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[108] arXiv:2512.24986 (cross-list from cs.GR) [pdf, ps, other]: Title: PhysTalk: Language-driven Real-time Physics in 3D Gaussian Scenes

Authors: Luca Collorone, Mert Kiray, Indro Spinelli, Fabio Galasso, Benjamin Busam

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[109] arXiv:2512.24894 (cross-list from cond-mat.mes-hall) [pdf, ps, other]: Title: Towards autonomous time-calibration of large quantum-dot devices: Detection, real-time feedback, and noise spectroscopy

Authors: Anantha S. Rao, Barnaby van Straaten, Valentin John, Cécile X. Yu, Stefan D. Oosterhout, Lucas Stehouwer, Giordano Scappucci, M. D. Stewart, Jr., Menno Veldhorst, Francesco Borsoi, Justyna P. Zwolak

Comments: 12 pages, 4 figures

Subjects: Mesoscale and Nanoscale Physics (cond-mat.mes-hall); Computer Vision and Pattern Recognition (cs.CV); Emerging Technologies (cs.ET); Quantum Physics (quant-ph)
[110] arXiv:2512.24766 (cross-list from cs.RO) [pdf, ps, other]: Title: Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow

Authors: Karthik Dharmarajan, Wenlong Huang, Jiajun Wu, Li Fei-Fei, Ruohan Zhang

Comments: Project website: this https URL

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[111] arXiv:2512.24499 (cross-list from cs.CR) [pdf, ps, other]: Title: Training-Free Color-Aware Adversarial Diffusion Sanitization for Diffusion Stegomalware Defense at Security Gateways

Authors: Vladimir Frants, Sos Agaian

Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[112] arXiv:2512.24492 (cross-list from eess.IV) [pdf, ps, other]: Title: Automated Classification of First-Trimester Fetal Heart Views Using Ultrasound-Specific Self-Supervised Learning

Authors: Youssef Megahed, Aylin Erman, Robin Ducharme, Mark C. Walker, Steven Hawken, Adrian D. C. Chan

Comments: 7 pages, 4 figures

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[113] arXiv:2512.24404 (cross-list from cs.LG) [pdf, ps, other]: Title: Lifting Vision: Ground to Aerial Localization with Reasoning Guided Planning

Authors: Soham Pahari, M. Srinivas

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[114] arXiv:2512.24384 (cross-list from cs.RO) [pdf, ps, other]: Title: Geometric Multi-Session Map Merging with Learned Local Descriptors

Authors: Yanlong Ma, Nakul S. Joshi, Christa S. Robison, Philip R. Osteen, Brett T. Lopez

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[115] arXiv:2512.24212 (cross-list from cs.RO) [pdf, ps, other]: Title: RANGER: A Monocular Zero-Shot Semantic Navigation Framework through Contextual Adaptation

Authors: Ming-Ming Yu, Yi Chen, Börje F. Karlsson, Wenjun Wu

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[116] arXiv:2512.24138 (cross-list from cs.LG) [pdf, ps, other]: Title: GARDO: Reinforcing Diffusion Models without Reward Hacking

Authors: Haoran He, Yuxiao Ye, Jie Liu, Jiajun Liang, Zhiyong Wang, Ziyang Yuan, Xintao Wang, Hangyu Mao, Pengfei Wan, Ling Pan

Comments: 17 pages. Project: this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[117] arXiv:2512.24117 (cross-list from eess.IV) [pdf, ps, other]: Title: Targeted Semantic Segmentation of Himalayan Glacial Lakes Using Time-Series SAR: Towards Automated GLOF Early Warning

Authors: Pawan Adhikari, Satish Raj Regmi, Hari Ram Shrestha

Comments: 12 pages, 6 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[118] arXiv:2512.24019 (cross-list from quant-ph) [pdf, ps, other]: Title: One-Shot Structured Pruning of Quantum Neural Networks via $q$-Group Engineering and Quantum Geometric Metrics

Authors: Haijian Shao, Wei Liu, Xing Deng, Yingtao Jiang

Comments: 10 pages, 2 figures

Subjects: Quantum Physics (quant-ph); Computer Vision and Pattern Recognition (cs.CV)
[119] arXiv:2512.23906 (cross-list from eess.SP) [pdf, ps, other]: Title: A multimodal Transformer for InSAR-based ground deformation forecasting with cross-site generalization across Europe

Authors: Wendong Yao, Binhua Huang, Soumyabrata Dev

Comments: submitted to ISPRS Journal of Photogrammetry and Remote Sensing for review

Subjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[120] arXiv:2512.23864 (cross-list from cs.RO) [pdf, ps, other]: Title: Learning to Feel the Future: DreamTacVLA for Contact-Rich Manipulation

Authors: Guo Ye, Zexi Zhang, Xu Zhao, Shang Wu, Haoran Lu, Shihan Lu, Han Liu

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[121] arXiv:2512.23766 (cross-list from cs.LG) [pdf, ps, other]: Title: A Granular Grassmannian Clustering Framework via the Schubert Variety of Best Fit

Authors: Karim Salta, Michael Kirby, Chris Peterson

Subjects: Machine Learning (cs.LG); Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
[122] arXiv:2512.23757 (cross-list from eess.IV) [pdf, ps, other]: Title: Leveraging Machine Learning for Early Detection of Lung Diseases

Authors: Bahareh Rahmani, Harsha Reddy Bindela, Rama Kanth Reddy Gosula, Krishna Yedubati, Mohammad Amir Salari, Leslie Hinyard, Payam Norouzzadeh, Eli Snir, Martin Schoen

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[123] arXiv:2512.23739 (cross-list from cs.CL) [pdf, ps, other]: Title: Break Out the Silverware -- Semantic Understanding of Stored Household Items

Authors: Michaela Levi-Richter, Reuth Mirsky, Oren Glickman

Comments: Poster presented at the Israeli Seminar on Computational Linguistics 2025

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[124] arXiv:2512.23726 (cross-list from physics.med-ph) [pdf, ps, other]: Title: q3-MuPa: Quick, Quiet, Quantitative Multi-Parametric MRI using Physics-Informed Diffusion Models

Authors: Shishuai Wang, Florian Wiesinger, Noemi Sgambelluri, Carolin Pirkl, Stefan Klein, Juan A. Hernandez-Tamames, Dirk H.J. Poot

Subjects: Medical Physics (physics.med-ph); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)

Tue, 30 Dec 2025 (showing first 5 of 220 entries)

[125] arXiv:2512.23709 [pdf, ps, other]: Title: Stream-DiffVSR: Low-Latency Streamable Video Super-Resolution via Auto-Regressive Diffusion

Authors: Hau-Shiang Shiu, Chin-Yang Lin, Zhixiang Wang, Chi-Wei Hsiao, Po-Fan Yu, Yu-Chih Chen, Yu-Lun Liu

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[126] arXiv:2512.23705 [pdf, ps, other]: Title: Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation

Authors: Shaocong Xu, Songlin Wei, Qizhe Wei, Zheng Geng, Hong Li, Licheng Shen, Qianpu Sun, Shu Han, Bin Ma, Bohan Li, Chongjie Ye, Yuhang Zheng, Nan Wang, Saining Zhang, Hao Zhao

Comments: Project Page: this https URL; Code: this https URL; Dataset: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[127] arXiv:2512.23667 [pdf, ps, other]: Title: IDT: A Physically Grounded Transformer for Feed-Forward Multi-View Intrinsic Decomposition

Authors: Kang Du, Yirui Guan, Zeyu Wang

Comments: 10 pages 4 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[128] arXiv:2512.23646 [pdf, ps, other]: Title: OmniAgent: Audio-Guided Active Perception Agent for Omnimodal Audio-Video Understanding

Authors: Keda Tao, Wenjie Du, Bohan Yu, Weiqiang Wang, Jian Liu, Huan Wang

Comments: Website:this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[129] arXiv:2512.23635 [pdf, ps, other]: Title: Rethinking the Spatio-Temporal Alignment of End-to-End 3D Perception

Authors: Xiaoyu Li, Peidong Li, Xian Wu, Long Shi, Dedong Liu, Yitao Wu, Jiajia Fu, Dixiao Cui, Lijun Zhao, Lining Sun

Comments: Accepted to AAAI 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)

[ total of 603 entries: 1-100 | 30-129 | 130-229 | 230-329 | 330-429 | ... | 530-603 ]
[ showing 100 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2601, contact, help (Access key information)

> cs > cs.CV

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 29

Thu, 1 Jan 2026 (continued, showing last 95 of 124 entries)

Tue, 30 Dec 2025 (showing first 5 of 220 entries)