Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 678

[ total of 778 entries: 1-50 | ... | 529-578 | 579-628 | 629-678 | 679-728 | 729-778 ]
[ showing 50 entries per page: fewer | more | all ]

Tue, 2 Dec 2025 (continued, showing 50 of 278 entries)

[679] arXiv:2512.00456 [pdf, ps, other]: Title: CausalAffect: Causal Discovery for Facial Affective Understanding

Authors: Guanyu Hu, Tangzheng Lian, Dimitrios Kollias, Oya Celiktutan, Xinyu Yang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[680] arXiv:2512.00450 [pdf, ps, other]: Title: RecruitView: A Multimodal Dataset for Predicting Personality and Interview Performance for Human Resources Applications

Authors: Amit Kumar Gupta, Farhan Sheth, Hammad Shaikh, Dheeraj Kumar, Angkul Puniya, Deepak Panwar, Sandeep Chaurasia, Priya Mathur

Comments: 20 pages, 10 figures, 10 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[681] arXiv:2512.00438 [pdf, ps, other]: Title: FR-TTS: Test-Time Scaling for NTP-based Image Generation with Effective Filling-based Reward Signal

Authors: Hang Xu, Linjiang Huang, Feng Zhao

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[682] arXiv:2512.00428 [pdf, ps, other]: Title: Recognizing Pneumonia in Real-World Chest X-rays with a Classifier Trained with Images Synthetically Generated by Nano Banana

Authors: Jiachuan Peng, Kyle Lam, Jianing Qiu

Comments: 9 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[683] arXiv:2512.00425 [pdf, ps, other]: Title: What about gravity in video generation? Post-Training Newton's Laws with Verifiable Rewards

Authors: Minh-Quan Le, Yuanzhi Zhu, Vicky Kalogeiton, Dimitris Samaras

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[684] arXiv:2512.00424 [pdf, ps, other]: Title: Recovering Origin Destination Flows from Bus CCTV: Early Results from Nairobi and Kigali

Authors: Nthenya Kyatha, Jay Taneja

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[685] arXiv:2512.00422 [pdf, ps, other]: Title: PhysGen: Physically Grounded 3D Shape Generation for Industrial Design

Authors: Yingxuan You, Chen Zhao, Hantao Zhang, Mingda Xu, Pascal Fua

Comments: 14 pages, 10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[686] arXiv:2512.00413 [pdf, ps, other]: Title: SplatFont3D: Structure-Aware Text-to-3D Artistic Font Generation with Part-Level Style Control

Authors: Ji Gan, Lingxu Chen, Jiaxu Leng, Xinbo Gao

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[687] arXiv:2512.00408 [pdf, ps, other]: Title: Low-Bitrate Video Compression through Semantic-Conditioned Diffusion

Authors: Lingdong Wang, Guan-Ming Su, Divya Kothandaraman, Tsung-Wei Huang, Mohammad Hajiesmaili, Ramesh K. Sitaraman

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[688] arXiv:2512.00395 [pdf, ps, other]: Title: Better, Stronger, Faster: Tackling the Trilemma in MLLM-based Segmentation with Simultaneous Textual Mask Prediction

Authors: Jiazhen Liu, Mingkuan Feng, Long Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[689] arXiv:2512.00387 [pdf, ps, other]: Title: WiseEdit: Benchmarking Cognition- and Creativity-Informed Image Editing

Authors: Kaihang Pan, Weile Chen, Haiyi Qiu, Qifan Yu, Wendong Bu, Zehan Wang, Yun Zhu, Juncheng Li, Siliang Tang

Comments: 32 pages, 20 figures. Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[690] arXiv:2512.00385 [pdf, ps, other]: Title: EZ-SP: Fast and Lightweight Superpoint-Based 3D Segmentation

Authors: Louis Geist, Loic Landrieu, Damien Robert

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[691] arXiv:2512.00381 [pdf, ps, other]: Title: Pore-scale Image Patch Dataset and A Comparative Evaluation of Pore-scale Facial Features

Authors: Dong Li, HuaLiang Lin, JiaYu Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[692] arXiv:2512.00369 [pdf, ps, other]: Title: POLARIS: Projection-Orthogonal Least Squares for Robust and Adaptive Inversion in Diffusion Models

Authors: Wenshuo Chen, Haosen Li, Shaofeng Liang, Lei Wang, Haozhe Jia, Kaishen Yuan, Jieming Wu, Bowen Tian, Yutao Yue

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[693] arXiv:2512.00368 [pdf, ps, other]: Title: THCRL: Trusted Hierarchical Contrastive Representation Learning for Multi-View Clustering

Authors: Jian Zhu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[694] arXiv:2512.00365 [pdf, ps, other]: Title: Towards aligned body representations in vision models

Authors: Andrey Gizdov, Andrea Procopio, Yichen Li, Daniel Harari, Tomer Ullman

Comments: Andrea Procopio and Andrey Gizdov have equal contributions

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[695] arXiv:2512.00363 [pdf, ps, other]: Title: MM-DETR: An Efficient Multimodal Detection Transformer with Mamba-Driven Dual-Granularity Fusion and Frequency-Aware Modality Adapters

Authors: Jianhong Han, Yupei Wang, Yuan Zhang, Liang Chen

Comments: Manuscript submitted to IEEE Transactions on Geoscience and Remote Sensing

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[696] arXiv:2512.00355 [pdf, ps, other]: Title: SMamDiff: Spatial Mamba for Stochastic Human Motion Prediction

Authors: Junqiao Fan, Pengfei Liu, Haocong Rao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[697] arXiv:2512.00345 [pdf, ps, other]: Title: mmPred: Radar-based Human Motion Prediction in the Dark

Authors: Junqiao Fan, Haocong Rao, Jiarui Zhang, Jianfei Yang, Lihua Xie

Comments: This paper is accepted by AAAI-2026

Journal-ref: AAAI-2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[698] arXiv:2512.00343 [pdf, ps, other]: Title: Assimilation Matters: Model-level Backdoor Detection in Vision-Language Pretrained Models

Authors: Zhongqi Wang, Jie Zhang, Shiguang Shan, Xilin Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[699] arXiv:2512.00336 [pdf, ps, other]: Title: MVAD : A Comprehensive Multimodal Video-Audio Dataset for AIGC Detection

Authors: Mengxue Hu, Yunfeng Diao, Changtao Miao, Jianshu Li, Zhe Li, Joey Tianyi Zhou

Comments: 7 pages,2 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[700] arXiv:2512.00327 [pdf, ps, other]: Title: Odometry Without Correspondence from Inertially Constrained Ruled Surfaces

Authors: Chenqi Zhu, Levi Burner, Yiannis Aloimonos

Comments: 14 pages, 13 figures, 5 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[701] arXiv:2512.00310 [pdf, ps, other]: Title: ART-ASyn: Anatomy-aware Realistic Texture-based Anomaly Synthesis Framework for Chest X-Rays

Authors: Qinyi Cao, Jianan Fan, Weidong Cai

Comments: Accepted in WACV2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[702] arXiv:2512.00308 [pdf, ps, other]: Title: Optimizing Distributional Geometry Alignment with Optimal Transport for Generative Dataset Distillation

Authors: Xiao Cui, Yulei Qin, Wengang Zhou, Hongsheng Li, Houqiang Li

Comments: NeurIPS 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[703] arXiv:2512.00300 [pdf, ps, other]: Title: TGSFormer: Scalable Temporal Gaussian Splatting for Embodied Semantic Scene Completion

Authors: Rui Qian, Haozhi Cao, Tianchen Deng, Tianxin Hu, Weixiang Guo, Shenghai Yuan, Lihua Xie

Comments: 14 pages, 10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[704] arXiv:2512.00294 [pdf, ps, other]: Title: Words into World: A Task-Adaptive Agent for Language-Guided Spatial Retrieval in AR

Authors: Lixing Guo, Tobias Höllerer

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[705] arXiv:2512.00281 [pdf, ps, other]: Title: Rethinking Lung Cancer Screening: AI Nodule Detection and Diagnosis Outperforms Radiologists, Leading Models, and Standards Beyond Size and Growth

Authors: Sylvain Bodard, Pierre Baudot, Benjamin Renoust, Charles Voyton, Gwendoline De Bie, Ezequiel Geremia, Van-Khoa Le, Danny Francis, Pierre-Henri Siot, Yousra Haddou, Vincent Bobin, Jean-Christophe Brisset, Carey C. Thomson, Valerie Bourdes, Benoit Huet

Comments: 25 pages, 8 figures, with supplementary information containing 11 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Neurons and Cognition (q-bio.NC)
[706] arXiv:2512.00275 [pdf, ps, other]: Title: HIMOSA: Efficient Remote Sensing Image Super-Resolution with Hierarchical Mixture of Sparse Attention

Authors: Yi Liu, Yi Wan, Xinyi Liu, Qiong Wu, Panwang Xia, Xuejun Huang, Yongjun Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[707] arXiv:2512.00269 [pdf, ps, other]: Title: USB: Unified Synthetic Brain Framework for Bidirectional Pathology-Healthy Generation and Editing

Authors: Jun Wang, Peirong Liu

Comments: 16 pages, 17 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[708] arXiv:2512.00264 [pdf, ps, other]: Title: HeartFormer: Semantic-Aware Dual-Structure Transformers for 3D Four-Chamber Cardiac Point Cloud Reconstruction

Authors: Zhengda Ma, Abhirup Banerjee

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[709] arXiv:2512.00261 [pdf, ps, other]: Title: UniDiff: Parameter-Efficient Adaptation of Diffusion Models for Land Cover Classification with Multi-Modal Remotely Sensed Imagery and Sparse Annotations

Authors: Yuzhen Hu, Saurabh Prasad

Comments: Camera-ready for WACV 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[710] arXiv:2512.00255 [pdf, ps, other]: Title: Relightable Holoported Characters: Capturing and Relighting Dynamic Human Performance from Sparse Views

Authors: Kunwar Maheep Singh, Jianchun Chen, Vladislav Golyanik, Stephan J. Garbin, Thabo Beeler, Rishabh Dabral, Marc Habermann, Christian Theobalt

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[711] arXiv:2512.00226 [pdf, ps, other]: Title: DenseScan: Advancing 3D Scene Understanding with 2D Dense Annotation

Authors: Zirui Wang, Tao Zhang

Comments: Workshop on Space in Vision, Language, and Embodied AI at NeurIPS 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[712] arXiv:2512.00208 [pdf, ps, other]: Title: ReactionMamba: Generating Short &Long Human Reaction Sequences

Authors: Hajra Anwar Beg, Baptiste Chopin, Hao Tang, Mohamed Daoudi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[713] arXiv:2512.00198 [pdf, ps, other]: Title: Mammo-FM: Breast-specific foundational model for Integrated Mammographic Diagnosis, Prognosis, and Reporting

Authors: Shantanu Ghosh, Vedant Parthesh Joshi, Rayan Syed, Aya Kassem, Abhishek Varshney, Payel Basak, Weicheng Dai, Judy Wawira Gichoya, Hari M. Trivedi, Imon Banerjee, Shyam Visweswaran, Clare B. Poynton, Kayhan Batmanghelich

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[714] arXiv:2512.00194 [pdf, ps, other]: Title: AutocleanEEG ICVision: Automated ICA Artifact Classification Using Vision-Language AI

Authors: Zag ElSayed, Grace Westerkamp, Gavin Gammoh, Yanchen Liu, Peyton Siekierski, Craig Erickson, Ernest Pedapati

Comments: 6 pages, 8 figures

Journal-ref: Conference ICMI2026

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Quantitative Methods (q-bio.QM)
[715] arXiv:2512.00179 [pdf, ps, other]: Title: Efficient Edge-Compatible CNN for Speckle-Based Material Recognition in Laser Cutting Systems

Authors: Mohamed Abdallah Salem (North Dakota State University), Nourhan Zein Diab (New Mansoura University)

Comments: Copyright 2025 IEEE. This is the author's version of the work that has been Accepted for publication in the Proceedings of the 2025 IEEE The 35th International Conference on Computer Theory and Applications (ICCTA 2025). Final published version will be available on IEEE Xplore

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[716] arXiv:2512.00130 [pdf, ps, other]: Title: Local and Global Context-and-Object-part-Aware Superpixel-based Data Augmentation for Deep Visual Recognition

Authors: Fadi Dornaika, Danyang Sun

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[717] arXiv:2512.00129 [pdf, ps, other]: Title: Analysis of Incursive Breast Cancer in Mammograms Using YOLO, Explainability, and Domain Adaptation

Authors: Jayan Adhikari, Prativa Joshi, Susish Baral

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[718] arXiv:2512.00125 [pdf, ps, other]: Title: Hybrid Synthetic Data Generation with Domain Randomization Enables Zero-Shot Vision-Based Part Inspection Under Extreme Class Imbalance

Authors: Ruo-Syuan Mei, Sixian Jia, Guangze Li, Soo Yeon Lee, Brian Musser, William Keller, Sreten Zakula, Jorge Arinez, Chenhui Shao

Comments: Submitted to the NAMRC 54

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[719] arXiv:2512.00117 [pdf, ps, other]: Title: TinyViT: Field Deployable Transformer Pipeline for Solar Panel Surface Fault and Severity Screening

Authors: Ishwaryah Pandiarajan, Mohamed Mansoor Roomi Sindha, Uma Maheswari Pandyan, Sharafia N

Comments: 3pages, 2figures,ICGVIP 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[720] arXiv:2512.00103 [pdf, ps, other]: Title: Comparative Analysis of Vision Transformer, Convolutional, and Hybrid Architectures for Mental Health Classification Using Actigraphy-Derived Images

Authors: Ifeanyi Okala

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[721] arXiv:2512.00091 [pdf, ps, other]: Title: Deep Filament Extraction for 3D Concrete Printing

Authors: Karam Mawas, Mehdi Maboudi, Pedro Achanccaray, Markus Gerke

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[722] arXiv:2512.00089 [pdf, ps, other]: Title: TeleViT1.0: Teleconnection-aware Vision Transformers for Subseasonal to Seasonal Wildfire Pattern Forecasts

Authors: Ioannis Prapas, Nikolaos Papadopoulos, Nikolaos-Ioannis Bountos, Dimitrios Michail, Gustau Camps-Valls, Ioannis Papoutsis

Comments: Under review

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[723] arXiv:2512.00088 [pdf, ps, other]: Title: SemImage: Semantic Image Representation for Text, a Novel Framework for Embedding Disentangled Linguistic Features

Authors: Mohammad Zare

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[724] arXiv:2512.00087 [pdf, ps, other]: Title: Exploring Automated Recognition of Instructional Activity and Discourse from Multimodal Classroom Data

Authors: Ivo Bueno, Ruikun Hou, Babette Bühler, Tim Fütterer, James Drimalla, Jonathan Kyle Foster, Peter Youngs, Peter Gerjets, Ulrich Trautwein, Enkelejda Kasneci

Comments: This article has been accepted for publication in the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[725] arXiv:2512.00086 [pdf, ps, other]: Title: Multi-modal On-Device Learning for Monocular Depth Estimation on Ultra-low-power MCUs

Authors: Davide Nadalini, Manuele Rusci, Elia Cereda, Luca Benini, Francesco Conti, Daniele Palossi

Comments: 14 pages, 9 figures, 3 tables. Associated open-source release available at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[726] arXiv:2512.00084 [pdf, ps, other]: Title: A Fast and Efficient Modern BERT based Text-Conditioned Diffusion Model for Medical Image Segmentation

Authors: Venkata Siddharth Dhara, Pawan Kumar

Comments: 15 pages, 3 figures, Accepted in Slide 3 10th International Conference on Computer Vision & Image Processing (CVIP 2026)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[727] arXiv:2512.00082 [pdf, ps, other]: Title: Exploring Diagnostic Prompting Approach for Multimodal LLM-based Visual Complexity Assessment: A Case Study of Amazon Search Result Pages

Authors: Divendar Murtadak, Yoon Kim, Trilokya Akula

Comments: 9 pages, 4 figures, 9 tables. Study on diagnostic prompting for multimodal LLM-based visual complexity assessment of Amazon search result pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[728] arXiv:2512.00080 [pdf, ps, other]: Title: Conceptual Evaluation of Deep Visual Stereo Odometry for the MARWIN Radiation Monitoring Robot in Accelerator Tunnels

Authors: André Dehne, Juri Zach, Peer Stelldinger

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)

[ total of 778 entries: 1-50 | ... | 529-578 | 579-628 | 629-678 | 679-728 | 729-778 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help (Access key information)

> cs > cs.CV

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 678

Tue, 2 Dec 2025 (continued, showing 50 of 278 entries)