We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 703

[ total of 778 entries: 1-25 | ... | 629-653 | 654-678 | 679-703 | 704-728 | 729-753 | 754-778 ]
[ showing 25 entries per page: fewer | more | all ]

Tue, 2 Dec 2025 (continued, showing 25 of 278 entries)

[704]  arXiv:2512.00294 [pdf, ps, other]
Title: Words into World: A Task-Adaptive Agent for Language-Guided Spatial Retrieval in AR
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[705]  arXiv:2512.00281 [pdf, ps, other]
Title: Rethinking Lung Cancer Screening: AI Nodule Detection and Diagnosis Outperforms Radiologists, Leading Models, and Standards Beyond Size and Growth
Comments: 25 pages, 8 figures, with supplementary information containing 11 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Neurons and Cognition (q-bio.NC)
[706]  arXiv:2512.00275 [pdf, ps, other]
Title: HIMOSA: Efficient Remote Sensing Image Super-Resolution with Hierarchical Mixture of Sparse Attention
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[707]  arXiv:2512.00269 [pdf, ps, other]
Title: USB: Unified Synthetic Brain Framework for Bidirectional Pathology-Healthy Generation and Editing
Authors: Jun Wang, Peirong Liu
Comments: 16 pages, 17 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[708]  arXiv:2512.00264 [pdf, ps, other]
Title: HeartFormer: Semantic-Aware Dual-Structure Transformers for 3D Four-Chamber Cardiac Point Cloud Reconstruction
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[709]  arXiv:2512.00261 [pdf, ps, other]
Title: UniDiff: Parameter-Efficient Adaptation of Diffusion Models for Land Cover Classification with Multi-Modal Remotely Sensed Imagery and Sparse Annotations
Comments: Camera-ready for WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[710]  arXiv:2512.00255 [pdf, ps, other]
Title: Relightable Holoported Characters: Capturing and Relighting Dynamic Human Performance from Sparse Views
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[711]  arXiv:2512.00226 [pdf, ps, other]
Title: DenseScan: Advancing 3D Scene Understanding with 2D Dense Annotation
Authors: Zirui Wang, Tao Zhang
Comments: Workshop on Space in Vision, Language, and Embodied AI at NeurIPS 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[712]  arXiv:2512.00208 [pdf, ps, other]
Title: ReactionMamba: Generating Short &Long Human Reaction Sequences
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[713]  arXiv:2512.00198 [pdf, ps, other]
Title: Mammo-FM: Breast-specific foundational model for Integrated Mammographic Diagnosis, Prognosis, and Reporting
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[714]  arXiv:2512.00194 [pdf, ps, other]
Title: AutocleanEEG ICVision: Automated ICA Artifact Classification Using Vision-Language AI
Comments: 6 pages, 8 figures
Journal-ref: Conference ICMI2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Quantitative Methods (q-bio.QM)
[715]  arXiv:2512.00179 [pdf, ps, other]
Title: Efficient Edge-Compatible CNN for Speckle-Based Material Recognition in Laser Cutting Systems
Authors: Mohamed Abdallah Salem (North Dakota State University), Nourhan Zein Diab (New Mansoura University)
Comments: Copyright 2025 IEEE. This is the author's version of the work that has been Accepted for publication in the Proceedings of the 2025 IEEE The 35th International Conference on Computer Theory and Applications (ICCTA 2025). Final published version will be available on IEEE Xplore
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[716]  arXiv:2512.00130 [pdf, ps, other]
Title: Local and Global Context-and-Object-part-Aware Superpixel-based Data Augmentation for Deep Visual Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[717]  arXiv:2512.00129 [pdf, ps, other]
Title: Analysis of Incursive Breast Cancer in Mammograms Using YOLO, Explainability, and Domain Adaptation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[718]  arXiv:2512.00125 [pdf, ps, other]
Title: Hybrid Synthetic Data Generation with Domain Randomization Enables Zero-Shot Vision-Based Part Inspection Under Extreme Class Imbalance
Comments: Submitted to the NAMRC 54
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[719]  arXiv:2512.00117 [pdf, ps, other]
Title: TinyViT: Field Deployable Transformer Pipeline for Solar Panel Surface Fault and Severity Screening
Comments: 3pages, 2figures,ICGVIP 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[720]  arXiv:2512.00103 [pdf, ps, other]
Title: Comparative Analysis of Vision Transformer, Convolutional, and Hybrid Architectures for Mental Health Classification Using Actigraphy-Derived Images
Authors: Ifeanyi Okala
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[721]  arXiv:2512.00091 [pdf, ps, other]
Title: Deep Filament Extraction for 3D Concrete Printing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[722]  arXiv:2512.00089 [pdf, ps, other]
Title: TeleViT1.0: Teleconnection-aware Vision Transformers for Subseasonal to Seasonal Wildfire Pattern Forecasts
Comments: Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[723]  arXiv:2512.00088 [pdf, ps, other]
Title: SemImage: Semantic Image Representation for Text, a Novel Framework for Embedding Disentangled Linguistic Features
Authors: Mohammad Zare
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[724]  arXiv:2512.00087 [pdf, ps, other]
Title: Exploring Automated Recognition of Instructional Activity and Discourse from Multimodal Classroom Data
Comments: This article has been accepted for publication in the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[725]  arXiv:2512.00086 [pdf, ps, other]
Title: Multi-modal On-Device Learning for Monocular Depth Estimation on Ultra-low-power MCUs
Comments: 14 pages, 9 figures, 3 tables. Associated open-source release available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[726]  arXiv:2512.00084 [pdf, ps, other]
Title: A Fast and Efficient Modern BERT based Text-Conditioned Diffusion Model for Medical Image Segmentation
Comments: 15 pages, 3 figures, Accepted in Slide 3 10th International Conference on Computer Vision & Image Processing (CVIP 2026)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[727]  arXiv:2512.00082 [pdf, ps, other]
Title: Exploring Diagnostic Prompting Approach for Multimodal LLM-based Visual Complexity Assessment: A Case Study of Amazon Search Result Pages
Comments: 9 pages, 4 figures, 9 tables. Study on diagnostic prompting for multimodal LLM-based visual complexity assessment of Amazon search result pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[728]  arXiv:2512.00080 [pdf, ps, other]
Title: Conceptual Evaluation of Deep Visual Stereo Odometry for the MARWIN Radiation Monitoring Robot in Accelerator Tunnels
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[ total of 778 entries: 1-25 | ... | 629-653 | 654-678 | 679-703 | 704-728 | 729-753 | 754-778 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)