We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 678

[ total of 778 entries: 1-50 | ... | 529-578 | 579-628 | 629-678 | 679-728 | 729-778 ]
[ showing 50 entries per page: fewer | more | all ]

Tue, 2 Dec 2025 (continued, showing 50 of 278 entries)

[679]  arXiv:2512.00456 [pdf, ps, other]
Title: CausalAffect: Causal Discovery for Facial Affective Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[680]  arXiv:2512.00450 [pdf, ps, other]
Title: RecruitView: A Multimodal Dataset for Predicting Personality and Interview Performance for Human Resources Applications
Comments: 20 pages, 10 figures, 10 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[681]  arXiv:2512.00438 [pdf, ps, other]
Title: FR-TTS: Test-Time Scaling for NTP-based Image Generation with Effective Filling-based Reward Signal
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[682]  arXiv:2512.00428 [pdf, ps, other]
Title: Recognizing Pneumonia in Real-World Chest X-rays with a Classifier Trained with Images Synthetically Generated by Nano Banana
Comments: 9 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[683]  arXiv:2512.00425 [pdf, ps, other]
Title: What about gravity in video generation? Post-Training Newton's Laws with Verifiable Rewards
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[684]  arXiv:2512.00424 [pdf, ps, other]
Title: Recovering Origin Destination Flows from Bus CCTV: Early Results from Nairobi and Kigali
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[685]  arXiv:2512.00422 [pdf, ps, other]
Title: PhysGen: Physically Grounded 3D Shape Generation for Industrial Design
Comments: 14 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[686]  arXiv:2512.00413 [pdf, ps, other]
Title: SplatFont3D: Structure-Aware Text-to-3D Artistic Font Generation with Part-Level Style Control
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[687]  arXiv:2512.00408 [pdf, ps, other]
Title: Low-Bitrate Video Compression through Semantic-Conditioned Diffusion
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[688]  arXiv:2512.00395 [pdf, ps, other]
Title: Better, Stronger, Faster: Tackling the Trilemma in MLLM-based Segmentation with Simultaneous Textual Mask Prediction
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[689]  arXiv:2512.00387 [pdf, ps, other]
Title: WiseEdit: Benchmarking Cognition- and Creativity-Informed Image Editing
Comments: 32 pages, 20 figures. Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[690]  arXiv:2512.00385 [pdf, ps, other]
Title: EZ-SP: Fast and Lightweight Superpoint-Based 3D Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[691]  arXiv:2512.00381 [pdf, ps, other]
Title: Pore-scale Image Patch Dataset and A Comparative Evaluation of Pore-scale Facial Features
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[692]  arXiv:2512.00369 [pdf, ps, other]
Title: POLARIS: Projection-Orthogonal Least Squares for Robust and Adaptive Inversion in Diffusion Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[693]  arXiv:2512.00368 [pdf, ps, other]
Title: THCRL: Trusted Hierarchical Contrastive Representation Learning for Multi-View Clustering
Authors: Jian Zhu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[694]  arXiv:2512.00365 [pdf, ps, other]
Title: Towards aligned body representations in vision models
Comments: Andrea Procopio and Andrey Gizdov have equal contributions
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[695]  arXiv:2512.00363 [pdf, ps, other]
Title: MM-DETR: An Efficient Multimodal Detection Transformer with Mamba-Driven Dual-Granularity Fusion and Frequency-Aware Modality Adapters
Comments: Manuscript submitted to IEEE Transactions on Geoscience and Remote Sensing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[696]  arXiv:2512.00355 [pdf, ps, other]
Title: SMamDiff: Spatial Mamba for Stochastic Human Motion Prediction
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[697]  arXiv:2512.00345 [pdf, ps, other]
Title: mmPred: Radar-based Human Motion Prediction in the Dark
Comments: This paper is accepted by AAAI-2026
Journal-ref: AAAI-2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[698]  arXiv:2512.00343 [pdf, ps, other]
Title: Assimilation Matters: Model-level Backdoor Detection in Vision-Language Pretrained Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[699]  arXiv:2512.00336 [pdf, ps, other]
Title: MVAD : A Comprehensive Multimodal Video-Audio Dataset for AIGC Detection
Comments: 7 pages,2 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[700]  arXiv:2512.00327 [pdf, ps, other]
Title: Odometry Without Correspondence from Inertially Constrained Ruled Surfaces
Comments: 14 pages, 13 figures, 5 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[701]  arXiv:2512.00310 [pdf, ps, other]
Title: ART-ASyn: Anatomy-aware Realistic Texture-based Anomaly Synthesis Framework for Chest X-Rays
Comments: Accepted in WACV2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[702]  arXiv:2512.00308 [pdf, ps, other]
Title: Optimizing Distributional Geometry Alignment with Optimal Transport for Generative Dataset Distillation
Comments: NeurIPS 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[703]  arXiv:2512.00300 [pdf, ps, other]
Title: TGSFormer: Scalable Temporal Gaussian Splatting for Embodied Semantic Scene Completion
Comments: 14 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[704]  arXiv:2512.00294 [pdf, ps, other]
Title: Words into World: A Task-Adaptive Agent for Language-Guided Spatial Retrieval in AR
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[705]  arXiv:2512.00281 [pdf, ps, other]
Title: Rethinking Lung Cancer Screening: AI Nodule Detection and Diagnosis Outperforms Radiologists, Leading Models, and Standards Beyond Size and Growth
Comments: 25 pages, 8 figures, with supplementary information containing 11 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Neurons and Cognition (q-bio.NC)
[706]  arXiv:2512.00275 [pdf, ps, other]
Title: HIMOSA: Efficient Remote Sensing Image Super-Resolution with Hierarchical Mixture of Sparse Attention
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[707]  arXiv:2512.00269 [pdf, ps, other]
Title: USB: Unified Synthetic Brain Framework for Bidirectional Pathology-Healthy Generation and Editing
Authors: Jun Wang, Peirong Liu
Comments: 16 pages, 17 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[708]  arXiv:2512.00264 [pdf, ps, other]
Title: HeartFormer: Semantic-Aware Dual-Structure Transformers for 3D Four-Chamber Cardiac Point Cloud Reconstruction
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[709]  arXiv:2512.00261 [pdf, ps, other]
Title: UniDiff: Parameter-Efficient Adaptation of Diffusion Models for Land Cover Classification with Multi-Modal Remotely Sensed Imagery and Sparse Annotations
Comments: Camera-ready for WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[710]  arXiv:2512.00255 [pdf, ps, other]
Title: Relightable Holoported Characters: Capturing and Relighting Dynamic Human Performance from Sparse Views
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[711]  arXiv:2512.00226 [pdf, ps, other]
Title: DenseScan: Advancing 3D Scene Understanding with 2D Dense Annotation
Authors: Zirui Wang, Tao Zhang
Comments: Workshop on Space in Vision, Language, and Embodied AI at NeurIPS 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[712]  arXiv:2512.00208 [pdf, ps, other]
Title: ReactionMamba: Generating Short &Long Human Reaction Sequences
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[713]  arXiv:2512.00198 [pdf, ps, other]
Title: Mammo-FM: Breast-specific foundational model for Integrated Mammographic Diagnosis, Prognosis, and Reporting
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[714]  arXiv:2512.00194 [pdf, ps, other]
Title: AutocleanEEG ICVision: Automated ICA Artifact Classification Using Vision-Language AI
Comments: 6 pages, 8 figures
Journal-ref: Conference ICMI2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Quantitative Methods (q-bio.QM)
[715]  arXiv:2512.00179 [pdf, ps, other]
Title: Efficient Edge-Compatible CNN for Speckle-Based Material Recognition in Laser Cutting Systems
Authors: Mohamed Abdallah Salem (North Dakota State University), Nourhan Zein Diab (New Mansoura University)
Comments: Copyright 2025 IEEE. This is the author's version of the work that has been Accepted for publication in the Proceedings of the 2025 IEEE The 35th International Conference on Computer Theory and Applications (ICCTA 2025). Final published version will be available on IEEE Xplore
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[716]  arXiv:2512.00130 [pdf, ps, other]
Title: Local and Global Context-and-Object-part-Aware Superpixel-based Data Augmentation for Deep Visual Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[717]  arXiv:2512.00129 [pdf, ps, other]
Title: Analysis of Incursive Breast Cancer in Mammograms Using YOLO, Explainability, and Domain Adaptation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[718]  arXiv:2512.00125 [pdf, ps, other]
Title: Hybrid Synthetic Data Generation with Domain Randomization Enables Zero-Shot Vision-Based Part Inspection Under Extreme Class Imbalance
Comments: Submitted to the NAMRC 54
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[719]  arXiv:2512.00117 [pdf, ps, other]
Title: TinyViT: Field Deployable Transformer Pipeline for Solar Panel Surface Fault and Severity Screening
Comments: 3pages, 2figures,ICGVIP 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[720]  arXiv:2512.00103 [pdf, ps, other]
Title: Comparative Analysis of Vision Transformer, Convolutional, and Hybrid Architectures for Mental Health Classification Using Actigraphy-Derived Images
Authors: Ifeanyi Okala
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[721]  arXiv:2512.00091 [pdf, ps, other]
Title: Deep Filament Extraction for 3D Concrete Printing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[722]  arXiv:2512.00089 [pdf, ps, other]
Title: TeleViT1.0: Teleconnection-aware Vision Transformers for Subseasonal to Seasonal Wildfire Pattern Forecasts
Comments: Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[723]  arXiv:2512.00088 [pdf, ps, other]
Title: SemImage: Semantic Image Representation for Text, a Novel Framework for Embedding Disentangled Linguistic Features
Authors: Mohammad Zare
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[724]  arXiv:2512.00087 [pdf, ps, other]
Title: Exploring Automated Recognition of Instructional Activity and Discourse from Multimodal Classroom Data
Comments: This article has been accepted for publication in the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[725]  arXiv:2512.00086 [pdf, ps, other]
Title: Multi-modal On-Device Learning for Monocular Depth Estimation on Ultra-low-power MCUs
Comments: 14 pages, 9 figures, 3 tables. Associated open-source release available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[726]  arXiv:2512.00084 [pdf, ps, other]
Title: A Fast and Efficient Modern BERT based Text-Conditioned Diffusion Model for Medical Image Segmentation
Comments: 15 pages, 3 figures, Accepted in Slide 3 10th International Conference on Computer Vision & Image Processing (CVIP 2026)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[727]  arXiv:2512.00082 [pdf, ps, other]
Title: Exploring Diagnostic Prompting Approach for Multimodal LLM-based Visual Complexity Assessment: A Case Study of Amazon Search Result Pages
Comments: 9 pages, 4 figures, 9 tables. Study on diagnostic prompting for multimodal LLM-based visual complexity assessment of Amazon search result pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[728]  arXiv:2512.00080 [pdf, ps, other]
Title: Conceptual Evaluation of Deep Visual Stereo Odometry for the MARWIN Radiation Monitoring Robot in Accelerator Tunnels
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[ total of 778 entries: 1-50 | ... | 529-578 | 579-628 | 629-678 | 679-728 | 729-778 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)