We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 29

[ total of 754 entries: 1-50 | 30-79 | 80-129 | 130-179 | 180-229 | ... | 730-754 ]
[ showing 50 entries per page: fewer | more | all ]

Thu, 11 Dec 2025 (continued, showing 50 of 135 entries)

[30]  arXiv:2512.09580 [pdf, ps, other]
Title: Content-Adaptive Image Retouching Guided by Attribute-Based Text Representation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[31]  arXiv:2512.09579 [pdf, ps, other]
Title: Hands-on Evaluation of Visual Transformers for Object Recognition and Detection
Journal-ref: 37th International Conference on Tools with Artificial Intelligence (ICTAI 2025)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[32]  arXiv:2512.09576 [pdf, ps, other]
Title: Seeing Soil from Space: Towards Robust and Scalable Remote Soil Nutrient Analysis
Authors: David Seu (1), Nicolas Longepe (2), Gabriel Cioltea (1), Erik Maidik (1), Calin Andrei (1) ((1) CO2 Angels, Cluj-Napoca, Romania, (2) European Space Agency Phi-Lab, Frascati, Italy)
Comments: 23 pages, 13 figures, 13 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Geophysics (physics.geo-ph)
[33]  arXiv:2512.09573 [pdf, ps, other]
Title: Investigate the Low-level Visual Perception in Vision-Language based Image Quality Assessment
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[34]  arXiv:2512.09565 [pdf, ps, other]
Title: From Graphs to Gates: DNS-HyXNet, A Lightweight and Deployable Sequential Model for Real-Time DNS Tunnel Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[35]  arXiv:2512.09555 [pdf, ps, other]
Title: Building Reasonable Inference for Vision-Language Models in Blind Image Quality Assessment
Comments: Accepted to the ICONIP (International Conference on Neural Information Processing), 2025
Journal-ref: Building Reasonable Inference for Vision-Language Models in Blind Image Quality Assessment. In: Taniguchi, T., et al. Neural Information Processing. ICONIP 2025. Lecture Notes in Computer Science, vol 16310. Springer, Singapore
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[36]  arXiv:2512.09546 [pdf, ps, other]
Title: A Dual-Domain Convolutional Network for Hyperspectral Single-Image Super-Resolution
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[37]  arXiv:2512.09525 [pdf, ps, other]
Title: Masked Registration and Autoencoding of CT Images for Predictive Tibia Reconstruction
Comments: DGM4MICCAI
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[38]  arXiv:2512.09497 [pdf, ps, other]
Title: Gradient-Guided Learning Network for Infrared Small Target Detection
Comments: Accepted by GRSL 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[39]  arXiv:2512.09492 [pdf, ps, other]
Title: StateSpace-SSL: Linear-Time Self-supervised Learning for Plant Disease Detectio
Comments: Accepted to AAAI workshop (AgriAI 2026)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[40]  arXiv:2512.09489 [pdf, ps, other]
Title: MODA: The First Challenging Benchmark for Multispectral Object Detection in Aerial Images
Comments: 8 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[41]  arXiv:2512.09477 [pdf, ps, other]
Title: Color encoding in Latent Space of Stable Diffusion Models
Comments: 6 pages, 8 figures, Color Imaging Conference 33
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[42]  arXiv:2512.09471 [pdf, ps, other]
Title: Temporal-Spatial Tubelet Embedding for Cloud-Robust MSI Reconstruction using MSI-SAR Fusion: A Multi-Head Self-Attention Video Vision Transformer Approach
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[43]  arXiv:2512.09463 [pdf, ps, other]
Title: Privacy-Preserving Computer Vision for Industry: Three Case Studies in Human-Centric Manufacturing
Comments: Accepted to the AAAI26 HCM workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[44]  arXiv:2512.09461 [pdf, ps, other]
Title: Cytoplasmic Strings Analysis in Human Embryo Time-Lapse Videos using Deep Learning Framework
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[45]  arXiv:2512.09446 [pdf, ps, other]
Title: Defect-aware Hybrid Prompt Optimization via Progressive Tuning for Zero-Shot Multi-type Anomaly Detection and Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[46]  arXiv:2512.09441 [pdf, ps, other]
Title: Representation Calibration and Uncertainty Guidance for Class-Incremental Learning based on Vision Language Model
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[47]  arXiv:2512.09435 [pdf, ps, other]
Title: UniPart: Part-Level 3D Generation with Unified 3D Geom-Seg Latents
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[48]  arXiv:2512.09423 [pdf, ps, other]
Title: FunPhase: A Periodic Functional Autoencoder for Motion Generation via Phase Manifolds
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[49]  arXiv:2512.09422 [pdf, ps, other]
Title: InfoMotion: A Graph-Based Approach to Video Dataset Distillation for Echocardiography
Comments: Accepted at MICAD 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[50]  arXiv:2512.09418 [pdf, ps, other]
Title: Label-free Motion-Conditioned Diffusion Model for Cardiac Ultrasound Synthesis
Comments: Accepted at MICAD 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[51]  arXiv:2512.09417 [pdf, ps, other]
Title: DirectSwap: Mask-Free Cross-Identity Training and Benchmarking for Expression-Consistent Video Head Swapping
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[52]  arXiv:2512.09407 [pdf, ps, other]
Title: Generative Point Cloud Registration
Comments: 14 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[53]  arXiv:2512.09402 [pdf, ps, other]
Title: Wasserstein-Aligned Hyperbolic Multi-View Clustering
Comments: 14 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[54]  arXiv:2512.09393 [pdf, ps, other]
Title: Detection and Localization of Subdural Hematoma Using Deep Learning on Computed Tomography
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[55]  arXiv:2512.09383 [pdf, ps, other]
Title: Perception-Inspired Color Space Design for Photo White Balance Editing
Comments: Accepted to WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[56]  arXiv:2512.09375 [pdf, ps, other]
Title: Log NeRF: Comparing Spaces for Learning Radiance Fields
Authors: Sihe Chen (Northeastern University), Luv Verma (Northeastern University), Bruce A. Maxwell (Northeastern University)
Comments: The 36th British Machine Vision Conference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[57]  arXiv:2512.09373 [pdf, ps, other]
Title: FUSER: Feed-Forward MUltiview 3D Registration Transformer and SE(3)$^N$ Diffusion Refinement
Comments: 13 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[58]  arXiv:2512.09364 [pdf, ps, other]
Title: ASSIST-3D: Adapted Scene Synthesis for Class-Agnostic 3D Instance Segmentation
Comments: Accepted by AAAI 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[59]  arXiv:2512.09363 [pdf, ps, other]
Title: StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[60]  arXiv:2512.09354 [pdf, ps, other]
Title: Video-QTR: Query-Driven Temporal Reasoning Framework for Lightweight Video Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[61]  arXiv:2512.09350 [pdf, ps, other]
Title: TextGuider: Training-Free Guidance for Text Rendering via Attention Alignment
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[62]  arXiv:2512.09335 [pdf, ps, other]
Title: Relightable and Dynamic Gaussian Avatar Reconstruction from Monocular Video
Comments: 8 pages, 9 figures, published in ACM MM 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[63]  arXiv:2512.09327 [pdf, ps, other]
Title: UniLS: End-to-End Audio-Driven Avatars for Unified Listening and Speaking
Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD)
[64]  arXiv:2512.09315 [pdf, ps, other]
Title: Benchmarking Real-World Medical Image Classification with Noisy Labels: Challenges, Practice, and Outlook
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[65]  arXiv:2512.09311 [pdf, ps, other]
Title: Transformer-Driven Multimodal Fusion for Explainable Suspiciousness Estimation in Visual Surveillance
Comments: 12 pages, 10 figures, IEEE Transaction on Image Processing
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[66]  arXiv:2512.09307 [pdf, ps, other]
Title: From SAM to DINOv2: Towards Distilling Foundation Models to Lightweight Baselines for Generalized Polyp Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[67]  arXiv:2512.09299 [pdf, ps, other]
Title: VABench: A Comprehensive Benchmark for Audio-Video Generation
Comments: 24 pages, 25 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD)
[68]  arXiv:2512.09296 [pdf, ps, other]
Title: Traffic Scene Small Target Detection Method Based on YOLOv8n-SPTS Model for Autonomous Driving
Authors: Songhan Wu
Comments: 6 pages, 7 figures, 1 table. Accepted to The 2025 IEEE 3rd International Conference on Electrical, Automation and Computer Engineering (ICEACE), 2025. Code available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[69]  arXiv:2512.09289 [pdf, ps, other]
Title: MelanomaNet: Explainable Deep Learning for Skin Lesion Classification
Comments: 7 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[70]  arXiv:2512.09282 [pdf, ps, other]
Title: FoundIR-v2: Optimizing Pre-Training Data Mixtures for Image Restoration Foundation Model
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[71]  arXiv:2512.09278 [pdf, ps, other]
Title: LoGoColor: Local-Global 3D Colorization for 360° Scenes
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[72]  arXiv:2512.09276 [pdf, ps, other]
Title: Dynamic Facial Expressions Analysis Based Parkinson's Disease Auxiliary Diagnosis
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[73]  arXiv:2512.09271 [pdf, ps, other]
Title: LongT2IBench: A Benchmark for Evaluating Long Text-to-Image Generation with Graph-structured Annotations
Comments: The paper has been accepted by AAAI 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[74]  arXiv:2512.09270 [pdf, ps, other]
Title: MoRel: Long-Range Flicker-Free 4D Motion Modeling via Anchor Relay-based Bidirectional Blending with Hierarchical Densification
Comments: Please visit our project page at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[75]  arXiv:2512.09258 [pdf, ps, other]
Title: ROI-Packing: Efficient Region-Based Compression for Machine Vision
Journal-ref: International Conference on Multimedia Information Processing and Retrieval (MIPR), San Jose, CA, USA, 2025, pp. 233-238
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[76]  arXiv:2512.09251 [pdf, ps, other]
Title: GLACIA: Instance-Aware Positional Reasoning for Glacial Lake Segmentation via Multimodal Large Language Model
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[77]  arXiv:2512.09247 [pdf, ps, other]
Title: OmniPSD: Layered PSD Generation with Diffusion Transformer
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[78]  arXiv:2512.09244 [pdf, ps, other]
Title: A Clinically Interpretable Deep CNN Framework for Early Chronic Kidney Disease Prediction Using Grad-CAM-Based Explainable AI
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[79]  arXiv:2512.09235 [pdf, ps, other]
Title: Efficient Feature Compression for Machines with Global Statistics Preservation
Journal-ref: 2025 IEEE International Symposium on Circuits and Systems (ISCAS), London, United Kingdom, 2025, pp. 1-5
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 754 entries: 1-50 | 30-79 | 80-129 | 130-179 | 180-229 | ... | 730-754 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)