We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 9

[ total of 749 entries: 1-25 | 10-34 | 35-59 | 60-84 | 85-109 | ... | 735-749 ]
[ showing 25 entries per page: fewer | more | all ]

Wed, 10 Dec 2025 (continued, showing 25 of 131 entries)

[10]  arXiv:2512.08881 [pdf, ps, other]
Title: SATGround: A Spatially-Aware Approach for Visual Grounding in Remote Sensing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[11]  arXiv:2512.08873 [pdf, ps, other]
Title: Siamese-Driven Optimization for Low-Resolution Image Latent Embedding in Image Captioning
Comments: 6 pages
Journal-ref: 2024 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[12]  arXiv:2512.08860 [pdf, ps, other]
Title: Tri-Bench: Stress-Testing VLM Reliability on Spatial Reasoning under Camera Tilt and Object Interference
Authors: Amit Bendkhale
Comments: 6 pages, 3 figures. Code and data: this https URL Accepted to the AAAI 2026 Workshop on Trust and Control in Agentic AI (TrustAgent)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[13]  arXiv:2512.08854 [pdf, ps, other]
Title: Generation is Required for Data-Efficient Perception
Comments: Preprint
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[14]  arXiv:2512.08829 [pdf, ps, other]
Title: InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models
Comments: 16 pages, 8 figures, conference or other essential info
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[15]  arXiv:2512.08820 [pdf, ps, other]
Title: Training-Free Dual Hyperbolic Adapters for Better Cross-Modal Reasoning
Comments: Accepted in IEEE Transactions on Multimedia (TMM)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[16]  arXiv:2512.08789 [pdf, ps, other]
Title: MatteViT: High-Frequency-Aware Document Shadow Removal with Shadow Matte Guidance
Comments: 10 pages, 7 figures, 5 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[17]  arXiv:2512.08785 [pdf, ps, other]
Title: LoFA: Learning to Predict Personalized Priors for Fast Adaptation of Visual Generative Models
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[18]  arXiv:2512.08774 [pdf, ps, other]
Title: Refining Visual Artifacts in Diffusion Models via Explainable AI-based Flaw Activation Maps
Comments: 10 pages, 9 figures, 7 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[19]  arXiv:2512.08765 [pdf, ps, other]
Title: Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance
Comments: NeurlPS 2025. Code and data available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[20]  arXiv:2512.08751 [pdf, ps, other]
Title: Skewness-Guided Pruning of Multimodal Swin Transformers for Federated Skin Lesion Classification on Edge Devices
Subjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
[21]  arXiv:2512.08747 [pdf, ps, other]
Title: A Scalable Pipeline Combining Procedural 3D Graphics and Guided Diffusion for Photorealistic Synthetic Training Data Generation in White Button Mushroom Segmentation
Comments: 20 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[22]  arXiv:2512.08738 [pdf, ps, other]
Title: Pose-Based Sign Language Spotting via an End-to-End Encoder Architecture
Comments: To appear at AACL-IJCNLP 2025 Workshop WSLP
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[23]  arXiv:2512.08733 [pdf, ps, other]
Title: Mitigating Individual Skin Tone Bias in Skin Lesion Classification through Distribution-Aware Reweighting
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[24]  arXiv:2512.08730 [pdf, ps, other]
Title: SegEarth-OV3: Exploring SAM 3 for Open-Vocabulary Semantic Segmentation in Remote Sensing Images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[25]  arXiv:2512.08700 [pdf, ps, other]
Title: Scale-invariant and View-relational Representation Learning for Full Surround Monocular Depth
Comments: Accepted at IEEE Robotics and Automation Letters (RA-L) 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[26]  arXiv:2512.08697 [pdf, ps, other]
Title: What really matters for person re-identification? A Mixture-of-Experts Framework for Semantic Attribute Importance
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[27]  arXiv:2512.08673 [pdf, ps, other]
Title: Dual-Branch Center-Surrounding Contrast: Rethinking Contrastive Learning for 3D Point Clouds
Comments: 16 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[28]  arXiv:2512.08648 [pdf, ps, other]
Title: Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank
Comments: 19 pages, 19 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[29]  arXiv:2512.08647 [pdf, ps, other]
Title: C-DIRA: Computationally Efficient Dynamic ROI Routing and Domain-Invariant Adversarial Learning for Lightweight Driver Behavior Recognition
Authors: Keito Inoshita
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[30]  arXiv:2512.08645 [pdf, ps, other]
Title: Chain-of-Image Generation: Toward Monitorable and Controllable Image Generation
Comments: 19 pages, 13 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[31]  arXiv:2512.08639 [pdf, ps, other]
Title: Aerial Vision-Language Navigation with a Unified Framework for Spatial, Temporal and Embodied Reasoning
Comments: Under Review, 12 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[32]  arXiv:2512.08627 [pdf, ps, other]
Title: Trajectory Densification and Depth from Perspective-based Blur
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[33]  arXiv:2512.08625 [pdf, ps, other]
Title: OpenMonoGS-SLAM: Monocular Gaussian Splatting SLAM with Open-set Semantics
Comments: 8 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[34]  arXiv:2512.08606 [pdf, ps, other]
Title: Decoupling Template Bias in CLIP: Harnessing Empty Prompts for Enhanced Few-Shot Learning
Comments: 14 pages, 8 figures, Association for the Advancement of Artificial Intelligence (AAAI2026, poster)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[ total of 749 entries: 1-25 | 10-34 | 35-59 | 60-84 | 85-109 | ... | 735-749 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)