We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 625

[ total of 778 entries: 1-50 | ... | 476-525 | 526-575 | 576-625 | 626-675 | 676-725 | 726-775 | 776-778 ]
[ showing 50 entries per page: fewer | more | all ]

Tue, 2 Dec 2025 (continued, showing 50 of 278 entries)

[626]  arXiv:2512.00904 [pdf, ps, other]
Title: Hierarchical Semantic Alignment for Image Clustering
Comments: AAAI 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[627]  arXiv:2512.00903 [pdf, ps, other]
Title: SwiftVLA: Unlocking Spatiotemporal Dynamics for Lightweight VLA Models at Minimal Overhead
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[628]  arXiv:2512.00891 [pdf, ps, other]
Title: Accelerating Streaming Video Large Language Models via Hierarchical Token Compression
Comments: Code is avaliable at \url{this https URL}
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[629]  arXiv:2512.00887 [pdf, ps, other]
Title: Multilingual Training-Free Remote Sensing Image Captioning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[630]  arXiv:2512.00885 [pdf, ps, other]
Title: HanDyVQA: A Video QA Benchmark for Fine-Grained Hand-Object Interaction Dynamics
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[631]  arXiv:2512.00882 [pdf, ps, other]
Title: Look, Recite, Then Answer: Enhancing VLM Performance via Self-Generated Knowledge Hints
Authors: Xisheng Feng
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[632]  arXiv:2512.00880 [pdf, ps, other]
Title: Quantum-Inspired Spectral Geometry for Neural Operator Equivalence and Structured Pruning
Comments: 6 pages, 1 figure, preliminary version; concepts and simulation experiments only
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[633]  arXiv:2512.00877 [pdf, ps, other]
Title: Feed-Forward 3D Gaussian Splatting Compression with Long-Context Modeling
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[634]  arXiv:2512.00873 [pdf, ps, other]
Title: Neural Discrete Representation Learning for Sparse-View CBCT Reconstruction: From Algorithm Design to Prospective Multicenter Clinical Evaluation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[635]  arXiv:2512.00872 [pdf, ps, other]
Title: TAP-CT: 3D Task-Agnostic Pretraining of Computed Tomography Foundation Models
Comments: 22 pages, 4 figures, 8 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[636]  arXiv:2512.00850 [pdf, ps, other]
Title: Smol-GS: Compact Representations for Abstract 3D Gaussian Splatting
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[637]  arXiv:2512.00846 [pdf, ps, other]
Title: AFRAgent : An Adaptive Feature Renormalization Based High Resolution Aware GUI agent
Comments: Accepted at WACV 2026 Conference
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[638]  arXiv:2512.00832 [pdf, ps, other]
Title: PanFlow: Decoupled Motion Control for Panoramic Video Generation
Comments: Accepted by AAAI. Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[639]  arXiv:2512.00814 [pdf, ps, other]
Title: IRPO: Boosting Image Restoration via Post-training GRPO
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[640]  arXiv:2512.00805 [pdf, ps, other]
Title: Thinking with Drafts: Speculative Temporal Reasoning for Efficient Long Video Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[641]  arXiv:2512.00796 [pdf, ps, other]
Title: CircleFlow: Flow-Guided Camera Blur Estimation using a Circle Grid Target
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[642]  arXiv:2512.00794 [pdf, ps, other]
Title: PolarGS: Polarimetric Cues for Ambiguity-Free Gaussian Splatting with Accurate Geometry Recovery
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[643]  arXiv:2512.00773 [pdf, ps, other]
Title: DEJIMA: A Novel Large-scale Japanese Dataset for Image Captioning and Visual Question Answering
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[644]  arXiv:2512.00771 [pdf, ps, other]
Title: EAG3R: Event-Augmented 3D Geometry Estimation for Dynamic and Extreme-Lighting Scenes
Comments: Accepted at NeurIPS 2025 (spotlight)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[645]  arXiv:2512.00765 [pdf, ps, other]
Title: The Outline of Deception: Physical Adversarial Attacks on Traffic Signs Using Edge Patches
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[646]  arXiv:2512.00762 [pdf, ps, other]
Title: Seeing the Wind from a Falling Leaf
Comments: Accepted at NeurIPS 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[647]  arXiv:2512.00752 [pdf, ps, other]
Title: Charts Are Not Images: On the Challenges of Scientific Chart Editing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[648]  arXiv:2512.00748 [pdf, ps, other]
Title: Probabilistic Modeling of Multi-rater Medical Image Segmentation for Diversity and Personalization
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[649]  arXiv:2512.00744 [pdf, ps, other]
Title: Joint Multi-scale Gated Transformer and Prior-guided Convolutional Network for Learned Image Compression
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[650]  arXiv:2512.00743 [pdf, ps, other]
Title: Multi-GRPO: Multi-Group Advantage Estimation for Text-to-Image Generation with Tree-Based Trajectories and Multiple Rewards
Comments: 20 pages, 15 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[651]  arXiv:2512.00723 [pdf, ps, other]
Title: TrajDiff: End-to-end Autonomous Driving without Perception Annotation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[652]  arXiv:2512.00718 [pdf, ps, other]
Title: RS-ISRefiner: Towards Better Adapting Vision Foundation Models for Interactive Segmentation of Remote Sensing Images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[653]  arXiv:2512.00714 [pdf, ps, other]
Title: Deep Learning-Based Computer Vision Models for Early Cancer Detection Using Multimodal Medical Imaging and Radiogenomic Integration Frameworks
Journal-ref: International Journal of Computer Applications Technology and Research, vol. 14, no. 11, pp. 1-14, 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[654]  arXiv:2512.00706 [pdf, ps, other]
Title: Optimizing LVLMs with On-Policy Data for Effective Hallucination Mitigation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[655]  arXiv:2512.00700 [pdf, ps, other]
Title: CAR-Net: A Cascade Refinement Network for Rotational Motion Deblurring under Angle Information Uncertainty
Comments: Accepted to AAIML 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[656]  arXiv:2512.00694 [pdf, ps, other]
Title: Affordance-First Decomposition for Continual Learning in Video-Language Understanding
Comments: Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[657]  arXiv:2512.00691 [pdf, ps, other]
Title: Silhouette-based Gait Foundation Model
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[658]  arXiv:2512.00677 [pdf, ps, other]
Title: Dynamic-eDiTor: Training-Free Text-Driven 4D Scene Editing with Multimodal Diffusion Transformer
Comments: 4D Scene Editing
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[659]  arXiv:2512.00676 [pdf, ps, other]
Title: Realistic Handwritten Multi-Digit Writer (MDW) Number Recognition Challenges
Authors: Kiri L. Wagstaff
Comments: 10 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[660]  arXiv:2512.00647 [pdf, ps, other]
Title: MambaScope: Coarse-to-Fine Scoping for Efficient Vision Mamba
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[661]  arXiv:2512.00641 [pdf, ps, other]
Title: Graph-Attention Network with Adversarial Domain Alignment for Robust Cross-Domain Facial Expression Recognition
Comments: 17 pages, 5 figures. Accepted at the 17th Asian Conference on Machine Learning (ACML 2025), Taipei, Taiwan, December 9-12, 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[662]  arXiv:2512.00639 [pdf, ps, other]
Title: Doppler-Enhanced Deep Learning: Improving Thyroid Nodule Segmentation with YOLOv5 Instance Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG); Performance (cs.PF)
[663]  arXiv:2512.00626 [pdf, ps, other]
Title: XAI-Driven Skin Disease Classification: Leveraging GANs to Augment ResNet-50 Performance
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[664]  arXiv:2512.00625 [pdf, ps, other]
Title: Automatic Pith Detection in Tree Cross-Section Images Using Deep Learning
Comments: 8 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[665]  arXiv:2512.00597 [pdf, ps, other]
Title: Scaling Down to Scale Up: Towards Operationally-Efficient and Deployable Clinical Models via Cross-Modal Low-Rank Adaptation for Medical Vision-Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[666]  arXiv:2512.00582 [pdf, ps, other]
Title: SatireDecoder: Visual Cascaded Decoupling for Enhancing Satirical Image Comprehension
Comments: Accepted by AAAI 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[667]  arXiv:2512.00572 [pdf, ps, other]
Title: Integrating Skeleton Based Representations for Robust Yoga Pose Classification Using Deep Learning Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[668]  arXiv:2512.00565 [pdf, ps, other]
Title: Describe Anything Anywhere At Any Moment
Comments: 14 pages, 5 figures, 6 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[669]  arXiv:2512.00557 [pdf, ps, other]
Title: NeuroVolve: Evolving Visual Stimuli toward Programmable Neural Objectives
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[670]  arXiv:2512.00547 [pdf, ps, other]
Title: Asset-Driven Sematic Reconstruction of Dynamic Scene with Multi-Human-Object Interactions
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[671]  arXiv:2512.00539 [pdf, ps, other]
Title: SAIDO: Generalizable Detection of AI-Generated Images via Scene-Aware and Importance-Guided Dynamic Optimization in Continual Learning
Comments: 17 pages, 19 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[672]  arXiv:2512.00534 [pdf, ps, other]
Title: Cross-Temporal 3D Gaussian Splatting for Sparse-View Guided Scene Update
Comments: AAAI2026 accepted
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[673]  arXiv:2512.00532 [pdf, ps, other]
Title: Image Generation as a Visual Planner for Robotic Manipulation
Authors: Ye Pang
Comments: 11 pages 9 figures Under review at CVPR 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[674]  arXiv:2512.00514 [pdf, ps, other]
Title: Terrain Sensing with Smartphone Structured Light: 2D Dynamic Time Warping for Grid Pattern Matching
Authors: Tanaka Nobuaki
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[675]  arXiv:2512.00493 [pdf, ps, other]
Title: CC-FMO: Camera-Conditioned Zero-Shot Single Image to 3D Scene Generation with Foundation Model Orchestration
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 778 entries: 1-50 | ... | 476-525 | 526-575 | 576-625 | 626-675 | 676-725 | 726-775 | 776-778 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)