We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 111

[ total of 732 entries: 1-25 | ... | 37-61 | 62-86 | 87-111 | 112-136 | 137-161 | 162-186 | 187-211 | ... | 712-732 ]
[ showing 25 entries per page: fewer | more | all ]

Tue, 16 Dec 2025 (continued, showing 25 of 244 entries)

[112]  arXiv:2512.12664 [pdf, ps, other]
Title: InteracTalker: Prompt-Based Human-Object Interaction with Co-Speech Gesture Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[113]  arXiv:2512.12662 [pdf, ps, other]
Title: Anatomy-Guided Representation Learning Using a Transformer-Based Network for Thyroid Nodule Segmentation in Ultrasound Images
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[114]  arXiv:2512.12658 [pdf, ps, other]
Title: CogDoc: Towards Unified thinking in Documents
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[115]  arXiv:2512.12657 [pdf, ps, other]
Title: Cross-modal Fundus Image Registration under Large FoV Disparity
Comments: Accepted as a regular paper at MMM 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[116]  arXiv:2512.12633 [pdf, ps, other]
Title: DiG: Differential Grounding for Enhancing Fine-Grained Perception in Multimodal Large Language Model
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[117]  arXiv:2512.12623 [pdf, ps, other]
Title: Reasoning Within the Mind: Dynamic Multimodal Interleaving in Latent Space
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[118]  arXiv:2512.12622 [pdf, ps, other]
Title: D3D-VLP: Dynamic 3D Vision-Language-Planning Model for Embodied Grounding and Navigation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[119]  arXiv:2512.12610 [pdf, ps, other]
Title: Patch-wise Retrieval: A Bag of Practical Techniques for Instance-level Matching
Comments: WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[120]  arXiv:2512.12604 [pdf, ps, other]
Title: No Cache Left Idle: Accelerating diffusion model via Extreme-slimming Caching
Comments: Project page: this https URL Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[121]  arXiv:2512.12598 [pdf, ps, other]
Title: Geometry-Aware Scene-Consistent Image Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[122]  arXiv:2512.12596 [pdf, ps, other]
Title: Content-Aware Ad Banner Layout Generation with Two-Stage Chain-of-Thought in Vision Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[123]  arXiv:2512.12595 [pdf, ps, other]
Title: Vision-Enhanced Large Language Models for High-Resolution Image Synthesis and Multimodal Data Interpretation
Authors: Karthikeya KV
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[124]  arXiv:2512.12590 [pdf, ps, other]
Title: Automatic Wire-Harness Color Sequence Detector
Comments: 6 pages, 20 figures, IEEE ICIIS 2025 Conference - Accepted
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[125]  arXiv:2512.12586 [pdf, ps, other]
Title: StegaVAR: Privacy-Preserving Video Action Recognition via Steganographic Domain Analysis
Comments: 13 pages, 10 figures. This is the extended version of the paper accepted at AAAI 2026, including related works and appendix
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[126]  arXiv:2512.12571 [pdf, ps, other]
Title: From Tokens to Photons: Test-Time Physical Prompting for Vison-Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[127]  arXiv:2512.12560 [pdf, ps, other]
Title: StreamingAssistant: Efficient Visual Token Pruning for Accelerating Online Video Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[128]  arXiv:2512.12549 [pdf, ps, other]
Title: Supervised Contrastive Frame Aggregation for Video Representation Learning
Comments: 12 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[129]  arXiv:2512.12539 [pdf, ps, other]
Title: Anatomy Guided Coronary Artery Segmentation from CCTA Using Spatial Frequency Joint Modeling
Comments: 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[130]  arXiv:2512.12534 [pdf, ps, other]
Title: Animus3D: Text-driven 3D Animation via Motion Score Distillation
Comments: SIGGRAPH Asia 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[131]  arXiv:2512.12508 [pdf, ps, other]
Title: Generative Spatiotemporal Data Augmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[132]  arXiv:2512.12498 [pdf, ps, other]
Title: Advancing Cache-Based Few-Shot Classification via Patch-Driven Relational Gated Graph Attention
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[133]  arXiv:2512.12492 [pdf, ps, other]
Title: Adaptive Detector-Verifier Framework for Zero-Shot Polyp Detection in Open-World Settings
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[134]  arXiv:2512.12487 [pdf, ps, other]
Title: More Than the Final Answer: Improving Visual Extraction and Logical Consistency in Vision-Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[135]  arXiv:2512.12459 [pdf, ps, other]
Title: From Particles to Fields: Reframing Photon Mapping with Continuous Gaussian Photon Fields
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[136]  arXiv:2512.12430 [pdf, ps, other]
Title: Endless World: Real-Time 3D-Aware Long Video Generation
Comments: 10 pages,7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 732 entries: 1-25 | ... | 37-61 | 62-86 | 87-111 | 112-136 | 137-161 | 162-186 | 187-211 | ... | 712-732 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)