We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 684

[ total of 925 entries: 1-10 | ... | 655-664 | 665-674 | 675-684 | 685-694 | 695-704 | 705-714 | 715-724 | ... | 925 ]
[ showing 10 entries per page: fewer | more | all ]

Mon, 1 Dec 2025 (showing first 10 of 241 entries)

[685]  arXiv:2511.23478 [pdf, ps, other]
Title: Video-R2: Reinforcing Consistent and Grounded Reasoning in Multimodal Language Models
Comments: Video-R2 Technical Report
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[686]  arXiv:2511.23477 [pdf, ps, other]
Title: Video-CoM: Interactive Video Reasoning via Chain of Manipulations
Comments: Technical Report
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[687]  arXiv:2511.23475 [pdf, ps, other]
Title: AnyTalker: Scaling Multi-Person Talking Video Generation with Interactivity Refinement
Comments: Homepage: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[688]  arXiv:2511.23469 [pdf, ps, other]
Title: Visual Generation Tuning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[689]  arXiv:2511.23450 [pdf, ps, other]
Title: Object-Centric Data Synthesis for Category-level Object Detection
Comments: 10 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[690]  arXiv:2511.23429 [pdf, ps, other]
Title: Hunyuan-GameCraft-2: Instruction-following Interactive Game World Model
Comments: Technical Report, Project page:this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[691]  arXiv:2511.23428 [pdf, ps, other]
Title: DisMo: Disentangled Motion Representations for Open-World Motion Transfer
Comments: Accepted at NeurIPS 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[692]  arXiv:2511.23405 [pdf, ps, other]
Title: MANTA: Physics-Informed Generalized Underwater Object Tracking
Comments: Accepted to the IEEE/CVF WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[693]  arXiv:2511.23386 [pdf, ps, other]
Title: VQRAE: Representation Quantization Autoencoders for Multimodal Understanding, Generation and Reconstruction
Comments: 19 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[694]  arXiv:2511.23377 [pdf, ps, other]
Title: DEAL-300K: Diffusion-based Editing Area Localization with a 300K-Scale Dataset and Frequency-Prompted Baseline
Comments: 13pages,12 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 925 entries: 1-10 | ... | 655-664 | 665-674 | 675-684 | 685-694 | 695-704 | 705-714 | 715-724 | ... | 925 ]
[ showing 10 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)