We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 500

[ total of 778 entries: 1-25 | ... | 426-450 | 451-475 | 476-500 | 501-525 | 526-550 | 551-575 | 576-600 | ... | 776-778 ]
[ showing 25 entries per page: fewer | more | all ]

Tue, 2 Dec 2025 (showing first 25 of 278 entries)

[501]  arXiv:2512.02018 [pdf, ps, other]
Title: Data-Centric Visual Development for Self-Driving Labs
Comments: 11 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[502]  arXiv:2512.02017 [pdf, ps, other]
Title: Visual Sync: Multi-Camera Synchronization via Cross-View Object Motion
Comments: Accepted to NeurIPS 2025. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[503]  arXiv:2512.02016 [pdf, ps, other]
Title: Objects in Generated Videos Are Slower Than They Appear: Models Suffer Sub-Earth Gravity and Don't Know Galileo's Principle...for now
Comments: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[504]  arXiv:2512.02015 [pdf, ps, other]
Title: Generative Video Motion Editing with 3D Point Tracks
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[505]  arXiv:2512.02014 [pdf, ps, other]
Title: TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[506]  arXiv:2512.02012 [pdf, ps, other]
Title: Improved Mean Flows: On the Challenges of Fastforward Generative Models
Comments: Technical report
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[507]  arXiv:2512.02009 [pdf, ps, other]
Title: AirSim360: A Panoramic Simulation Platform within Drone View
Comments: Project Website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[508]  arXiv:2512.02006 [pdf, ps, other]
Title: MV-TAP: Tracking Any Point in Multi-View Videos
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[509]  arXiv:2512.02005 [pdf, ps, other]
Title: Learning Visual Affordance from Audio
Comments: 15 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[510]  arXiv:2512.01989 [pdf, ps, other]
Title: PAI-Bench: A Comprehensive Benchmark For Physical AI
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[511]  arXiv:2512.01988 [pdf, ps, other]
Title: Artemis: Structured Visual Reasoning for Perception Policy Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[512]  arXiv:2512.01975 [pdf, ps, other]
Title: SGDiff: Scene Graph Guided Diffusion Model for Image Collaborative SegCaptioning
Comments: Accept by AAAI-2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[513]  arXiv:2512.01960 [pdf, ps, other]
Title: SpriteHand: Real-Time Versatile Hand-Object Interaction with Autoregressive Video Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[514]  arXiv:2512.01952 [pdf, ps, other]
Title: GrndCtrl: Grounding World Models via Self-Supervised Reward Alignment
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[515]  arXiv:2512.01949 [pdf, ps, other]
Title: Script: Graph-Structured and Query-Conditioned Semantic Token Pruning for Multimodal Large Language Models
Comments: Published in Transactions on Machine Learning Research, Project in this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[516]  arXiv:2512.01934 [pdf, ps, other]
Title: Physical ID-Transfer Attacks against Multi-Object Tracking via Adversarial Trajectory
Comments: Accepted to Annual Computer Security Applications Conference (ACSAC) 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[517]  arXiv:2512.01922 [pdf, ps, other]
Title: Med-VCD: Mitigating Hallucination for Medical Large Vision Language Models through Visual Contrastive Decoding
Journal-ref: Computers in Biology and Medicine (2026)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[518]  arXiv:2512.01908 [pdf, ps, other]
Title: SARL: Spatially-Aware Self-Supervised Representation Learning for Visuo-Tactile Perception
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[519]  arXiv:2512.01895 [pdf, ps, other]
Title: StyleYourSmile: Cross-Domain Face Retargeting Without Paired Multi-Style Data
Comments: 15 pages, 14 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[520]  arXiv:2512.01889 [pdf, ps, other]
Title: KM-ViPE: Online Tightly Coupled Vision-Language-Geometry Fusion for Open-Vocabulary Semantic SLAM
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[521]  arXiv:2512.01885 [pdf, ps, other]
Title: TransientTrack: Advanced Multi-Object Tracking and Classification of Cancer Cells with Transient Fluorescent Signals
Comments: 13 pages, 7 figures, 2 tables. This work has been submitted to IEEE Transactions on Medical Imaging
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cell Behavior (q-bio.CB); Quantitative Methods (q-bio.QM)
[522]  arXiv:2512.01853 [pdf, ps, other]
Title: COACH: Collaborative Agents for Contextual Highlighting -- A Multi-Agent Framework for Sports Video Analysis
Comments: Accepted by AAAI 2026 Workshop LaMAS
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[523]  arXiv:2512.01850 [pdf, ps, other]
Title: Register Any Point: Scaling 3D Point Cloud Registration by Flow Matching
Comments: 22 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[524]  arXiv:2512.01843 [pdf, ps, other]
Title: PhyDetEx: Detecting and Explaining the Physical Plausibility of T2V Models
Comments: 17 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[525]  arXiv:2512.01830 [pdf, ps, other]
Title: OpenREAD: Reinforced Open-Ended Reasoning for End-to-End Autonomous Driving with LLM-as-Critic
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 778 entries: 1-25 | ... | 426-450 | 451-475 | 476-500 | 501-525 | 526-550 | 551-575 | 576-600 | ... | 776-778 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)