We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 486

[ total of 778 entries: 1-25 | ... | 412-436 | 437-461 | 462-486 | 487-511 | 512-536 | 537-561 | 562-586 | ... | 762-778 ]
[ showing 25 entries per page: fewer | more | all ]

Wed, 3 Dec 2025 (continued, showing last 14 of 141 entries)

[487]  arXiv:2512.02920 (cross-list from cs.LG) [pdf, ps, other]
Title: Learning Multimodal Embeddings for Traffic Accident Prediction and Causal Estimation
Comments: 17 pages. To appear in KDD'26 Datasets
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Social and Information Networks (cs.SI)
[488]  arXiv:2512.02787 (cross-list from cs.RO) [pdf, ps, other]
Title: Diagnose, Correct, and Learn from Manipulation Failures via Visual Symbols
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[489]  arXiv:2512.02719 (cross-list from cs.CL) [pdf, ps, other]
Title: Emergent Bayesian Behaviour and Optimal Cue Combination in LLMs
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neurons and Cognition (q-bio.NC)
[490]  arXiv:2512.02651 (cross-list from cs.HC) [pdf, ps, other]
Title: Real-Time Multimodal Data Collection Using Smartwatches and Its Visualization in Education
Comments: Accepted in Technological Ecosystems for Enhancing Multiculturality (TEEM) 2025
Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV); Software Engineering (cs.SE)
[491]  arXiv:2512.02636 (cross-list from cs.LG) [pdf, ps, other]
Title: Joint Distillation for Fast Likelihood Evaluation and Sampling in Flow-based Models
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[492]  arXiv:2512.02609 (cross-list from cs.RO) [pdf, ps, other]
Title: SAM2Grasp: Resolve Multi-modal Grasping via Prompt-conditioned Temporal Action Prediction
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[493]  arXiv:2512.02340 (cross-list from cs.AI) [pdf, ps, other]
Title: Reasoning Path and Latent State Analysis for Multi-view Visual Spatial Reasoning: A Cognitive Science Perspective
Comments: 23 pages, 37 figures
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[494]  arXiv:2512.02306 (cross-list from cs.AI) [pdf, ps, other]
Title: OmniGuard: Unified Omni-Modal Guardrails with Deliberate Reasoning
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[495]  arXiv:2512.02293 (cross-list from cs.RO) [pdf, ps, other]
Title: VIGS-SLAM: Visual Inertial Gaussian Splatting SLAM
Comments: Project page: this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[496]  arXiv:2512.02280 (cross-list from cs.AI) [pdf, ps, other]
Title: Bridging the Gap: Toward Cognitive Autonomy in Artificial Intelligence
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[497]  arXiv:2512.02243 (cross-list from cs.CR) [pdf, ps, other]
Title: PhishSnap: Image-Based Phishing Detection Using Perceptual Hashing
Comments: IEE Standard Formatting, 3 pages, 3 figures
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[498]  arXiv:2512.02143 (cross-list from cs.GR) [pdf, ps, other]
Title: CoatFusion: Controllable Material Coating in Images
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[499]  arXiv:2512.02088 (cross-list from eess.IV) [pdf, ps, other]
Title: Comparing Baseline and Day-1 Diffusion MRI Using Multimodal Deep Embeddings for Stroke Outcome Prediction
Comments: 5 pages, 5 figures, 2 tables
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[500]  arXiv:2512.02062 (cross-list from cs.CR) [pdf, ps, other]
Title: Superpixel Attack: Enhancing Black-box Adversarial Attack with Image-driven Division Areas
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)

Tue, 2 Dec 2025 (showing first 11 of 278 entries)

[501]  arXiv:2512.02018 [pdf, ps, other]
Title: Data-Centric Visual Development for Self-Driving Labs
Comments: 11 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[502]  arXiv:2512.02017 [pdf, ps, other]
Title: Visual Sync: Multi-Camera Synchronization via Cross-View Object Motion
Comments: Accepted to NeurIPS 2025. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[503]  arXiv:2512.02016 [pdf, ps, other]
Title: Objects in Generated Videos Are Slower Than They Appear: Models Suffer Sub-Earth Gravity and Don't Know Galileo's Principle...for now
Comments: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[504]  arXiv:2512.02015 [pdf, ps, other]
Title: Generative Video Motion Editing with 3D Point Tracks
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[505]  arXiv:2512.02014 [pdf, ps, other]
Title: TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[506]  arXiv:2512.02012 [pdf, ps, other]
Title: Improved Mean Flows: On the Challenges of Fastforward Generative Models
Comments: Technical report
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[507]  arXiv:2512.02009 [pdf, ps, other]
Title: AirSim360: A Panoramic Simulation Platform within Drone View
Comments: Project Website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[508]  arXiv:2512.02006 [pdf, ps, other]
Title: MV-TAP: Tracking Any Point in Multi-View Videos
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[509]  arXiv:2512.02005 [pdf, ps, other]
Title: Learning Visual Affordance from Audio
Comments: 15 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[510]  arXiv:2512.01989 [pdf, ps, other]
Title: PAI-Bench: A Comprehensive Benchmark For Physical AI
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[511]  arXiv:2512.01988 [pdf, ps, other]
Title: Artemis: Structured Visual Reasoning for Perception Policy Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 778 entries: 1-25 | ... | 412-436 | 437-461 | 462-486 | 487-511 | 512-536 | 537-561 | 562-586 | ... | 762-778 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)