We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 496

[ total of 778 entries: 1-25 | ... | 422-446 | 447-471 | 472-496 | 497-521 | 522-546 | 547-571 | 572-596 | ... | 772-778 ]
[ showing 25 entries per page: fewer | more | all ]

Wed, 3 Dec 2025 (continued, showing last 4 of 141 entries)

[497]  arXiv:2512.02243 (cross-list from cs.CR) [pdf, ps, other]
Title: PhishSnap: Image-Based Phishing Detection Using Perceptual Hashing
Comments: IEE Standard Formatting, 3 pages, 3 figures
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[498]  arXiv:2512.02143 (cross-list from cs.GR) [pdf, ps, other]
Title: CoatFusion: Controllable Material Coating in Images
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[499]  arXiv:2512.02088 (cross-list from eess.IV) [pdf, ps, other]
Title: Comparing Baseline and Day-1 Diffusion MRI Using Multimodal Deep Embeddings for Stroke Outcome Prediction
Comments: 5 pages, 5 figures, 2 tables
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[500]  arXiv:2512.02062 (cross-list from cs.CR) [pdf, ps, other]
Title: Superpixel Attack: Enhancing Black-box Adversarial Attack with Image-driven Division Areas
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)

Tue, 2 Dec 2025 (showing first 21 of 278 entries)

[501]  arXiv:2512.02018 [pdf, ps, other]
Title: Data-Centric Visual Development for Self-Driving Labs
Comments: 11 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[502]  arXiv:2512.02017 [pdf, ps, other]
Title: Visual Sync: Multi-Camera Synchronization via Cross-View Object Motion
Comments: Accepted to NeurIPS 2025. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[503]  arXiv:2512.02016 [pdf, ps, other]
Title: Objects in Generated Videos Are Slower Than They Appear: Models Suffer Sub-Earth Gravity and Don't Know Galileo's Principle...for now
Comments: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[504]  arXiv:2512.02015 [pdf, ps, other]
Title: Generative Video Motion Editing with 3D Point Tracks
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[505]  arXiv:2512.02014 [pdf, ps, other]
Title: TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[506]  arXiv:2512.02012 [pdf, ps, other]
Title: Improved Mean Flows: On the Challenges of Fastforward Generative Models
Comments: Technical report
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[507]  arXiv:2512.02009 [pdf, ps, other]
Title: AirSim360: A Panoramic Simulation Platform within Drone View
Comments: Project Website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[508]  arXiv:2512.02006 [pdf, ps, other]
Title: MV-TAP: Tracking Any Point in Multi-View Videos
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[509]  arXiv:2512.02005 [pdf, ps, other]
Title: Learning Visual Affordance from Audio
Comments: 15 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[510]  arXiv:2512.01989 [pdf, ps, other]
Title: PAI-Bench: A Comprehensive Benchmark For Physical AI
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[511]  arXiv:2512.01988 [pdf, ps, other]
Title: Artemis: Structured Visual Reasoning for Perception Policy Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[512]  arXiv:2512.01975 [pdf, ps, other]
Title: SGDiff: Scene Graph Guided Diffusion Model for Image Collaborative SegCaptioning
Comments: Accept by AAAI-2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[513]  arXiv:2512.01960 [pdf, ps, other]
Title: SpriteHand: Real-Time Versatile Hand-Object Interaction with Autoregressive Video Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[514]  arXiv:2512.01952 [pdf, ps, other]
Title: GrndCtrl: Grounding World Models via Self-Supervised Reward Alignment
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[515]  arXiv:2512.01949 [pdf, ps, other]
Title: Script: Graph-Structured and Query-Conditioned Semantic Token Pruning for Multimodal Large Language Models
Comments: Published in Transactions on Machine Learning Research, Project in this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[516]  arXiv:2512.01934 [pdf, ps, other]
Title: Physical ID-Transfer Attacks against Multi-Object Tracking via Adversarial Trajectory
Comments: Accepted to Annual Computer Security Applications Conference (ACSAC) 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[517]  arXiv:2512.01922 [pdf, ps, other]
Title: Med-VCD: Mitigating Hallucination for Medical Large Vision Language Models through Visual Contrastive Decoding
Journal-ref: Computers in Biology and Medicine (2026)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[518]  arXiv:2512.01908 [pdf, ps, other]
Title: SARL: Spatially-Aware Self-Supervised Representation Learning for Visuo-Tactile Perception
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[519]  arXiv:2512.01895 [pdf, ps, other]
Title: StyleYourSmile: Cross-Domain Face Retargeting Without Paired Multi-Style Data
Comments: 15 pages, 14 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[520]  arXiv:2512.01889 [pdf, ps, other]
Title: KM-ViPE: Online Tightly Coupled Vision-Language-Geometry Fusion for Open-Vocabulary Semantic SLAM
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[521]  arXiv:2512.01885 [pdf, ps, other]
Title: TransientTrack: Advanced Multi-Object Tracking and Classification of Cancer Cells with Transient Fluorescent Signals
Comments: 13 pages, 7 figures, 2 tables. This work has been submitted to IEEE Transactions on Medical Imaging
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cell Behavior (q-bio.CB); Quantitative Methods (q-bio.QM)
[ total of 778 entries: 1-25 | ... | 422-446 | 447-471 | 472-496 | 497-521 | 522-546 | 547-571 | 572-596 | ... | 772-778 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)