We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 506

[ total of 778 entries: 1-25 | ... | 432-456 | 457-481 | 482-506 | 507-531 | 532-556 | 557-581 | 582-606 | ... | 757-778 ]
[ showing 25 entries per page: fewer | more | all ]

Tue, 2 Dec 2025 (continued, showing 25 of 278 entries)

[507]  arXiv:2512.02009 [pdf, ps, other]
Title: AirSim360: A Panoramic Simulation Platform within Drone View
Comments: Project Website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[508]  arXiv:2512.02006 [pdf, ps, other]
Title: MV-TAP: Tracking Any Point in Multi-View Videos
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[509]  arXiv:2512.02005 [pdf, ps, other]
Title: Learning Visual Affordance from Audio
Comments: 15 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[510]  arXiv:2512.01989 [pdf, ps, other]
Title: PAI-Bench: A Comprehensive Benchmark For Physical AI
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[511]  arXiv:2512.01988 [pdf, ps, other]
Title: Artemis: Structured Visual Reasoning for Perception Policy Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[512]  arXiv:2512.01975 [pdf, ps, other]
Title: SGDiff: Scene Graph Guided Diffusion Model for Image Collaborative SegCaptioning
Comments: Accept by AAAI-2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[513]  arXiv:2512.01960 [pdf, ps, other]
Title: SpriteHand: Real-Time Versatile Hand-Object Interaction with Autoregressive Video Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[514]  arXiv:2512.01952 [pdf, ps, other]
Title: GrndCtrl: Grounding World Models via Self-Supervised Reward Alignment
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[515]  arXiv:2512.01949 [pdf, ps, other]
Title: Script: Graph-Structured and Query-Conditioned Semantic Token Pruning for Multimodal Large Language Models
Comments: Published in Transactions on Machine Learning Research, Project in this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[516]  arXiv:2512.01934 [pdf, ps, other]
Title: Physical ID-Transfer Attacks against Multi-Object Tracking via Adversarial Trajectory
Comments: Accepted to Annual Computer Security Applications Conference (ACSAC) 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[517]  arXiv:2512.01922 [pdf, ps, other]
Title: Med-VCD: Mitigating Hallucination for Medical Large Vision Language Models through Visual Contrastive Decoding
Journal-ref: Computers in Biology and Medicine (2026)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[518]  arXiv:2512.01908 [pdf, ps, other]
Title: SARL: Spatially-Aware Self-Supervised Representation Learning for Visuo-Tactile Perception
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[519]  arXiv:2512.01895 [pdf, ps, other]
Title: StyleYourSmile: Cross-Domain Face Retargeting Without Paired Multi-Style Data
Comments: 15 pages, 14 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[520]  arXiv:2512.01889 [pdf, ps, other]
Title: KM-ViPE: Online Tightly Coupled Vision-Language-Geometry Fusion for Open-Vocabulary Semantic SLAM
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[521]  arXiv:2512.01885 [pdf, ps, other]
Title: TransientTrack: Advanced Multi-Object Tracking and Classification of Cancer Cells with Transient Fluorescent Signals
Comments: 13 pages, 7 figures, 2 tables. This work has been submitted to IEEE Transactions on Medical Imaging
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cell Behavior (q-bio.CB); Quantitative Methods (q-bio.QM)
[522]  arXiv:2512.01853 [pdf, ps, other]
Title: COACH: Collaborative Agents for Contextual Highlighting -- A Multi-Agent Framework for Sports Video Analysis
Comments: Accepted by AAAI 2026 Workshop LaMAS
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[523]  arXiv:2512.01850 [pdf, ps, other]
Title: Register Any Point: Scaling 3D Point Cloud Registration by Flow Matching
Comments: 22 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[524]  arXiv:2512.01843 [pdf, ps, other]
Title: PhyDetEx: Detecting and Explaining the Physical Plausibility of T2V Models
Comments: 17 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[525]  arXiv:2512.01830 [pdf, ps, other]
Title: OpenREAD: Reinforced Open-Ended Reasoning for End-to-End Autonomous Driving with LLM-as-Critic
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[526]  arXiv:2512.01827 [pdf, ps, other]
Title: CauSight: Learning to Supersense for Visual Causal Discovery
Comments: project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[527]  arXiv:2512.01821 [pdf, ps, other]
Title: Seeing through Imagination: Learning Scene Geometry via Implicit Spatial World Modeling
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[528]  arXiv:2512.01816 [pdf, ps, other]
Title: Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights
Comments: 35 pages, 12 figures, 10 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[529]  arXiv:2512.01803 [pdf, ps, other]
Title: Generative Action Tell-Tales: Assessing Human Motion in Synthesized Videos
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[530]  arXiv:2512.01789 [pdf, ps, other]
Title: SAM3-UNet: Simplified Adaptation of Segment Anything Model 3
Comments: Technical Report
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[531]  arXiv:2512.01788 [pdf, ps, other]
Title: Learned Image Compression for Earth Observation: Implications for Downstream Segmentation Tasks
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 778 entries: 1-25 | ... | 432-456 | 457-481 | 482-506 | 507-531 | 532-556 | 557-581 | 582-606 | ... | 757-778 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)