We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 580

[ total of 603 entries: 1-50 | ... | 431-480 | 481-530 | 531-580 | 581-603 ]
[ showing 50 entries per page: fewer | more | all ]

Wed, 24 Dec 2025 (continued, showing last 23 of 86 entries)

[581]  arXiv:2512.19928 [pdf, ps, other]
Title: Unified Brain Surface and Volume Registration
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[582]  arXiv:2512.19918 [pdf, ps, other]
Title: Widget2Code: From Visual Widgets to UI Code via Multimodal LLMs
Comments: Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[583]  arXiv:2512.19871 [pdf, ps, other]
Title: HyGE-Occ: Hybrid View-Transformation with 3D Gaussian and Edge Priors for 3D Panoptic Occupancy Prediction
Comments: 11 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[584]  arXiv:2512.19850 [pdf, ps, other]
Title: RANSAC Scoring Functions: Analysis and Reality Check
Authors: A. Shekhovtsov
Subjects: Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP)
[585]  arXiv:2512.19823 [pdf, ps, other]
Title: Learning to Refocus with Video Diffusion Models
Comments: Code and data are available at this https URL . SIGGRAPH Asia 2025, Dec. 2025
Journal-ref: Proceedings of the SIGGRAPH Asia 2025, pp. 1-11, 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[586]  arXiv:2512.19817 [pdf, ps, other]
Title: Generating the Past, Present and Future from a Motion-Blurred Image
Comments: Code and data are available at this https URL
Journal-ref: ACM Trans. Graph. (SIGGRAPH Asia 2025), vol. 44, no. 6, pp. 1-15, Dec. 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[587]  arXiv:2512.19711 [pdf, ps, other]
Title: PHANTOM: PHysical ANamorphic Threats Obstructing Connected Vehicle Mobility
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[588]  arXiv:2512.20618 (cross-list from cs.AI) [pdf, ps, other]
Title: LongVideoAgent: Multi-Agent Reasoning with Long Videos
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[589]  arXiv:2512.20595 (cross-list from cs.CL) [pdf, ps, other]
Title: Cube Bench: A Benchmark for Spatial Visual Reasoning in MLLMs
Comments: 27 pages, 5 figures, 9 tables. Cube available at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[590]  arXiv:2512.20464 (cross-list from physics.optics) [pdf, ps, other]
Title: Snapshot 3D image projection using a diffractive decoder
Comments: 22 Pages, 8 Figures
Subjects: Optics (physics.optics); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Applied Physics (physics.app-ph)
[591]  arXiv:2512.20436 (cross-list from eess.IV) [pdf, ps, other]
Title: Dual-Encoder Transformer-Based Multimodal Learning for Ischemic Stroke Lesion Segmentation Using Diffusion MRI
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[592]  arXiv:2512.20420 (cross-list from cs.LG) [pdf, ps, other]
Title: Simplifying Multi-Task Architectures Through Task-Specific Normalization
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[593]  arXiv:2512.20387 (cross-list from cs.AI) [pdf, ps, other]
Title: Generative Digital Twins: Vision-Language Simulation Models for Executable Industrial Systems
Comments: 10 pages, 9 figures
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[594]  arXiv:2512.20374 (cross-list from eess.IV) [pdf, ps, other]
Title: CLIP Based Region-Aware Feature Fusion for Automated BBPS Scoring in Colonoscopy Images
Comments: 12 pages, 9 figures, BMVC 2025 submission
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[595]  arXiv:2512.20350 (cross-list from cs.LG) [pdf, ps, other]
Title: Field-Space Attention for Structure-Preserving Earth System Transformers
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Mathematical Physics (math-ph)
[596]  arXiv:2512.20299 (cross-list from cs.RO) [pdf, ps, other]
Title: KnowVal: A Knowledge-Augmented and Value-Guided Autonomous Driving System
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[597]  arXiv:2512.20249 (cross-list from cs.LG) [pdf, ps, other]
Title: Unified Multimodal Brain Decoding via Cross-Subject Soft-ROI Fusion
Authors: Xuanyu Hu
Comments: 15 pages, 2 figures, 4 tables. Submitted to ICPR 2026
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[598]  arXiv:2512.20233 (cross-list from cs.LG) [pdf, ps, other]
Title: How I Met Your Bias: Investigating Bias Amplification in Diffusion Models
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[599]  arXiv:2512.20145 (cross-list from cs.CL) [pdf, ps, other]
Title: Retrieval-augmented Prompt Learning for Pre-trained Foundation Models
Comments: IEEE/ACM Transactions on Audio, Speech and Language Processing
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[600]  arXiv:2512.20129 (cross-list from cs.HC) [pdf, ps, other]
Title: Dreamcrafter: Immersive Editing of 3D Radiance Fields Through Flexible, Generative Inputs and Outputs
Comments: CHI 2025, Project page: this https URL
Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV)
[601]  arXiv:2512.20056 (cross-list from cs.AI) [pdf, ps, other]
Title: Towards Generative Location Awareness for Disaster Response: A Probabilistic Cross-view Geolocalization Approach
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[602]  arXiv:2512.19731 (cross-list from cs.LG) [pdf, ps, other]
Title: Exploring Deep-to-Shallow Transformable Neural Networks for Intelligent Embedded Systems
Comments: Accepted by IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[603]  arXiv:2512.18099 (cross-list from eess.AS) [pdf, ps, other]
Title: SAM Audio: Segment Anything in Audio
Subjects: Audio and Speech Processing (eess.AS); Computer Vision and Pattern Recognition (cs.CV)
[ total of 603 entries: 1-50 | ... | 431-480 | 481-530 | 531-580 | 581-603 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2601, contact, help  (Access key information)