We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions

[ total of 1054 entries: 1-25 | 26-50 | 51-75 | 76-100 | ... | 1051-1054 ]
[ showing 25 entries per page: fewer | more | all ]

Fri, 20 Mar 2026 (showing first 25 of 147 entries)

[1]  arXiv:2603.19235 [pdf, ps, other]
Title: Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding
Comments: 31 pages, 12 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[2]  arXiv:2603.19234 [pdf, ps, other]
Title: Matryoshka Gaussian Splatting
Comments: project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[3]  arXiv:2603.19232 [pdf, ps, other]
Title: Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens
Comments: Accepted by CVPR 2026 main track; Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[4]  arXiv:2603.19231 [pdf, ps, other]
Title: MonoArt: Progressive Structural Reasoning for Monocular Articulated 3D Reconstruction
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[5]  arXiv:2603.19228 [pdf, ps, other]
Title: SAMA: Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing
Comments: 24 pages, 12 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[6]  arXiv:2603.19227 [pdf, ps, other]
Title: Bridging Semantic and Kinematic Conditions with Diffusion-based Discrete Motion Tokenizer
Comments: Project Page: this https URL GitHub: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[7]  arXiv:2603.19226 [pdf, ps, other]
Title: Under One Sun: Multi-Object Generative Perception of Materials and Illumination
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[8]  arXiv:2603.19224 [pdf, ps, other]
Title: EffectErase: Joint Video Object Removal and Insertion for High-Quality Effect Erasing
Comments: CVPR 2026, Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[9]  arXiv:2603.19222 [pdf, ps, other]
Title: Spectrally-Guided Diffusion Noise Schedules
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[10]  arXiv:2603.19219 [pdf, ps, other]
Title: DriveTok: 3D Driving Scene Tokenization for Unified Multi-View Reconstruction and Understanding
Comments: Project Page: this https URL Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[11]  arXiv:2603.19218 [pdf, ps, other]
Title: Rethinking Vector Field Learning for Generative Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[12]  arXiv:2603.19217 [pdf, ps, other]
Title: LVOmniBench: Pioneering Long Audio-Video Understanding Evaluation for Omnimodal LLMs
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[13]  arXiv:2603.19216 [pdf, ps, other]
Title: DreamPartGen: Semantically Grounded Part-Level 3D Generation via Collaborative Latent Denoising
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[14]  arXiv:2603.19209 [pdf, ps, other]
Title: Do VLMs Need Vision Transformers? Evaluating State Space Models as Vision Encoders
Comments: Project page: this https URL ; Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[15]  arXiv:2603.19206 [pdf, ps, other]
Title: RPiAE: A Representation-Pivoted Autoencoder Enhancing Both Image Generation and Editing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[16]  arXiv:2603.19203 [pdf, ps, other]
Title: Tinted Frames: Question Framing Blinds Vision-Language Models
Comments: Preprint. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[17]  arXiv:2603.19193 [pdf, ps, other]
Title: Reconstruction Matters: Learning Geometry-Aligned BEV Representation through 3D Gaussian Splatting
Comments: Project page at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[18]  arXiv:2603.19169 [pdf, ps, other]
Title: ARIADNE: A Perception-Reasoning Synergy Framework for Trustworthy Coronary Angiography Analysis
Comments: 28 pages, 5 figures . arXiv:submit/7385738 [cs.AI]
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[19]  arXiv:2603.19158 [pdf, ps, other]
Title: Adaptive Auxiliary Prompt Blending for Target-Faithful Diffusion Generation
Comments: Accepted in CVPR 2026 (main track). 10 pages, 6 figures; supplementary material included (14 pages, 11 figures)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[20]  arXiv:2603.19157 [pdf, ps, other]
Title: ADAPT: Attention Driven Adaptive Prompt Scheduling and InTerpolating Orthogonal Complements for Rare Concepts Generation
Comments: Accepted in CVPR 2026 (findings). 10 pages, 4 figures; supplementary material included (8 pages, 10 figures)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[21]  arXiv:2603.19137 [pdf, ps, other]
Title: GSMem: 3D Gaussian Splatting as Persistent Spatial Memory for Zero-Shot Embodied Exploration and Reasoning
Comments: Project page at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[22]  arXiv:2603.19122 [pdf, ps, other]
Title: Revisiting Autoregressive Models for Generative Image Classification
Comments: Tech report
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[23]  arXiv:2603.19121 [pdf, ps, other]
Title: CustomTex: High-fidelity Indoor Scene Texturing via Multi-Reference Customization
Comments: Accepted to CVPR 2026. This version integrates the main paper and supplementary material
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[24]  arXiv:2603.19098 [pdf, ps, other]
Title: TAU-R1: Visual Language Model for Traffic Anomaly Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[25]  arXiv:2603.19092 [pdf, ps, other]
Title: SAVeS: Steering Safety Judgments in Vision-Language Models via Semantic Cues
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[ total of 1054 entries: 1-25 | 26-50 | 51-75 | 76-100 | ... | 1051-1054 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2603, contact, help  (Access key information)