We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Science

Authors and titles for recent submissions

[ total of 3959 entries: 1-25 | 26-50 | 51-75 | 76-100 | ... | 3951-3959 ]
[ showing 25 entries per page: fewer | more ]

Fri, 20 Mar 2026 (showing first 25 of 677 entries)

[1]  arXiv:2603.19235 [pdf, ps, other]
Title: Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding
Comments: 31 pages, 12 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[2]  arXiv:2603.19234 [pdf, ps, other]
Title: Matryoshka Gaussian Splatting
Comments: project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[3]  arXiv:2603.19233 [pdf, ps, other]
Title: Not All Features Are Created Equal: A Mechanistic Study of Vision-Language-Action Models
Comments: Accepted to Multimodal Intelligence Workshop @ ICLR
Subjects: Robotics (cs.RO)
[4]  arXiv:2603.19232 [pdf, ps, other]
Title: Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens
Comments: Accepted by CVPR 2026 main track; Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[5]  arXiv:2603.19231 [pdf, ps, other]
Title: MonoArt: Progressive Structural Reasoning for Monocular Articulated 3D Reconstruction
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[6]  arXiv:2603.19229 [pdf, ps, other]
Title: NavTrust: Benchmarking Trustworthiness for Embodied Navigation
Comments: Project Website: this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Systems and Control (eess.SY)
[7]  arXiv:2603.19228 [pdf, ps, other]
Title: SAMA: Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing
Comments: 24 pages, 12 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[8]  arXiv:2603.19227 [pdf, ps, other]
Title: Bridging Semantic and Kinematic Conditions with Diffusion-based Discrete Motion Tokenizer
Comments: Project Page: this https URL GitHub: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[9]  arXiv:2603.19226 [pdf, ps, other]
Title: Under One Sun: Multi-Object Generative Perception of Materials and Illumination
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[10]  arXiv:2603.19225 [pdf, ps, other]
Title: FinTradeBench: A Financial Reasoning Benchmark for LLMs
Authors: Yogesh Agrawal, Aniruddha Dutta, Md Mahadi Hasan, Santu Karmaker, Aritra Dutta (University of Central Florida)
Comments: 8 pages main text, 22 pages total (including references and appendix). 5 figures, 14 tables. Preprint under review. Code and data will be made available upon publication
Subjects: Computational Engineering, Finance, and Science (cs.CE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Computational Finance (q-fin.CP)
[11]  arXiv:2603.19224 [pdf, ps, other]
Title: EffectErase: Joint Video Object Removal and Insertion for High-Quality Effect Erasing
Comments: CVPR 2026, Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[12]  arXiv:2603.19223 [pdf, ps, other]
Title: F2LLM-v2: Inclusive, Performant, and Efficient Embeddings for a Multilingual World
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[13]  arXiv:2603.19222 [pdf, ps, other]
Title: Spectrally-Guided Diffusion Noise Schedules
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[14]  arXiv:2603.19221 [pdf, ps, other]
Title: Online Learning and Equilibrium Computation with Ranking Feedback
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Science and Game Theory (cs.GT)
[15]  arXiv:2603.19220 [pdf, ps, other]
Title: Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation
Comments: We release the model and data at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[16]  arXiv:2603.19219 [pdf, ps, other]
Title: DriveTok: 3D Driving Scene Tokenization for Unified Multi-View Reconstruction and Understanding
Comments: Project Page: this https URL Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[17]  arXiv:2603.19218 [pdf, ps, other]
Title: Rethinking Vector Field Learning for Generative Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[18]  arXiv:2603.19217 [pdf, ps, other]
Title: LVOmniBench: Pioneering Long Audio-Video Understanding Evaluation for Omnimodal LLMs
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[19]  arXiv:2603.19216 [pdf, ps, other]
Title: DreamPartGen: Semantically Grounded Part-Level 3D Generation via Collaborative Latent Denoising
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[20]  arXiv:2603.19213 [pdf, ps, other]
Title: Constitutive vs. Corrective: A Causal Taxonomy of Human Runtime Involvement in AI Systems
Subjects: Computers and Society (cs.CY); Human-Computer Interaction (cs.HC)
[21]  arXiv:2603.19209 [pdf, ps, other]
Title: Do VLMs Need Vision Transformers? Evaluating State Space Models as Vision Encoders
Comments: Project page: this https URL ; Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[22]  arXiv:2603.19206 [pdf, ps, other]
Title: RPiAE: A Representation-Pivoted Autoencoder Enhancing Both Image Generation and Editing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[23]  arXiv:2603.19204 [pdf, ps, other]
Title: Robustness, Cost, and Attack-Surface Concentration in Phishing Detection
Comments: 14 pages, 4 figures, 9 tables
Subjects: Machine Learning (cs.LG)
[24]  arXiv:2603.19203 [pdf, ps, other]
Title: Tinted Frames: Question Framing Blinds Vision-Language Models
Comments: Preprint. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[25]  arXiv:2603.19201 [pdf, ps, other]
Title: OmniVTA: Visuo-Tactile World Modeling for Contact-Rich Robotic Manipulation
Comments: TARS Robotics Project Page: this https URL
Subjects: Robotics (cs.RO)
[ total of 3959 entries: 1-25 | 26-50 | 51-75 | 76-100 | ... | 3951-3959 ]
[ showing 25 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2603, contact, help  (Access key information)