We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions

[ total of 1061 entries: 1-25 | 26-50 | 51-75 | 76-100 | ... | 1051-1061 ]
[ showing 25 entries per page: fewer | more | all ]

Mon, 18 May 2026 (showing first 25 of 134 entries)

[1]  arXiv:2605.16258 [pdf, ps, other]
Title: IVGT: Implicit Visual Geometry Transformer for Neural Scene Representation
Comments: Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[2]  arXiv:2605.16241 [pdf, ps, other]
Title: Offline Semantic Guidance for Efficient Vision-Language-Action Policy Distillation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3]  arXiv:2605.16179 [pdf, ps, other]
Title: MAgSeg: Segmentation of Agricultural Landscapes in High-Resolution Satellite Imagery using Multimodal Large Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[4]  arXiv:2605.16171 [pdf, ps, other]
Title: Res$^2$CLIP: Few-Shot Generalist Anomaly Detection with Residual-to-Residual Alignment
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[5]  arXiv:2605.16165 [pdf, ps, other]
Title: Second-Order Multi-Level Variance Correction for Modality Competition in Multimodal Models
Authors: Yishun Lu, Wes Armour
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[6]  arXiv:2605.16147 [pdf, ps, other]
Title: Registers Matter for Pixel-Space Diffusion Transformers
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[7]  arXiv:2605.16137 [pdf, ps, other]
Title: STABLE: Simulation-Ready Tabletop Layout Generation via a Semantics-Physics Dual System
Comments: ICML 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[8]  arXiv:2605.16127 [pdf, ps, other]
Title: WeatherOcc3D: VLM-Assisted Adverse Weather Aware 3D Semantic Occupancy Prediction
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[9]  arXiv:2605.16122 [pdf, ps, other]
Title: GenShield: Unified Detection and Artifact Correction for AI-Generated Images
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[10]  arXiv:2605.16080 [pdf, ps, other]
Title: ReAlign: Generalizable Image Forgery Detection via Reasoning-Aligned Representation
Comments: Accepted by CVPR 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[11]  arXiv:2605.16079 [pdf, ps, other]
Title: VideoSeeker: Incentivizing Instance-level Video Understanding via Native Agentic Tool Invocation
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[12]  arXiv:2605.16076 [pdf, ps, other]
Title: AgriMind: An Ensemble Deep Learning Framework for Multi-Class Plant Disease Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[13]  arXiv:2605.16065 [pdf, ps, other]
Title: Robust Prior-Guided Segmentation for Editable 3D Gaussian Splatting
Comments: Accepted at IEEE International Conference on Image Processing 2026, 6 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[14]  arXiv:2605.16022 [pdf, ps, other]
Title: EndoGSim: Physics-Aware 4D Dynamic Endoscopic Scene Simulations via MLLM-Guided Gaussian Splatting
Comments: Early Accepted by MICCAI 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[15]  arXiv:2605.16008 [pdf, ps, other]
Title: End-to-end plaque counting and virus titration from laboratory plate images with deep learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[16]  arXiv:2605.16003 [pdf, ps, other]
Title: Echo-Forcing: A Scene Memory Framework for Interactive Long Video Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[17]  arXiv:2605.15997 [pdf, ps, other]
Title: Segmentation, Detection and Explanation: A Unified Framework for CT Appearance Reasoning
Comments: 8 pages, 4 figures, submitted to IEEE Transactions on Medical Imaging (TMI)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[18]  arXiv:2605.15980 [pdf, ps, other]
Title: Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[19]  arXiv:2605.15961 [pdf, ps, other]
Title: Sparse Autoencoders enable Robust and Interpretable Fine-tuning of CLIP models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[20]  arXiv:2605.15951 [pdf, ps, other]
Title: From Failure to Feedback: Group Revision Unlocks Hard Cases in Object-Level Grounding
Comments: 8 pages, 5 figures, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[21]  arXiv:2605.15942 [pdf, ps, other]
Title: Decomposed Vision-Language Alignment for Fine-Grained Open-Vocabulary Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[22]  arXiv:2605.15923 [pdf, ps, other]
Title: Invaria: Learning Scale and Density Invariance in Point Clouds via Next-Resolution Prediction
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[23]  arXiv:2605.15921 [pdf, ps, other]
Title: AdaEraser: Training-Free Object Removal via Adaptive Attention Suppression
Authors: Dingming Liu
Comments: Accepted by ICML 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[24]  arXiv:2605.15908 [pdf, ps, other]
Title: RaPD: Resolution-Agnostic Pixel Diffusion via Semantics-Enriched Implicit Representations
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[25]  arXiv:2605.15906 [pdf, ps, other]
Title: A Causally Grounded Taxonomy for Image Degradation Robustness Evaluation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 1061 entries: 1-25 | 26-50 | 51-75 | 76-100 | ... | 1051-1061 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2605, contact, help  (Access key information)