We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 70

[ total of 747 entries: 1-50 | 21-70 | 71-120 | 121-170 | 171-220 | 221-270 | ... | 721-747 ]
[ showing 50 entries per page: fewer | more | all ]

Mon, 15 Dec 2025 (continued, showing last 34 of 104 entries)

[71]  arXiv:2512.11226 [pdf, ps, other]
Title: FutureX: Enhance End-to-End Autonomous Driving via Latent Chain-of-Thought World Model
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[72]  arXiv:2512.11225 [pdf, ps, other]
Title: VFMF: World Modeling by Forecasting Vision Foundation Model Features
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[73]  arXiv:2512.11215 [pdf, ps, other]
Title: SmokeBench: Evaluating Multimodal Large Language Models for Wildfire Smoke Detection
Comments: Accepted to WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[74]  arXiv:2512.11203 [pdf, ps, other]
Title: AutoRefiner: Improving Autoregressive Video Diffusion Models via Reflective Refinement Over the Stochastic Sampling Path
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[75]  arXiv:2512.11199 [pdf, ps, other]
Title: CADKnitter: Compositional CAD Generation from Text and Geometry Guidance
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[76]  arXiv:2512.11189 [pdf, ps, other]
Title: Multi-task Learning with Extended Temporal Shift Module for Temporal Action Localization
Comments: BinEgo360@ICCV25
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[77]  arXiv:2512.11186 [pdf, ps, other]
Title: Lightweight 3D Gaussian Splatting Compression via Video Codec
Comments: Accepted by DCC2026 Oral
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[78]  arXiv:2512.11167 [pdf, ps, other]
Title: Image Tiling for High-Resolution Reasoning: Balancing Local Detail with Global Context
Comments: Accepted in AAAI 2025 Workshop on Reproducible AI
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[79]  arXiv:2512.11141 [pdf, ps, other]
Title: Learning complete and explainable visual representations from itemized text supervision
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[80]  arXiv:2512.11130 [pdf, ps, other]
Title: Fast-FoundationStereo: Real-Time Zero-Shot Stereo Matching
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[81]  arXiv:2512.11121 [pdf, ps, other]
Title: Learning from a Generative Oracle: Domain Adaptation for Restoration
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[82]  arXiv:2512.11104 [pdf, ps, other]
Title: Information-driven Fusion of Pathology Foundation Models for Enhanced Disease Characterization
Comments: 29 Pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[83]  arXiv:2512.11099 [pdf, ps, other]
Title: VGent: Visual Grounding via Modular Design for Disentangling Reasoning and Prediction
Comments: 8 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[84]  arXiv:2512.11098 [pdf, ps, other]
Title: Vision-Language Models for Infrared Industrial Sensing in Additive Manufacturing Scene Description
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[85]  arXiv:2512.11076 [pdf, ps, other]
Title: E-CHUM: Event-based Cameras for Human Detection and Urban Monitoring
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[86]  arXiv:2512.11061 [pdf, ps, other]
Title: VDAWorld: World Modelling via VLM-Directed Abstraction and Simulation
Comments: Website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[87]  arXiv:2512.11060 [pdf, ps, other]
Title: Synthetic Vasculature and Pathology Enhance Vision-Language Model Reasoning
Comments: 23 pages, 8 figures, 6 tables. Full paper under review for MIDL 2026 (Medical Imaging with Deep Learning)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[88]  arXiv:2512.11057 [pdf, ps, other]
Title: Weakly Supervised Tuberculosis Localization in Chest X-rays through Knowledge Distillation
Comments: 18 pages, 9 figures, 4 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[89]  arXiv:2512.11016 [pdf, ps, other]
Title: SoccerMaster: A Vision Foundation Model for Soccer Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[90]  arXiv:2512.11015 [pdf, ps, other]
Title: Leveraging Text Guidance for Enhancing Demographic Fairness in Gender Classification
Authors: Anoop Krishnan
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[91]  arXiv:2512.11797 (cross-list from cs.RO) [pdf, ps, other]
Title: AnchorDream: Repurposing Video Diffusion for Embodiment-Aware Robot Data Synthesis
Comments: Project page: this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[92]  arXiv:2512.11745 (cross-list from eess.IV) [pdf, ps, other]
Title: mViSE: A Visual Search Engine for Analyzing Multiplex IHC Brain Tissue Images
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[93]  arXiv:2512.11695 (cross-list from physics.flu-dyn) [pdf, ps, other]
Title: Particle Image Velocimetry Refinement via Consensus ADMM
Comments: Code: this https URL
Subjects: Fluid Dynamics (physics.flu-dyn); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Optimization and Control (math.OC)
[94]  arXiv:2512.11676 (cross-list from math.PR) [pdf, ps, other]
Title: Stochastics of shapes and Kunita flows
Subjects: Probability (math.PR); Computer Vision and Pattern Recognition (cs.CV)
[95]  arXiv:2512.11582 (cross-list from cs.LG) [pdf, ps, other]
Title: Brain-Semantoks: Learning Semantic Tokens of Brain Dynamics with a Self-Distilled Foundation Model
Comments: Code and pretrained models available at this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neurons and Cognition (q-bio.NC)
[96]  arXiv:2512.11532 (cross-list from cs.DC) [pdf, ps, other]
Title: Parallax: Runtime Parallelization for Operator Fallbacks in Heterogeneous Edge Systems
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[97]  arXiv:2512.11433 (cross-list from cs.AI) [pdf, other]
Title: Back to the Baseline: Examining Baseline Effects on Explainability Metrics
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[98]  arXiv:2512.11399 (cross-list from cs.CL) [pdf, ps, other]
Title: Minimal Clips, Maximum Salience: Long Video Summarization via Key Moment Extraction
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[99]  arXiv:2512.11243 (cross-list from cs.LG) [pdf, ps, other]
Title: Task-Aware Multi-Expert Architecture For Lifelong Deep Learning
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[100]  arXiv:2512.11218 (cross-list from cs.RO) [pdf, ps, other]
Title: Seeing to Act, Prompting to Specify: A Bayesian Factorization of Vision Language Action Policy
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[101]  arXiv:2512.11194 (cross-list from cs.LG) [pdf, ps, other]
Title: Beyond Memorization: Gradient Projection Enables Selective Learning in Diffusion Models
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[102]  arXiv:2512.11145 (cross-list from cs.LG) [pdf, ps, other]
Title: Autoencoder-based Semi-Supervised Dimensionality Reduction and Clustering for Scientific Ensembles
Comments: Research Internship Project
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[103]  arXiv:2512.11047 (cross-list from cs.RO) [pdf, ps, other]
Title: WholeBodyVLA: Towards Unified Latent VLA for Whole-Body Loco-Manipulation Control
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[104]  arXiv:2512.10966 (cross-list from cs.LG) [pdf, ps, other]
Title: Multimodal Fusion of Regional Brain Experts for Interpretable Alzheimer's Disease Diagnosis
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)

Fri, 12 Dec 2025 (showing first 16 of 118 entries)

[105]  arXiv:2512.10959 [pdf, ps, other]
Title: StereoSpace: Depth-Free Synthesis of Stereo Geometry via End-to-End Diffusion in a Canonical Space
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[106]  arXiv:2512.10958 [pdf, ps, other]
Title: WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World
Comments: Preprint; 80 pages, 37 figures, 29 tables; Project Page at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[107]  arXiv:2512.10957 [pdf, ps, other]
Title: SceneMaker: Open-set 3D Scene Generation with Decoupled De-occlusion and Pose Estimation Model
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[108]  arXiv:2512.10956 [pdf, ps, other]
Title: Empowering Dynamic Urban Navigation with Stereo and Mid-Level Vision
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[109]  arXiv:2512.10955 [pdf, ps, other]
Title: Omni-Attribute: Open-vocabulary Attribute Encoder for Visual Concept Personalization
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[110]  arXiv:2512.10954 [pdf, ps, other]
Title: Group Diffusion: Enhancing Image Generation by Unlocking Cross-Sample Collaboration
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[111]  arXiv:2512.10950 [pdf, ps, other]
Title: E-RayZer: Self-supervised 3D Reconstruction as Spatial Visual Pre-training
Comments: Project website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[112]  arXiv:2512.10949 [pdf, ps, other]
Title: Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation
Comments: Code is released at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[113]  arXiv:2512.10948 [pdf, ps, other]
Title: ClusIR: Towards Cluster-Guided All-in-One Image Restoration
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[114]  arXiv:2512.10947 [pdf, ps, other]
Title: Towards Efficient and Effective Multi-Camera Encoding for End-to-End Driving
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[115]  arXiv:2512.10945 [pdf, ps, other]
Title: MeViS: A Multi-Modal Dataset for Referring Motion Expression Video Segmentation
Comments: IEEE TPAMI, Project Page: this https URL
Journal-ref: in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 47, no. 12, pp. 11400-11416, 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[116]  arXiv:2512.10943 [pdf, ps, other]
Title: AlcheMinT: Fine-grained Temporal Control for Multi-Reference Consistent Video Generation
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[117]  arXiv:2512.10942 [pdf, ps, other]
Title: VL-JEPA: Joint Embedding Predictive Architecture for Vision-language
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[118]  arXiv:2512.10941 [pdf, ps, other]
Title: Mull-Tokens: Modality-Agnostic Latent Thinking
Comments: Project webpage: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[119]  arXiv:2512.10940 [pdf, ps, other]
Title: OmniView: An All-Seeing Diffusion Model for 3D and 4D View Synthesis
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[120]  arXiv:2512.10939 [pdf, ps, other]
Title: GaussianHeadTalk: Wobble-Free 3D Talking Heads with Audio Driven Gaussian Splatting
Comments: IEEE/CVF Winter Conference on Applications of Computer Vision 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 747 entries: 1-50 | 21-70 | 71-120 | 121-170 | 171-220 | 221-270 | ... | 721-747 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)