We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 309

[ total of 778 entries: 1-25 | ... | 235-259 | 260-284 | 285-309 | 310-334 | 335-359 | 360-384 | 385-409 | ... | 760-778 ]
[ showing 25 entries per page: fewer | more | all ]

Thu, 4 Dec 2025 (continued, showing 25 of 130 entries)

[310]  arXiv:2512.03477 [pdf, ps, other]
Title: Fairness-Aware Fine-Tuning of Vision-Language Models for Medical Glaucoma Diagnosis
Comments: 10 pages, 3 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[311]  arXiv:2512.03474 [pdf, ps, other]
Title: Procedural Mistake Detection via Action Effect Modeling
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[312]  arXiv:2512.03470 [pdf, ps, other]
Title: Difference Decomposition Networks for Infrared Small Target Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[313]  arXiv:2512.03463 [pdf, ps, other]
Title: Text-Printed Image: Bridging the Image-Text Modality Gap for Text-centric Training of Large Vision-Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[314]  arXiv:2512.03454 [pdf, ps, other]
Title: Think Before You Drive: World Model-Inspired Multimodal Grounding for Autonomous Vehicles
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[315]  arXiv:2512.03453 [pdf, ps, other]
Title: GeoVideo: Introducing Geometric Regularization into Video Generation Model
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[316]  arXiv:2512.03451 [pdf, ps, other]
Title: GalaxyDiT: Efficient Video Generation with Guidance Alignment and Adaptive Proxy in Diffusion Transformers
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[317]  arXiv:2512.03450 [pdf, ps, other]
Title: KeyPointDiffuser: Unsupervised 3D Keypoint Learning via Latent Diffusion Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[318]  arXiv:2512.03449 [src]
Title: LM-CartSeg: Automated Segmentation of Lateral and Medial Cartilage and Subchondral Bone for Radiomics Analysis
Authors: Tongxu Zhang
Comments: The manuscript represents only a preliminary and substantially incompleted exploration. The author has decided not to stand by these results, and a thoroughly revised and significantly different version will be developed separately. Therefore this version is withdrawn and should not be cited
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[319]  arXiv:2512.03445 [pdf, ps, other]
Title: Multi-Aspect Knowledge-Enhanced Medical Vision-Language Pretraining with Multi-Agent Data Generation
Comments: 10 pages. Under Review
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[320]  arXiv:2512.03430 [pdf, ps, other]
Title: Label-Efficient Hyperspectral Image Classification via Spectral FiLM Modulation of Low-Level Pretrained Diffusion Features
Comments: Accepted to the ICML 2025 TerraBytes Workshop (June 9, 2025)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[321]  arXiv:2512.03427 [pdf, ps, other]
Title: Generalization Evaluation of Deep Stereo Matching Methods for UAV-Based Forestry Applications
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[322]  arXiv:2512.03424 [pdf, ps, other]
Title: DM3D: Deformable Mamba via Offset-Guided Gaussian Sequencing for Point Cloud Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[323]  arXiv:2512.03418 [pdf, ps, other]
Title: YOLOA: Real-Time Affordance Detection via LLM Adapter
Comments: 13 pages, 9 figures, conference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[324]  arXiv:2512.03405 [pdf, ps, other]
Title: ViDiC: Video Difference Captioning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[325]  arXiv:2512.03404 [pdf, ps, other]
Title: MOS: Mitigating Optical-SAR Modality Gap for Cross-Modal Ship Re-Identification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[326]  arXiv:2512.03370 [pdf, ps, other]
Title: ShelfGaussian: Shelf-Supervised Open-Vocabulary Gaussian-based 3D Scene Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[327]  arXiv:2512.03369 [pdf, ps, other]
Title: FireSentry: A Multi-Modal Spatio-temporal Benchmark Dataset for Fine-Grained Wildfire Spread Forecasting
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[328]  arXiv:2512.03359 [pdf, ps, other]
Title: A Hybrid Deep Learning Framework with Explainable AI for Lung Cancer Classification with DenseNet169 and SVM
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[329]  arXiv:2512.03350 [pdf, ps, other]
Title: SeeU: Seeing the Unseen World via 4D Dynamics-aware Generation
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[330]  arXiv:2512.03346 [pdf, ps, other]
Title: Hierarchical Attention for Sparse Volumetric Anomaly Detection in Subclinical Keratoconus
Comments: 16 pages, 7 figures, 6 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[331]  arXiv:2512.03345 [pdf, ps, other]
Title: HalluGen: Synthesizing Realistic and Controllable Hallucinations for Evaluating Image Restoration
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[332]  arXiv:2512.03339 [pdf, ps, other]
Title: ProtoEFNet: Dynamic Prototype Learning for Inherently Interpretable Ejection Fraction Estimation in Echocardiography
Comments: 11 pages, Accepted in IMIMIC Workshop at MICCAI 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[333]  arXiv:2512.03335 [pdf, ps, other]
Title: Step-by-step Layered Design Generation
Journal-ref: AAAI 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[334]  arXiv:2512.03317 [pdf, ps, other]
Title: NavMapFusion: Diffusion-based Fusion of Navigation Maps for Online Vectorized HD Map Construction
Comments: Accepted to 2026 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2026)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[ total of 778 entries: 1-25 | ... | 235-259 | 260-284 | 285-309 | 310-334 | 335-359 | 360-384 | 385-409 | ... | 760-778 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)