We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 221

[ total of 732 entries: 1-25 | ... | 147-171 | 172-196 | 197-221 | 222-246 | 247-271 | 272-296 | 297-321 | ... | 722-732 ]
[ showing 25 entries per page: fewer | more | all ]

Tue, 16 Dec 2025 (continued, showing last 23 of 244 entries)

[222]  arXiv:2512.12694 (cross-list from cs.DL) [pdf, ps, other]
Title: Hybrid Retrieval-Augmented Generation for Robust Multilingual Document Question Answering
Comments: Preprint
Subjects: Digital Libraries (cs.DL); Computer Vision and Pattern Recognition (cs.CV)
[223]  arXiv:2512.12690 (cross-list from cs.LG) [pdf, ps, other]
Title: Reassessing the Role of Supervised Fine-Tuning: An Empirical Study in VLM Reasoning
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[224]  arXiv:2512.12683 (cross-list from quant-ph) [pdf, ps, other]
Title: Quantum Implicit Neural Representations for 3D Scene Reconstruction and Novel View Synthesis
Subjects: Quantum Physics (quant-ph); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[225]  arXiv:2512.12663 (cross-list from cs.LG) [pdf, ps, other]
Title: PerNodeDrop: A Method Balancing Specialized Subnets and Regularization in Deep Neural Networks
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[226]  arXiv:2512.12367 (cross-list from physics.optics) [pdf, ps, other]
Title: JPEG-Inspired Cloud-Edge Holography
Subjects: Optics (physics.optics); Computer Vision and Pattern Recognition (cs.CV)
[227]  arXiv:2512.12284 (cross-list from eess.IV) [pdf, ps, other]
Title: V-Rex: Real-Time Streaming Video LLM Acceleration via Dynamic KV Cache Retrieval
Comments: 14 pages, 20 figures, conference
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[228]  arXiv:2512.12236 (cross-list from eess.IV) [pdf, ps, other]
Title: Resolution-Independent Neural Operators for Multi-Rate Sparse-View CT
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[229]  arXiv:2512.12203 (cross-list from cs.RO) [pdf, ps, other]
Title: Navigation Around Unknown Space Objects Using Visible-Thermal Image Fusion
Comments: 18 pages, 11 figures. To be published in proceedings of AIAA SCITECH 2026 Forum
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[230]  arXiv:2512.12196 (cross-list from cs.MM) [pdf, ps, other]
Title: AutoMV: An Automatic Multi-Agent System for Music Video Generation
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[231]  arXiv:2512.11982 (cross-list from astro-ph.IM) [pdf, ps, other]
Title: Semantic search for 100M+ galaxy images using AI-generated captions
Comments: Presented at the NeurIPS 2025 AI4Science Workshop
Subjects: Instrumentation and Methods for Astrophysics (astro-ph.IM); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[232]  arXiv:2512.11957 (cross-list from astro-ph.IM) [pdf, ps, other]
Title: Pre-training vision models for the classification of alerts from wide-field time-domain surveys
Comments: To be submitted to PASP
Subjects: Instrumentation and Methods for Astrophysics (astro-ph.IM); Computer Vision and Pattern Recognition (cs.CV)
[233]  arXiv:2512.11903 (cross-list from cs.RO) [pdf, ps, other]
Title: Aion: Towards Hierarchical 4D Scene Graphs with Temporal Flow Dynamics
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[234]  arXiv:2512.11883 (cross-list from cs.CY) [pdf, ps, other]
Title: Aesthetic Alignment Risks Assimilation: How Image Generation and Reward Models Reinforce Beauty Bias and Ideological "Censorship"
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[235]  arXiv:2512.11867 (cross-list from cs.LG) [pdf, ps, other]
Title: On the Dangers of Bootstrapping Generation for Continual Learning and Beyond
Comments: DAGM German Conference on Pattern Recognition, 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[236]  arXiv:2512.11849 (cross-list from cs.CL) [pdf, ps, other]
Title: KH-FUNSD: A Hierarchical and Fine-Grained Layout Analysis Dataset for Low-Resource Khmer Business Document
Authors: Nimol Thuon, Jun Du
Journal-ref: 2025 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[237]  arXiv:2512.11837 (cross-list from q-bio.QM) [pdf, ps, other]
Title: Vision Foundry: A System for Training Foundational Vision AI Models
Comments: 10 pages, 4 figures, 3 tables, submitted to AMIA 2026 Informatics Summit
Subjects: Quantitative Methods (q-bio.QM); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[238]  arXiv:2512.11833 (cross-list from cs.LG) [pdf, ps, other]
Title: Soft Decision Tree classifier: explainable and extendable PyTorch implementation
Authors: Reuben R Shamir
Comments: Keywords: Soft Decision Tree, Short-term Memory Soft Decision Tree, Classification, Explainability
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[239]  arXiv:2512.11831 (cross-list from cs.LG) [pdf, ps, other]
Title: On the Design of One-step Diffusion via Shortcutting Flow Paths
Comments: 10 pages of main body, conference paper
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[240]  arXiv:2512.11827 (cross-list from cs.CY) [pdf, ps, other]
Title: Assessing Greenspace Attractiveness with ChatGPT, Claude, and Gemini: Do AI Models Reflect Human Perceptions?
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[241]  arXiv:2512.11824 (cross-list from cs.RO) [pdf, ps, other]
Title: ReGlove: A Soft Pneumatic Glove for Activities of Daily Living Assistance via Wrist-Mounted Vision
Authors: Rosh Ho, Jian Zhang
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[242]  arXiv:2512.11817 (cross-list from cs.CY) [pdf, ps, other]
Title: A Reproducible Workflow for Scraping, Structuring, and Segmenting Legacy Archaeological Artifact Images
Comments: 12 Pages, 5 figures
Subjects: Computers and Society (cs.CY); Computer Vision and Pattern Recognition (cs.CV)
[243]  arXiv:2512.11811 (cross-list from cs.CL) [pdf, ps, other]
Title: Enhancing Urban Visual Place Recognition for Crowdsourced Flood Imagery via LLM-Guided Attention
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[244]  arXiv:2512.11802 (cross-list from cs.RO) [pdf, ps, other]
Title: Benchmarking Tesla's Traffic Light and Stop Sign Control: Field Dataset and Behavior Insights
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)

Mon, 15 Dec 2025 (showing first 2 of 104 entries)

[245]  arXiv:2512.11800 [pdf, ps, other]
Title: Moment-Based 3D Gaussian Splatting: Resolving Volumetric Occlusion with Order-Independent Transmittance
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[246]  arXiv:2512.11799 [pdf, ps, other]
Title: V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 732 entries: 1-25 | ... | 147-171 | 172-196 | 197-221 | 222-246 | 247-271 | 272-296 | 297-321 | ... | 722-732 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)