We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 86

[ total of 754 entries: 1-50 | 37-86 | 87-136 | 137-186 | 187-236 | 237-286 | ... | 737-754 ]
[ showing 50 entries per page: fewer | more | all ]

Thu, 11 Dec 2025 (continued, showing last 49 of 135 entries)

[87]  arXiv:2512.09115 [pdf, ps, other]
Title: SuperF: Neural Implicit Fields for Multi-Image Super-Resolution
Comments: 23 pages, 13 figures, 8 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[88]  arXiv:2512.09112 [pdf, ps, other]
Title: GimbalDiffusion: Gravity-Aware Camera Control for Video Generation
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[89]  arXiv:2512.09095 [pdf, ps, other]
Title: Food Image Generation on Multi-Noun Categories
Comments: Accepted by WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[90]  arXiv:2512.09092 [pdf, ps, other]
Title: Explaining the Unseen: Multimodal Vision-Language Reasoning for Situational Awareness in Underground Mining Disasters
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[91]  arXiv:2512.09081 [pdf, ps, other]
Title: AgentComp: From Agentic Reasoning to Compositional Mastery in Text-to-Image Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[92]  arXiv:2512.09071 [pdf, ps, other]
Title: Adaptive Thresholding for Visual Place Recognition using Negative Gaussian Mixture Statistics
Comments: Accepted and presented at IEEE RoboticCC 2025. 4 pages short paper
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[93]  arXiv:2512.09069 [pdf, ps, other]
Title: KD-OCT: Efficient Knowledge Distillation for Clinical-Grade Retinal OCT Classification
Comments: 7 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[94]  arXiv:2512.09062 [pdf, ps, other]
Title: SIP: Site in Pieces- A Dataset of Disaggregated Construction-Phase 3D Scans for Semantic Segmentation and Scene Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[95]  arXiv:2512.09056 [pdf, ps, other]
Title: ConceptPose: Training-Free Zero-Shot Object Pose Estimation using Concept Vectors
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[96]  arXiv:2512.09016 [pdf, ps, other]
Title: Learning to Remove Lens Flare in Event Camera
Comments: Preprint; 29 pages, 14 figures, 4 tables; Project Page at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[97]  arXiv:2512.09011 [pdf, ps, other]
Title: An Approach for Detection of Entities in Dynamic Media Contents
Comments: 12 pages, 8 figures
Journal-ref: Journal of Computer Science and Technology Studies, Vol. 5, No. 3, pp. 13-24, 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[98]  arXiv:2512.09010 [pdf, ps, other]
Title: Towards Lossless Ultimate Vision Token Compression for VLMs
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[99]  arXiv:2512.09005 [pdf, ps, other]
Title: A Survey of Body and Face Motion: Datasets, Performance Evaluation Metrics and Generative Techniques
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[100]  arXiv:2512.09001 [pdf, ps, other]
Title: A Physics-Constrained, Design-Driven Methodology for Defect Dataset Generation in Optical Lithography
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[101]  arXiv:2512.08999 [pdf, ps, other]
Title: Diffusion Model Regularized Implicit Neural Representation for CT Metal Artifact Reduction
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[102]  arXiv:2512.08996 [pdf, ps, other]
Title: Demo: Generative AI helps Radiotherapy Planning with User Preference
Comments: Best paper in GenAI4Health at NeurIPS 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[103]  arXiv:2512.08991 [pdf, ps, other]
Title: Deterministic World Models for Verification of Closed-loop Vision-based Systems
Comments: 22 pages, 10 figures. Submitted to FM 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[104]  arXiv:2512.08989 [pdf, ps, other]
Title: Enhancing Knowledge Transfer in Hyperspectral Image Classification via Cross-scene Knowledge Integration
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[105]  arXiv:2512.08987 [pdf, ps, other]
Title: 3DID: Direct 3D Inverse Design for Aerodynamics with Physics-Aware Optimization
Comments: Accepted at NeurIPS 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[106]  arXiv:2512.08986 [pdf, ps, other]
Title: Explainable Fundus Image Curation and Lesion Detection in Diabetic Retinopathy
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[107]  arXiv:2512.08985 [pdf, ps, other]
Title: An Efficient Test-Time Scaling Approach for Image Generation
Comments: 11 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[108]  arXiv:2512.08984 [pdf, ps, other]
Title: RAG-HAR: Retrieval Augmented Generation-based Human Activity Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[109]  arXiv:2512.08983 [pdf, ps, other]
Title: HSCP: A Two-Stage Spectral Clustering Framework for Resource-Constrained UAV Identification
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[110]  arXiv:2512.08982 [pdf, ps, other]
Title: Consist-Retinex: One-Step Noise-Emphasized Consistency Training Accelerates High-Quality Retinex Enhancement
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[111]  arXiv:2512.08981 [pdf, ps, other]
Title: Mitigating Bias with Words: Inducing Demographic Ambiguity in Face Recognition Templates by Text Encoding
Comments: Accepted at BMVC workshop (SRBS) 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[112]  arXiv:2512.08980 [pdf, ps, other]
Title: Training Multi-Image Vision Agents via End2End Reinforcement Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[113]  arXiv:2512.08979 [pdf, ps, other]
Title: What Happens When: Learning Temporal Orders of Events in Videos
Comments: WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[114]  arXiv:2512.09920 (cross-list from cs.RO) [pdf, ps, other]
Title: LISN: Language-Instructed Social Navigation with VLM-based Controller Modulating
Comments: 8 pages
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[115]  arXiv:2512.09903 (cross-list from cs.RO) [pdf, ps, other]
Title: YOPO-Nav: Visual Navigation using 3DGS Graphs from One-Pass Videos
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[116]  arXiv:2512.09898 (cross-list from cs.RO) [pdf, ps, other]
Title: Visual Heading Prediction for Autonomous Aerial Vehicles
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Multiagent Systems (cs.MA); Systems and Control (eess.SY)
[117]  arXiv:2512.09851 (cross-list from cs.RO) [pdf, ps, other]
Title: Simultaneous Tactile-Visual Perception for Learning Multimodal Robot Manipulation
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[118]  arXiv:2512.09841 (cross-list from cs.CL) [pdf, ps, other]
Title: ChronusOmni: Improving Time Awareness of Omni Large Language Models
Comments: Code available at this https URL
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[119]  arXiv:2512.09779 (cross-list from eess.IV) [pdf, ps, other]
Title: PathCo-LatticE: Pathology-Constrained Lattice-Of Experts Framework for Fully-supervised Few-Shot Cardiac MRI Segmentation
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[120]  arXiv:2512.09664 (cross-list from cs.DC) [pdf, ps, other]
Title: SynthPix: A lightspeed PIV images generator
Comments: Code: this https URL
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[121]  arXiv:2512.09610 (cross-list from cs.HC) [pdf, ps, other]
Title: ImageTalk: Designing a Multimodal AAC Text Generation System Driven by Image Recognition and Natural Language Generation
Comments: 24 pages, 10 figures
Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[122]  arXiv:2512.09607 (cross-list from cs.RO) [pdf, ps, other]
Title: UrbanNav: Learning Language-Guided Urban Navigation from Web-Scale Human Trajectories
Comments: 9 pages, 5 figures, accepted to AAAI 2026
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[123]  arXiv:2512.09510 (cross-list from cs.RO) [pdf, ps, other]
Title: ViTA-Seg: Vision Transformer for Amodal Segmentation in Robotics
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[124]  arXiv:2512.09469 (cross-list from quant-ph) [pdf, ps, other]
Title: LiePrune: Lie Group and Quantum Geometric Dual Representation for One-Shot Structured Pruning of Quantum Neural Networks
Comments: 7 pages, 2 figures
Subjects: Quantum Physics (quant-ph); Computer Vision and Pattern Recognition (cs.CV)
[125]  arXiv:2512.09447 (cross-list from cs.RO) [pdf, ps, other]
Title: Sequential Testing for Descriptor-Agnostic LiDAR Loop Closure in Repetitive Environments
Comments: 8 pages, 4 figures
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[126]  arXiv:2512.09406 (cross-list from cs.RO) [pdf, ps, other]
Title: H2R-Grounder: A Paired-Data-Free Paradigm for Translating Human Interaction Videos into Physically Grounded Robot Videos
Comments: 13 pages, 6 figures
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[127]  arXiv:2512.09376 (cross-list from cs.LG) [pdf, ps, other]
Title: Rates and architectures for learning geometrically non-trivial operators
Comments: 26 pages, 5 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Differential Geometry (math.DG)
[128]  arXiv:2512.09343 (cross-list from cs.RO) [pdf, ps, other]
Title: Development and Testing for Perception Based Autonomous Landing of a Long-Range QuadPlane
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[129]  arXiv:2512.09340 (cross-list from cs.AI) [pdf, ps, other]
Title: Visual Categorization Across Minds and Models: Cognitive Analysis of Human Labeling and Neuro-Symbolic Integration
Comments: 12 pages, 3 figures. Research manuscript based on the final project for CS6795 (Introduction to Cognitive Science), Georgia Tech
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[130]  arXiv:2512.09309 (cross-list from cs.DC) [pdf, ps, other]
Title: A Distributed Framework for Privacy-Enhanced Vision Transformers on the Edge
Comments: 16 pages, 7 figures. Published in the Proceedings of the Tenth ACM/IEEE Symposium on Edge Computing (SEC '25), Dec 3-6, 2025, Washington, D.C., USA
Journal-ref: Proceedings of the Tenth ACM/IEEE Symposium on Edge Computing (SEC '25), 2025, Article 8, pp. 1-16
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[131]  arXiv:2512.09201 (cross-list from cs.GR) [pdf, ps, other]
Title: Residual Primitive Fitting of 3D Shapes with SuperFrusta
Comments: this https URL
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[132]  arXiv:2512.09094 (cross-list from eess.IV) [pdf, ps, other]
Title: Causal Attribution of Model Performance Gaps in Medical Imaging Under Distribution Shifts
Comments: Medical Imaging meets EurIPS Workshop: MedEurIPS 2025
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Methodology (stat.ME)
[133]  arXiv:2512.08998 (cross-list from eess.IV) [pdf, ps, other]
Title: DermETAS-SNA LLM: A Dermatology Focused Evolutionary Transformer Architecture Search with StackNet Augmented LLM Assistant
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[134]  arXiv:2512.08992 (cross-list from eess.IV) [pdf, ps, other]
Title: Enhanced Chest Disease Classification Using an Improved CheXNet Framework with EfficientNetV2-M and Optimization-Driven Learning
Comments: 23 pages, 6 figures, 7 tables
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[135]  arXiv:2512.08990 (cross-list from eess.IV) [pdf, ps, other]
Title: Agreement Disagreement Guided Knowledge Transfer for Cross-Scene Hyperspectral Imaging
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)

Wed, 10 Dec 2025 (showing first 1 of 131 entries)

[136]  arXiv:2512.08931 [pdf, ps, other]
Title: Astra: General Interactive World Model with Autoregressive Denoising
Comments: Code is available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[ total of 754 entries: 1-50 | 37-86 | 87-136 | 137-186 | 187-236 | 237-286 | ... | 737-754 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)