We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 79

[ total of 754 entries: 1-100 | 80-179 | 180-279 | 280-379 | 380-479 | ... | 680-754 ]
[ showing 100 entries per page: fewer | more | all ]

Thu, 11 Dec 2025 (continued, showing last 56 of 135 entries)

[80]  arXiv:2512.09232 [pdf, ps, other]
Title: Enabling Next-Generation Consumer Experience with Feature Coding for Machines
Journal-ref: 2025 IEEE International Conference on Consumer Electronics (ICCE), Las Vegas, NV, USA, 2025, pp. 1-4
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[81]  arXiv:2512.09215 [pdf, ps, other]
Title: View-on-Graph: Zero-shot 3D Visual Grounding via Vision-Language Reasoning on Scene Graphs
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[82]  arXiv:2512.09185 [pdf, ps, other]
Title: Learning Patient-Specific Disease Dynamics with Latent Flow Matching for Longitudinal Imaging Generation
Comments: Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[83]  arXiv:2512.09172 [pdf, ps, other]
Title: Prompt-Based Continual Compositional Zero-Shot Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[84]  arXiv:2512.09164 [pdf, ps, other]
Title: WonderZoom: Multi-Scale 3D World Generation
Comments: Project website: this https URL The first two authors contributed equally
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[85]  arXiv:2512.09162 [pdf, ps, other]
Title: GTAvatar: Bridging Gaussian Splatting and Texture Mapping for Relightable and Editable Gaussian Avatars
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[86]  arXiv:2512.09134 [pdf, ps, other]
Title: Integrated Pipeline for Coronary Angiography With Automated Lesion Profiling, Virtual Stenting, and 100-Vessel FFR Validation
Comments: 22 pages, 10 figures, 7 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[87]  arXiv:2512.09115 [pdf, ps, other]
Title: SuperF: Neural Implicit Fields for Multi-Image Super-Resolution
Comments: 23 pages, 13 figures, 8 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[88]  arXiv:2512.09112 [pdf, ps, other]
Title: GimbalDiffusion: Gravity-Aware Camera Control for Video Generation
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[89]  arXiv:2512.09095 [pdf, ps, other]
Title: Food Image Generation on Multi-Noun Categories
Comments: Accepted by WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[90]  arXiv:2512.09092 [pdf, ps, other]
Title: Explaining the Unseen: Multimodal Vision-Language Reasoning for Situational Awareness in Underground Mining Disasters
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[91]  arXiv:2512.09081 [pdf, ps, other]
Title: AgentComp: From Agentic Reasoning to Compositional Mastery in Text-to-Image Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[92]  arXiv:2512.09071 [pdf, ps, other]
Title: Adaptive Thresholding for Visual Place Recognition using Negative Gaussian Mixture Statistics
Comments: Accepted and presented at IEEE RoboticCC 2025. 4 pages short paper
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[93]  arXiv:2512.09069 [pdf, ps, other]
Title: KD-OCT: Efficient Knowledge Distillation for Clinical-Grade Retinal OCT Classification
Comments: 7 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[94]  arXiv:2512.09062 [pdf, ps, other]
Title: SIP: Site in Pieces- A Dataset of Disaggregated Construction-Phase 3D Scans for Semantic Segmentation and Scene Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[95]  arXiv:2512.09056 [pdf, ps, other]
Title: ConceptPose: Training-Free Zero-Shot Object Pose Estimation using Concept Vectors
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[96]  arXiv:2512.09016 [pdf, ps, other]
Title: Learning to Remove Lens Flare in Event Camera
Comments: Preprint; 29 pages, 14 figures, 4 tables; Project Page at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[97]  arXiv:2512.09011 [pdf, ps, other]
Title: An Approach for Detection of Entities in Dynamic Media Contents
Comments: 12 pages, 8 figures
Journal-ref: Journal of Computer Science and Technology Studies, Vol. 5, No. 3, pp. 13-24, 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[98]  arXiv:2512.09010 [pdf, ps, other]
Title: Towards Lossless Ultimate Vision Token Compression for VLMs
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[99]  arXiv:2512.09005 [pdf, ps, other]
Title: A Survey of Body and Face Motion: Datasets, Performance Evaluation Metrics and Generative Techniques
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[100]  arXiv:2512.09001 [pdf, ps, other]
Title: A Physics-Constrained, Design-Driven Methodology for Defect Dataset Generation in Optical Lithography
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[101]  arXiv:2512.08999 [pdf, ps, other]
Title: Diffusion Model Regularized Implicit Neural Representation for CT Metal Artifact Reduction
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[102]  arXiv:2512.08996 [pdf, ps, other]
Title: Demo: Generative AI helps Radiotherapy Planning with User Preference
Comments: Best paper in GenAI4Health at NeurIPS 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[103]  arXiv:2512.08991 [pdf, ps, other]
Title: Deterministic World Models for Verification of Closed-loop Vision-based Systems
Comments: 22 pages, 10 figures. Submitted to FM 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[104]  arXiv:2512.08989 [pdf, ps, other]
Title: Enhancing Knowledge Transfer in Hyperspectral Image Classification via Cross-scene Knowledge Integration
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[105]  arXiv:2512.08987 [pdf, ps, other]
Title: 3DID: Direct 3D Inverse Design for Aerodynamics with Physics-Aware Optimization
Comments: Accepted at NeurIPS 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[106]  arXiv:2512.08986 [pdf, ps, other]
Title: Explainable Fundus Image Curation and Lesion Detection in Diabetic Retinopathy
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[107]  arXiv:2512.08985 [pdf, ps, other]
Title: An Efficient Test-Time Scaling Approach for Image Generation
Comments: 11 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[108]  arXiv:2512.08984 [pdf, ps, other]
Title: RAG-HAR: Retrieval Augmented Generation-based Human Activity Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[109]  arXiv:2512.08983 [pdf, ps, other]
Title: HSCP: A Two-Stage Spectral Clustering Framework for Resource-Constrained UAV Identification
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[110]  arXiv:2512.08982 [pdf, ps, other]
Title: Consist-Retinex: One-Step Noise-Emphasized Consistency Training Accelerates High-Quality Retinex Enhancement
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[111]  arXiv:2512.08981 [pdf, ps, other]
Title: Mitigating Bias with Words: Inducing Demographic Ambiguity in Face Recognition Templates by Text Encoding
Comments: Accepted at BMVC workshop (SRBS) 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[112]  arXiv:2512.08980 [pdf, ps, other]
Title: Training Multi-Image Vision Agents via End2End Reinforcement Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[113]  arXiv:2512.08979 [pdf, ps, other]
Title: What Happens When: Learning Temporal Orders of Events in Videos
Comments: WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[114]  arXiv:2512.09920 (cross-list from cs.RO) [pdf, ps, other]
Title: LISN: Language-Instructed Social Navigation with VLM-based Controller Modulating
Comments: 8 pages
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[115]  arXiv:2512.09903 (cross-list from cs.RO) [pdf, ps, other]
Title: YOPO-Nav: Visual Navigation using 3DGS Graphs from One-Pass Videos
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[116]  arXiv:2512.09898 (cross-list from cs.RO) [pdf, ps, other]
Title: Visual Heading Prediction for Autonomous Aerial Vehicles
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Multiagent Systems (cs.MA); Systems and Control (eess.SY)
[117]  arXiv:2512.09851 (cross-list from cs.RO) [pdf, ps, other]
Title: Simultaneous Tactile-Visual Perception for Learning Multimodal Robot Manipulation
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[118]  arXiv:2512.09841 (cross-list from cs.CL) [pdf, ps, other]
Title: ChronusOmni: Improving Time Awareness of Omni Large Language Models
Comments: Code available at this https URL
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[119]  arXiv:2512.09779 (cross-list from eess.IV) [pdf, ps, other]
Title: PathCo-LatticE: Pathology-Constrained Lattice-Of Experts Framework for Fully-supervised Few-Shot Cardiac MRI Segmentation
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[120]  arXiv:2512.09664 (cross-list from cs.DC) [pdf, ps, other]
Title: SynthPix: A lightspeed PIV images generator
Comments: Code: this https URL
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[121]  arXiv:2512.09610 (cross-list from cs.HC) [pdf, ps, other]
Title: ImageTalk: Designing a Multimodal AAC Text Generation System Driven by Image Recognition and Natural Language Generation
Comments: 24 pages, 10 figures
Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[122]  arXiv:2512.09607 (cross-list from cs.RO) [pdf, ps, other]
Title: UrbanNav: Learning Language-Guided Urban Navigation from Web-Scale Human Trajectories
Comments: 9 pages, 5 figures, accepted to AAAI 2026
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[123]  arXiv:2512.09510 (cross-list from cs.RO) [pdf, ps, other]
Title: ViTA-Seg: Vision Transformer for Amodal Segmentation in Robotics
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[124]  arXiv:2512.09469 (cross-list from quant-ph) [pdf, ps, other]
Title: LiePrune: Lie Group and Quantum Geometric Dual Representation for One-Shot Structured Pruning of Quantum Neural Networks
Comments: 7 pages, 2 figures
Subjects: Quantum Physics (quant-ph); Computer Vision and Pattern Recognition (cs.CV)
[125]  arXiv:2512.09447 (cross-list from cs.RO) [pdf, ps, other]
Title: Sequential Testing for Descriptor-Agnostic LiDAR Loop Closure in Repetitive Environments
Comments: 8 pages, 4 figures
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[126]  arXiv:2512.09406 (cross-list from cs.RO) [pdf, ps, other]
Title: H2R-Grounder: A Paired-Data-Free Paradigm for Translating Human Interaction Videos into Physically Grounded Robot Videos
Comments: 13 pages, 6 figures
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[127]  arXiv:2512.09376 (cross-list from cs.LG) [pdf, ps, other]
Title: Rates and architectures for learning geometrically non-trivial operators
Comments: 26 pages, 5 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Differential Geometry (math.DG)
[128]  arXiv:2512.09343 (cross-list from cs.RO) [pdf, ps, other]
Title: Development and Testing for Perception Based Autonomous Landing of a Long-Range QuadPlane
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[129]  arXiv:2512.09340 (cross-list from cs.AI) [pdf, ps, other]
Title: Visual Categorization Across Minds and Models: Cognitive Analysis of Human Labeling and Neuro-Symbolic Integration
Comments: 12 pages, 3 figures. Research manuscript based on the final project for CS6795 (Introduction to Cognitive Science), Georgia Tech
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[130]  arXiv:2512.09309 (cross-list from cs.DC) [pdf, ps, other]
Title: A Distributed Framework for Privacy-Enhanced Vision Transformers on the Edge
Comments: 16 pages, 7 figures. Published in the Proceedings of the Tenth ACM/IEEE Symposium on Edge Computing (SEC '25), Dec 3-6, 2025, Washington, D.C., USA
Journal-ref: Proceedings of the Tenth ACM/IEEE Symposium on Edge Computing (SEC '25), 2025, Article 8, pp. 1-16
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[131]  arXiv:2512.09201 (cross-list from cs.GR) [pdf, ps, other]
Title: Residual Primitive Fitting of 3D Shapes with SuperFrusta
Comments: this https URL
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[132]  arXiv:2512.09094 (cross-list from eess.IV) [pdf, ps, other]
Title: Causal Attribution of Model Performance Gaps in Medical Imaging Under Distribution Shifts
Comments: Medical Imaging meets EurIPS Workshop: MedEurIPS 2025
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Methodology (stat.ME)
[133]  arXiv:2512.08998 (cross-list from eess.IV) [pdf, ps, other]
Title: DermETAS-SNA LLM: A Dermatology Focused Evolutionary Transformer Architecture Search with StackNet Augmented LLM Assistant
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[134]  arXiv:2512.08992 (cross-list from eess.IV) [pdf, ps, other]
Title: Enhanced Chest Disease Classification Using an Improved CheXNet Framework with EfficientNetV2-M and Optimization-Driven Learning
Comments: 23 pages, 6 figures, 7 tables
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[135]  arXiv:2512.08990 (cross-list from eess.IV) [pdf, ps, other]
Title: Agreement Disagreement Guided Knowledge Transfer for Cross-Scene Hyperspectral Imaging
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)

Wed, 10 Dec 2025 (showing first 44 of 131 entries)

[136]  arXiv:2512.08931 [pdf, ps, other]
Title: Astra: General Interactive World Model with Autoregressive Denoising
Comments: Code is available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[137]  arXiv:2512.08930 [pdf, ps, other]
Title: Selfi: Self Improving Reconstruction Engine via 3D Geometric Feature Alignment
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[138]  arXiv:2512.08924 [pdf, ps, other]
Title: Efficiently Reconstructing Dynamic Scenes One D4RT at a Time
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[139]  arXiv:2512.08922 [pdf, ps, other]
Title: Unified Diffusion Transformer for High-fidelity Text-Aware Image Restoration
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[140]  arXiv:2512.08912 [pdf, ps, other]
Title: LiDAS: Lighting-driven Dynamic Active Sensing for Nighttime Perception
Comments: Preprint. 12 pages, 9 figures. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[141]  arXiv:2512.08905 [pdf, ps, other]
Title: Self-Evolving 3D Scene Generation from a Single Image
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[142]  arXiv:2512.08897 [pdf, ps, other]
Title: UniLayDiff: A Unified Diffusion Transformer for Content-Aware Layout Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[143]  arXiv:2512.08889 [pdf, ps, other]
Title: No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers
Comments: Project webpage: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[144]  arXiv:2512.08888 [pdf, ps, other]
Title: Accelerated Rotation-Invariant Convolution for UAV Image Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[145]  arXiv:2512.08881 [pdf, ps, other]
Title: SATGround: A Spatially-Aware Approach for Visual Grounding in Remote Sensing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[146]  arXiv:2512.08873 [pdf, ps, other]
Title: Siamese-Driven Optimization for Low-Resolution Image Latent Embedding in Image Captioning
Comments: 6 pages
Journal-ref: 2024 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[147]  arXiv:2512.08860 [pdf, ps, other]
Title: Tri-Bench: Stress-Testing VLM Reliability on Spatial Reasoning under Camera Tilt and Object Interference
Authors: Amit Bendkhale
Comments: 6 pages, 3 figures. Code and data: this https URL Accepted to the AAAI 2026 Workshop on Trust and Control in Agentic AI (TrustAgent)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[148]  arXiv:2512.08854 [pdf, ps, other]
Title: Generation is Required for Data-Efficient Perception
Comments: Preprint
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[149]  arXiv:2512.08829 [pdf, ps, other]
Title: InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models
Comments: 16 pages, 8 figures, conference or other essential info
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[150]  arXiv:2512.08820 [pdf, ps, other]
Title: Training-Free Dual Hyperbolic Adapters for Better Cross-Modal Reasoning
Comments: Accepted in IEEE Transactions on Multimedia (TMM)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[151]  arXiv:2512.08789 [pdf, ps, other]
Title: MatteViT: High-Frequency-Aware Document Shadow Removal with Shadow Matte Guidance
Comments: 10 pages, 7 figures, 5 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[152]  arXiv:2512.08785 [pdf, ps, other]
Title: LoFA: Learning to Predict Personalized Priors for Fast Adaptation of Visual Generative Models
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[153]  arXiv:2512.08774 [pdf, ps, other]
Title: Refining Visual Artifacts in Diffusion Models via Explainable AI-based Flaw Activation Maps
Comments: 10 pages, 9 figures, 7 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[154]  arXiv:2512.08765 [pdf, ps, other]
Title: Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance
Comments: NeurlPS 2025. Code and data available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[155]  arXiv:2512.08751 [pdf, ps, other]
Title: Skewness-Guided Pruning of Multimodal Swin Transformers for Federated Skin Lesion Classification on Edge Devices
Subjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
[156]  arXiv:2512.08747 [pdf, ps, other]
Title: A Scalable Pipeline Combining Procedural 3D Graphics and Guided Diffusion for Photorealistic Synthetic Training Data Generation in White Button Mushroom Segmentation
Comments: 20 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[157]  arXiv:2512.08738 [pdf, ps, other]
Title: Pose-Based Sign Language Spotting via an End-to-End Encoder Architecture
Comments: To appear at AACL-IJCNLP 2025 Workshop WSLP
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[158]  arXiv:2512.08733 [pdf, ps, other]
Title: Mitigating Individual Skin Tone Bias in Skin Lesion Classification through Distribution-Aware Reweighting
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[159]  arXiv:2512.08730 [pdf, ps, other]
Title: SegEarth-OV3: Exploring SAM 3 for Open-Vocabulary Semantic Segmentation in Remote Sensing Images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[160]  arXiv:2512.08700 [pdf, ps, other]
Title: Scale-invariant and View-relational Representation Learning for Full Surround Monocular Depth
Comments: Accepted at IEEE Robotics and Automation Letters (RA-L) 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[161]  arXiv:2512.08697 [pdf, ps, other]
Title: What really matters for person re-identification? A Mixture-of-Experts Framework for Semantic Attribute Importance
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[162]  arXiv:2512.08673 [pdf, ps, other]
Title: Dual-Branch Center-Surrounding Contrast: Rethinking Contrastive Learning for 3D Point Clouds
Comments: 16 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[163]  arXiv:2512.08648 [pdf, ps, other]
Title: Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank
Comments: 19 pages, 19 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[164]  arXiv:2512.08647 [pdf, ps, other]
Title: C-DIRA: Computationally Efficient Dynamic ROI Routing and Domain-Invariant Adversarial Learning for Lightweight Driver Behavior Recognition
Authors: Keito Inoshita
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[165]  arXiv:2512.08645 [pdf, ps, other]
Title: Chain-of-Image Generation: Toward Monitorable and Controllable Image Generation
Comments: 19 pages, 13 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[166]  arXiv:2512.08639 [pdf, ps, other]
Title: Aerial Vision-Language Navigation with a Unified Framework for Spatial, Temporal and Embodied Reasoning
Comments: Under Review, 12 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[167]  arXiv:2512.08627 [pdf, ps, other]
Title: Trajectory Densification and Depth from Perspective-based Blur
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[168]  arXiv:2512.08625 [pdf, ps, other]
Title: OpenMonoGS-SLAM: Monocular Gaussian Splatting SLAM with Open-set Semantics
Comments: 8 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[169]  arXiv:2512.08606 [pdf, ps, other]
Title: Decoupling Template Bias in CLIP: Harnessing Empty Prompts for Enhanced Few-Shot Learning
Comments: 14 pages, 8 figures, Association for the Advancement of Artificial Intelligence (AAAI2026, poster)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[170]  arXiv:2512.08589 [pdf, ps, other]
Title: Automated Pollen Recognition in Optical and Holographic Microscopy Images
Comments: 08 pages, 10 figures, 04 tables, 20 references. Date of Conference: 13-14 June 2025 Date Added to IEEE Xplore: 10 July 2025 Electronic ISBN: 979-8-3315-0969-9 Print on Demand(PoD) ISBN: 979-8-3315-0970-5 DOI: 10.1109/AICCONF64766.2025.11064260 Conference Location: Prague, Czech Republic Online Access: this https URL
Journal-ref: 2025 3rd Cognitive Models and Artificial Intelligence Conference (AICCONF), vol. 1, no. 1, pp. 1-8, Prague, Czech Republic, IEEE, 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[171]  arXiv:2512.08577 [pdf, ps, other]
Title: Disturbance-Free Surgical Video Generation from Multi-Camera Shadowless Lamps for Open Surgery
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[172]  arXiv:2512.08572 [pdf, ps, other]
Title: From Cells to Survival: Hierarchical Analysis of Cell Inter-Relations in Multiplex Microscopy for Lung Cancer Prognosis
Comments: 5 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[173]  arXiv:2512.08569 [pdf, ps, other]
Title: Instance-Aware Test-Time Segmentation for Continual Domain Shifts
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[174]  arXiv:2512.08564 [pdf, ps, other]
Title: Modular Neural Image Signal Processing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[175]  arXiv:2512.08560 [pdf, ps, other]
Title: BrainExplore: Large-Scale Discovery of Interpretable Visual Representations in the Human Brain
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[176]  arXiv:2512.08557 [pdf, ps, other]
Title: SSCATeR: Sparse Scatter-Based Convolution Algorithm with Temporal Data Recycling for Real-Time 3D Object Detection in LiDAR Point Clouds
Comments: 22 Pages, 26 Figures, This work has been submitted to the IEEE Sensors Journal for possible publication
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[177]  arXiv:2512.08547 [pdf, ps, other]
Title: An Iteration-Free Fixed-Point Estimator for Diffusion Inversion
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[178]  arXiv:2512.08542 [pdf, ps, other]
Title: A Novel Wasserstein Quaternion Generative Adversarial Network for Color Image Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Numerical Analysis (math.NA)
[179]  arXiv:2512.08537 [pdf, ps, other]
Title: Fast-ARDiff: An Entropy-informed Acceleration Framework for Continuous Space Autoregressive Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 754 entries: 1-100 | 80-179 | 180-279 | 280-379 | 380-479 | ... | 680-754 ]
[ showing 100 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)