We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 136

[ total of 754 entries: 1-100 | 37-136 | 137-236 | 237-336 | 337-436 | 437-536 | ... | 737-754 ]
[ showing 100 entries per page: fewer | more | all ]

Wed, 10 Dec 2025 (continued, showing 100 of 131 entries)

[137]  arXiv:2512.08930 [pdf, ps, other]
Title: Selfi: Self Improving Reconstruction Engine via 3D Geometric Feature Alignment
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[138]  arXiv:2512.08924 [pdf, ps, other]
Title: Efficiently Reconstructing Dynamic Scenes One D4RT at a Time
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[139]  arXiv:2512.08922 [pdf, ps, other]
Title: Unified Diffusion Transformer for High-fidelity Text-Aware Image Restoration
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[140]  arXiv:2512.08912 [pdf, ps, other]
Title: LiDAS: Lighting-driven Dynamic Active Sensing for Nighttime Perception
Comments: Preprint. 12 pages, 9 figures. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[141]  arXiv:2512.08905 [pdf, ps, other]
Title: Self-Evolving 3D Scene Generation from a Single Image
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[142]  arXiv:2512.08897 [pdf, ps, other]
Title: UniLayDiff: A Unified Diffusion Transformer for Content-Aware Layout Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[143]  arXiv:2512.08889 [pdf, ps, other]
Title: No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers
Comments: Project webpage: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[144]  arXiv:2512.08888 [pdf, ps, other]
Title: Accelerated Rotation-Invariant Convolution for UAV Image Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[145]  arXiv:2512.08881 [pdf, ps, other]
Title: SATGround: A Spatially-Aware Approach for Visual Grounding in Remote Sensing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[146]  arXiv:2512.08873 [pdf, ps, other]
Title: Siamese-Driven Optimization for Low-Resolution Image Latent Embedding in Image Captioning
Comments: 6 pages
Journal-ref: 2024 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[147]  arXiv:2512.08860 [pdf, ps, other]
Title: Tri-Bench: Stress-Testing VLM Reliability on Spatial Reasoning under Camera Tilt and Object Interference
Authors: Amit Bendkhale
Comments: 6 pages, 3 figures. Code and data: this https URL Accepted to the AAAI 2026 Workshop on Trust and Control in Agentic AI (TrustAgent)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[148]  arXiv:2512.08854 [pdf, ps, other]
Title: Generation is Required for Data-Efficient Perception
Comments: Preprint
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[149]  arXiv:2512.08829 [pdf, ps, other]
Title: InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models
Comments: 16 pages, 8 figures, conference or other essential info
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[150]  arXiv:2512.08820 [pdf, ps, other]
Title: Training-Free Dual Hyperbolic Adapters for Better Cross-Modal Reasoning
Comments: Accepted in IEEE Transactions on Multimedia (TMM)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[151]  arXiv:2512.08789 [pdf, ps, other]
Title: MatteViT: High-Frequency-Aware Document Shadow Removal with Shadow Matte Guidance
Comments: 10 pages, 7 figures, 5 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[152]  arXiv:2512.08785 [pdf, ps, other]
Title: LoFA: Learning to Predict Personalized Priors for Fast Adaptation of Visual Generative Models
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[153]  arXiv:2512.08774 [pdf, ps, other]
Title: Refining Visual Artifacts in Diffusion Models via Explainable AI-based Flaw Activation Maps
Comments: 10 pages, 9 figures, 7 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[154]  arXiv:2512.08765 [pdf, ps, other]
Title: Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance
Comments: NeurlPS 2025. Code and data available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[155]  arXiv:2512.08751 [pdf, ps, other]
Title: Skewness-Guided Pruning of Multimodal Swin Transformers for Federated Skin Lesion Classification on Edge Devices
Subjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
[156]  arXiv:2512.08747 [pdf, ps, other]
Title: A Scalable Pipeline Combining Procedural 3D Graphics and Guided Diffusion for Photorealistic Synthetic Training Data Generation in White Button Mushroom Segmentation
Comments: 20 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[157]  arXiv:2512.08738 [pdf, ps, other]
Title: Pose-Based Sign Language Spotting via an End-to-End Encoder Architecture
Comments: To appear at AACL-IJCNLP 2025 Workshop WSLP
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[158]  arXiv:2512.08733 [pdf, ps, other]
Title: Mitigating Individual Skin Tone Bias in Skin Lesion Classification through Distribution-Aware Reweighting
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[159]  arXiv:2512.08730 [pdf, ps, other]
Title: SegEarth-OV3: Exploring SAM 3 for Open-Vocabulary Semantic Segmentation in Remote Sensing Images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[160]  arXiv:2512.08700 [pdf, ps, other]
Title: Scale-invariant and View-relational Representation Learning for Full Surround Monocular Depth
Comments: Accepted at IEEE Robotics and Automation Letters (RA-L) 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[161]  arXiv:2512.08697 [pdf, ps, other]
Title: What really matters for person re-identification? A Mixture-of-Experts Framework for Semantic Attribute Importance
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[162]  arXiv:2512.08673 [pdf, ps, other]
Title: Dual-Branch Center-Surrounding Contrast: Rethinking Contrastive Learning for 3D Point Clouds
Comments: 16 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[163]  arXiv:2512.08648 [pdf, ps, other]
Title: Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank
Comments: 19 pages, 19 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[164]  arXiv:2512.08647 [pdf, ps, other]
Title: C-DIRA: Computationally Efficient Dynamic ROI Routing and Domain-Invariant Adversarial Learning for Lightweight Driver Behavior Recognition
Authors: Keito Inoshita
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[165]  arXiv:2512.08645 [pdf, ps, other]
Title: Chain-of-Image Generation: Toward Monitorable and Controllable Image Generation
Comments: 19 pages, 13 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[166]  arXiv:2512.08639 [pdf, ps, other]
Title: Aerial Vision-Language Navigation with a Unified Framework for Spatial, Temporal and Embodied Reasoning
Comments: Under Review, 12 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[167]  arXiv:2512.08627 [pdf, ps, other]
Title: Trajectory Densification and Depth from Perspective-based Blur
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[168]  arXiv:2512.08625 [pdf, ps, other]
Title: OpenMonoGS-SLAM: Monocular Gaussian Splatting SLAM with Open-set Semantics
Comments: 8 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[169]  arXiv:2512.08606 [pdf, ps, other]
Title: Decoupling Template Bias in CLIP: Harnessing Empty Prompts for Enhanced Few-Shot Learning
Comments: 14 pages, 8 figures, Association for the Advancement of Artificial Intelligence (AAAI2026, poster)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[170]  arXiv:2512.08589 [pdf, ps, other]
Title: Automated Pollen Recognition in Optical and Holographic Microscopy Images
Comments: 08 pages, 10 figures, 04 tables, 20 references. Date of Conference: 13-14 June 2025 Date Added to IEEE Xplore: 10 July 2025 Electronic ISBN: 979-8-3315-0969-9 Print on Demand(PoD) ISBN: 979-8-3315-0970-5 DOI: 10.1109/AICCONF64766.2025.11064260 Conference Location: Prague, Czech Republic Online Access: this https URL
Journal-ref: 2025 3rd Cognitive Models and Artificial Intelligence Conference (AICCONF), vol. 1, no. 1, pp. 1-8, Prague, Czech Republic, IEEE, 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[171]  arXiv:2512.08577 [pdf, ps, other]
Title: Disturbance-Free Surgical Video Generation from Multi-Camera Shadowless Lamps for Open Surgery
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[172]  arXiv:2512.08572 [pdf, ps, other]
Title: From Cells to Survival: Hierarchical Analysis of Cell Inter-Relations in Multiplex Microscopy for Lung Cancer Prognosis
Comments: 5 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[173]  arXiv:2512.08569 [pdf, ps, other]
Title: Instance-Aware Test-Time Segmentation for Continual Domain Shifts
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[174]  arXiv:2512.08564 [pdf, ps, other]
Title: Modular Neural Image Signal Processing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[175]  arXiv:2512.08560 [pdf, ps, other]
Title: BrainExplore: Large-Scale Discovery of Interpretable Visual Representations in the Human Brain
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[176]  arXiv:2512.08557 [pdf, ps, other]
Title: SSCATeR: Sparse Scatter-Based Convolution Algorithm with Temporal Data Recycling for Real-Time 3D Object Detection in LiDAR Point Clouds
Comments: 22 Pages, 26 Figures, This work has been submitted to the IEEE Sensors Journal for possible publication
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[177]  arXiv:2512.08547 [pdf, ps, other]
Title: An Iteration-Free Fixed-Point Estimator for Diffusion Inversion
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[178]  arXiv:2512.08542 [pdf, ps, other]
Title: A Novel Wasserstein Quaternion Generative Adversarial Network for Color Image Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Numerical Analysis (math.NA)
[179]  arXiv:2512.08537 [pdf, ps, other]
Title: Fast-ARDiff: An Entropy-informed Acceleration Framework for Continuous Space Autoregressive Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[180]  arXiv:2512.08535 [pdf, ps, other]
Title: Photo3D: Advancing Photorealistic 3D Generation through Structure-Aligned Detail Enhancement
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[181]  arXiv:2512.08534 [pdf, ps, other]
Title: PaintFlow: A Unified Framework for Interactive Oil Paintings Editing and Generation
Comments: 14 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[182]  arXiv:2512.08529 [pdf, ps, other]
Title: MVP: Multiple View Prediction Improves GUI Grounding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[183]  arXiv:2512.08524 [pdf, ps, other]
Title: Beyond Real Weights: Hypercomplex Representations for Stable Quantization
Comments: Accepted in Winter Conference on Applications of Computer Vision (WACV) 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[184]  arXiv:2512.08511 [pdf, ps, other]
Title: Thinking with Images via Self-Calling Agent
Comments: Code would be released at this https URL soon
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[185]  arXiv:2512.08506 [pdf, ps, other]
Title: OCCDiff: Occupancy Diffusion Model for High-Fidelity 3D Building Reconstruction from Noisy Point Clouds
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[186]  arXiv:2512.08505 [pdf, ps, other]
Title: Beyond the Noise: Aligning Prompts with Latent Representations in Diffusion Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[187]  arXiv:2512.08503 [pdf, ps, other]
Title: Disrupting Hierarchical Reasoning: Adversarial Protection for Geographic Privacy in Multimodal Reasoning Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[188]  arXiv:2512.08498 [pdf, ps, other]
Title: On-the-fly Large-scale 3D Reconstruction from Multi-Camera Rigs
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[189]  arXiv:2512.08486 [pdf, ps, other]
Title: Temporal Concept Dynamics in Diffusion Models via Prompt-Conditioned Interventions
Comments: Code is available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[190]  arXiv:2512.08478 [pdf, ps, other]
Title: Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[191]  arXiv:2512.08477 [pdf, ps, other]
Title: ContextDrag: Precise Drag-Based Image Editing via Context-Preserving Token Injection and Position-Consistent Attention
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[192]  arXiv:2512.08467 [pdf, ps, other]
Title: Team-Aware Football Player Tracking with SAM: An Appearance-Based Approach to Occlusion Recovery
Comments: 8 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[193]  arXiv:2512.08445 [pdf, ps, other]
Title: Uncertainty-Aware Subset Selection for Robust Visual Explainability under Distribution Shifts
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[194]  arXiv:2512.08441 [pdf, ps, other]
Title: Leveraging Multispectral Sensors for Color Correction in Mobile Cameras
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[195]  arXiv:2512.08439 [pdf, ps, other]
Title: LapFM: A Laparoscopic Segmentation Foundation Model via Hierarchical Concept Evolving Pre-training
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[196]  arXiv:2512.08430 [pdf, ps, other]
Title: SDT-6D: Fully Sparse Depth-Transformer for Staged End-to-End 6D Pose Estimation in Industrial Multi-View Bin Picking
Comments: Accepted to WACV 2026. Preprint version
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[197]  arXiv:2512.08410 [pdf, ps, other]
Title: Towards Effective and Efficient Long Video Understanding of Multimodal Large Language Models via One-shot Clip Retrieval
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[198]  arXiv:2512.08406 [pdf, ps, other]
Title: SAM-Body4D: Training-Free 4D Human Body Mesh Recovery from Videos
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[199]  arXiv:2512.08400 [pdf, ps, other]
Title: Towards Visual Re-Identification of Fish using Fine-Grained Classification for Electronic Monitoring in Fisheries
Comments: The paper has been accepted for publication at Northern Lights Deep Learning (NLDL) Conference 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[200]  arXiv:2512.08397 [pdf, ps, other]
Title: Detection of Digital Facial Retouching utilizing Face Beauty Information
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[201]  arXiv:2512.08378 [pdf, ps, other]
Title: Simultaneous Enhancement and Noise Suppression under Complex Illumination Conditions
Comments: The paper has been accepted and officially published by IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[202]  arXiv:2512.08374 [pdf, ps, other]
Title: The Unseen Bias: How Norm Discrepancy in Pre-Norm MLLMs Leads to Visual Information Loss
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[203]  arXiv:2512.08362 [pdf, ps, other]
Title: SCU-CGAN: Enhancing Fire Detection through Synthetic Fire Image Generation and Dataset Augmentation
Comments: Accepted for main track at MobieSec 2024 (not published in the proceedings)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[204]  arXiv:2512.08358 [pdf, ps, other]
Title: TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels
Comments: Accepted by NeurIPS 2025. Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[205]  arXiv:2512.08337 [pdf, ps, other]
Title: DINO-BOLDNet: A DINOv3-Guided Multi-Slice Attention Network for T1-to-BOLD Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[206]  arXiv:2512.08334 [pdf, ps, other]
Title: HybridSplat: Fast Reflection-baked Gaussian Tracing using Hybrid Splatting
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[207]  arXiv:2512.08331 [pdf, ps, other]
Title: Bi^2MAC: Bimodal Bi-Adaptive Mask-Aware Convolution for Remote Sensing Pansharpening
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[208]  arXiv:2512.08330 [pdf, ps, other]
Title: PointDico: Contrastive 3D Representation Learning Guided by Diffusion Models
Comments: Accepted by IJCNN 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[209]  arXiv:2512.08329 [pdf, ps, other]
Title: Interpreting Structured Perturbations in Image Protection Methods for Diffusion Models
Comments: 32 pages, 17 figures, 1 table, 5 algorithms, preprint
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[210]  arXiv:2512.08327 [pdf, ps, other]
Title: Low Rank Support Quaternion Matrix Machine
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[211]  arXiv:2512.08325 [pdf, ps, other]
Title: GeoDiffMM: Geometry-Guided Conditional Diffusion for Motion Magnification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[212]  arXiv:2512.08323 [pdf, ps, other]
Title: Detecting Dental Landmarks from Intraoral 3D Scans: the 3DTeethLand challenge
Comments: MICCAI 2024, 3DTeethLand, Challenge report, under review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[213]  arXiv:2512.08317 [pdf, ps, other]
Title: GeoDM: Geometry-aware Distribution Matching for Dataset Distillation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[214]  arXiv:2512.08309 [pdf, ps, other]
Title: Terrain Diffusion: A Diffusion-Based Successor to Perlin Noise in Infinite, Real-Time Terrain Generation
Authors: Alexander Goslin
Comments: Project website: this https URL Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
[215]  arXiv:2512.08294 [pdf, ps, other]
Title: OpenSubject: Leveraging Video-Derived Identity and Diversity Priors for Subject-driven Image Generation and Manipulation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[216]  arXiv:2512.08282 [pdf, ps, other]
Title: PAVAS: Physics-Aware Video-to-Audio Synthesis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD)
[217]  arXiv:2512.08269 [pdf, ps, other]
Title: EgoX: Egocentric Video Generation from a Single Exocentric Video
Comments: 21 pages, project page : this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[218]  arXiv:2512.08262 [pdf, ps, other]
Title: RLCNet: An end-to-end deep learning framework for simultaneous online calibration of LiDAR, RADAR, and Camera
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[219]  arXiv:2512.08254 [pdf, ps, other]
Title: SFP: Real-World Scene Recovery Using Spatial and Frequency Priors
Comments: 10 pages, 13 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[220]  arXiv:2512.08253 [pdf, ps, other]
Title: Query-aware Hub Prototype Learning for Few-Shot 3D Point Cloud Semantic Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[221]  arXiv:2512.08247 [pdf, ps, other]
Title: Distilling Future Temporal Knowledge with Masked Feature Reconstruction for 3D Object Detection
Comments: AAAI-26
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[222]  arXiv:2512.08243 [pdf, ps, other]
Title: Residual-SwinCA-Net: A Channel-Aware Integrated Residual CNN-Swin Transformer for Malignant Lesion Segmentation in BUSI
Authors: Saeeda Naz, Saddam Hussain Khan (Artificial Intelligence Lab, Department of Computer Systems Engineering, University of Engineering and Applied Sciences (UEAS), Swat, Pakistan)
Comments: 26 Pages, 10 Figures, 4 Tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[223]  arXiv:2512.08240 [pdf, ps, other]
Title: HybridToken-VLM: Hybrid Token Compression for Vision-Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[224]  arXiv:2512.08237 [pdf, ps, other]
Title: FastBEV++: Fast by Algorithm, Deployable by Design
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[225]  arXiv:2512.08229 [pdf, ps, other]
Title: Geometry-Aware Sparse Depth Sampling for High-Fidelity RGB-D Depth Completion in Robotic Systems
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[226]  arXiv:2512.08228 [pdf, ps, other]
Title: MM-CoT:A Benchmark for Probing Visual Chain-of-Thought Reasoning in Multimodal Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[227]  arXiv:2512.08227 [pdf, ps, other]
Title: New VVC profiles targeting Feature Coding for Machines
Comments: Accepted for presentation at ICIP 2025 workshop on Coding for Machines
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[228]  arXiv:2512.08223 [pdf, ps, other]
Title: SOP^2: Transfer Learning with Scene-Oriented Prompt Pool on 3D Object Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[229]  arXiv:2512.08221 [pdf, ps, other]
Title: VisKnow: Constructing Visual Knowledge Base for Object Understanding
Comments: 16 pages, 12 figures, 7 tables. Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[230]  arXiv:2512.08215 [pdf, ps, other]
Title: Blur2Sharp: Human Novel Pose and View Synthesis with Generative Prior Refinement
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[231]  arXiv:2512.08198 [pdf, ps, other]
Title: Animal Re-Identification on Microcontrollers
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[232]  arXiv:2512.08180 [pdf, ps, other]
Title: GeoLoom: High-quality Geometric Diagram Generation from Textual Input
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[233]  arXiv:2512.08163 [pdf, ps, other]
Title: Accuracy Does Not Guarantee Human-Likeness in Monocular Depth Estimators
Comments: 22 pages, 12 figures, 1 table
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[234]  arXiv:2512.08161 [pdf, ps, other]
Title: Fourier-RWKV: A Multi-State Perception Network for Efficient Image Dehazing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[235]  arXiv:2512.08135 [pdf, ps, other]
Title: CVP: Central-Peripheral Vision-Inspired Multimodal Model for Spatial Reasoning
Comments: Accepted to WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[236]  arXiv:2512.08075 [pdf, ps, other]
Title: Identification of Deforestation Areas in the Amazon Rainforest Using Change Detection Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 754 entries: 1-100 | 37-136 | 137-236 | 237-336 | 337-436 | 437-536 | ... | 737-754 ]
[ showing 100 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)