We gratefully acknowledge support from
the Simons Foundation and member institutions.

Image and Video Processing

Authors and titles for recent submissions

[ total of 48 entries: 1-48 ]
[ showing up to 50 entries per page: fewer | more ]

Fri, 5 Dec 2025

[1]  arXiv:2512.04709 [pdf, ps, other]
Title: Multi Task Denoiser Training for Solving Linear Inverse Problems
Comments: 9 pages, incl. 1 page references. Published at CVMP 2025
Subjects: Image and Video Processing (eess.IV)
[2]  arXiv:2512.04586 [pdf, ps, other]
Title: Structure-Aware Adaptive Kernel MPPCA Denoising for Diffusion MRI
Subjects: Image and Video Processing (eess.IV)
[3]  arXiv:2512.05114 (cross-list from cs.LG) [pdf, ps, other]
Title: Deep infant brain segmentation from multi-contrast MRI
Comments: 8 pages, 8 figures, 1 table, website at this https URL, presented at the 2025 IEEE Asilomar Conference on Signals, Systems, and Computers
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)

Thu, 4 Dec 2025

[4]  arXiv:2512.03962 [pdf, ps, other]
Title: Tada-DIP: Input-adaptive Deep Image Prior for One-shot 3D Image Reconstruction
Comments: 6 pages, 8 figures, 2025 Asilomar Conference on Signals, Systems, and Computers. Code is available at github.com/evanbell02/Tada-DIP/
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[5]  arXiv:2512.03752 [pdf, ps, other]
Title: A BTR-Based Approach for Detection of Infrared Small Targets
Authors: Ke-Xin Li
Subjects: Image and Video Processing (eess.IV)
[6]  arXiv:2512.03202 [pdf, ps, other]
Title: Quality assurance of the Federal Interagency Traumatic Brain Injury Research (FITBIR) MRI database to enable integrated multi-site analysis
Comments: 4 pages, 4 figures. This work has been submitted to the IEEE for possible publication
Subjects: Image and Video Processing (eess.IV)
[7]  arXiv:2512.03196 [pdf, ps, other]
Title: Ultra-Strong Gradient Diffusion MRI with Self-Supervised Learning for Prostate Cancer Characterization
Comments: 24 pages, 17 figures, 7 tables
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[8]  arXiv:2512.04019 (cross-list from cs.CV) [pdf, ps, other]
Title: Ultra-lightweight Neural Video Representation Compression
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[9]  arXiv:2512.03539 (cross-list from physics.optics) [pdf, ps, other]
Title: Real-Time Control and Automation Framework for Acousto-Holographic Microscopy
Subjects: Optics (physics.optics); Image and Video Processing (eess.IV); Systems and Control (eess.SY)
[10]  arXiv:2512.03216 (cross-list from physics.ins-det) [pdf, ps, other]
Title: Kaleidoscopic Scintillation Event Imaging
Subjects: Instrumentation and Detectors (physics.ins-det); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)

Wed, 3 Dec 2025

[11]  arXiv:2512.02917 [pdf, ps, other]
Title: Maintaining SUV Accuracy in Low-Count PET with PETfectior: A Deep Learning Denoising Solution
Subjects: Image and Video Processing (eess.IV); Medical Physics (physics.med-ph)
[12]  arXiv:2512.02091 [pdf, ps, other]
Title: TT-Stack: A Transformer-Based Tiered-Stacking Ensemble Framework with Meta-Learning for Automated Breast Cancer Detection in Mammography
Comments: This paper contains 15 pages with 23 figures and 4 tables. This Paper is already accepted in IEEE Computational Intelligence Magazine (CIM)
Journal-ref: IEEE Computational Intelligence Magazine (CIM) in 19th July 2025
Subjects: Image and Video Processing (eess.IV)
[13]  arXiv:2512.02088 [pdf, ps, other]
Title: Comparing Baseline and Day-1 Diffusion MRI Using Multimodal Deep Embeddings for Stroke Outcome Prediction
Comments: 5 pages, 5 figures, 2 tables
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[14]  arXiv:2512.02759 (cross-list from eess.AS) [pdf, ps, other]
Title: Towards Language-Independent Face-Voice Association with Multimodal Foundation Models
Comments: This paper presents the system description of the UZH-CL team for the FAME2026 Challenge at ICASSP 2026. Our model achieved second place in the final ranking
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD); Image and Video Processing (eess.IV)
[15]  arXiv:2512.02268 (cross-list from cs.CV) [pdf, ps, other]
Title: Spatiotemporal Pyramid Flow Matching for Climate Emulation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Machine Learning (stat.ML)
[16]  arXiv:2512.02066 (cross-list from quant-ph) [pdf, ps, other]
Title: Parallel Multi-Circuit Quantum Feature Fusion in Hybrid Quantum-Classical Convolutional Neural Networks for Breast Tumor Classification
Authors: Ece Yurtseven
Subjects: Quantum Physics (quant-ph); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)

Tue, 2 Dec 2025

[17]  arXiv:2512.01913 [pdf, ps, other]
Title: Disentangling Progress in Medical Image Registration: Beyond Trend-Driven Architectures towards Domain-Specific Strategies
Comments: Submitted to Medical Image Analysis. Journal Extension of arXiv:2407.19274
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[18]  arXiv:2512.01135 [pdf, ps, other]
Title: Diffusion-Based Synthesis of 3D T1w MPRAGE Images from Multi-Echo GRE with Multi-Parametric MRI Integration
Subjects: Image and Video Processing (eess.IV)
[19]  arXiv:2512.00488 [pdf, ps, other]
Title: Large-field-of-view lensless imaging with miniaturized sensors
Subjects: Image and Video Processing (eess.IV)
[20]  arXiv:2512.00350 [pdf, ps, other]
Title: MedCondDiff: Lightweight, Robust, Semantically Guided Diffusion for Medical Image Segmentation
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[21]  arXiv:2512.00271 [pdf, ps, other]
Title: Comparative Evaluation of Generative AI Models for Chest Radiograph Report Generation in the Emergency Department
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[22]  arXiv:2512.01702 (cross-list from cs.LG) [pdf, ps, other]
Title: A unified framework for geometry-independent operator learning in cardiac electrophysiology simulations
Subjects: Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[23]  arXiv:2512.01567 (cross-list from eess.SP) [pdf, ps, other]
Title: In-Context Learning for Deep Joint Source-Channel Coding Over MIMO Channels
Subjects: Signal Processing (eess.SP); Image and Video Processing (eess.IV)
[24]  arXiv:2512.00396 (cross-list from cs.LG) [pdf, ps, other]
Title: Time-Series at the Edge: Tiny Separable CNNs for Wearable Gait Detection and Optimal Sensor Placement
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[25]  arXiv:2512.00229 (cross-list from cs.LG) [pdf, ps, other]
Title: TIE: A Training-Inversion-Exclusion Framework for Visually Interpretable and Uncertainty-Guided Out-of-Distribution Detection
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Machine Learning (stat.ML)
[26]  arXiv:2512.00203 (cross-list from stat.AP) [pdf, ps, other]
Title: Beyond Expected Goals: A Probabilistic Framework for Shot Occurrences in Soccer
Comments: 18pp main + 3pp appendix; 8 figures, 12 tables. Submitted to the Journal of Quantitative Analysis in Sports (JQAS). Data proprietary to Gradient Sports; we share derived features & scripts (code under MIT/Apache-2.0). Preprint licensed CC BY 4.0
Subjects: Applications (stat.AP); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[27]  arXiv:2512.00194 (cross-list from cs.CV) [pdf, ps, other]
Title: AutocleanEEG ICVision: Automated ICA Artifact Classification Using Vision-Language AI
Comments: 6 pages, 8 figures
Journal-ref: Conference ICMI2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Quantitative Methods (q-bio.QM)
[28]  arXiv:2512.00191 (cross-list from cs.LG) [pdf, ps, other]
Title: Hybrid Context-Fusion Attention (CFA) U-Net and Clustering for Robust Seismic Horizon Interpretation
Subjects: Machine Learning (cs.LG); Image and Video Processing (eess.IV); Geophysics (physics.geo-ph)
[29]  arXiv:2512.00179 (cross-list from cs.CV) [pdf, ps, other]
Title: Efficient Edge-Compatible CNN for Speckle-Based Material Recognition in Laser Cutting Systems
Authors: Mohamed Abdallah Salem (North Dakota State University), Nourhan Zein Diab (New Mansoura University)
Comments: Copyright 2025 IEEE. This is the author's version of the work that has been Accepted for publication in the Proceedings of the 2025 IEEE The 35th International Conference on Computer Theory and Applications (ICCTA 2025). Final published version will be available on IEEE Xplore
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[30]  arXiv:2512.00138 (cross-list from cs.AR) [pdf, ps, other]
Title: Ternary-Input Binary-Weight CNN Accelerator Design for Miniature Object Classification System with Query-Driven Spatial DVS
Comments: 6 pages.12 figures & 2 table
Subjects: Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[31]  arXiv:2512.00117 (cross-list from cs.CV) [pdf, ps, other]
Title: TinyViT: Field Deployable Transformer Pipeline for Solar Panel Surface Fault and Severity Screening
Comments: 3pages, 2figures,ICGVIP 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[32]  arXiv:2512.00075 (cross-list from cs.CV) [pdf, ps, other]
Title: Adapter Shield: A Unified Framework with Built-in Authentication for Preventing Unauthorized Zero-Shot Image-to-Image Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[33]  arXiv:2512.00070 (cross-list from cs.AR) [pdf, ps, other]
Title: A CNN-Based Technique to Assist Layout-to-Generator Conversion for Analog Circuits
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)

Mon, 1 Dec 2025

[34]  arXiv:2511.23251 [pdf, ps, other]
Title: Deep Learning for Restoring MPI System Matrices Using Simulated Training Data
Subjects: Image and Video Processing (eess.IV)
[35]  arXiv:2511.22911 [pdf, ps, other]
Title: MICCAI STS 2024 Challenge: Semi-Supervised Instance-Level Tooth Segmentation in Panoramic X-ray and CBCT Images
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[36]  arXiv:2511.22890 [pdf, ps, other]
Title: Two-Dimensional Tomographic Reconstruction From Projections With Unknown Angles and Unknown Spatial Shifts
Comments: 5 pages, 2 figures, 1 table, submitted to the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
Subjects: Image and Video Processing (eess.IV)
[37]  arXiv:2511.22859 [pdf, ps, other]
Title: TokCom-UEP: Semantic Importance-Matched Unequal Error Protection for Resilient Image Transmission
Subjects: Image and Video Processing (eess.IV); Cryptography and Security (cs.CR)
[38]  arXiv:2511.22606 [pdf, ps, other]
Title: Hard Spatial Gating for Precision-Driven Brain Metastasis Segmentation: Addressing the Over-Segmentation Paradox in Deep Attention Networks
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[39]  arXiv:2511.22327 [pdf, ps, other]
Title: Content Adaptive Encoding For Interactive Game Streaming
Comments: 5 pages
Journal-ref: Picture Coding Symposium 2025
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[40]  arXiv:2511.22250 [pdf, ps, other]
Title: ColonAdapter: Geometry Estimation Through Foundation Model Adaptation for Colonoscopy
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[41]  arXiv:2511.22094 [pdf, ps, other]
Title: GACELLE: GPU-accelerated tools for model parameter estimation and image reconstruction
Authors: Kwok-Shing Chan (1 and 2), Hansol Lee (1 and 2), Yixin Ma (1 and 2), Berkin Bilgic (1 and 2), Susie Y. Huang (1 and 2), Hong-Hsi Lee (1 and 2), José P. Marques (3) ((1) Department of Radiology, Athinoula A. Martinos Center for Biomedical Imaging, Massachusetts General Hospital, Charlestown, MA, United States, (2) Harvard Medical School, Boston, MA, United States, (3) Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, The Netherlands)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[42]  arXiv:2511.22001 [pdf, ps, other]
Title: When Do Domain-Specific Foundation Models Justify Their Cost? A Systematic Evaluation Across Retinal Imaging Tasks
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[43]  arXiv:2511.21985 [pdf, ps, other]
Title: Digital Elevation Model Estimation from RGB Satellite Imagery using Generative Deep Learning
Comments: 5 pages, 4 figures, accepted at IGARSS 2025 conference
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Signal Processing (eess.SP)
[44]  arXiv:2511.21926 [pdf, ps, other]
Title: Comparing SAM 2 and SAM 3 for Zero-Shot Segmentation of 3D Medical Data
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[45]  arXiv:2511.21775 [pdf, ps, other]
Title: Attention-Guided Fair AI Modeling for Skin Cancer Diagnosis
Subjects: Image and Video Processing (eess.IV)
[46]  arXiv:2511.21767 [pdf, ps, other]
Title: LAYER: A Quantitative Explainable AI Framework for Decoding Tissue-Layer Drivers of Myofascial Low Back Pain
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Tissues and Organs (q-bio.TO)
[47]  arXiv:2511.22745 (cross-list from math.OC) [pdf, ps, other]
Title: A lasso-alternative to Dijkstra's algorithm for identifying short paths in networks
Comments: 25 pages, 7 figures
Subjects: Optimization and Control (math.OC); Distributed, Parallel, and Cluster Computing (cs.DC); Social and Information Networks (cs.SI); Image and Video Processing (eess.IV)
[48]  arXiv:2511.22046 (cross-list from cs.NI) [pdf, ps, other]
Title: AutoRec: Accelerating Loss Recovery for Live Streaming in a Multi-Supplier Market
Subjects: Networking and Internet Architecture (cs.NI); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[ total of 48 entries: 1-48 ]
[ showing up to 50 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, eess, new, 2512, contact, help  (Access key information)