We gratefully acknowledge support from
the Simons Foundation and member institutions.

Image and Video Processing

Authors and titles for recent submissions

[ total of 71 entries: 1-25 | 26-50 | 51-71 ]
[ showing 25 entries per page: fewer | more | all ]

Thu, 5 Feb 2026

[1]  arXiv:2602.04032 [pdf, ps, other]
Title: MS-SCANet: A Multiscale Transformer-Based Architecture with Dual Attention for No-Reference Image Quality Assessment
Comments: Published in ICASSP 2025, 5 pages, 3 figures
Journal-ref: Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2025
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[2]  arXiv:2602.03998 [pdf, ps, other]
Title: AtlasPatch: An Efficient and Scalable Tool for Whole Slide Image Preprocessing in Computational Pathology
Comments: Under review
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[3]  arXiv:2602.03910 [pdf, ps, other]
Title: CONRep: Uncertainty-Aware Vision-Language Report Drafting Using Conformal Prediction
Comments: 17 pages, 3 figures, 3 tables
Subjects: Image and Video Processing (eess.IV)
[4]  arXiv:2602.03887 [pdf, ps, other]
Title: To What Extent Do Token-Level Representations from Pathology Foundation Models Improve Dense Prediction?
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[5]  arXiv:2602.03870 [pdf, ps, other]
Title: DINO-AD: Unsupervised Anomaly Detection with Frozen DINO-V3 Features
Comments: Accepted by ISBI 2026, 4 pages, 2 figures, 3 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[6]  arXiv:2602.04834 (cross-list from physics.optics) [pdf, ps, other]
Title: ConvRML: High-Quality Lensless Imaging with Random Multi-Focal Lenslets
Comments: 28 pages, 11 figures
Subjects: Optics (physics.optics); Image and Video Processing (eess.IV)
[7]  arXiv:2602.04712 (cross-list from cs.CV) [pdf, ps, other]
Title: SAR-RAG: ATR Visual Question Answering by Semantic Search, Retrieval, and MLLM Generation
Comments: Submitted to 2026 IEEE Radar Conference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[8]  arXiv:2602.04162 (cross-list from cs.CV) [pdf, ps, other]
Title: Improving 2D Diffusion Models for 3D Medical Imaging with Inter-Slice Consistent Stochasticity
Comments: Accepted by ICLR 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)

Wed, 4 Feb 2026

[9]  arXiv:2602.02798 [pdf, ps, other]
Title: Real-time topology-aware M-mode OCT segmentation for robotic deep anterior lamellar keratoplasty (DALK) guidance
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[10]  arXiv:2602.02795 [pdf, ps, other]
Title: Super-Resolution and Denoising of Corneal B-Scan OCT Imaging Using Diffusion Model Plug-and-Play Priors
Journal-ref: Proceedings of SPIE, Vol. 13865, 13865-67 (2026)
Subjects: Image and Video Processing (eess.IV)
[11]  arXiv:2602.02758 [pdf, ps, other]
Title: Wide-field high-resolution microscopy via high-speed galvo scanning and real-time mosaicking
Subjects: Image and Video Processing (eess.IV)
[12]  arXiv:2602.02755 [pdf, ps, other]
Title: Physics-based generation of multilayer corneal OCT data via Gaussian modeling and MCML for AI-driven diagnostic and surgical guidance applications
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[13]  arXiv:2602.02603 [pdf, ps, other]
Title: EchoJEPA: A Latent Predictive Foundation Model for Echocardiography
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[14]  arXiv:2602.02552 [pdf, ps, other]
Title: Super-résolution non supervisée d'images hyperspectrales de télédétection utilisant un entraînement entièrement synthétique
Comments: in French language
Journal-ref: GRETSI 2025: XXXe Colloque Francophone de Traitement du Signal et des Images, Strasbourg, France, August 2025
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[15]  arXiv:2602.03669 (cross-list from cs.CV) [pdf, ps, other]
Title: Efficient Sequential Neural Network with Spatial-Temporal Attention and Linear LSTM for Robust Lane Detection Using Multi-Frame Images
Comments: 14 pages, 9 figures, under review by IEEE T-ITS
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[16]  arXiv:2602.03294 (cross-list from cs.CV) [pdf, ps, other]
Title: LEVIO: Lightweight Embedded Visual Inertial Odometry for Resource-Constrained Devices
Comments: This article has been accepted for publication in the IEEE Sensors Journal (JSEN)
Journal-ref: IEEE Sensors Journal ( Volume: 26, Issue: 3, 01 February 2026)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO); Image and Video Processing (eess.IV)
[17]  arXiv:2602.03281 (cross-list from physics.app-ph) [pdf, ps, other]
Title: Physics-Based Learning of the Wave Speed Landscape in Complex Media
Comments: 40 pages, 8 figures, 1 table
Subjects: Applied Physics (physics.app-ph); Image and Video Processing (eess.IV); Medical Physics (physics.med-ph); Optics (physics.optics)
[18]  arXiv:2602.03264 (cross-list from cs.CV) [pdf, ps, other]
Title: HypCBC: Domain-Invariant Hyperbolic Cross-Branch Consistency for Generalizable Medical Image Analysis
Comments: Accepted to Transactions on Machine Learning Research (TMLR)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[19]  arXiv:2602.02713 (cross-list from physics.med-ph) [pdf, ps, other]
Title: Perfusion Imaging and Single Material Reconstruction in Polychromatic Photon Counting CT
Comments: Code is available at this https URL
Subjects: Medical Physics (physics.med-ph); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[20]  arXiv:2602.02567 (cross-list from cs.LG) [pdf, ps, other]
Title: IceBench-S2S: A Benchmark of Deep Learning for Challenging Subseasonal-to-Seasonal Daily Arctic Sea Ice Forecasting in Deep Latent Space
Comments: 9 pages, 6 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[21]  arXiv:2602.02508 (cross-list from cs.IT) [pdf, ps, other]
Title: Precoding-Oriented CSI Feedback Design with Mutual Information Regularized VQ-VAE
Comments: 5 pages, submitted to IEEE VTC conference
Subjects: Information Theory (cs.IT); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)

Tue, 3 Feb 2026 (showing first 4 of 28 entries)

[22]  arXiv:2602.02031 [pdf, ps, other]
Title: Edge-Aligned Initialization of Kernels for Steered Mixture-of-Experts
Subjects: Image and Video Processing (eess.IV)
[23]  arXiv:2602.01681 [pdf, ps, other]
Title: Hyperspectral Image Fusion with Spectral-Band and Fusion-Scale Agnosticism
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[24]  arXiv:2602.01513 [pdf, ps, other]
Title: MarkCleaner: High-Fidelity Watermark Removal via Imperceptible Micro-Geometric Perturbation
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[25]  arXiv:2602.01444 [pdf, ps, other]
Title: A texture-based framework for foundational ultrasound models
Comments: This work has been submitted to the IEEE for possible publication
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[ total of 71 entries: 1-25 | 26-50 | 51-71 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, eess, new, 2602, contact, help  (Access key information)