We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 29

[ total of 603 entries: 1-100 | 30-129 | 130-229 | 230-329 | 330-429 | ... | 530-603 ]
[ showing 100 entries per page: fewer | more | all ]

Thu, 1 Jan 2026 (continued, showing last 95 of 124 entries)

[30]  arXiv:2512.24622 [pdf, ps, other]
Title: FireRescue: A UAV-Based Dataset and Enhanced YOLO Model for Object Detection in Fire Rescue Scenes
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[31]  arXiv:2512.24620 [pdf, ps, other]
Title: LLHA-Net: A Hierarchical Attention Network for Two-View Correspondence Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[32]  arXiv:2512.24605 [pdf, ps, other]
Title: MoniRefer: A Real-world Large-scale Multi-modal Dataset based on Roadside Infrastructure for 3D Visual Grounding
Comments: 14 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[33]  arXiv:2512.24603 [pdf, ps, other]
Title: Collaborative Low-Rank Adaptation for Pre-Trained Vision Transformers
Comments: 13 tables, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[34]  arXiv:2512.24593 [pdf, ps, other]
Title: 3D Semantic Segmentation for Post-Disaster Assessment
Comments: Accepted by the 2025 IEEE International Geoscience and Remote Sensing Symposium (IGARSS 2025)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[35]  arXiv:2512.24592 [pdf, ps, other]
Title: SliceLens: Fine-Grained and Grounded Error Slice Discovery for Multi-Instance Vision Tasks
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[36]  arXiv:2512.24591 [pdf, ps, other]
Title: Improving Few-Shot Change Detection Visual Question Answering via Decision-Ambiguity-guided Reinforcement Fine-Tuning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[37]  arXiv:2512.24561 [pdf, ps, other]
Title: RGBT-Ground Benchmark: Visual Grounding Beyond RGB in Complex Real-World Scenarios
Comments: 27pages, 9figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[38]  arXiv:2512.24552 [pdf, ps, other]
Title: OCP-LS: An Efficient Algorithm for Visual Localization
Subjects: Computer Vision and Pattern Recognition (cs.CV); Optimization and Control (math.OC)
[39]  arXiv:2512.24551 [pdf, ps, other]
Title: PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[40]  arXiv:2512.24547 [pdf, ps, other]
Title: Hierarchical Vector-Quantized Latents for Perceptual Low-Resolution Video Compression
Comments: 11 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[41]  arXiv:2512.24518 [pdf, ps, other]
Title: Using Large Language Models To Translate Machine Results To Human Results
Comments: 11 pages, 7 figures, 3 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[42]  arXiv:2512.24473 [pdf, ps, other]
Title: F2IDiff: Real-world Image Super-resolution using Feature to Image Diffusion Foundation Model
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[43]  arXiv:2512.24463 [pdf, ps, other]
Title: Spectral and Spatial Graph Learning for Multispectral Solar Image Compression
Comments: 8 pages, 6 figures 1 table. Code available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[44]  arXiv:2512.24438 [pdf, ps, other]
Title: Exploring Compositionality in Vision Transformers using Wavelet Representations
Comments: 9 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[45]  arXiv:2512.24411 [pdf, ps, other]
Title: AI-Driven Evaluation of Surgical Skill via Action Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[46]  arXiv:2512.24408 [pdf, ps, other]
Title: DyStream: Streaming Dyadic Talking Heads Generation via Flow Matching-based Autoregressive Model
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[47]  arXiv:2512.24386 [pdf, ps, other]
Title: RedunCut: Measurement-Driven Sampling and Accuracy Performance Modeling for Low-Cost Live Video Analytics
Comments: 21 pages, 23 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
[48]  arXiv:2512.24385 [pdf, ps, other]
Title: Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems
Comments: Preprint; 38 pages, 7 figures, 9 tables; GitHub at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[49]  arXiv:2512.24340 [pdf, ps, other]
Title: DermaVQA-DAS: Dermatology Assessment Schema (DAS) & Datasets for Closed-Ended Question Answering & Segmentation in Patient-Generated Dermatology Images
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[50]  arXiv:2512.24338 [pdf, ps, other]
Title: The Mechanics of CNN Filtering with Rectification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[51]  arXiv:2512.24331 [pdf, ps, other]
Title: Spatial-aware Vision Language Model for Autonomous Driving
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[52]  arXiv:2512.24330 [pdf, ps, other]
Title: SenseNova-MARS: Empowering Multimodal Agentic Reasoning and Search via Reinforcement Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[53]  arXiv:2512.24323 [pdf, ps, other]
Title: Robust Egocentric Referring Video Object Segmentation via Dual-Modal Causal Intervention
Comments: NeurIPS 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[54]  arXiv:2512.24321 [pdf, ps, other]
Title: UniAct: Unified Motion Generation and Action Streaming for Humanoid Robots
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[55]  arXiv:2512.24294 [pdf, ps, other]
Title: Virtual-Eyes: Quantitative Validation of a Lung CT Quality-Control Pipeline for Foundation-Model Cancer Risk Prediction
Comments: 23 pages, and Under Review-MIDL-2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[56]  arXiv:2512.24278 [pdf, ps, other]
Title: One-shot synthesis of rare gastrointestinal lesions improves diagnostic accuracy and clinical training
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[57]  arXiv:2512.24276 [pdf, ps, other]
Title: LiftProj: Space Lifting and Projection-Based Panorama Stitching
Comments: 16 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[58]  arXiv:2512.24271 [pdf, ps, other]
Title: Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation
Comments: 18 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[59]  arXiv:2512.24260 [pdf, ps, other]
Title: Physically-Grounded Manifold Projection with Foundation Priors for Metal Artifact Reduction in Dental CBCT
Comments: This manuscript has been submitted to Medical Image Analysis for peer review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[60]  arXiv:2512.24243 [pdf, ps, other]
Title: MambaSeg: Harnessing Mamba for Accurate and Efficient Image-Event Semantic Segmentation
Comments: Accepted by AAAI 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[61]  arXiv:2512.24231 [pdf, ps, other]
Title: MotivNet: Evolving Meta-Sapiens into an Emotionally Intelligent Foundation Model
Comments: 6 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[62]  arXiv:2512.24227 [pdf, ps, other]
Title: Mirage: One-Step Video Diffusion for Photorealistic and Coherent Asset Editing in Driving Scenes
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[63]  arXiv:2512.24224 [pdf, ps, other]
Title: ARM: A Learnable, Plug-and-Play Module for CLIP-based Open-vocabulary Semantic Segmentation
Comments: 10 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[64]  arXiv:2512.24214 [pdf, ps, other]
Title: Medical Image Classification on Imbalanced Data Using ProGAN and SMA-Optimized ResNet: Application to COVID-19
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[65]  arXiv:2512.24195 [pdf, ps, other]
Title: CorGi: Contribution-Guided Block-Wise Interval Caching for Training-Free Acceleration of Diffusion Transformers
Comments: 16 pages, 20 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[66]  arXiv:2512.24193 [pdf, ps, other]
Title: PointRAFT: 3D deep learning for high-throughput prediction of potato tuber weight from partial point clouds
Comments: 14 pages, 7 figures, 3 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[67]  arXiv:2512.24176 [pdf, ps, other]
Title: Guiding a Diffusion Transformer with the Internal Dynamics of Itself
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[68]  arXiv:2512.24172 [pdf, ps, other]
Title: Deep Global Clustering for Hyperspectral Image Segmentation: Concepts, Applications, and Open Challenges
Comments: 10 pages, 4 figures. Technical report extending ACPA 2025 conference paper. Code and data available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[69]  arXiv:2512.24165 [pdf, ps, other]
Title: DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[70]  arXiv:2512.24162 [pdf, ps, other]
Title: Bayesian Self-Distillation for Image Classification
Comments: 17 pages, 17 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[71]  arXiv:2512.24160 [pdf, ps, other]
Title: Towards Open-Vocabulary Industrial Defect Understanding with a Large-Scale Multimodal Dataset
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[72]  arXiv:2512.24146 [pdf, ps, other]
Title: Taming Preference Mode Collapse via Directional Decoupling Alignment in Diffusion Reinforcement Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[73]  arXiv:2512.24120 [pdf, ps, other]
Title: Enhancing LLM-Based Neural Network Generation: Few-Shot Prompting and Efficient Validation for Automated Architecture Design
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[74]  arXiv:2512.24119 [pdf, ps, other]
Title: GeoBench: Rethinking Multimodal Geometric Problem-Solving via Hierarchical Evaluation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[75]  arXiv:2512.24111 [pdf, ps, other]
Title: Guided Diffusion-based Generation of Adversarial Objects for Real-World Monocular Depth Estimation Attacks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[76]  arXiv:2512.24100 [pdf, ps, other]
Title: Think Before You Move: Latent Motion Reasoning for Text-to-Motion Generation
Comments: project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[77]  arXiv:2512.24097 [pdf, ps, other]
Title: Factorized Learning for Temporally Grounded Video-Language Models
Comments: ICCV 2025 paper. This arXiv version updates Figure 1 to include the concurrent work Qwen2.5-VL to ensure consistency with Table 1
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multimedia (cs.MM)
[78]  arXiv:2512.24086 [pdf, ps, other]
Title: RainFusion2.0: Temporal-Spatial Awareness and Hardware-Efficient Block-wise Sparse Attention
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[79]  arXiv:2512.24074 [pdf, ps, other]
Title: Balanced Hierarchical Contrastive Learning with Decoupled Queries for Fine-grained Object Detection in Remote Sensing Images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[80]  arXiv:2512.24066 [pdf, ps, other]
Title: Pathology Context Recalibration Network for Ocular Disease Recognition
Comments: The article has been accepted for publication at Machine Intelligence Research (MIR)
Journal-ref: Machine Intelligence Research 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[81]  arXiv:2512.24064 [pdf, ps, other]
Title: Neighbor-aware Instance Refining with Noisy Labels for Cross-Modal Retrieval
Comments: 9 pages, 4 figures, and AAAI-26 conference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[82]  arXiv:2512.24035 [pdf, ps, other]
Title: Reinforced Diffusion: Learning to Push the Limits of Anisotropic Diffusion for Image Denoising
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[83]  arXiv:2512.24026 [pdf, ps, other]
Title: PipeFlow: Pipelined Processing and Motion-Aware Frame Selection for Long-Form Video Editing
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[84]  arXiv:2512.24023 [pdf, ps, other]
Title: RSAgent: Learning to Reason and Act for Text-Guided Segmentation via Multi-Turn Tool Invocations
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[85]  arXiv:2512.24022 [pdf, ps, other]
Title: FUSE-RSVLM: Feature Fusion Vision-Language Model for Remote Sensing
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[86]  arXiv:2512.24018 [pdf, ps, other]
Title: Structure-Guided Allocation of 2D Gaussians for Image Representation and Compression
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[87]  arXiv:2512.24016 [pdf, ps, other]
Title: FitControler: Toward Fit-Aware Virtual Try-On
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[88]  arXiv:2512.24015 [pdf, ps, other]
Title: On Exact Editing of Flow-Based Diffusion Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[89]  arXiv:2512.24013 [pdf, ps, other]
Title: Bridging the Perception-Cognition Gap:Re-engineering SAM2 with Hilbert-Mamba for Robust VLM-based Medical Diagnosis
Authors: Hao Wu, Hui Li, Yiyun Su
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[90]  arXiv:2512.23998 [pdf, ps, other]
Title: Improved 3D Gaussian Splatting of Unknown Spacecraft Structure Using Space Environment Illumination Knowledge
Comments: Presented at 2025 IEEE International Conference on Space Robotics (iSpaRo)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[91]  arXiv:2512.23997 [pdf, ps, other]
Title: Bridging Structure and Appearance: Topological Features for Robust Self-Supervised Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[92]  arXiv:2512.23990 [pdf, ps, other]
Title: GCA-ResUNet: Medical Image Segmentation Using Grouped Coordinate Attention
Authors: Jun Ding, Shang Gao
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[93]  arXiv:2512.23986 [pdf, ps, other]
Title: Anomaly detection in satellite imagery through temporal inpainting
Subjects: Computer Vision and Pattern Recognition (cs.CV); Geophysics (physics.geo-ph)
[94]  arXiv:2512.23983 [pdf, ps, other]
Title: DriveExplorer: Images-Only Decoupled 4D Reconstruction with Progressive Restoration for Driving View Extrapolation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[95]  arXiv:2512.23953 [pdf, ps, other]
Title: T2VAttack: Adversarial Attack on Text-to-Video Diffusion Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[96]  arXiv:2512.23950 [pdf, ps, other]
Title: U-Net-Like Spiking Neural Networks for Single Image Dehazing
Comments: 9 pages, 4 figures. Accepted at IJCNN 2025 (Rome, Italy). To appear in IEEE/IJCNN 2025 proceedings
Journal-ref: 2025 International Joint Conference on Neural Networks (IJCNN), Rome, Italy, pp. 1-9, 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[97]  arXiv:2512.23942 [pdf, ps, other]
Title: Kinematic-Based Assessment of Surgical Actions in Microanastomosis
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[98]  arXiv:2512.23938 [pdf, ps, other]
Title: Learnable Query Aggregation with KV Routing for Cross-view Geo-localisation
Comments: 7 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[99]  arXiv:2512.23936 [pdf, ps, other]
Title: MGML: A Plug-and-Play Meta-Guided Multi-Modal Learning Framework for Incomplete Multimodal Brain Tumor Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[100]  arXiv:2512.23920 [pdf, ps, other]
Title: Learning to learn skill assessment for fetal ultrasound scanning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[101]  arXiv:2512.23903 [pdf, ps, other]
Title: Scaling Remote Sensing Foundation Models: Data Domain Tradeoffs at the Peta-Scale
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[102]  arXiv:2512.23894 [pdf, ps, other]
Title: MRI-to-CT Synthesis With Cranial Suture Segmentations Using A Variational Autoencoder Framework
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[103]  arXiv:2512.23860 [pdf, ps, other]
Title: Lifelong Domain Adaptive 3D Human Pose Estimation
Comments: Accepted by AAAI 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[104]  arXiv:2512.23851 [pdf, ps, other]
Title: Pretraining Frame Preservation in Autoregressive Video Memory Compression
Comments: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[105]  arXiv:2512.23819 [pdf, ps, other]
Title: Video-Based Performance Evaluation for ECR Drills in Synthetic Training Environments
Comments: 14 pages, 9 figures, I/ITSEC-2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[106]  arXiv:2512.23786 [pdf, ps, other]
Title: Leveraging Synthetic Priors for Monocular Depth Estimation in Specular Surgical Environments
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[107]  arXiv:2512.25034 (cross-list from cs.LG) [pdf, ps, other]
Title: Generative Classifiers Avoid Shortcut Solutions
Comments: ICLR 2025. Code: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[108]  arXiv:2512.24986 (cross-list from cs.GR) [pdf, ps, other]
Title: PhysTalk: Language-driven Real-time Physics in 3D Gaussian Scenes
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[109]  arXiv:2512.24894 (cross-list from cond-mat.mes-hall) [pdf, ps, other]
Title: Towards autonomous time-calibration of large quantum-dot devices: Detection, real-time feedback, and noise spectroscopy
Comments: 12 pages, 4 figures
Subjects: Mesoscale and Nanoscale Physics (cond-mat.mes-hall); Computer Vision and Pattern Recognition (cs.CV); Emerging Technologies (cs.ET); Quantum Physics (quant-ph)
[110]  arXiv:2512.24766 (cross-list from cs.RO) [pdf, ps, other]
Title: Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow
Comments: Project website: this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[111]  arXiv:2512.24499 (cross-list from cs.CR) [pdf, ps, other]
Title: Training-Free Color-Aware Adversarial Diffusion Sanitization for Diffusion Stegomalware Defense at Security Gateways
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[112]  arXiv:2512.24492 (cross-list from eess.IV) [pdf, ps, other]
Title: Automated Classification of First-Trimester Fetal Heart Views Using Ultrasound-Specific Self-Supervised Learning
Comments: 7 pages, 4 figures
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[113]  arXiv:2512.24404 (cross-list from cs.LG) [pdf, ps, other]
Title: Lifting Vision: Ground to Aerial Localization with Reasoning Guided Planning
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[114]  arXiv:2512.24384 (cross-list from cs.RO) [pdf, ps, other]
Title: Geometric Multi-Session Map Merging with Learned Local Descriptors
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[115]  arXiv:2512.24212 (cross-list from cs.RO) [pdf, ps, other]
Title: RANGER: A Monocular Zero-Shot Semantic Navigation Framework through Contextual Adaptation
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[116]  arXiv:2512.24138 (cross-list from cs.LG) [pdf, ps, other]
Title: GARDO: Reinforcing Diffusion Models without Reward Hacking
Comments: 17 pages. Project: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[117]  arXiv:2512.24117 (cross-list from eess.IV) [pdf, ps, other]
Title: Targeted Semantic Segmentation of Himalayan Glacial Lakes Using Time-Series SAR: Towards Automated GLOF Early Warning
Comments: 12 pages, 6 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[118]  arXiv:2512.24019 (cross-list from quant-ph) [pdf, ps, other]
Title: One-Shot Structured Pruning of Quantum Neural Networks via $q$-Group Engineering and Quantum Geometric Metrics
Comments: 10 pages, 2 figures
Subjects: Quantum Physics (quant-ph); Computer Vision and Pattern Recognition (cs.CV)
[119]  arXiv:2512.23906 (cross-list from eess.SP) [pdf, ps, other]
Title: A multimodal Transformer for InSAR-based ground deformation forecasting with cross-site generalization across Europe
Comments: submitted to ISPRS Journal of Photogrammetry and Remote Sensing for review
Subjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[120]  arXiv:2512.23864 (cross-list from cs.RO) [pdf, ps, other]
Title: Learning to Feel the Future: DreamTacVLA for Contact-Rich Manipulation
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[121]  arXiv:2512.23766 (cross-list from cs.LG) [pdf, ps, other]
Title: A Granular Grassmannian Clustering Framework via the Schubert Variety of Best Fit
Subjects: Machine Learning (cs.LG); Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
[122]  arXiv:2512.23757 (cross-list from eess.IV) [pdf, ps, other]
Title: Leveraging Machine Learning for Early Detection of Lung Diseases
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[123]  arXiv:2512.23739 (cross-list from cs.CL) [pdf, ps, other]
Title: Break Out the Silverware -- Semantic Understanding of Stored Household Items
Comments: Poster presented at the Israeli Seminar on Computational Linguistics 2025
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[124]  arXiv:2512.23726 (cross-list from physics.med-ph) [pdf, ps, other]
Title: q3-MuPa: Quick, Quiet, Quantitative Multi-Parametric MRI using Physics-Informed Diffusion Models
Subjects: Medical Physics (physics.med-ph); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)

Tue, 30 Dec 2025 (showing first 5 of 220 entries)

[125]  arXiv:2512.23709 [pdf, ps, other]
Title: Stream-DiffVSR: Low-Latency Streamable Video Super-Resolution via Auto-Regressive Diffusion
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[126]  arXiv:2512.23705 [pdf, ps, other]
Title: Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation
Comments: Project Page: this https URL; Code: this https URL; Dataset: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[127]  arXiv:2512.23667 [pdf, ps, other]
Title: IDT: A Physically Grounded Transformer for Feed-Forward Multi-View Intrinsic Decomposition
Comments: 10 pages 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[128]  arXiv:2512.23646 [pdf, ps, other]
Title: OmniAgent: Audio-Guided Active Perception Agent for Omnimodal Audio-Video Understanding
Comments: Website:this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[129]  arXiv:2512.23635 [pdf, ps, other]
Title: Rethinking the Spatio-Temporal Alignment of End-to-End 3D Perception
Comments: Accepted to AAAI 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 603 entries: 1-100 | 30-129 | 130-229 | 230-329 | 330-429 | ... | 530-603 ]
[ showing 100 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2601, contact, help  (Access key information)