We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 88

[ total of 737 entries: 1-100 | 89-188 | 189-288 | 289-388 | 389-488 | ... | 689-737 ]
[ showing 100 entries per page: fewer | more | all ]

Fri, 12 Dec 2025 (continued, showing last 30 of 118 entries)

[89]  arXiv:2512.10262 [pdf, ps, other]
Title: VLM-NCD:Novel Class Discovery with Vision-Based Large Language Models
Comments: 8 pages, 5 figures, conference
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[90]  arXiv:2512.10252 [pdf, ps, other]
Title: GDKVM: Echocardiography Video Segmentation via Spatiotemporal Key-Value Memory with Gated Delta Rule
Comments: Accepted to ICCV 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[91]  arXiv:2512.10251 [pdf, ps, other]
Title: THE-Pose: Topological Prior with Hybrid Graph Fusion for Estimating Category-Level 6D Object Pose
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[92]  arXiv:2512.10248 [pdf, ps, other]
Title: RobustSora: De-Watermarked Benchmark for Robust AI-Generated Video Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[93]  arXiv:2512.10244 [pdf, ps, other]
Title: Solving Semi-Supervised Few-Shot Learning from an Auto-Annotation Perspective
Comments: website and code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[94]  arXiv:2512.10237 [pdf, ps, other]
Title: Multi-dimensional Preference Alignment by Conditioning Reward Itself
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[95]  arXiv:2512.10230 [pdf, ps, other]
Title: Emerging Standards for Machine-to-Machine Video Coding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[96]  arXiv:2512.10226 [pdf, ps, other]
Title: Latent Chain-of-Thought World Modeling for End-to-End Driving
Comments: Technical Report
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[97]  arXiv:2512.10209 [pdf, ps, other]
Title: Feature Coding for Scalable Machine Vision
Comments: This article has been accepted for publication in IEEE Consumer Electronics Magazine
Journal-ref: 2025 IEEE Consumer Electronics Magazine
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[98]  arXiv:2512.10151 [pdf, ps, other]
Title: Topological Conditioning for Mammography Models via a Stable Wavelet-Persistence Vectorization
Comments: 8 Pages, 2 Figures, submitted to IEEE Transactions on Medical Imaging
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[99]  arXiv:2512.10102 [pdf, ps, other]
Title: Hierarchical Instance Tracking to Balance Privacy Preservation with Accessible Information
Comments: Accepted at WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[100]  arXiv:2512.10095 [pdf, ps, other]
Title: TraceFlow: Dynamic 3D Reconstruction of Specular Scenes Driven by Ray Tracing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[101]  arXiv:2512.10067 [pdf, ps, other]
Title: Independent Density Estimation
Authors: Jiahao Liu
Comments: 10 pages, 1 table, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[102]  arXiv:2512.10041 [pdf, ps, other]
Title: MetaVoxel: Joint Diffusion Modeling of Imaging and Clinical Metadata
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[103]  arXiv:2512.10038 [pdf, ps, other]
Title: Diffusion Is Your Friend in Show, Suggest and Tell
Journal-ref: 2025 IEEE International Conference on Big Data
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[104]  arXiv:2512.10031 [pdf, ps, other]
Title: ABBSPO: Adaptive Bounding Box Scaling and Symmetric Prior based Orientation Prediction for Detecting Aerial Image Objects
Comments: 17 pages, 11 figures, 8 tables, supplementary included. Accepted to CVPR 2025. Please visit our project page at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[105]  arXiv:2512.09969 [pdf, ps, other]
Title: Neuromorphic Eye Tracking for Low-Latency Pupil Detection
Comments: 8 pages, 2 figures, conference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[106]  arXiv:2512.10953 (cross-list from cs.LG) [pdf, ps, other]
Title: Bidirectional Normalizing Flow: From Data to Noise and Back
Comments: Tech report
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[107]  arXiv:2512.10938 (cross-list from cs.LG) [pdf, ps, other]
Title: Stronger Normalization-Free Transformers
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[108]  arXiv:2512.10821 (cross-list from cs.AI) [pdf, ps, other]
Title: Agile Deliberation: Concept Deliberation for Subjective Visual Classification
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[109]  arXiv:2512.10817 (cross-list from cs.LG) [pdf, ps, other]
Title: Extrapolation of Periodic Functions Using Binary Encoding of Continuous Numerical Values
Comments: Submitted to JMLR, under review
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[110]  arXiv:2512.10805 (cross-list from cs.LG) [pdf, ps, other]
Title: Interpretable and Steerable Concept Bottleneck Sparse Autoencoders
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[111]  arXiv:2512.10766 (cross-list from cs.CR) [pdf, ps, other]
Title: Metaphor-based Jailbreaking Attacks on Text-to-Image Models
Comments: This paper includes model-generated content that may contain offensive or distressing material
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[112]  arXiv:2512.10691 (cross-list from cs.AI) [pdf, ps, other]
Title: Enhancing Radiology Report Generation and Visual Grounding using Reinforcement Learning
Comments: 10 pages main text (3 figures, 3 tables), 31 pages in total
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[113]  arXiv:2512.10675 (cross-list from cs.RO) [pdf, ps, other]
Title: Evaluating Gemini Robotics Policies in a Veo World Simulator
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[114]  arXiv:2512.10524 (cross-list from cs.LG) [pdf, ps, other]
Title: Mode-Seeking for Inverse Problems with Diffusion Models
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[115]  arXiv:2512.10319 (cross-list from cs.RO) [pdf, ps, other]
Title: Design of a six wheel suspension and a three-axis linear actuation mechanism for a laser weeding robot
Comments: 15 Pages, 10 figures
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Systems and Control (eess.SY)
[116]  arXiv:2512.10224 (cross-list from cs.LG) [pdf, ps, other]
Title: Federated Domain Generalization with Latent Space Inversion
Comments: Accepted at ICDM 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[117]  arXiv:2512.09944 (cross-list from cs.AI) [pdf, ps, other]
Title: Echo-CoPilot: A Multi-View, Multi-Task Agent for Echocardiography Interpretation and Reporting
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[118]  arXiv:2510.20875 (cross-list from cs.LG) [pdf, ps, other]
Title: CC-GRMAS: A Multi-Agent Graph Neural System for Spatiotemporal Landslide Risk Assessment in High Mountain Asia
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)

Thu, 11 Dec 2025 (showing first 70 of 135 entries)

[119]  arXiv:2512.09925 [pdf, ps, other]
Title: GAINS: Gaussian-based Inverse Rendering from Sparse Multi-View Captures
Comments: 23 pages, 18 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[120]  arXiv:2512.09924 [pdf, ps, other]
Title: ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
Comments: Project Page: [this https URL](this https URL)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[121]  arXiv:2512.09923 [pdf, ps, other]
Title: Splatent: Splatting Diffusion Latents for Novel View Synthesis
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[122]  arXiv:2512.09913 [pdf, ps, other]
Title: NordFKB: a fine-grained benchmark dataset for geospatial AI in Norway
Comments: 8 pages, 2 figures, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[123]  arXiv:2512.09907 [pdf, ps, other]
Title: VisualActBench: Can VLMs See and Act like a Human?
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[124]  arXiv:2512.09874 [pdf, ps, other]
Title: Benchmarking Document Parsers on Mathematical Formula Extraction from PDFs
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[125]  arXiv:2512.09871 [pdf, ps, other]
Title: Diffusion Posterior Sampler for Hyperspectral Unmixing with Spectral Variability Modeling
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[126]  arXiv:2512.09867 [pdf, ps, other]
Title: MedForget: Hierarchy-Aware Multimodal Unlearning Testbed for Medical AI
Comments: Dataset and Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[127]  arXiv:2512.09864 [pdf, ps, other]
Title: UniUGP: Unifying Understanding, Generation, and Planing For End-to-end Autonomous Driving
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[128]  arXiv:2512.09847 [pdf, ps, other]
Title: From Detection to Anticipation: Online Understanding of Struggles across Various Tasks and Activities
Comments: Accepted by WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[129]  arXiv:2512.09824 [pdf, ps, other]
Title: Composing Concepts from Images and Videos via Concept-prompt Binding
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[130]  arXiv:2512.09814 [pdf, ps, other]
Title: DynaIP: Dynamic Image Prompt Adapter for Scalable Zero-shot Personalized Text-to-Image Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[131]  arXiv:2512.09806 [pdf, ps, other]
Title: CHEM: Estimating and Understanding Hallucinations in Deep Learning for Image Processing
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[132]  arXiv:2512.09801 [pdf, ps, other]
Title: Modality-Specific Enhancement and Complementary Fusion for Semi-Supervised Multi-Modal Brain Tumor Segmentation
Comments: 9 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[133]  arXiv:2512.09792 [pdf, ps, other]
Title: FastPose-ViT: A Vision Transformer for Real-Time Spacecraft Pose Estimation
Comments: Accepted to WACV 2026. Preprint version
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[134]  arXiv:2512.09773 [pdf, other]
Title: Stylized Meta-Album: Group-bias injection with style transfer to study robustness against distribution shifts
Authors: Romain Mussard (UNIROUEN), Aurélien Gauffre (UGA), Ihsan Ullah, Thanh Gia Hieu Khuong (TAU, LISN), Massih-Reza Amini (UGA), Isabelle Guyon (TAU, LISN), Lisheng Sun-Hosoya (TAU, LISN)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[135]  arXiv:2512.09700 [pdf, ps, other]
Title: LiM-YOLO: Less is More with Pyramid Level Shift and Normalized Auxiliary Branch for Ship Detection in Optical Remote Sensing Imagery
Comments: 16 pages, 8 figures, 9 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[136]  arXiv:2512.09687 [pdf, ps, other]
Title: Unconsciously Forget: Mitigating Memorization; Without Knowing What is being Memorized
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[137]  arXiv:2512.09670 [pdf, ps, other]
Title: An Automated Tip-and-Cue Framework for Optimized Satellite Tasking and Visual Intelligence
Comments: Under review at IEEE Transactions on Geoscience and Remote Sensing (TGRS). 13 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Systems and Control (eess.SY)
[138]  arXiv:2512.09665 [pdf, ps, other]
Title: OxEnsemble: Fair Ensembles for Low-Data Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG)
[139]  arXiv:2512.09663 [pdf, ps, other]
Title: IF-Bench: Benchmarking and Enhancing MLLMs for Infrared Images with Generative Visual Prompting
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[140]  arXiv:2512.09646 [pdf, ps, other]
Title: VHOI: Controllable Video Generation of Human-Object Interactions from Sparse Trajectories via Motion Densification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[141]  arXiv:2512.09644 [pdf, ps, other]
Title: Kaapana: A Comprehensive Open-Source Platform for Integrating AI in Medical Imaging Research Environments
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[142]  arXiv:2512.09633 [pdf, ps, other]
Title: Benchmarking SAM2-based Trackers on FMOX
Journal-ref: 33rd International Conference on Artificial Intelligence and Cognitive Science (AICS 2025), December, 2025, Dublin, Ireland
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[143]  arXiv:2512.09626 [pdf, ps, other]
Title: Beyond Sequences: A Benchmark for Atomic Hand-Object Interaction Using a Static RNN Encoder
Comments: Code available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[144]  arXiv:2512.09617 [pdf, ps, other]
Title: FROMAT: Multiview Material Appearance Transfer via Few-Shot Self-Attention Adaptation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[145]  arXiv:2512.09616 [pdf, ps, other]
Title: Rethinking Chain-of-Thought Reasoning for Videos
Comments: Technical report
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[146]  arXiv:2512.09592 [pdf, ps, other]
Title: CS3D: An Efficient Facial Expression Recognition via Event Vision
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[147]  arXiv:2512.09583 [pdf, ps, other]
Title: UnReflectAnything: RGB-Only Highlight Removal by Rendering Synthetic Specular Supervision
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[148]  arXiv:2512.09580 [pdf, ps, other]
Title: Content-Adaptive Image Retouching Guided by Attribute-Based Text Representation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[149]  arXiv:2512.09579 [pdf, ps, other]
Title: Hands-on Evaluation of Visual Transformers for Object Recognition and Detection
Journal-ref: 37th International Conference on Tools with Artificial Intelligence (ICTAI 2025)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[150]  arXiv:2512.09576 [pdf, ps, other]
Title: Seeing Soil from Space: Towards Robust and Scalable Remote Soil Nutrient Analysis
Authors: David Seu (1), Nicolas Longepe (2), Gabriel Cioltea (1), Erik Maidik (1), Calin Andrei (1) ((1) CO2 Angels, Cluj-Napoca, Romania, (2) European Space Agency Phi-Lab, Frascati, Italy)
Comments: 23 pages, 13 figures, 13 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Geophysics (physics.geo-ph)
[151]  arXiv:2512.09573 [pdf, ps, other]
Title: Investigate the Low-level Visual Perception in Vision-Language based Image Quality Assessment
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[152]  arXiv:2512.09565 [pdf, ps, other]
Title: From Graphs to Gates: DNS-HyXNet, A Lightweight and Deployable Sequential Model for Real-Time DNS Tunnel Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[153]  arXiv:2512.09555 [pdf, ps, other]
Title: Building Reasonable Inference for Vision-Language Models in Blind Image Quality Assessment
Comments: Accepted to the ICONIP (International Conference on Neural Information Processing), 2025
Journal-ref: Building Reasonable Inference for Vision-Language Models in Blind Image Quality Assessment. In: Taniguchi, T., et al. Neural Information Processing. ICONIP 2025. Lecture Notes in Computer Science, vol 16310. Springer, Singapore
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[154]  arXiv:2512.09546 [pdf, ps, other]
Title: A Dual-Domain Convolutional Network for Hyperspectral Single-Image Super-Resolution
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[155]  arXiv:2512.09525 [pdf, ps, other]
Title: Masked Registration and Autoencoding of CT Images for Predictive Tibia Reconstruction
Comments: DGM4MICCAI
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[156]  arXiv:2512.09497 [pdf, ps, other]
Title: Gradient-Guided Learning Network for Infrared Small Target Detection
Comments: Accepted by GRSL 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[157]  arXiv:2512.09492 [pdf, ps, other]
Title: StateSpace-SSL: Linear-Time Self-supervised Learning for Plant Disease Detection
Comments: Accepted to AAAI workshop (AgriAI 2026)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[158]  arXiv:2512.09489 [pdf, ps, other]
Title: MODA: The First Challenging Benchmark for Multispectral Object Detection in Aerial Images
Comments: 8 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[159]  arXiv:2512.09477 [pdf, ps, other]
Title: Color encoding in Latent Space of Stable Diffusion Models
Comments: 6 pages, 8 figures, Color Imaging Conference 33
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[160]  arXiv:2512.09471 [pdf, ps, other]
Title: Temporal-Spatial Tubelet Embedding for Cloud-Robust MSI Reconstruction using MSI-SAR Fusion: A Multi-Head Self-Attention Video Vision Transformer Approach
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[161]  arXiv:2512.09463 [pdf, ps, other]
Title: Privacy-Preserving Computer Vision for Industry: Three Case Studies in Human-Centric Manufacturing
Comments: Accepted to the AAAI26 HCM workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[162]  arXiv:2512.09461 [pdf, ps, other]
Title: Cytoplasmic Strings Analysis in Human Embryo Time-Lapse Videos using Deep Learning Framework
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[163]  arXiv:2512.09446 [pdf, ps, other]
Title: Defect-aware Hybrid Prompt Optimization via Progressive Tuning for Zero-Shot Multi-type Anomaly Detection and Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[164]  arXiv:2512.09441 [pdf, ps, other]
Title: Representation Calibration and Uncertainty Guidance for Class-Incremental Learning based on Vision Language Model
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[165]  arXiv:2512.09435 [pdf, ps, other]
Title: UniPart: Part-Level 3D Generation with Unified 3D Geom-Seg Latents
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[166]  arXiv:2512.09423 [pdf, ps, other]
Title: FunPhase: A Periodic Functional Autoencoder for Motion Generation via Phase Manifolds
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[167]  arXiv:2512.09422 [pdf, ps, other]
Title: InfoMotion: A Graph-Based Approach to Video Dataset Distillation for Echocardiography
Comments: Accepted at MICAD 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[168]  arXiv:2512.09418 [pdf, ps, other]
Title: Label-free Motion-Conditioned Diffusion Model for Cardiac Ultrasound Synthesis
Comments: Accepted at MICAD 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[169]  arXiv:2512.09417 [pdf, ps, other]
Title: DirectSwap: Mask-Free Cross-Identity Training and Benchmarking for Expression-Consistent Video Head Swapping
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[170]  arXiv:2512.09407 [pdf, ps, other]
Title: Generative Point Cloud Registration
Comments: 14 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[171]  arXiv:2512.09402 [pdf, ps, other]
Title: Wasserstein-Aligned Hyperbolic Multi-View Clustering
Comments: 14 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[172]  arXiv:2512.09393 [pdf, ps, other]
Title: Detection and Localization of Subdural Hematoma Using Deep Learning on Computed Tomography
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[173]  arXiv:2512.09383 [pdf, ps, other]
Title: Perception-Inspired Color Space Design for Photo White Balance Editing
Comments: Accepted to WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[174]  arXiv:2512.09375 [pdf, ps, other]
Title: Log NeRF: Comparing Spaces for Learning Radiance Fields
Authors: Sihe Chen (Northeastern University), Luv Verma (Northeastern University), Bruce A. Maxwell (Northeastern University)
Comments: The 36th British Machine Vision Conference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[175]  arXiv:2512.09373 [pdf, ps, other]
Title: FUSER: Feed-Forward MUltiview 3D Registration Transformer and SE(3)$^N$ Diffusion Refinement
Comments: 13 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[176]  arXiv:2512.09364 [pdf, ps, other]
Title: ASSIST-3D: Adapted Scene Synthesis for Class-Agnostic 3D Instance Segmentation
Comments: Accepted by AAAI 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[177]  arXiv:2512.09363 [pdf, ps, other]
Title: StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[178]  arXiv:2512.09354 [pdf, ps, other]
Title: Video-QTR: Query-Driven Temporal Reasoning Framework for Lightweight Video Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[179]  arXiv:2512.09350 [pdf, ps, other]
Title: TextGuider: Training-Free Guidance for Text Rendering via Attention Alignment
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[180]  arXiv:2512.09335 [pdf, ps, other]
Title: Relightable and Dynamic Gaussian Avatar Reconstruction from Monocular Video
Comments: 8 pages, 9 figures, published in ACM MM 2025
Journal-ref: In Proceedings of the 33rd ACM International Conference on Multimedia. 2025. p. 7405-7414
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[181]  arXiv:2512.09327 [pdf, ps, other]
Title: UniLS: End-to-End Audio-Driven Avatars for Unified Listening and Speaking
Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD)
[182]  arXiv:2512.09315 [pdf, ps, other]
Title: Benchmarking Real-World Medical Image Classification with Noisy Labels: Challenges, Practice, and Outlook
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[183]  arXiv:2512.09311 [pdf, ps, other]
Title: Transformer-Driven Multimodal Fusion for Explainable Suspiciousness Estimation in Visual Surveillance
Comments: 12 pages, 10 figures, IEEE Transaction on Image Processing
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[184]  arXiv:2512.09307 [pdf, ps, other]
Title: From SAM to DINOv2: Towards Distilling Foundation Models to Lightweight Baselines for Generalized Polyp Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[185]  arXiv:2512.09299 [pdf, ps, other]
Title: VABench: A Comprehensive Benchmark for Audio-Video Generation
Comments: 24 pages, 25 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD)
[186]  arXiv:2512.09296 [pdf, ps, other]
Title: Traffic Scene Small Target Detection Method Based on YOLOv8n-SPTS Model for Autonomous Driving
Authors: Songhan Wu
Comments: 6 pages, 7 figures, 1 table. Accepted to The 2025 IEEE 3rd International Conference on Electrical, Automation and Computer Engineering (ICEACE), 2025. Code available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[187]  arXiv:2512.09289 [pdf, ps, other]
Title: MelanomaNet: Explainable Deep Learning for Skin Lesion Classification
Comments: 7 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[188]  arXiv:2512.09282 [pdf, ps, other]
Title: FoundIR-v2: Optimizing Pre-Training Data Mixtures for Image Restoration Foundation Model
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 737 entries: 1-100 | 89-188 | 189-288 | 289-388 | 389-488 | ... | 689-737 ]
[ showing 100 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)