We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 550

[ total of 778 entries: 1-50 | ... | 401-450 | 451-500 | 501-550 | 551-600 | 601-650 | 651-700 | 701-750 | 751-778 ]
[ showing 50 entries per page: fewer | more | all ]

Tue, 2 Dec 2025 (continued, showing 50 of 278 entries)

[551]  arXiv:2512.01563 [pdf, ps, other]
Title: MasHeNe: A Benchmark for Head and Neck CT Mass Segmentation using Window-Enhanced Mamba with Frequency-Domain Integration
Comments: The 14th International Symposium on Information and Communication Technology Conference SoICT 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[552]  arXiv:2512.01540 [pdf, ps, other]
Title: FlashVGGT: Efficient and Scalable Visual Geometry Transformers with Compressed Descriptor Attention
Authors: Zipeng Wang, Dan Xu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[553]  arXiv:2512.01534 [pdf, ps, other]
Title: Deep Unsupervised Anomaly Detection in Brain Imaging: Large-Scale Benchmarking and Bias Analysis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[554]  arXiv:2512.01533 [pdf, ps, other]
Title: Diffusion Fuzzy System: Fuzzy Rule Guided Latent Multi-Path Diffusion Modeling
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[555]  arXiv:2512.01519 [pdf, ps, other]
Title: QuantumCanvas: A Multimodal Benchmark for Visual Learning of Atomic Interactions
Subjects: Computer Vision and Pattern Recognition (cs.CV); Materials Science (cond-mat.mtrl-sci); Quantum Physics (quant-ph)
[556]  arXiv:2512.01510 [pdf, ps, other]
Title: Semantic-aware Random Convolution and Source Matching for Domain Generalization in Medical Image Segmentation
Comments: Preprint submitted to Computer Methods and Programs in Biomedicine (currently under revision)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[557]  arXiv:2512.01495 [pdf, ps, other]
Title: ELVIS: Enhance Low-Light for Video Instance Segmentation in the Dark
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[558]  arXiv:2512.01494 [pdf, other]
Title: A variational method for curve extraction with curvature-dependent energies
Authors: Majid Arthaud (ENPC, MOKAPLAN, UMich), Antonin Chambolle (CEREMADE, MOKAPLAN), Vincent Duval (MOKAPLAN)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[559]  arXiv:2512.01481 [pdf, ps, other]
Title: ChronosObserver: Taming 4D World with Hyperspace Diffusion Sampling
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[560]  arXiv:2512.01478 [pdf, ps, other]
Title: CourtMotion: Learning Event-Driven Motion Representations from Skeletal Data for Basketball
Authors: Omer Sela (1 and 2), Michael Chertok (1), Lior Wolf (2) ((1) Amazon, (2) Tel Aviv University)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multiagent Systems (cs.MA)
[561]  arXiv:2512.01444 [pdf, ps, other]
Title: FastAnimate: Towards Learnable Template Construction and Pose Deformation for Fast 3D Human Avatar Animation
Comments: 9 pages,4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[562]  arXiv:2512.01427 [pdf, ps, other]
Title: Language-Guided Open-World Anomaly Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[563]  arXiv:2512.01426 [pdf, ps, other]
Title: ResDiT: Evoking the Intrinsic Resolution Scalability in Diffusion Transformers
Comments: 8 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[564]  arXiv:2512.01424 [pdf, ps, other]
Title: ViRectify: A Challenging Benchmark for Video Reasoning Correction with Multimodal Large Language Models
Comments: 22 pages, 11 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[565]  arXiv:2512.01422 [pdf, ps, other]
Title: MDiff4STR: Mask Diffusion Model for Scene Text Recognition
Comments: Accepted by AAAI 2026 (Oral)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[566]  arXiv:2512.01419 [pdf, ps, other]
Title: Rice-VL: Evaluating Vision-Language Models for Cultural Understanding Across ASEAN Countries
Comments: 14 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[567]  arXiv:2512.01390 [pdf, ps, other]
Title: FRAMER: Frequency-Aligned Self-Distillation with Adaptive Modulation Leveraging Diffusion Priors for Real-World Image Super-Resolution
Comments: Comments: Please visit our project page at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[568]  arXiv:2512.01383 [pdf, ps, other]
Title: PointNet4D: A Lightweight 4D Point Cloud Video Backbone for Online and Offline Perception in Robotic Applications
Comments: Accepted by WACV2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[569]  arXiv:2512.01382 [pdf, ps, other]
Title: Reversible Inversion for Training-Free Exemplar-guided Image Editing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[570]  arXiv:2512.01380 [pdf, ps, other]
Title: Textured Geometry Evaluation: Perceptual 3D Textured Shape Metric via 3D Latent-Geometry Network
Comments: Accepted by AAAI26
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[571]  arXiv:2512.01373 [pdf, ps, other]
Title: SRAM: Shape-Realism Alignment Metric for No Reference 3D Shape Evaluation
Comments: Accepted by AAAI2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[572]  arXiv:2512.01366 [pdf, ps, other]
Title: BlinkBud: Detecting Hazards from Behind via Sampled Monocular 3D Detection on a Single Earbud
Comments: This is the author-accepted version of the paper published in Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT), Vol. 9, No. 4, Article 191, 2025. Final published version: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[573]  arXiv:2512.01352 [pdf, ps, other]
Title: OpenBox: Annotate Any Bounding Boxes in 3D
Comments: Accepted by NeurIPS 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[574]  arXiv:2512.01348 [pdf, ps, other]
Title: Handwritten Text Recognition for Low Resource Languages
Comments: 21 Pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[575]  arXiv:2512.01342 [pdf, ps, other]
Title: InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[576]  arXiv:2512.01340 [pdf, ps, other]
Title: EvalTalker: Learning to Evaluate Real-Portrait-Driven Multi-Subject Talking Humans
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[577]  arXiv:2512.01334 [pdf, ps, other]
Title: AlignVid: Training-Free Attention Scaling for Semantic Fidelity in Text-Guided Image-to-Video Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[578]  arXiv:2512.01333 [pdf, ps, other]
Title: Optimizing Stroke Risk Prediction: A Machine Learning Pipeline Combining ROS-Balanced Ensembles and XAI
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[579]  arXiv:2512.01319 [pdf, ps, other]
Title: Rethinking Intracranial Aneurysm Vessel Segmentation: A Perspective from Computational Fluid Dynamics Applications
Comments: 18 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[580]  arXiv:2512.01315 [pdf, ps, other]
Title: FOD-S2R: A FOD Dataset for Sim2Real Transfer Learning based Object Detection
Comments: 8 pages, 11 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[581]  arXiv:2512.01314 [pdf, ps, other]
Title: TokenPure: Watermark Removal through Tokenized Appearance and Structural Guidance
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[582]  arXiv:2512.01312 [pdf, ps, other]
Title: IVCR-200K: A Large-Scale Multi-turn Dialogue Benchmark for Interactive Video Corpus Retrieval
Comments: Accepted by SIGIR2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[583]  arXiv:2512.01310 [pdf, ps, other]
Title: Lost in Distortion: Uncovering the Domain Gap Between Computer Vision and Brain Imaging - A Study on Pretraining for Age Prediction
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[584]  arXiv:2512.01306 [pdf, ps, other]
Title: Gaussian Swaying: Surface-Based Framework for Aerodynamic Simulation with 3D Gaussians
Comments: Accepted to WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[585]  arXiv:2512.01302 [pdf, ps, other]
Title: DCText: Scheduled Attention Masking for Visual Text Generation via Divide-and-Conquer Strategy
Comments: Accepted to WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[586]  arXiv:2512.01298 [pdf, ps, other]
Title: TBT-Former: Learning Temporal Boundary Distributions for Action Localization
Comments: 8 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[587]  arXiv:2512.01296 [pdf, ps, other]
Title: EGG-Fusion: Efficient 3D Reconstruction with Geometry-aware Gaussian Surfel on the Fly
Comments: SIGGRAPH ASIA 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[588]  arXiv:2512.01292 [pdf, ps, other]
Title: Diffusion Model in Latent Space for Medical Image Segmentation Task
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[589]  arXiv:2512.01291 [pdf, ps, other]
Title: Supervised Contrastive Machine Unlearning of Background Bias in Sonar Image Classification with Fine-Grained Explainable AI
Comments: Accepted to CVIP 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[590]  arXiv:2512.01273 [pdf, ps, other]
Title: nnMobileNet++: Towards Efficient Hybrid Networks for Retinal Image Analysis
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[591]  arXiv:2512.01268 [pdf, ps, other]
Title: ViscNet: Vision-Based In-line Viscometry for Fluid Mixing Process
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[592]  arXiv:2512.01248 [pdf, ps, other]
Title: TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[593]  arXiv:2512.01242 [pdf, ps, other]
Title: Generative Adversarial Gumbel MCTS for Abstract Visual Composition Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[594]  arXiv:2512.01236 [pdf, ps, other]
Title: PSR: Scaling Multi-Subject Personalized Image Generation with Pairwise Subject-Consistency Rewards
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[595]  arXiv:2512.01223 [pdf, ps, other]
Title: S$^2$-MLLM: Boosting Spatial Reasoning Capability of MLLMs for 3D Visual Grounding with Structural Guidance
Comments: 18 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[596]  arXiv:2512.01214 [pdf, ps, other]
Title: M4-BLIP: Advancing Multi-Modal Media Manipulation Detection through Face-Enhanced Local Analysis
Comments: 12 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[597]  arXiv:2512.01213 [pdf, ps, other]
Title: Closing the Approximation Gap of Partial AUC Optimization: A Tale of Two Formulations
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[598]  arXiv:2512.01204 [pdf, ps, other]
Title: TabletopGen: Instance-Level Interactive 3D Tabletop Scene Generation from Text or Single Image
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[599]  arXiv:2512.01178 [pdf, ps, other]
Title: VSRD++: Autolabeling for 3D Object Detection via Instance-Aware Volumetric Silhouette Rendering
Comments: arXiv admin note: text overlap with arXiv:2404.00149
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[600]  arXiv:2512.01165 [pdf, ps, other]
Title: Real-Time On-the-Go Annotation Framework Using YOLO for Automated Dataset Generation
Authors: Mohamed Abdallah Salem (1), Ahmed Harb Rabia (1) ((1) North Dakota State University)
Comments: Copyright 2025 IEEE. This is the author's version of the work that has been accepted for publication in Proceedings of the 5. Interdisciplinary Conference on Electrics and Computer (INTCEC 2025) 15-16 September 2025, Chicago-USA. The final version of record is available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[ total of 778 entries: 1-50 | ... | 401-450 | 451-500 | 501-550 | 551-600 | 601-650 | 651-700 | 701-750 | 751-778 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)