We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 384

[ total of 778 entries: 1-50 | ... | 235-284 | 285-334 | 335-384 | 385-434 | 435-484 | 485-534 | 535-584 | ... | 735-778 ]
[ showing 50 entries per page: fewer | more | all ]

Wed, 3 Dec 2025 (continued, showing 50 of 141 entries)

[385]  arXiv:2512.02932 [pdf, ps, other]
Title: EGGS: Exchangeable 2D/3D Gaussian Splatting for Geometry-Appearance Balanced Novel View Synthesis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[386]  arXiv:2512.02931 [pdf, ps, other]
Title: DiverseAR: Boosting Diversity in Bitwise Autoregressive Image Generation
Comments: 23 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[387]  arXiv:2512.02906 [pdf, ps, other]
Title: MRD: Multi-resolution Retrieval-Detection Fusion for High-Resolution Image Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[388]  arXiv:2512.02899 [pdf, ps, other]
Title: Glance: Accelerating Diffusion Models with 1 Sample
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[389]  arXiv:2512.02897 [pdf, ps, other]
Title: Polar Perspectives: Evaluating 2-D LiDAR Projections for Robust Place Recognition with Visual Foundation Models
Comments: 13 Pages, 5 Figures, 2 Tables Under Review
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[390]  arXiv:2512.02895 [pdf, ps, other]
Title: MindGPT-4ov: An Enhanced MLLM via a Multi-Stage Post-Training Paradigm
Comments: 33 pages, 14 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[391]  arXiv:2512.02870 [pdf, ps, other]
Title: Taming Camera-Controlled Video Generation with Verifiable Geometry Reward
Comments: 11 pages, 4 figures, 7 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[392]  arXiv:2512.02867 [pdf, ps, other]
Title: MICCAI STSR 2025 Challenge: Semi-Supervised Teeth and Pulp Segmentation and CBCT-IOS Registration
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[393]  arXiv:2512.02860 [pdf, ps, other]
Title: RFOP: Rethinking Fusion and Orthogonal Projection for Face-Voice Association
Comments: Ranked 3rd in Fame 2026 Challenge, ICASSP
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[394]  arXiv:2512.02850 [pdf, ps, other]
Title: Are Detectors Fair to Indian IP-AIGC? A Cross-Generator Study
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[395]  arXiv:2512.02846 [pdf, ps, other]
Title: Action Anticipation at a Glimpse: To What Extent Can Multimodal Cues Replace Video?
Comments: Accepted in WACV 2026 - Applications Track
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[396]  arXiv:2512.02835 [pdf, ps, other]
Title: ReVSeg: Incentivizing the Reasoning Chain for Video Segmentation with Reinforcement Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[397]  arXiv:2512.02830 [pdf, ps, other]
Title: Defense That Attacks: How Robust Models Become Better Attackers
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[398]  arXiv:2512.02794 [pdf, ps, other]
Title: PhyCustom: Towards Realistic Physical Customization in Text-to-Image Generation
Comments: codes:this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[399]  arXiv:2512.02793 [pdf, ps, other]
Title: IC-World: In-Context Generation for Shared World Modeling
Comments: codes:this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[400]  arXiv:2512.02792 [pdf, ps, other]
Title: HUD: Hierarchical Uncertainty-Aware Disambiguation Network for Composed Video Retrieval
Comments: Accepted by ACM MM 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[401]  arXiv:2512.02790 [pdf, ps, other]
Title: UnicEdit-10M: A Dataset and Benchmark Breaking the Scale-Quality Barrier via Unified Verification for Reasoning-Enriched Edits
Comments: 31 pages, 15 figures, 12 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[402]  arXiv:2512.02789 [pdf, ps, other]
Title: TrackNetV5: Residual-Driven Spatio-Temporal Refinement and Motion Direction Decoupling for Fast Object Tracking
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[403]  arXiv:2512.02781 [pdf, ps, other]
Title: LumiX: Structured and Coherent Text-to-Intrinsic Generation
Comments: The code will be available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[404]  arXiv:2512.02780 [pdf, ps, other]
Title: Rethinking Surgical Smoke: A Smoke-Type-Aware Laparoscopic Video Desmoking Method and Dataset
Comments: 12 pages, 15 figures. Accepted to AAAI-26 (Main Technical Track)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[405]  arXiv:2512.02751 [pdf, ps, other]
Title: AttMetNet: Attention-Enhanced Deep Neural Network for Methane Plume Detection in Sentinel-2 Satellite Imagery
Comments: 15 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[406]  arXiv:2512.02743 [pdf, ps, other]
Title: Reasoning-Aware Multimodal Fusion for Hateful Video Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[407]  arXiv:2512.02737 [pdf, ps, other]
Title: Beyond Paired Data: Self-Supervised UAV Geo-Localization from Reference Imagery Alone
Comments: Accepted at WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[408]  arXiv:2512.02727 [pdf, ps, other]
Title: DF-Mamba: Deformable State Space Modeling for 3D Hand Pose Estimation in Interactions
Comments: Accepted to WACV 2026. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[409]  arXiv:2512.02715 [pdf, ps, other]
Title: GeoViS: Geospatially Rewarded Visual Search for Remote Sensing Visual Grounding
Comments: 11 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[410]  arXiv:2512.02702 [pdf, ps, other]
Title: Tissue-mask supported inter-subject whole-body image registration in the UK Biobank -- A method benchmarking study
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[411]  arXiv:2512.02700 [pdf, ps, other]
Title: VLM-Pruner: Buffering for Spatial Sparsity in an Efficient VLM Centrifugal Token Pruning Paradigm
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[412]  arXiv:2512.02697 [pdf, ps, other]
Title: GeoBridge: A Semantic-Anchored Multi-View Foundation Model Bridging Images and Text for Geo-Localization
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[413]  arXiv:2512.02696 [pdf, ps, other]
Title: ALDI-ray: Adapting the ALDI Framework for Security X-ray Object Detection
Comments: Submitted to ICASSP 2026 Conference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[414]  arXiv:2512.02686 [pdf, ps, other]
Title: ClimaOoD: Improving Anomaly Segmentation via Physically Realistic Synthetic Data
Authors: Yuxing Liu, Yong Liu
Comments: Under review;
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[415]  arXiv:2512.02685 [pdf, ps, other]
Title: Unsupervised Structural Scene Decomposition via Foreground-Aware Slot Attention with Pseudo-Mask Guidance
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[416]  arXiv:2512.02681 [pdf, ps, other]
Title: PGP-DiffSR: Phase-Guided Progressive Pruning for Efficient Diffusion-based Image Super-Resolution
Comments: 10 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[417]  arXiv:2512.02668 [pdf, ps, other]
Title: UAUTrack: Towards Unified Multimodal Anti-UAV Visual Tracking
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[418]  arXiv:2512.02664 [pdf, ps, other]
Title: PolarGuide-GSDR: 3D Gaussian Splatting Driven by Polarization Priors and Deferred Reflection for Real-World Reflective Scenes
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[419]  arXiv:2512.02660 [pdf, ps, other]
Title: Spatially-Grounded Document Retrieval via Patch-to-Region Relevance Propagation
Comments: 13 pages, 1 figure, 2 tables. Open-source implementation available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[420]  arXiv:2512.02650 [pdf, ps, other]
Title: Hear What Matters! Text-conditioned Selective Video-to-Audio Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[421]  arXiv:2512.02648 [pdf, ps, other]
Title: PoreTrack3D: A Benchmark for Dynamic 3D Gaussian Splatting in Pore-Scale Facial Trajectory Tracking
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[422]  arXiv:2512.02643 [pdf, ps, other]
Title: Leveraging Large-Scale Pretrained Spatial-Spectral Priors for General Zero-Shot Pansharpening
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[423]  arXiv:2512.02624 [pdf, ps, other]
Title: PPTBench: Towards Holistic Evaluation of Large Language Models for PowerPoint Layout and Design Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[424]  arXiv:2512.02622 [pdf, ps, other]
Title: RULER-Bench: Probing Rule-based Reasoning Abilities of Next-level Video Generation Models for Vision Foundation Intelligence
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[425]  arXiv:2512.02621 [pdf, ps, other]
Title: Content-Aware Texturing for Gaussian Splatting
Comments: Project Page: this https URL
Journal-ref: Eurographics Symposium on Rendering (Symposium Track), 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[426]  arXiv:2512.02576 [pdf, ps, other]
Title: Co-speech Gesture Video Generation via Motion-Based Graph Retrieval
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[427]  arXiv:2512.02566 [pdf, ps, other]
Title: From Panel to Pixel: Zoom-In Vision-Language Pretraining from Biomedical Scientific Literature
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[428]  arXiv:2512.02554 [pdf, ps, other]
Title: OmniPerson: Unified Identity-Preserving Pedestrian Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[429]  arXiv:2512.02541 [pdf, ps, other]
Title: AVGGT: Rethinking Global Attention for Accelerating VGGT
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[430]  arXiv:2512.02536 [pdf, ps, other]
Title: WeMMU: Enhanced Bridging of Vision-Language Models and Diffusion Models via Noisy Query Tokens
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[431]  arXiv:2512.02520 [pdf, ps, other]
Title: On the Problem of Consistent Anomalies in Zero-Shot Anomaly Detection
Authors: Tai Le-Gia
Comments: PhD Dissertation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[432]  arXiv:2512.02517 [pdf, ps, other]
Title: SkyMoE: A Vision-Language Foundation Model for Enhancing Geospatial Interpretation with Mixture of Experts
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[433]  arXiv:2512.02512 [pdf, ps, other]
Title: Two-Stage Vision Transformer for Image Restoration: Colorization Pretraining + Residual Upsampling
Comments: Accepted as a Tiny Paper at the 13th Indian Conference on Computer Vision, Graphics and Image Processing (ICVGIP 2025), IIT Mandi, India. 3 pages, 1 figure
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[434]  arXiv:2512.02505 [pdf, ps, other]
Title: GeoDiT: A Diffusion-based Vision-Language Model for Geospatial Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 778 entries: 1-50 | ... | 235-284 | 285-334 | 335-384 | 385-434 | 435-484 | 485-534 | 535-584 | ... | 735-778 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)