We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 399

[ total of 737 entries: 1-50 | ... | 250-299 | 300-349 | 350-399 | 400-449 | 450-499 | 500-549 | 550-599 | ... | 700-737 ]
[ showing 50 entries per page: fewer | more | all ]

Tue, 9 Dec 2025 (continued, showing 50 of 259 entries)

[400]  arXiv:2512.07738 [pdf, ps, other]
Title: HLTCOE Evaluation Team at TREC 2025: VQA Track
Comments: 7 pages, 1 figure
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[401]  arXiv:2512.07733 [pdf, ps, other]
Title: SpatialDreamer: Incentivizing Spatial Reasoning via Active Mental Imagery
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[402]  arXiv:2512.07730 [pdf, ps, other]
Title: SAVE: Sparse Autoencoder-Driven Visual Information Enhancement for Mitigating Object Hallucination
Comments: WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[403]  arXiv:2512.07729 [pdf, ps, other]
Title: Improving action classification with brain-inspired deep networks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[404]  arXiv:2512.07720 [pdf, ps, other]
Title: ViSA: 3D-Aware Video Shading for Real-Time Upper-Body Avatar Creation
Comments: Project page: \url{this https URL}
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[405]  arXiv:2512.07712 [pdf, ps, other]
Title: UnCageNet: Tracking and Pose Estimation of Caged Animal
Comments: 9 pages, 2 figures, 2 tables. Accepted to the Indian Conference on Computer Vision, Graphics, and Image Processing (ICVGIP 2025), Mandi, India
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[406]  arXiv:2512.07703 [pdf, ps, other]
Title: PVeRA: Probabilistic Vector-Based Random Matrix Adaptation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[407]  arXiv:2512.07702 [pdf, ps, other]
Title: Guiding What Not to Generate: Automated Negative Prompting for Text-Image Alignment
Comments: WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[408]  arXiv:2512.07698 [pdf, ps, other]
Title: sim2art: Accurate Articulated Object Modeling from a Single Video using Synthetic Training Data Only
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[409]  arXiv:2512.07674 [pdf, ps, other]
Title: DIST-CLIP: Arbitrary Metadata and Image Guided MRI Harmonization via Disentangled Anatomy-Contrast Representations
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[410]  arXiv:2512.07668 [pdf, ps, other]
Title: EgoCampus: Egocentric Pedestrian Eye Gaze Model and Dataset
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[411]  arXiv:2512.07661 [pdf, ps, other]
Title: Optimization-Guided Diffusion for Interactive Scene Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[412]  arXiv:2512.07652 [pdf, ps, other]
Title: An AI-Powered Autonomous Underwater System for Sea Exploration and Scientific Research
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[413]  arXiv:2512.07651 [pdf, ps, other]
Title: Liver Fibrosis Quantification and Analysis: The LiQA Dataset and Baseline Method
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[414]  arXiv:2512.07628 [pdf, ps, other]
Title: MoCA: Mixture-of-Components Attention for Scalable Compositional 3D Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[415]  arXiv:2512.07606 [pdf, ps, other]
Title: Decomposition Sampling for Efficient Region Annotations in Active Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[416]  arXiv:2512.07599 [pdf, ps, other]
Title: Online Segment Any 3D Thing as Instance Tracking
Comments: NeurIPS 2025, Code is at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[417]  arXiv:2512.07596 [pdf, ps, other]
Title: More than Segmentation: Benchmarking SAM 3 for Segmentation, 3D Perception, and Reconstruction in Robotic Surgery
Comments: Technical Report
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[418]  arXiv:2512.07590 [pdf, ps, other]
Title: Robust Variational Model Based Tailored UNet: Leveraging Edge Detector and Mean Curvature for Improved Image Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[419]  arXiv:2512.07584 [pdf, ps, other]
Title: LongCat-Image Technical Report
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[420]  arXiv:2512.07580 [pdf, ps, other]
Title: All You Need Are Random Visual Tokens? Demystifying Token Pruning in VLLMs
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[421]  arXiv:2512.07568 [pdf, ps, other]
Title: Dual-Stream Cross-Modal Representation Learning via Residual Semantic Decorrelation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[422]  arXiv:2512.07564 [pdf, ps, other]
Title: Toward More Reliable Artificial Intelligence: Reducing Hallucinations in Vision-Language Models
Comments: 24 pages, 3 figures, 2 tables. Training-free self-correction framework for vision-language models. Code and implementation details will be released at: this https URL
Journal-ref: The 4th National and International Academic Conference Celebrating the 20th Anniversary of Rajapruk University (2026)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[423]  arXiv:2512.07527 [pdf, ps, other]
Title: From Orbit to Ground: Generative City Photogrammetry from Extreme Off-Nadir Satellite Images
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[424]  arXiv:2512.07514 [pdf, ps, other]
Title: MeshRipple: Structured Autoregressive Generation of Artist-Meshes
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[425]  arXiv:2512.07504 [pdf, ps, other]
Title: ControlVP: Interactive Geometric Refinement of AI-Generated Images with Consistent Vanishing Points
Comments: Accepted to WACV 2026, 8 pages, supplementary included. Dataset and code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[426]  arXiv:2512.07503 [pdf, ps, other]
Title: SJD++: Improved Speculative Jacobi Decoding for Training-free Acceleration of Discrete Auto-regressive Text-to-Image Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[427]  arXiv:2512.07500 [pdf, ps, other]
Title: MultiMotion: Multi Subject Video Motion Transfer via Video Diffusion Transformer
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[428]  arXiv:2512.07498 [pdf, ps, other]
Title: Towards Robust DeepFake Detection under Unstable Face Sequences: Adaptive Sparse Graph Embedding with Order-Free Representation and Explicit Laplacian Spectral Prior
Comments: 16 pages (including appendix)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[429]  arXiv:2512.07480 [pdf, ps, other]
Title: Single-step Diffusion-based Video Coding with Semantic-Temporal Guidance
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[430]  arXiv:2512.07469 [pdf, ps, other]
Title: Unified Video Editing with Temporal Reasoner
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[431]  arXiv:2512.07426 [pdf, ps, other]
Title: When normalization hallucinates: unseen risks in AI-powered whole slide image processing
Comments: 4 pages, accepted for oral presentation at SPIE Medical Imaging, 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[432]  arXiv:2512.07415 [pdf, ps, other]
Title: Data-driven Exploration of Mobility Interaction Patterns
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[433]  arXiv:2512.07410 [pdf, ps, other]
Title: InterAgent: Physics-based Multi-agent Command Execution via Diffusion on Interaction Graphs
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[434]  arXiv:2512.07394 [pdf, ps, other]
Title: Reconstructing Objects along Hand Interaction Timelines in Egocentric Video
Comments: webpage: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[435]  arXiv:2512.07391 [pdf, ps, other]
Title: GlimmerNet: A Lightweight Grouped Dilated Depthwise Convolutions for UAV-Based Emergency Monitoring
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[436]  arXiv:2512.07385 [pdf, ps, other]
Title: How Far are Modern Trackers from UAV-Anti-UAV? A Million-Scale Benchmark and New Baseline
Comments: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[437]  arXiv:2512.07383 [pdf, ps, other]
Title: LogicCBMs: Logic-Enhanced Concept-Based Learning
Comments: 18 pages, 19 figures, WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[438]  arXiv:2512.07381 [pdf, ps, other]
Title: Tessellation GS: Neural Mesh Gaussians for Robust Monocular Reconstruction of Dynamic Objects
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[439]  arXiv:2512.07379 [pdf, ps, other]
Title: Enhancing Small Object Detection with YOLO: A Novel Framework for Improved Accuracy and Efficiency
Comments: 22 pages, 16 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[440]  arXiv:2512.07360 [pdf, ps, other]
Title: Structure-Aware Feature Rectification with Region Adjacency Graphs for Training-Free Open-Vocabulary Semantic Segmentation
Comments: Accepted to WACV2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[441]  arXiv:2512.07351 [pdf, ps, other]
Title: DeepAgent: A Dual Stream Multi Agent Fusion for Robust Multimodal Deepfake Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Sound (cs.SD)
[442]  arXiv:2512.07348 [pdf, ps, other]
Title: MICo-150K: A Comprehensive Dataset Advancing Multi-Image Composition
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[443]  arXiv:2512.07345 [pdf, ps, other]
Title: Debiasing Diffusion Priors via 3D Attention for Consistent Gaussian Splatting
Comments: 15 pages, 8 figures, 5 tables, 2 algorithms, Accepted by AAAI 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[444]  arXiv:2512.07338 [pdf, ps, other]
Title: Generalized Referring Expression Segmentation on Aerial Photos
Comments: Submitted to IEEE J-STARS
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[445]  arXiv:2512.07331 [pdf, ps, other]
Title: The Inductive Bottleneck: Data-Driven Emergence of Representational Sparsity in Vision Transformers
Authors: Kanishk Awadhiya
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[446]  arXiv:2512.07328 [pdf, ps, other]
Title: ContextAnyone: Context-Aware Diffusion for Character-Consistent Text-to-Video Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[447]  arXiv:2512.07305 [pdf, ps, other]
Title: Reevaluating Automated Wildlife Species Detection: A Reproducibility Study on a Custom Image Dataset
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[448]  arXiv:2512.07302 [pdf, ps, other]
Title: Towards Accurate UAV Image Perception: Guiding Vision-Language Models with Stronger Task Prompts
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[449]  arXiv:2512.07276 [pdf, ps, other]
Title: Geo3DVQA: Evaluating Vision-Language Models for 3D Geospatial Reasoning from Aerial Imagery
Comments: Accepted to WACV 2026. Camera-ready-based version with minor edits for readability (no change in the contents)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 737 entries: 1-50 | ... | 250-299 | 300-349 | 350-399 | 400-449 | 450-499 | 500-549 | 550-599 | ... | 700-737 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)