We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 619

[ total of 749 entries: 1-50 | ... | 470-519 | 520-569 | 570-619 | 620-669 | 670-719 | 720-749 ]
[ showing 50 entries per page: fewer | more | all ]

Thu, 4 Dec 2025 (showing first 50 of 130 entries)

[620]  arXiv:2512.04085 [pdf, ps, other]
Title: Unique Lives, Shared World: Learning from Single-Life Videos
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[621]  arXiv:2512.04084 [pdf, ps, other]
Title: SimFlow: Simplified and End-to-End Training of Latent Normalizing Flows
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[622]  arXiv:2512.04082 [pdf, ps, other]
Title: PosterCopilot: Toward Layout Reasoning and Controllable Editing for Professional Graphic Design
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[623]  arXiv:2512.04069 [pdf, ps, other]
Title: SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[624]  arXiv:2512.04048 [pdf, ps, other]
Title: Stable Signer: Hierarchical Sign Language Generative Model
Comments: 12 pages, 7 figures. More Demo at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Computers and Society (cs.CY)
[625]  arXiv:2512.04040 [pdf, ps, other]
Title: RELIC: Interactive Video World Model with Long-Horizon Memory
Comments: 22 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[626]  arXiv:2512.04039 [pdf, ps, other]
Title: Fast & Efficient Normalizing Flows and Applications of Image Generative Models
Authors: Sandeep Nagar
Comments: PhD Thesis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[627]  arXiv:2512.04025 [pdf, ps, other]
Title: PSA: Pyramid Sparse Attention for Efficient Video Understanding and Generation
Comments: Tech report
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[628]  arXiv:2512.04021 [pdf, ps, other]
Title: C3G: Learning Compact 3D Representations with 2K Gaussians
Comments: Project Page : this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[629]  arXiv:2512.04019 [pdf, ps, other]
Title: Ultra-lightweight Neural Video Representation Compression
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[630]  arXiv:2512.04015 [pdf, ps, other]
Title: Learning Group Actions In Disentangled Latent Image Representations
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[631]  arXiv:2512.04012 [pdf, ps, other]
Title: Emergent Outlier View Rejection in Visual Geometry Grounded Transformers
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[632]  arXiv:2512.04007 [pdf, ps, other]
Title: On the Temporality for Sketch Representation Learning
Comments: Preprint submitted to Pattern Recognition Letters
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[633]  arXiv:2512.04000 [pdf, ps, other]
Title: Divide, then Ground: Adapting Frame Selection to Query Types for Long-Form Video Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[634]  arXiv:2512.03996 [pdf, ps, other]
Title: Highly Efficient Test-Time Scaling for T2I Diffusion Models with Text Embedding Perturbation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[635]  arXiv:2512.03992 [pdf, ps, other]
Title: DIQ-H: Evaluating Hallucination Persistence in VLMs Under Temporal Visual Degradation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[636]  arXiv:2512.03981 [pdf, ps, other]
Title: DirectDrag: High-Fidelity, Mask-Free, Prompt-Free Drag-based Image Editing via Readout-Guided Feature Alignment
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[637]  arXiv:2512.03979 [pdf, ps, other]
Title: BlurDM: A Blur Diffusion Model for Image Deblurring
Comments: NeurIPS 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[638]  arXiv:2512.03964 [pdf, ps, other]
Title: Training for Identity, Inference for Controllability: A Unified Approach to Tuning-Free Face Personalization
Comments: 17 pages, 13 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[639]  arXiv:2512.03963 [pdf, ps, other]
Title: TempR1: Improving Temporal Understanding of MLLMs via Temporal-Aware Multi-Task Reinforcement Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[640]  arXiv:2512.03939 [pdf, ps, other]
Title: MUT3R: Motion-aware Updating Transformer for Dynamic 3D Reconstruction
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[641]  arXiv:2512.03932 [pdf, ps, other]
Title: Beyond the Ground Truth: Enhanced Supervision for Image Restoration
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[642]  arXiv:2512.03918 [pdf, ps, other]
Title: UniMo: Unifying 2D Video and 3D Human Motion with an Autoregressive Framework
Comments: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[643]  arXiv:2512.03905 [pdf, ps, other]
Title: Zero-Shot Video Translation and Editing with Frame Spatial-Temporal Correspondence
Comments: Code: this https URL, Project: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[644]  arXiv:2512.03883 [pdf, ps, other]
Title: Dual Cross-Attention Siamese Transformer for Rectal Tumor Regrowth Assessment in Watch-and-Wait Endoscopy
Comments: 6 pages, 5 figures, 1 table, submitted to ISBI conference
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[645]  arXiv:2512.03869 [pdf, ps, other]
Title: An Automated Framework for Large-Scale Graph-Based Cerebrovascular Analysis
Comments: Submitted to ISBI 2026. 6 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[646]  arXiv:2512.03862 [pdf, ps, other]
Title: Diminishing Returns in Self-Supervised Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[647]  arXiv:2512.03854 [pdf, ps, other]
Title: Prostate biopsy whole slide image dataset from an underrepresented Middle Eastern population
Comments: 13 pages, 2 figures and 1 table
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[648]  arXiv:2512.03852 [pdf, ps, other]
Title: Traffic Image Restoration under Adverse Weather via Frequency-Aware Mamba
Comments: 12pages, 13 figures, 5tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[649]  arXiv:2512.03848 [pdf, ps, other]
Title: PULSE: A Unified Multi-Task Architecture for Cardiac Segmentation, Diagnosis, and Few-Shot Cross-Modality Clinical Adaptation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[650]  arXiv:2512.03844 [pdf, ps, other]
Title: CoDA: From Text-to-Image Diffusion Models to Training-Free Dataset Distillation
Comments: 34 pages, 24 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[651]  arXiv:2512.03837 [pdf, ps, other]
Title: Heatmap Pooling Network for Action Recognition from RGB Videos
Comments: Final Version of IEEE Transactions on Pattern Analysis and Machine Intelligence
Journal-ref: IEEE Transactions on Pattern Analysis and Machine Intelligence (2025)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[652]  arXiv:2512.03834 [pdf, ps, other]
Title: Lean Unet: A Compact Model for Image Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[653]  arXiv:2512.03827 [pdf, ps, other]
Title: A Robust Camera-based Method for Breath Rate Measurement
Comments: 9 pages, 4 figures, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[654]  arXiv:2512.03817 [pdf, ps, other]
Title: HieroGlyphTranslator: Automatic Recognition and Translation of Egyptian Hieroglyphs to English
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[655]  arXiv:2512.03796 [pdf, ps, other]
Title: LSRS: Latent Scale Rejection Sampling for Visual Autoregressive Modeling
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[656]  arXiv:2512.03794 [pdf, ps, other]
Title: AdaptVision: Efficient Vision-Language Models via Adaptive Visual Acquisition
Comments: 15 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[657]  arXiv:2512.03751 [pdf, ps, other]
Title: Research on Brain Tumor Classification Method Based on Improved ResNet34 Network
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[658]  arXiv:2512.03749 [pdf, ps, other]
Title: Fully Unsupervised Self-debiasing of Text-to-Image Diffusion Models
Comments: Accepted at WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[659]  arXiv:2512.03746 [pdf, ps, other]
Title: Thinking with Programming Vision: Towards a Unified View for Thinking with Images
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[660]  arXiv:2512.03745 [pdf, ps, other]
Title: Dual-level Modality Debiasing Learning for Unsupervised Visible-Infrared Person Re-Identification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[661]  arXiv:2512.03730 [pdf, ps, other]
Title: Out-of-the-box: Black-box Causal Attacks on Object Detectors
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[662]  arXiv:2512.03724 [pdf, ps, other]
Title: PosA-VLA: Enhancing Action Generation via Pose-Conditioned Anchor Attention
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[663]  arXiv:2512.03715 [pdf, ps, other]
Title: DINO-RotateMatch: A Rotation-Aware Deep Framework for Robust Image Matching in Large-Scale 3D Reconstruction
Comments: 9 pages, 5 figures, 1 table
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[664]  arXiv:2512.03701 [pdf, ps, other]
Title: Structured Uncertainty Similarity Score (SUSS): Learning a Probabilistic, Interpretable, Perceptual Metric Between Images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[665]  arXiv:2512.03687 [pdf, ps, other]
Title: Active Visual Perception: Opportunities and Challenges
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[666]  arXiv:2512.03683 [pdf, ps, other]
Title: GaussianBlender: Instant Stylization of 3D Gaussians with Disentangled Latent Spaces
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[667]  arXiv:2512.03673 [pdf, ps, other]
Title: ConvRot: Rotation-Based Plug-and-Play 4-bit Quantization for Diffusion Transformers
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[668]  arXiv:2512.03667 [pdf, ps, other]
Title: Colon-X: Advancing Intelligent Colonoscopy from Multimodal Understanding to Clinical Reasoning
Comments: Technical report
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[669]  arXiv:2512.03666 [pdf, ps, other]
Title: ToG-Bench: Task-Oriented Spatio-Temporal Grounding in Egocentric Videos
Comments: 26 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[ total of 749 entries: 1-50 | ... | 470-519 | 520-569 | 570-619 | 620-669 | 670-719 | 720-749 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)