We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 105

[ total of 778 entries: 1-50 | 6-55 | 56-105 | 106-155 | 156-205 | 206-255 | 256-305 | ... | 756-778 ]
[ showing 50 entries per page: fewer | more | all ]

Fri, 5 Dec 2025 (continued, showing 50 of 135 entries)

[106]  arXiv:2512.05076 [pdf, ps, other]
Title: BulletTime: Decoupled Control of Time and Camera Pose for Video Generation
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[107]  arXiv:2512.05060 [pdf, ps, other]
Title: 4DLangVGGT: 4D Language-Visual Geometry Grounded Transformer
Comments: Code: this https URL, Webpage: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[108]  arXiv:2512.05044 [pdf, ps, other]
Title: Joint 3D Geometry Reconstruction and Motion Generation for 4D Synthesis from a Single Image
Comments: 18 Pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[109]  arXiv:2512.05039 [pdf, ps, other]
Title: Semantic-Guided Two-Stage GAN for Face Inpainting with Hybrid Perceptual Encoding
Comments: Submitted for review CVPR-2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[110]  arXiv:2512.05025 [pdf, ps, other]
Title: RAMEN: Resolution-Adjustable Multimodal Encoder for Earth Observation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[111]  arXiv:2512.05021 [pdf, ps, other]
Title: HTR-ConvText: Leveraging Convolution and Textual Information for Handwritten Text Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[112]  arXiv:2512.05016 [pdf, ps, other]
Title: Generative Neural Video Compression via Video Diffusion Prior
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[113]  arXiv:2512.05006 [pdf, ps, other]
Title: Self-Supervised Learning for Transparent Object Depth Completion Using Depth from Non-Transparent Objects
Comments: conference
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[114]  arXiv:2512.05000 [pdf, ps, other]
Title: Reflection Removal through Efficient Adaptation of Diffusion Transformers
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[115]  arXiv:2512.04996 [pdf, ps, other]
Title: A dynamic memory assignment strategy for dilation-based ICP algorithm on embedded GPUs
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[116]  arXiv:2512.04981 [pdf, ps, other]
Title: Aligned but Stereotypical? The Hidden Influence of System Prompts on Social Bias in LVLM-Based Text-to-Image Models
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[117]  arXiv:2512.04970 [pdf, ps, other]
Title: Stable Single-Pixel Contrastive Learning for Semantic and Geometric Tasks
Comments: UniReps Workshop 2025, 12 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[118]  arXiv:2512.04969 [pdf, ps, other]
Title: Rethinking the Use of Vision Transformers for AI-Generated Image Detection
Comments: Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[119]  arXiv:2512.04967 [pdf, ps, other]
Title: Balanced Few-Shot Episodic Learning for Accurate Retinal Disease Diagnosis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[120]  arXiv:2512.04963 [pdf, ps, other]
Title: GeoPE:A Unified Geometric Positional Embedding for Structured Tensors
Authors: Yupu Yao, Bowen Yang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[121]  arXiv:2512.04952 [pdf, ps, other]
Title: FASTer: Toward Efficient Autoregressive Vision Language Action Modeling via neural Action Tokenization
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[122]  arXiv:2512.04943 [pdf, ps, other]
Title: Towards Adaptive Fusion of Multimodal Deep Networks for Human Action Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[123]  arXiv:2512.04939 [pdf, ps, other]
Title: LiteVGGT: Boosting Vanilla VGGT via Geometry-aware Cached Token Merging
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[124]  arXiv:2512.04927 [pdf, ps, other]
Title: Virtually Unrolling the Herculaneum Papyri by Diffeomorphic Spiral Fitting
Authors: Paul Henderson
Comments: Accepted at WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[125]  arXiv:2512.04926 [pdf, ps, other]
Title: Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[126]  arXiv:2512.04904 [pdf, ps, other]
Title: ReflexFlow: Rethinking Learning Objective for Exposure Bias Alleviation in Flow Matching
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[127]  arXiv:2512.04890 [pdf, ps, other]
Title: Equivariant Symmetry-Aware Head Pose Estimation for Fetal MRI
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[128]  arXiv:2512.04888 [pdf, ps, other]
Title: You Only Train Once (YOTO): A Retraining-Free Object Detection Framework
Comments: This manuscript was first submitted to the Engineering (Elsevier Journal). The preprint version was posted to arXiv afterwards to facilitate open access and community feedback
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[129]  arXiv:2512.04883 [pdf, ps, other]
Title: SDG-Track: A Heterogeneous Observer-Follower Framework for High-Resolution UAV Tracking on Embedded Platforms
Comments: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[130]  arXiv:2512.04875 [pdf, ps, other]
Title: SP-Det: Self-Prompted Dual-Text Fusion for Generalized Multi-Label Lesion Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[131]  arXiv:2512.04862 [pdf, ps, other]
Title: Contact-Aware Refinement of Human Pose Pseudo-Ground Truth via Bioimpedance Sensing
Comments: * Equal contribution. Minor figure corrections compared to the ICCV 2025 version
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[132]  arXiv:2512.04857 [pdf, ps, other]
Title: Autoregressive Image Generation Needs Only a Few Lines of Cached Tokens
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[133]  arXiv:2512.04837 [pdf, ps, other]
Title: A Sanity Check for Multi-In-Domain Face Forgery Detection in the Real World
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[134]  arXiv:2512.04832 [pdf, ps, other]
Title: Tokenizing Buildings: A Transformer for Layout Synthesis
Comments: 8 pages, 1 page References, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[135]  arXiv:2512.04830 [pdf, ps, other]
Title: FreeGen: Feed-Forward Reconstruction-Generation Co-Training for Free-Viewpoint Driving Scene Synthesis
Comments: Novel View Synthesis, Driving Scene, Free Trajectory, Image Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[136]  arXiv:2512.04821 [pdf, ps, other]
Title: LatentFM: A Latent Flow Matching Approach for Generative Medical Image Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[137]  arXiv:2512.04815 [pdf, ps, other]
Title: RobustSplat++: Decoupling Densification, Dynamics, and Illumination for In-the-Wild 3DGS
Comments: arXiv admin note: substantial text overlap with arXiv:2506.02751
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[138]  arXiv:2512.04810 [pdf, ps, other]
Title: EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[139]  arXiv:2512.04786 [pdf, ps, other]
Title: LaFiTe: A Generative Latent Field for 3D Native Texturing
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[140]  arXiv:2512.04784 [pdf, ps, other]
Title: PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[141]  arXiv:2512.04761 [pdf, ps, other]
Title: Order Matters: 3D Shape Generation from Sequential VR Sketches
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[142]  arXiv:2512.04734 [pdf, ps, other]
Title: MT-Depth: Multi-task Instance feature analysis for the Depth Completion
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[143]  arXiv:2512.04733 [pdf, ps, other]
Title: E3AD: An Emotion-Aware Vision-Language-Action Model for Human-Centric End-to-End Autonomous Driving
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[144]  arXiv:2512.04728 [pdf, ps, other]
Title: Measuring the Unspoken: A Disentanglement Model and Benchmark for Psychological Analysis in the Wild
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[145]  arXiv:2512.04699 [pdf, ps, other]
Title: OmniScaleSR: Unleashing Scale-Controlled Diffusion Prior for Faithful and Realistic Arbitrary-Scale Image Super-Resolution
Comments: Accepted as TCSVT, 15 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[146]  arXiv:2512.04686 [pdf, ps, other]
Title: Towards Cross-View Point Correspondence in Vision-Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[147]  arXiv:2512.04678 [pdf, ps, other]
Title: Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[148]  arXiv:2512.04677 [pdf, ps, other]
Title: Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[149]  arXiv:2512.04660 [pdf, ps, other]
Title: I2I-Bench: A Comprehensive Benchmark Suite for Image-to-Image Editing Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[150]  arXiv:2512.04643 [pdf, ps, other]
Title: SEASON: Mitigating Temporal Hallucination in Video Large Language Models via Self-Diagnostic Contrastive Decoding
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[151]  arXiv:2512.04619 [pdf, ps, other]
Title: Denoise to Track: Harnessing Video Diffusion Priors for Robust Correspondence
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[152]  arXiv:2512.04599 [pdf, ps, other]
Title: Malicious Image Analysis via Vision-Language Segmentation Fusion: Detection, Element, and Location in One-shot
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[153]  arXiv:2512.04597 [pdf, ps, other]
Title: When Robots Should Say "I Don't Know": Benchmarking Abstention in Embodied Question Answering
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[154]  arXiv:2512.04585 [pdf, ps, other]
Title: SAM3-I: Segment Anything with Instructions
Comments: Preliminary results; work in progress
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[155]  arXiv:2512.04581 [pdf, ps, other]
Title: Infrared UAV Target Tracking with Dynamic Feature Refinement and Global Contextual Attention Knowledge Distillation
Comments: Accepted by IEEE TMM
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 778 entries: 1-50 | 6-55 | 56-105 | 106-155 | 156-205 | 206-255 | 256-305 | ... | 756-778 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)