We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 424

[ total of 759 entries: 1-25 | ... | 350-374 | 375-399 | 400-424 | 425-449 | 450-474 | 475-499 | 500-524 | ... | 750-759 ]
[ showing 25 entries per page: fewer | more | all ]

Fri, 5 Dec 2025 (continued, showing 25 of 135 entries)

[425]  arXiv:2512.04532 [pdf, ps, other]
Title: PhyVLLM: Physics-Guided Video Language Model with Motion-Appearance Disentanglement
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[426]  arXiv:2512.04528 [pdf, ps, other]
Title: Auto3R: Automated 3D Reconstruction and Scanning via Data-driven Uncertainty Quantification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[427]  arXiv:2512.04522 [pdf, ps, other]
Title: Identity Clue Refinement and Enhancement for Visible-Infrared Person Re-Identification
Comments: 14 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[428]  arXiv:2512.04521 [pdf, ps, other]
Title: WiFi-based Cross-Domain Gesture Recognition Using Attention Mechanism
Subjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[429]  arXiv:2512.04520 [pdf, ps, other]
Title: Boundary-Aware Test-Time Adaptation for Zero-Shot Medical Image Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[430]  arXiv:2512.04519 [pdf, ps, other]
Title: VideoSSM: Autoregressive Long Video Generation with Hybrid State-Space Memory
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[431]  arXiv:2512.04515 [pdf, ps, other]
Title: EgoLCD: Egocentric Video Generation with Long Context Diffusion
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[432]  arXiv:2512.04511 [pdf, ps, other]
Title: DuGI-MAE: Improving Infrared Mask Autoencoders via Dual-Domain Guidance
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[433]  arXiv:2512.04504 [pdf, ps, other]
Title: UltraImage: Rethinking Resolution Extrapolation in Image Diffusion Transformers
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[434]  arXiv:2512.04499 [pdf, ps, other]
Title: Back to Basics: Motion Representation Matters for Human Motion Generation Using Diffusion Model
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[435]  arXiv:2512.04496 [pdf, ps, other]
Title: Shift-Window Meets Dual Attention: A Multi-Model Architecture for Specular Highlight Removal
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[436]  arXiv:2512.04487 [pdf, ps, other]
Title: Controllable Long-term Motion Generation with Extended Joint Targets
Comments: WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[437]  arXiv:2512.04485 [pdf, ps, other]
Title: Not All Birds Look The Same: Identity-Preserving Generation For Birds
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[438]  arXiv:2512.04483 [pdf, ps, other]
Title: DeRA: Decoupled Representation Alignment for Video Tokenization
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[439]  arXiv:2512.04461 [pdf, ps, other]
Title: UniTS: Unified Time Series Generative Model for Remote Sensing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[440]  arXiv:2512.04459 [pdf, ps, other]
Title: dVLM-AD: Enhance Diffusion Vision-Language-Model for Driving via Controllable Reasoning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[441]  arXiv:2512.04456 [pdf, ps, other]
Title: GuidNoise: Single-Pair Guided Diffusion for Generalized Noise Synthesis
Comments: AAAI2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[442]  arXiv:2512.04451 [pdf, ps, other]
Title: StreamEQA: Towards Streaming Video Understanding for Embodied Scenarios
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[443]  arXiv:2512.04441 [pdf, ps, other]
Title: MindDrive: An All-in-One Framework Bridging World Models and Vision-Language Model for End-to-End Autonomous Driving
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[444]  arXiv:2512.04426 [pdf, ps, other]
Title: Self-Paced and Self-Corrective Masked Prediction for Movie Trailer Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[445]  arXiv:2512.04425 [pdf, ps, other]
Title: Explainable Parkinsons Disease Gait Recognition Using Multimodal RGB-D Fusion and Large Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[446]  arXiv:2512.04421 [pdf, ps, other]
Title: UTrice: Unifying Primitives in Differentiable Ray Tracing and Rasterization via Triangles for Particle-Based 3D Scenes
Comments: 13 pages, 10 figures, submitted to CVPR2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[447]  arXiv:2512.04413 [pdf, ps, other]
Title: Dual-Stream Spectral Decoupling Distillation for Remote Sensing Object Detection
Comments: 12 pages, 8 figures, 11 tables
Journal-ref: IEEE Transactions on Geoscience and Remote Sensing 63 (2025) 1-11
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[448]  arXiv:2512.04397 [pdf, ps, other]
Title: Performance Evaluation of Transfer Learning Based Medical Image Classification Techniques for Disease Detection
Journal-ref: 2025 47th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Copenhagen, Denmark, 2025, pp. 1-5
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[449]  arXiv:2512.04395 [pdf, ps, other]
Title: Fourier-Attentive Representation Learning: A Fourier-Guided Framework for Few-Shot Generalization in Vision-Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 759 entries: 1-25 | ... | 350-374 | 375-399 | 400-424 | 425-449 | 450-474 | 475-499 | 500-524 | ... | 750-759 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)