We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 178

[ total of 737 entries: 1-25 | ... | 104-128 | 129-153 | 154-178 | 179-203 | 204-228 | 229-253 | 254-278 | ... | 729-737 ]
[ showing 25 entries per page: fewer | more | all ]

Thu, 11 Dec 2025 (continued, showing 25 of 135 entries)

[179]  arXiv:2512.09350 [pdf, ps, other]
Title: TextGuider: Training-Free Guidance for Text Rendering via Attention Alignment
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[180]  arXiv:2512.09335 [pdf, ps, other]
Title: Relightable and Dynamic Gaussian Avatar Reconstruction from Monocular Video
Comments: 8 pages, 9 figures, published in ACM MM 2025
Journal-ref: In Proceedings of the 33rd ACM International Conference on Multimedia. 2025. p. 7405-7414
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[181]  arXiv:2512.09327 [pdf, ps, other]
Title: UniLS: End-to-End Audio-Driven Avatars for Unified Listening and Speaking
Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD)
[182]  arXiv:2512.09315 [pdf, ps, other]
Title: Benchmarking Real-World Medical Image Classification with Noisy Labels: Challenges, Practice, and Outlook
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[183]  arXiv:2512.09311 [pdf, ps, other]
Title: Transformer-Driven Multimodal Fusion for Explainable Suspiciousness Estimation in Visual Surveillance
Comments: 12 pages, 10 figures, IEEE Transaction on Image Processing
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[184]  arXiv:2512.09307 [pdf, ps, other]
Title: From SAM to DINOv2: Towards Distilling Foundation Models to Lightweight Baselines for Generalized Polyp Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[185]  arXiv:2512.09299 [pdf, ps, other]
Title: VABench: A Comprehensive Benchmark for Audio-Video Generation
Comments: 24 pages, 25 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD)
[186]  arXiv:2512.09296 [pdf, ps, other]
Title: Traffic Scene Small Target Detection Method Based on YOLOv8n-SPTS Model for Autonomous Driving
Authors: Songhan Wu
Comments: 6 pages, 7 figures, 1 table. Accepted to The 2025 IEEE 3rd International Conference on Electrical, Automation and Computer Engineering (ICEACE), 2025. Code available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[187]  arXiv:2512.09289 [pdf, ps, other]
Title: MelanomaNet: Explainable Deep Learning for Skin Lesion Classification
Comments: 7 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[188]  arXiv:2512.09282 [pdf, ps, other]
Title: FoundIR-v2: Optimizing Pre-Training Data Mixtures for Image Restoration Foundation Model
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[189]  arXiv:2512.09278 [pdf, ps, other]
Title: LoGoColor: Local-Global 3D Colorization for 360° Scenes
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[190]  arXiv:2512.09276 [pdf, ps, other]
Title: Dynamic Facial Expressions Analysis Based Parkinson's Disease Auxiliary Diagnosis
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[191]  arXiv:2512.09271 [pdf, ps, other]
Title: LongT2IBench: A Benchmark for Evaluating Long Text-to-Image Generation with Graph-structured Annotations
Comments: The paper has been accepted by AAAI 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[192]  arXiv:2512.09270 [pdf, ps, other]
Title: MoRel: Long-Range Flicker-Free 4D Motion Modeling via Anchor Relay-based Bidirectional Blending with Hierarchical Densification
Comments: Please visit our project page at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[193]  arXiv:2512.09258 [pdf, ps, other]
Title: ROI-Packing: Efficient Region-Based Compression for Machine Vision
Journal-ref: International Conference on Multimedia Information Processing and Retrieval (MIPR), San Jose, CA, USA, 2025, pp. 233-238
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[194]  arXiv:2512.09251 [pdf, ps, other]
Title: GLACIA: Instance-Aware Positional Reasoning for Glacial Lake Segmentation via Multimodal Large Language Model
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[195]  arXiv:2512.09247 [pdf, ps, other]
Title: OmniPSD: Layered PSD Generation with Diffusion Transformer
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[196]  arXiv:2512.09244 [pdf, ps, other]
Title: A Clinically Interpretable Deep CNN Framework for Early Chronic Kidney Disease Prediction Using Grad-CAM-Based Explainable AI
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[197]  arXiv:2512.09235 [pdf, ps, other]
Title: Efficient Feature Compression for Machines with Global Statistics Preservation
Journal-ref: 2025 IEEE International Symposium on Circuits and Systems (ISCAS), London, United Kingdom, 2025, pp. 1-5
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[198]  arXiv:2512.09232 [pdf, ps, other]
Title: Enabling Next-Generation Consumer Experience with Feature Coding for Machines
Journal-ref: 2025 IEEE International Conference on Consumer Electronics (ICCE), Las Vegas, NV, USA, 2025, pp. 1-4
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[199]  arXiv:2512.09215 [pdf, ps, other]
Title: View-on-Graph: Zero-shot 3D Visual Grounding via Vision-Language Reasoning on Scene Graphs
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[200]  arXiv:2512.09185 [pdf, ps, other]
Title: Learning Patient-Specific Disease Dynamics with Latent Flow Matching for Longitudinal Imaging Generation
Comments: Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[201]  arXiv:2512.09172 [pdf, ps, other]
Title: Prompt-Based Continual Compositional Zero-Shot Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[202]  arXiv:2512.09164 [pdf, ps, other]
Title: WonderZoom: Multi-Scale 3D World Generation
Comments: Project website: this https URL The first two authors contributed equally
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[203]  arXiv:2512.09162 [pdf, ps, other]
Title: GTAvatar: Bridging Gaussian Splatting and Texture Mapping for Relightable and Editable Gaussian Avatars
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[ total of 737 entries: 1-25 | ... | 104-128 | 129-153 | 154-178 | 179-203 | 204-228 | 229-253 | 254-278 | ... | 729-737 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)