We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 70

[ total of 732 entries: 1-25 | 21-45 | 46-70 | 71-95 | 96-120 | 121-145 | 146-170 | ... | 721-732 ]
[ showing 25 entries per page: fewer | more | all ]

Tue, 16 Dec 2025 (continued, showing 25 of 244 entries)

[71]  arXiv:2512.13039 [pdf, ps, other]
Title: Bi-Erasing: A Bidirectional Framework for Concept Removal in Diffusion Models
Comments: Under Review
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[72]  arXiv:2512.13031 [pdf, ps, other]
Title: Comprehensive Evaluation of Rule-Based, Machine Learning, and Deep Learning in Human Estimation Using Radio Wave Sensing: Accuracy, Spatial Generalization, and Output Granularity Trade-offs
Comments: 10 pages, 5 figures. A comprehensive comparison of rule-based, machine learning, and deep learning approaches for human estimation using FMCW MIMO radar, focusing on accuracy, spatial generalization, and output granularity
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[73]  arXiv:2512.13030 [pdf, ps, other]
Title: Motus: A Unified Latent Action World Model
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[74]  arXiv:2512.13019 [pdf, ps, other]
Title: SneakPeek: Future-Guided Instructional Streaming Video Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[75]  arXiv:2512.13018 [pdf, ps, other]
Title: Comprehensive Deployment-Oriented Assessment for Cross-Environment Generalization in Deep Learning-Based mmWave Radar Sensing
Comments: 8 pages, 6 figures. Comprehensive evaluation of preprocessing, data augmentation, and transfer learning for cross-environment generalization in deep learning-based mmWave radar sensing
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[76]  arXiv:2512.13015 [pdf, ps, other]
Title: What Happens Next? Next Scene Prediction with a Unified Video Model
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[77]  arXiv:2512.13014 [pdf, ps, other]
Title: JoDiffusion: Jointly Diffusing Image with Pixel-Level Annotations for Semantic Segmentation Promotion
Comments: Accepted at AAAI 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[78]  arXiv:2512.13008 [pdf, ps, other]
Title: TWLR: Text-Guided Weakly-Supervised Lesion Localization and Severity Regression for Explainable Diabetic Retinopathy Grading
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[79]  arXiv:2512.13007 [pdf, ps, other]
Title: Light Field Based 6DoF Tracking of Previously Unobserved Objects
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[80]  arXiv:2512.13006 [pdf, ps, other]
Title: Few-Step Distillation for Text-to-Image Generation: A Practical Guide
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[81]  arXiv:2512.12997 [pdf, ps, other]
Title: Calibrating Uncertainty for Zero-Shot Adversarial CLIP
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[82]  arXiv:2512.12982 [pdf, ps, other]
Title: Scaling Up AI-Generated Image Detection via Generator-Aware Prototypes
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[83]  arXiv:2512.12977 [pdf, ps, other]
Title: VLCache: Computing 2% Vision Tokens and Reusing 98% for Vision-Language Inference
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[84]  arXiv:2512.12963 [pdf, ps, other]
Title: SCAdapter: Content-Style Disentanglement for Diffusion Style Transfer
Comments: Accepted to WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[85]  arXiv:2512.12941 [pdf, ps, other]
Title: UAGLNet: Uncertainty-Aggregated Global-Local Fusion Network with Cooperative CNN-Transformer for Building Extraction
Comments: IEEE TGRS
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[86]  arXiv:2512.12936 [pdf, ps, other]
Title: Content Adaptive based Motion Alignment Framework for Learned Video Compression
Comments: Accepted to Data Compression Conference (DCC) 2026 as a poster paper
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[87]  arXiv:2512.12935 [pdf, ps, other]
Title: Unified Interactive Multimodal Moment Retrieval via Cascaded Embedding-Reranking and Temporal-Aware Score Fusion
Comments: Accepted at AAAI Workshop 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[88]  arXiv:2512.12929 [pdf, ps, other]
Title: MADTempo: An Interactive System for Multi-Event Temporal Video Retrieval with Query Augmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[89]  arXiv:2512.12925 [pdf, ps, other]
Title: Sharpness-aware Dynamic Anchor Selection for Generalized Category Discovery
Comments: Accepted by TMM2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[90]  arXiv:2512.12906 [pdf, ps, other]
Title: Predictive Sample Assignment for Semantically Coherent Out-of-Distribution Detection
Comments: Accepted by TCSVT2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[91]  arXiv:2512.12898 [pdf, ps, other]
Title: Qonvolution: Towards Learning High-Frequency Signals with Queried Convolution
Comments: 28 pages, 8 figures, Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[92]  arXiv:2512.12887 [pdf, ps, other]
Title: Revisiting 2D Foundation Models for Scalable 3D Medical Image Classification
Comments: 1st Place in VLM3D Challenge
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[93]  arXiv:2512.12885 [pdf, ps, other]
Title: SignRAG: A Retrieval-Augmented System for Scalable Zero-Shot Road Sign Recognition
Comments: Submitted to IV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Robotics (cs.RO)
[94]  arXiv:2512.12884 [pdf, ps, other]
Title: Cross-Level Sensor Fusion with Object Lists via Transformer for 3D Object Detection
Comments: 6 pages, 3 figures, accepted at IV2025
Journal-ref: 2025 IEEE Intelligent Vehicles Symposium (IV)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[95]  arXiv:2512.12875 [pdf, ps, other]
Title: Schrodinger Audio-Visual Editor: Object-Level Audiovisual Removal
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD)
[ total of 732 entries: 1-25 | 21-45 | 46-70 | 71-95 | 96-120 | 121-145 | 146-170 | ... | 721-732 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)