We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 600

[ total of 778 entries: 1-50 | ... | 451-500 | 501-550 | 551-600 | 601-650 | 651-700 | 701-750 | 751-778 ]
[ showing 50 entries per page: fewer | more | all ]

Tue, 2 Dec 2025 (continued, showing 50 of 278 entries)

[601]  arXiv:2512.01153 [pdf, ps, other]
Title: DPAC: Distribution-Preserving Adversarial Control for Diffusion Sampling
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[602]  arXiv:2512.01148 [pdf, ps, other]
Title: SocialFusion: Addressing Social Degradation in Pre-trained Vision-Language Models
Comments: 22 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[603]  arXiv:2512.01145 [pdf, ps, other]
Title: Weakly Supervised Continuous Micro-Expression Intensity Estimation Using Temporal Deep Neural Network
Authors: Riyadh Mohammed Almushrafy (Majmaah University, Saudi Arabia)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[604]  arXiv:2512.01128 [pdf, ps, other]
Title: OmniFD: A Unified Model for Versatile Face Forgery Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[605]  arXiv:2512.01116 [pdf, ps, other]
Title: Structural Prognostic Event Modeling for Multimodal Cancer Survival Analysis
Comments: 37 pages, 14 Figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[606]  arXiv:2512.01103 [pdf, ps, other]
Title: Learning Eigenstructures of Unstructured Data Manifolds
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[607]  arXiv:2512.01095 [pdf, ps, other]
Title: CycliST: A Video Language Model Benchmark for Reasoning on Cyclical State Transitions
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[608]  arXiv:2512.01094 [pdf, ps, other]
Title: Accelerating Inference of Masked Image Generators via Reinforcement Learning
Comments: 15 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[609]  arXiv:2512.01085 [pdf, ps, other]
Title: Generalized Medical Phrase Grounding
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[610]  arXiv:2512.01059 [pdf, ps, other]
Title: Parameter Reduction Improves Vision Transformers: A Comparative Study of Sharing and Width Reduction
Authors: Anantha Padmanaban Krishna Kumar (Boston University)
Comments: 7 pages total (6 pages main text, 1 page references), 1 figures, 2 tables. Code available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[611]  arXiv:2512.01048 [pdf, ps, other]
Title: TRoVe: Discovering Error-Inducing Static Feature Biases in Temporal Vision-Language Models
Comments: NeurIPS 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[612]  arXiv:2512.01030 [pdf, ps, other]
Title: Lotus-2: Advancing Geometric Dense Prediction with Powerful Image Generative Model
Comments: Work done at the Hong Kong University of Science and Technology (Guangzhou). Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[613]  arXiv:2512.01008 [pdf, ps, other]
Title: LISA-3D: Lifting Language-Image Segmentation to 3D via Multi-View Consistency
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[614]  arXiv:2512.00999 [pdf, ps, other]
Title: Provenance-Driven Reliable Semantic Medical Image Vector Reconstruction via Lightweight Blockchain-Verified Latent Fingerprints
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[615]  arXiv:2512.00995 [pdf, ps, other]
Title: S2AM3D: Scale-controllable Part Segmentation of 3D Point Cloud
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[616]  arXiv:2512.00993 [pdf, ps, other]
Title: PhotoFramer: Multi-modal Image Composition Instruction
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[617]  arXiv:2512.00975 [pdf, ps, other]
Title: MM-ACT: Learn from Multimodal Parallel Generation to Act
Comments: 17 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[618]  arXiv:2512.00960 [pdf, ps, other]
Title: Efficient and Scalable Monocular Human-Object Interaction Motion Reconstruction
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[619]  arXiv:2512.00953 [pdf, ps, other]
Title: Adaptive Evidential Learning for Temporal-Semantic Robustness in Moment Retrieval
Comments: Accepted by AAAI 2026, 10 pages, 9 figures, 5 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[620]  arXiv:2512.00944 [pdf, ps, other]
Title: Binary-Gaussian: Compact and Progressive Representation for 3D Gaussian Segmentation
Journal-ref: AAAI2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[621]  arXiv:2512.00936 [pdf, ps, other]
Title: SceneProp: Combining Neural Network and Markov Random Field for Scene-Graph Grounding
Comments: Accepted to WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[622]  arXiv:2512.00927 [pdf, ps, other]
Title: LAHNet: Local Attentive Hashing Network for Point Cloud Registration
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[623]  arXiv:2512.00912 [pdf, ps, other]
Title: ForamDeepSlice: A High-Accuracy Deep Learning Framework for Foraminifera Species Classification from 2D Micro-CT Slices
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[624]  arXiv:2512.00911 [pdf, ps, other]
Title: Dual-Projection Fusion for Accurate Upright Panorama Generation in Robotic Vision
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[625]  arXiv:2512.00909 [pdf, ps, other]
Title: TalkingPose: Efficient Face and Gesture Animation with Feedback-guided Diffusion Model
Comments: WACV 2026, Project page available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[626]  arXiv:2512.00904 [pdf, ps, other]
Title: Hierarchical Semantic Alignment for Image Clustering
Comments: AAAI 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[627]  arXiv:2512.00903 [pdf, ps, other]
Title: SwiftVLA: Unlocking Spatiotemporal Dynamics for Lightweight VLA Models at Minimal Overhead
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[628]  arXiv:2512.00891 [pdf, ps, other]
Title: Accelerating Streaming Video Large Language Models via Hierarchical Token Compression
Comments: Code is avaliable at \url{this https URL}
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[629]  arXiv:2512.00887 [pdf, ps, other]
Title: Multilingual Training-Free Remote Sensing Image Captioning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[630]  arXiv:2512.00885 [pdf, ps, other]
Title: HanDyVQA: A Video QA Benchmark for Fine-Grained Hand-Object Interaction Dynamics
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[631]  arXiv:2512.00882 [pdf, ps, other]
Title: Look, Recite, Then Answer: Enhancing VLM Performance via Self-Generated Knowledge Hints
Authors: Xisheng Feng
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[632]  arXiv:2512.00880 [pdf, ps, other]
Title: Quantum-Inspired Spectral Geometry for Neural Operator Equivalence and Structured Pruning
Comments: 6 pages, 1 figure, preliminary version; concepts and simulation experiments only
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[633]  arXiv:2512.00877 [pdf, ps, other]
Title: Feed-Forward 3D Gaussian Splatting Compression with Long-Context Modeling
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[634]  arXiv:2512.00873 [pdf, ps, other]
Title: Neural Discrete Representation Learning for Sparse-View CBCT Reconstruction: From Algorithm Design to Prospective Multicenter Clinical Evaluation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[635]  arXiv:2512.00872 [pdf, ps, other]
Title: TAP-CT: 3D Task-Agnostic Pretraining of Computed Tomography Foundation Models
Comments: 22 pages, 4 figures, 8 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[636]  arXiv:2512.00850 [pdf, ps, other]
Title: Smol-GS: Compact Representations for Abstract 3D Gaussian Splatting
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[637]  arXiv:2512.00846 [pdf, ps, other]
Title: AFRAgent : An Adaptive Feature Renormalization Based High Resolution Aware GUI agent
Comments: Accepted at WACV 2026 Conference
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[638]  arXiv:2512.00832 [pdf, ps, other]
Title: PanFlow: Decoupled Motion Control for Panoramic Video Generation
Comments: Accepted by AAAI. Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[639]  arXiv:2512.00814 [pdf, ps, other]
Title: IRPO: Boosting Image Restoration via Post-training GRPO
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[640]  arXiv:2512.00805 [pdf, ps, other]
Title: Thinking with Drafts: Speculative Temporal Reasoning for Efficient Long Video Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[641]  arXiv:2512.00796 [pdf, ps, other]
Title: CircleFlow: Flow-Guided Camera Blur Estimation using a Circle Grid Target
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[642]  arXiv:2512.00794 [pdf, ps, other]
Title: PolarGS: Polarimetric Cues for Ambiguity-Free Gaussian Splatting with Accurate Geometry Recovery
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[643]  arXiv:2512.00773 [pdf, ps, other]
Title: DEJIMA: A Novel Large-scale Japanese Dataset for Image Captioning and Visual Question Answering
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[644]  arXiv:2512.00771 [pdf, ps, other]
Title: EAG3R: Event-Augmented 3D Geometry Estimation for Dynamic and Extreme-Lighting Scenes
Comments: Accepted at NeurIPS 2025 (spotlight)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[645]  arXiv:2512.00765 [pdf, ps, other]
Title: The Outline of Deception: Physical Adversarial Attacks on Traffic Signs Using Edge Patches
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[646]  arXiv:2512.00762 [pdf, ps, other]
Title: Seeing the Wind from a Falling Leaf
Comments: Accepted at NeurIPS 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[647]  arXiv:2512.00752 [pdf, ps, other]
Title: Charts Are Not Images: On the Challenges of Scientific Chart Editing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[648]  arXiv:2512.00748 [pdf, ps, other]
Title: Probabilistic Modeling of Multi-rater Medical Image Segmentation for Diversity and Personalization
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[649]  arXiv:2512.00744 [pdf, ps, other]
Title: Joint Multi-scale Gated Transformer and Prior-guided Convolutional Network for Learned Image Compression
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[650]  arXiv:2512.00743 [pdf, ps, other]
Title: Multi-GRPO: Multi-Group Advantage Estimation for Text-to-Image Generation with Tree-Based Trajectories and Multiple Rewards
Comments: 20 pages, 15 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 778 entries: 1-50 | ... | 451-500 | 501-550 | 551-600 | 601-650 | 651-700 | 701-750 | 751-778 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)