We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 725

[ total of 778 entries: 1-25 | ... | 651-675 | 676-700 | 701-725 | 726-750 | 751-775 | 776-778 ]
[ showing 25 entries per page: fewer | more | all ]

Tue, 2 Dec 2025 (continued, showing 25 of 278 entries)

[726]  arXiv:2512.00084 [pdf, ps, other]
Title: A Fast and Efficient Modern BERT based Text-Conditioned Diffusion Model for Medical Image Segmentation
Comments: 15 pages, 3 figures, Accepted in Slide 3 10th International Conference on Computer Vision & Image Processing (CVIP 2026)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[727]  arXiv:2512.00082 [pdf, ps, other]
Title: Exploring Diagnostic Prompting Approach for Multimodal LLM-based Visual Complexity Assessment: A Case Study of Amazon Search Result Pages
Comments: 9 pages, 4 figures, 9 tables. Study on diagnostic prompting for multimodal LLM-based visual complexity assessment of Amazon search result pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[728]  arXiv:2512.00080 [pdf, ps, other]
Title: Conceptual Evaluation of Deep Visual Stereo Odometry for the MARWIN Radiation Monitoring Robot in Accelerator Tunnels
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[729]  arXiv:2512.00078 [pdf, ps, other]
Title: Diffusion-Based Synthetic Brightfield Microscopy Images for Enhanced Single Cell Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[730]  arXiv:2512.00075 [pdf, ps, other]
Title: Adapter Shield: A Unified Framework with Built-in Authentication for Preventing Unauthorized Zero-Shot Image-to-Image Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[731]  arXiv:2512.00073 [pdf, ps, other]
Title: ProvRain: Rain-Adaptive Denoising and Vehicle Detection via MobileNet-UNet and Faster R-CNN
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[732]  arXiv:2512.00065 [pdf, ps, other]
Title: Satellite to Street : Disaster Impact Estimator
Comments: 11 pages,9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[733]  arXiv:2512.00061 [pdf, ps, other]
Title: DL-CapsNet: A Deep and Light Capsule Network
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[734]  arXiv:2512.00060 [pdf, ps, other]
Title: PEFT-DML: Parameter-Efficient Fine-Tuning Deep Metric Learning for Robust Multi-Modal 3D Object Detection in Autonomous Driving
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[735]  arXiv:2512.00042 [pdf, ps, other]
Title: Closing the Gap: Data-Centric Fine-Tuning of Vision Language Models for the Standardized Exam Questions
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computers and Society (cs.CY)
[736]  arXiv:2512.00008 [pdf, ps, other]
Title: MOTION: ML-Assisted On-Device Low-Latency Motion Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[737]  arXiv:2512.02020 (cross-list from cs.RO) [pdf, ps, other]
Title: EfficientFlow: Efficient Equivariant Flow Policy Learning for Embodied AI
Comments: Accepted by AAAI 2026. Project Page: this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[738]  arXiv:2512.01993 (cross-list from cs.RO) [pdf, ps, other]
Title: RoaD: Rollouts as Demonstrations for Closed-Loop Supervised Fine-Tuning of Autonomous Driving Policies
Comments: Preprint
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[739]  arXiv:2512.01979 (cross-list from cs.AI) [pdf, ps, other]
Title: Chain-of-Ground: Improving GUI Grounding via Iterative Reasoning and Reference Feedback
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[740]  arXiv:2512.01946 (cross-list from cs.RO) [pdf, ps, other]
Title: Guardian: Detecting Robotic Planning and Execution Errors with Vision-Language Models
Comments: Code, Data, and Models available at this https URL The paper contains 8 pages, 9 figures, 6 tables
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[741]  arXiv:2512.01913 (cross-list from eess.IV) [pdf, ps, other]
Title: Disentangling Progress in Medical Image Registration: Beyond Trend-Driven Architectures towards Domain-Specific Strategies
Comments: Submitted to Medical Image Analysis. Journal Extension of arXiv:2407.19274
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[742]  arXiv:2512.01822 (cross-list from cs.CL) [pdf, ps, other]
Title: InnoGym: Benchmarking the Innovation Potential of AI Agents
Comments: Work in progress
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[743]  arXiv:2512.01818 (cross-list from cs.LG) [pdf, ps, other]
Title: Forget Less, Retain More: A Lightweight Regularizer for Rehearsal-Based Continual Learning
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[744]  arXiv:2512.01687 (cross-list from cs.NE) [pdf, ps, other]
Title: Revisiting Direct Encoding: Learnable Temporal Dynamics for Static Image Spiking Neural Networks
Authors: Huaxu He
Subjects: Neural and Evolutionary Computing (cs.NE); Computer Vision and Pattern Recognition (cs.CV)
[745]  arXiv:2512.01550 (cross-list from cs.RO) [pdf, ps, other]
Title: NavForesee: A Unified Vision-Language World Model for Hierarchical Planning and Dual-Horizon Navigation Prediction
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[746]  arXiv:2512.01461 (cross-list from cs.LG) [pdf, ps, other]
Title: Stay Unique, Stay Efficient: Preserving Model Personality in Multi-Task Merging
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[747]  arXiv:2512.01329 (cross-list from cs.GR) [pdf, ps, other]
Title: TagSplat: Topology-Aware Gaussian Splatting for Dynamic Mesh Modeling and Tracking
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[748]  arXiv:2512.01324 (cross-list from hep-ex) [pdf, ps, other]
Title: Panda: Self-distillation of Reusable Sensor-level Representations for High Energy Physics
Comments: 23 pages, 15 figures, preprint. Project page at this https URL
Subjects: High Energy Physics - Experiment (hep-ex); Computer Vision and Pattern Recognition (cs.CV)
[749]  arXiv:2512.01252 (cross-list from cs.LG) [pdf, ps, other]
Title: Efficient Training of Diffusion Mixture-of-Experts Models: A Practical Recipe
Comments: 9 pages, 7 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[750]  arXiv:2512.01181 (cross-list from cs.LG) [pdf, ps, other]
Title: First On-Orbit Demonstration of a Geospatial Foundation Model
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[ total of 778 entries: 1-25 | ... | 651-675 | 676-700 | 701-725 | 726-750 | 751-775 | 776-778 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)