We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 288

[ total of 603 entries: 1-50 | ... | 139-188 | 189-238 | 239-288 | 289-338 | 339-388 | 389-438 | 439-488 | ... | 589-603 ]
[ showing 50 entries per page: fewer | more | all ]

Tue, 30 Dec 2025 (continued, showing 50 of 220 entries)

[289]  arXiv:2512.22226 [pdf, ps, other]
Title: VideoScaffold: Elastic-Scale Visual Hierarchies for Streaming Video Understanding in MLLMs
Comments: 11 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[290]  arXiv:2512.22220 [pdf, ps, other]
Title: On Extending Semantic Abstraction for Efficient Search of Hidden Objects
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[291]  arXiv:2512.22218 [pdf, ps, other]
Title: Towards Signboard-Oriented Visual Question Answering: ViSignVQA Dataset, Method and Benchmark
Comments: Dataset paper; code and data will be released
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[292]  arXiv:2512.22217 [pdf, ps, other]
Title: VLM-PAR: A Vision Language Model for Pedestrian Attribute Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[293]  arXiv:2512.22214 [pdf, ps, other]
Title: Signal-SGN++: Topology-Enhanced Time-Frequency Spiking Graph Network for Skeleton-Based Action Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[294]  arXiv:2512.22205 [pdf, ps, other]
Title: A CNN-Based Malaria Diagnosis from Blood Cell Images with SHAP and LIME Explainability
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[295]  arXiv:2512.22203 [pdf, ps, other]
Title: TCFormer: A 5M-Parameter Transformer with Density-Guided Aggregation for Weakly-Supervised Crowd Counting
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[296]  arXiv:2512.22197 [pdf, ps, other]
Title: Quadrant Segmentation VLM with Few-Shot Adaptation and OCT Learning-based Explainability Methods for Diabetic Retinopathy
Authors: Shivum Telang
Comments: 4 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[297]  arXiv:2512.22193 [pdf, ps, other]
Title: Tiny-YOLOSAM: Fast Hybrid Image Segmentation
Comments: 7 pages, 6 figures, 5 tables. Code available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[298]  arXiv:2512.22188 [pdf, ps, other]
Title: HookMIL: Revisiting Context Modeling in Multiple Instance Learning for Computational Pathology
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[299]  arXiv:2512.22185 [pdf, ps, other]
Title: SAMM2D: Scale-Aware Multi-Modal 2D Dual-Encoder for High-Sensitivity Intracrania Aneurysm Screening
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[300]  arXiv:2512.22183 [pdf, ps, other]
Title: Unbiased Visual Reasoning with Controlled Visual Inputs
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[301]  arXiv:2512.22182 [pdf, ps, other]
Title: Enhancing Medical Data Analysis through AI-Enhanced Locally Linear Embedding: Applications in Medical Point Location and Imagery
Comments: 5 pages, 3 figures. Accepted and published at 2024 19th International Conference on Emerging Technologies (ICET)
Journal-ref: Proc. IEEE ICET 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[302]  arXiv:2512.22177 [pdf, ps, other]
Title: Real-Time American Sign Language Recognition Using 3D Convolutional Neural Networks and LSTM: Architecture, Training, and Deployment
Authors: Dawnena Key
Comments: 10 pages, 1 figure, 2 tables. Patent pending (US 63/918,518). Code available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[303]  arXiv:2512.22175 [pdf, ps, other]
Title: Characterizing Motion Encoding in Video Diffusion Timesteps
Comments: 10 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[304]  arXiv:2512.23676 (cross-list from cs.AI) [pdf, ps, other]
Title: Web World Models
Comments: Project Page: this https URL
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[305]  arXiv:2512.23649 (cross-list from cs.RO) [pdf, ps, other]
Title: RoboMirror: Understand Before You Imitate for Video to Humanoid Locomotion
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[306]  arXiv:2512.23572 (cross-list from cs.CL) [pdf, ps, other]
Title: Instruction-Following Evaluation of Large Vision-Language Models
Comments: 21 pages, 7 figures
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[307]  arXiv:2512.23441 (cross-list from cs.LG) [pdf, ps, other]
Title: Stochastic Siamese MAE Pretraining for Longitudinal Medical Images
Comments: Under review. Code is available in this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[308]  arXiv:2512.23380 (cross-list from cs.LG) [pdf, ps, other]
Title: A unified framework for detecting point and collective anomalies in operating system logs via collaborative transformers
Comments: 72 pages, 19 figures, 19 tables, accepted in scientific reports on 5 November 2025
Journal-ref: Scientific Reports 15, 45698 (2025)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Networking and Internet Architecture (cs.NI); Operating Systems (cs.OS)
[309]  arXiv:2512.23343 (cross-list from cs.CL) [pdf, ps, other]
Title: AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents
Comments: 57 pages, 5 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[310]  arXiv:2512.23328 (cross-list from cs.AI) [pdf, ps, other]
Title: CubeBench: Diagnosing Interactive, Long-Horizon Spatial Reasoning Under Partial Observations
Comments: Webpage: this https URL
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[311]  arXiv:2512.23318 (cross-list from cs.RO) [pdf, ps, other]
Title: PCR-ORB: Enhanced ORB-SLAM3 with Point Cloud Refinement Using Deep Learning-Based Dynamic Object Filtering
Comments: 17 pages, 2 figures, 1 table
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[312]  arXiv:2512.23185 (cross-list from eess.IV) [pdf, ps, other]
Title: EIR: Enhanced Image Representations for Medical Report Generation
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[313]  arXiv:2512.23177 (cross-list from cs.LG) [pdf, ps, other]
Title: Machine Learning-Assisted Vocal Cord Ultrasound Examination: Project VIPR
Comments: Won Best Undergraduate Research Paper at the 2025 Midwest Instruction & Computing Symposium (MICS)
Subjects: Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE); Computer Vision and Pattern Recognition (cs.CV)
[314]  arXiv:2512.23162 (cross-list from cs.RO) [pdf, ps, other]
Title: SurgWorld: Learning Surgical Robot Policies from Videos via World Modeling
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[315]  arXiv:2512.23078 (cross-list from q-fin.GN) [pdf, ps, other]
Title: Deep Learning for Art Market Valuation
Subjects: General Finance (q-fin.GN); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); General Economics (econ.GN)
[316]  arXiv:2512.23073 (cross-list from cs.LG) [pdf, ps, other]
Title: Rethinking Fine-Tuning: Unlocking Hidden Capabilities in Vision-Language Models
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[317]  arXiv:2512.23033 (cross-list from cs.SE) [pdf, ps, other]
Title: Interpretable Gallbladder Ultrasound Diagnosis: A Lightweight Web-Mobile Software Platform with Real-Time XAI
Subjects: Software Engineering (cs.SE); Computer Vision and Pattern Recognition (cs.CV)
[318]  arXiv:2512.22899 (cross-list from cs.AI) [pdf, ps, other]
Title: HiSciBench: A Hierarchical Multi-disciplinary Benchmark for Scientific Intelligence from Reading to Discovery
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[319]  arXiv:2512.22855 (cross-list from physics.geo-ph) [pdf, ps, other]
Title: A Rapid GeoSAM-Based Workflow for Multi-Temporal Glacier Delineation: Case Study from Svalbard
Authors: Alexandru Hegyi
Subjects: Geophysics (physics.geo-ph); Computer Vision and Pattern Recognition (cs.CV)
[320]  arXiv:2512.22802 (cross-list from cs.LG) [pdf, ps, other]
Title: ReDiF: Reinforced Distillation for Few Step Diffusion
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[321]  arXiv:2512.22774 (cross-list from cs.LG) [pdf, ps, other]
Title: Schrodinger AI: A Unified Spectral-Dynamical Framework for Classification, Reasoning, and Operator-Based Generalization
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[322]  arXiv:2512.22766 (cross-list from eess.IV) [pdf, ps, other]
Title: SwinCCIR: An end-to-end deep network for Compton camera imaging reconstruction
Comments: 10 pages, 7 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Nuclear Experiment (nucl-ex)
[323]  arXiv:2512.22716 (cross-list from cs.AI) [pdf, ps, other]
Title: Memento-II: Learning by Stateful Reflective Memory
Authors: Jun Wang
Comments: 32 pages, three figures
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[324]  arXiv:2512.22690 (cross-list from cs.MM) [pdf, ps, other]
Title: Mesquite MoCap: Democratizing Real-Time Motion Capture with Affordable, Bodyworn IoT Sensors and WebXR SLAM
Comments: submitted to IEEE Journal of IoT
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[325]  arXiv:2512.22674 (cross-list from eess.IV) [pdf, ps, other]
Title: Semantic contrastive learning for orthogonal X-ray computed tomography reconstruction
Comments: This paper is accepted by Fully3D 2025
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[326]  arXiv:2512.22605 (cross-list from cs.AI) [pdf, ps, other]
Title: Learning Multi-Modal Mobility Dynamics for Generalized Next Location Recommendation
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[327]  arXiv:2512.22539 (cross-list from cs.RO) [pdf, ps, other]
Title: VLA-Arena: An Open-Source Framework for Benchmarking Vision-Language-Action Models
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[328]  arXiv:2512.22488 (cross-list from cs.LG) [pdf, ps, other]
Title: Toward Real-World IoT Security: Concept Drift-Resilient IoT Botnet Detection via Latent Space Representation Learning and Alignment
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[329]  arXiv:2512.22485 (cross-list from q-bio.NC) [pdf, ps, other]
Title: JParc: Joint cortical surface parcellation with registration
Comments: A. V. Dalca and B. Fischl are co-senior authors with equal contributions
Subjects: Neurons and Cognition (q-bio.NC); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[330]  arXiv:2512.22463 (cross-list from eess.IV) [pdf, ps, other]
Title: MEGA-PCC: A Mamba-based Efficient Approach for Joint Geometry and Attribute Point Cloud Compression
Comments: Accepted at the IEEE/CVF Winter Conference on Applications of Computer Vision 2026 (WACV 2026)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[331]  arXiv:2512.22385 (cross-list from cs.CL) [pdf, ps, other]
Title: LLM-Guided Exemplar Selection for Few-Shot Wearable-Sensor Human Activity Recognition
Comments: This paper has been accepted for presentation at ABC 2026. The manuscript is under revision prior to camera-ready submission
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[332]  arXiv:2512.22322 (cross-list from cs.CL) [pdf, ps, other]
Title: SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[333]  arXiv:2512.22317 (cross-list from cs.LG) [pdf, ps, other]
Title: LangPrecip: Language-Aware Multimodal Precipitation Nowcasting
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[334]  arXiv:2512.22288 (cross-list from cs.LG) [pdf, ps, other]
Title: Co-GRPO: Co-Optimized Group Relative Policy Optimization for Masked Diffusion Model
Comments: 17 pages, 6 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[335]  arXiv:2512.22249 (cross-list from cs.LG) [pdf, ps, other]
Title: Temporal Visual Semantics-Induced Human Motion Understanding with Large Language Models
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[336]  arXiv:2512.22242 (cross-list from cs.LG) [pdf, ps, other]
Title: Fairness Evaluation of Risk Estimation Models for Lung Cancer Screening
Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) this https URL
Journal-ref: Machine.Learning.for.Biomedical.Imaging. 3 (2025)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Image and Video Processing (eess.IV)
[337]  arXiv:2512.22238 (cross-list from cs.LG) [pdf, ps, other]
Title: Masking Teacher and Reinforcing Student for Distilling Vision-Language Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[338]  arXiv:2512.22209 (cross-list from eess.IV) [pdf, ps, other]
Title: Super-Resolution Enhancement of Medical Images Based on Diffusion Model: An Optimization Scheme for Low-Resolution Gastric Images
Authors: Haozhe Jia
Comments: 19 pages, 16 figures. Undergraduate final year project
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[ total of 603 entries: 1-50 | ... | 139-188 | 189-238 | 239-288 | 289-338 | 339-388 | 389-438 | 439-488 | ... | 589-603 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2601, contact, help  (Access key information)