We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 534

[ total of 778 entries: 1-50 | ... | 385-434 | 435-484 | 485-534 | 535-584 | 585-634 | 635-684 | 685-734 | 735-778 ]
[ showing 50 entries per page: fewer | more | all ]

Tue, 2 Dec 2025 (continued, showing 50 of 278 entries)

[535]  arXiv:2512.01763 [pdf, ps, other]
Title: HiconAgent: History Context-aware Policy Optimization for GUI Agents
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[536]  arXiv:2512.01755 [pdf, ps, other]
Title: FreqEdit: Preserving High-Frequency Features for Robust Multi-Turn Image Editing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[537]  arXiv:2512.01707 [pdf, ps, other]
Title: StreamGaze: Gaze-Guided Temporal Reasoning and Proactive Understanding in Streaming Videos
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[538]  arXiv:2512.01701 [pdf, ps, other]
Title: SSR: Semantic and Spatial Rectification for CLIP-based Weakly Supervised Segmentation
Comments: Accepted in AAAI 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[539]  arXiv:2512.01686 [pdf, ps, other]
Title: DreamingComics: A Story Visualization Pipeline via Subject and Layout Customized Generation using Video Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[540]  arXiv:2512.01681 [pdf, ps, other]
Title: Cross-Domain Validation of a Resection-Trained Self-Supervised Model on Multicentre Mesothelioma Biopsies
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[541]  arXiv:2512.01677 [pdf, ps, other]
Title: Open-world Hand-Object Interaction Video Generation Based on Structure and Contact-aware Representation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[542]  arXiv:2512.01675 [pdf, ps, other]
Title: GRASP: Guided Residual Adapters with Sample-wise Partitioning
Comments: 10 pages, 4 figures, 6 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[543]  arXiv:2512.01665 [pdf, ps, other]
Title: Bridging the Scale Gap: Balanced Tiny and General Object Detection in Remote Sensing Imagery
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[544]  arXiv:2512.01657 [pdf, ps, other]
Title: DB-KAUNet: An Adaptive Dual Branch Kolmogorov-Arnold UNet for Retinal Vessel Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[545]  arXiv:2512.01643 [pdf, ps, other]
Title: ViT$^3$: Unlocking Test-Time Training in Vision
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[546]  arXiv:2512.01636 [pdf, ps, other]
Title: Generative Editing in the Joint Vision-Language Space for Zero-Shot Composed Image Retrieval
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[547]  arXiv:2512.01629 [pdf, ps, other]
Title: SPARK: Sim-ready Part-level Articulated Reconstruction with VLM Knowledge
Comments: Project page: this https URL 17 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[548]  arXiv:2512.01611 [pdf, ps, other]
Title: Depth Matching Method Based on ShapeDTW for Oil-Based Mud Imager
Subjects: Computer Vision and Pattern Recognition (cs.CV); Geophysics (physics.geo-ph)
[549]  arXiv:2512.01589 [pdf, ps, other]
Title: Toward Content-based Indexing and Retrieval of Head and Neck CT with Abscess Segmentation
Comments: The 2025 IEEE International Conference on Content-Based Multimedia Indexing (IEEE CBMI)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[550]  arXiv:2512.01582 [pdf, ps, other]
Title: RoleMotion: A Large-Scale Dataset towards Robust Scene-Specific Role-Playing Motion Synthesis with Fine-grained Descriptions
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[551]  arXiv:2512.01563 [pdf, ps, other]
Title: MasHeNe: A Benchmark for Head and Neck CT Mass Segmentation using Window-Enhanced Mamba with Frequency-Domain Integration
Comments: The 14th International Symposium on Information and Communication Technology Conference SoICT 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[552]  arXiv:2512.01540 [pdf, ps, other]
Title: FlashVGGT: Efficient and Scalable Visual Geometry Transformers with Compressed Descriptor Attention
Authors: Zipeng Wang, Dan Xu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[553]  arXiv:2512.01534 [pdf, ps, other]
Title: Deep Unsupervised Anomaly Detection in Brain Imaging: Large-Scale Benchmarking and Bias Analysis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[554]  arXiv:2512.01533 [pdf, ps, other]
Title: Diffusion Fuzzy System: Fuzzy Rule Guided Latent Multi-Path Diffusion Modeling
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[555]  arXiv:2512.01519 [pdf, ps, other]
Title: QuantumCanvas: A Multimodal Benchmark for Visual Learning of Atomic Interactions
Subjects: Computer Vision and Pattern Recognition (cs.CV); Materials Science (cond-mat.mtrl-sci); Quantum Physics (quant-ph)
[556]  arXiv:2512.01510 [pdf, ps, other]
Title: Semantic-aware Random Convolution and Source Matching for Domain Generalization in Medical Image Segmentation
Comments: Preprint submitted to Computer Methods and Programs in Biomedicine (currently under revision)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[557]  arXiv:2512.01495 [pdf, ps, other]
Title: ELVIS: Enhance Low-Light for Video Instance Segmentation in the Dark
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[558]  arXiv:2512.01494 [pdf, other]
Title: A variational method for curve extraction with curvature-dependent energies
Authors: Majid Arthaud (ENPC, MOKAPLAN, UMich), Antonin Chambolle (CEREMADE, MOKAPLAN), Vincent Duval (MOKAPLAN)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[559]  arXiv:2512.01481 [pdf, ps, other]
Title: ChronosObserver: Taming 4D World with Hyperspace Diffusion Sampling
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[560]  arXiv:2512.01478 [pdf, ps, other]
Title: CourtMotion: Learning Event-Driven Motion Representations from Skeletal Data for Basketball
Authors: Omer Sela (1 and 2), Michael Chertok (1), Lior Wolf (2) ((1) Amazon, (2) Tel Aviv University)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multiagent Systems (cs.MA)
[561]  arXiv:2512.01444 [pdf, ps, other]
Title: FastAnimate: Towards Learnable Template Construction and Pose Deformation for Fast 3D Human Avatar Animation
Comments: 9 pages,4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[562]  arXiv:2512.01427 [pdf, ps, other]
Title: Language-Guided Open-World Anomaly Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[563]  arXiv:2512.01426 [pdf, ps, other]
Title: ResDiT: Evoking the Intrinsic Resolution Scalability in Diffusion Transformers
Comments: 8 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[564]  arXiv:2512.01424 [pdf, ps, other]
Title: ViRectify: A Challenging Benchmark for Video Reasoning Correction with Multimodal Large Language Models
Comments: 22 pages, 11 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[565]  arXiv:2512.01422 [pdf, ps, other]
Title: MDiff4STR: Mask Diffusion Model for Scene Text Recognition
Comments: Accepted by AAAI 2026 (Oral)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[566]  arXiv:2512.01419 [pdf, ps, other]
Title: Rice-VL: Evaluating Vision-Language Models for Cultural Understanding Across ASEAN Countries
Comments: 14 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[567]  arXiv:2512.01390 [pdf, ps, other]
Title: FRAMER: Frequency-Aligned Self-Distillation with Adaptive Modulation Leveraging Diffusion Priors for Real-World Image Super-Resolution
Comments: Comments: Please visit our project page at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[568]  arXiv:2512.01383 [pdf, ps, other]
Title: PointNet4D: A Lightweight 4D Point Cloud Video Backbone for Online and Offline Perception in Robotic Applications
Comments: Accepted by WACV2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[569]  arXiv:2512.01382 [pdf, ps, other]
Title: Reversible Inversion for Training-Free Exemplar-guided Image Editing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[570]  arXiv:2512.01380 [pdf, ps, other]
Title: Textured Geometry Evaluation: Perceptual 3D Textured Shape Metric via 3D Latent-Geometry Network
Comments: Accepted by AAAI26
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[571]  arXiv:2512.01373 [pdf, ps, other]
Title: SRAM: Shape-Realism Alignment Metric for No Reference 3D Shape Evaluation
Comments: Accepted by AAAI2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[572]  arXiv:2512.01366 [pdf, ps, other]
Title: BlinkBud: Detecting Hazards from Behind via Sampled Monocular 3D Detection on a Single Earbud
Comments: This is the author-accepted version of the paper published in Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT), Vol. 9, No. 4, Article 191, 2025. Final published version: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[573]  arXiv:2512.01352 [pdf, ps, other]
Title: OpenBox: Annotate Any Bounding Boxes in 3D
Comments: Accepted by NeurIPS 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[574]  arXiv:2512.01348 [pdf, ps, other]
Title: Handwritten Text Recognition for Low Resource Languages
Comments: 21 Pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[575]  arXiv:2512.01342 [pdf, ps, other]
Title: InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[576]  arXiv:2512.01340 [pdf, ps, other]
Title: EvalTalker: Learning to Evaluate Real-Portrait-Driven Multi-Subject Talking Humans
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[577]  arXiv:2512.01334 [pdf, ps, other]
Title: AlignVid: Training-Free Attention Scaling for Semantic Fidelity in Text-Guided Image-to-Video Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[578]  arXiv:2512.01333 [pdf, ps, other]
Title: Optimizing Stroke Risk Prediction: A Machine Learning Pipeline Combining ROS-Balanced Ensembles and XAI
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[579]  arXiv:2512.01319 [pdf, ps, other]
Title: Rethinking Intracranial Aneurysm Vessel Segmentation: A Perspective from Computational Fluid Dynamics Applications
Comments: 18 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[580]  arXiv:2512.01315 [pdf, ps, other]
Title: FOD-S2R: A FOD Dataset for Sim2Real Transfer Learning based Object Detection
Comments: 8 pages, 11 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[581]  arXiv:2512.01314 [pdf, ps, other]
Title: TokenPure: Watermark Removal through Tokenized Appearance and Structural Guidance
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[582]  arXiv:2512.01312 [pdf, ps, other]
Title: IVCR-200K: A Large-Scale Multi-turn Dialogue Benchmark for Interactive Video Corpus Retrieval
Comments: Accepted by SIGIR2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[583]  arXiv:2512.01310 [pdf, ps, other]
Title: Lost in Distortion: Uncovering the Domain Gap Between Computer Vision and Brain Imaging - A Study on Pretraining for Age Prediction
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[584]  arXiv:2512.01306 [pdf, ps, other]
Title: Gaussian Swaying: Surface-Based Framework for Aerodynamic Simulation with 3D Gaussians
Comments: Accepted to WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[ total of 778 entries: 1-50 | ... | 385-434 | 435-484 | 485-534 | 535-584 | 585-634 | 635-684 | 685-734 | 735-778 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)