We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 605

[ total of 749 entries: 1-100 | ... | 306-405 | 406-505 | 506-605 | 606-705 | 706-749 ]
[ showing 100 entries per page: fewer | more | all ]

Fri, 5 Dec 2025 (continued, showing last 14 of 135 entries)

[606]  arXiv:2512.05116 (cross-list from cs.LG) [pdf, ps, other]
Title: Value Gradient Guidance for Flow Matching Alignment
Comments: Accepted at NeurIPS 2025; 26 pages, 20 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[607]  arXiv:2512.05114 (cross-list from cs.LG) [pdf, ps, other]
Title: Deep infant brain segmentation from multi-contrast MRI
Comments: 8 pages, 8 figures, 1 table, website at this https URL, presented at the 2025 IEEE Asilomar Conference on Signals, Systems, and Computers
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[608]  arXiv:2512.05103 (cross-list from cs.LG) [pdf, ps, other]
Title: TV2TV: A Unified Framework for Interleaved Language and Video Generation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[609]  arXiv:2512.05094 (cross-list from cs.RO) [pdf, ps, other]
Title: From Generated Human Videos to Physically Plausible Robot Trajectories
Comments: For project website, see this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[610]  arXiv:2512.04814 (cross-list from cs.SD) [pdf, ps, other]
Title: Shared Multi-modal Embedding Space for Face-Voice Association
Comments: Ranked 1st in Fame 2026 Challenge, ICASSP
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV)
[611]  arXiv:2512.04763 (cross-list from cs.LG) [pdf, ps, other]
Title: MemLoRA: Distilling Expert Adapters for On-Device Memory Systems
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[612]  arXiv:2512.04705 (cross-list from cs.CC) [pdf, ps, other]
Title: Hardware-aware Neural Architecture Search of Early Exiting Networks on Edge Accelerators
Comments: Submitted to IEEE Transactions on Emerging Topics in Computing
Subjects: Computational Complexity (cs.CC); Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV)
[613]  arXiv:2512.04625 (cross-list from cs.LG) [pdf, ps, other]
Title: Rethinking Decoupled Knowledge Distillation: A Predictive Distribution Perspective
Comments: Accepted to IEEE TNNLS
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[614]  arXiv:2512.04556 (cross-list from cs.GR) [pdf, ps, other]
Title: Efficient Spatially-Variant Convolution via Differentiable Sparse Kernel Complex
Comments: 10 pages, 7 figures
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[615]  arXiv:2512.04464 (cross-list from cs.LG) [pdf, ps, other]
Title: Feature Engineering vs. Deep Learning for Automated Coin Grading: A Comparative Study on Saint-Gaudens Double Eagles
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[616]  arXiv:2512.04385 (cross-list from cs.LG) [pdf, ps, other]
Title: STeP-Diff: Spatio-Temporal Physics-Informed Diffusion Models for Mobile Fine-Grained Pollution Forecasting
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[617]  arXiv:2512.04264 (cross-list from cs.LG) [pdf, ps, other]
Title: Studying Various Activation Functions and Non-IID Data for Machine Learning Model Robustness
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[618]  arXiv:2512.04092 (cross-list from physics.soc-ph) [pdf, ps, other]
Title: The changing surface of the world's roads
Subjects: Physics and Society (physics.soc-ph); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[619]  arXiv:2512.04087 (cross-list from q-bio.NC) [pdf, ps, other]
Title: Human-Centred Evaluation of Text-to-Image Generation Models for Self-expression of Mental Distress: A Dataset Based on GPT-4o
Authors: Sui He, Shenbin Qian
Subjects: Neurons and Cognition (q-bio.NC); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Human-Computer Interaction (cs.HC)

Thu, 4 Dec 2025 (showing first 86 of 130 entries)

[620]  arXiv:2512.04085 [pdf, ps, other]
Title: Unique Lives, Shared World: Learning from Single-Life Videos
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[621]  arXiv:2512.04084 [pdf, ps, other]
Title: SimFlow: Simplified and End-to-End Training of Latent Normalizing Flows
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[622]  arXiv:2512.04082 [pdf, ps, other]
Title: PosterCopilot: Toward Layout Reasoning and Controllable Editing for Professional Graphic Design
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[623]  arXiv:2512.04069 [pdf, ps, other]
Title: SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[624]  arXiv:2512.04048 [pdf, ps, other]
Title: Stable Signer: Hierarchical Sign Language Generative Model
Comments: 12 pages, 7 figures. More Demo at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Computers and Society (cs.CY)
[625]  arXiv:2512.04040 [pdf, ps, other]
Title: RELIC: Interactive Video World Model with Long-Horizon Memory
Comments: 22 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[626]  arXiv:2512.04039 [pdf, ps, other]
Title: Fast & Efficient Normalizing Flows and Applications of Image Generative Models
Authors: Sandeep Nagar
Comments: PhD Thesis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[627]  arXiv:2512.04025 [pdf, ps, other]
Title: PSA: Pyramid Sparse Attention for Efficient Video Understanding and Generation
Comments: Tech report
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[628]  arXiv:2512.04021 [pdf, ps, other]
Title: C3G: Learning Compact 3D Representations with 2K Gaussians
Comments: Project Page : this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[629]  arXiv:2512.04019 [pdf, ps, other]
Title: Ultra-lightweight Neural Video Representation Compression
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[630]  arXiv:2512.04015 [pdf, ps, other]
Title: Learning Group Actions In Disentangled Latent Image Representations
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[631]  arXiv:2512.04012 [pdf, ps, other]
Title: Emergent Outlier View Rejection in Visual Geometry Grounded Transformers
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[632]  arXiv:2512.04007 [pdf, ps, other]
Title: On the Temporality for Sketch Representation Learning
Comments: Preprint submitted to Pattern Recognition Letters
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[633]  arXiv:2512.04000 [pdf, ps, other]
Title: Divide, then Ground: Adapting Frame Selection to Query Types for Long-Form Video Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[634]  arXiv:2512.03996 [pdf, ps, other]
Title: Highly Efficient Test-Time Scaling for T2I Diffusion Models with Text Embedding Perturbation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[635]  arXiv:2512.03992 [pdf, ps, other]
Title: DIQ-H: Evaluating Hallucination Persistence in VLMs Under Temporal Visual Degradation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[636]  arXiv:2512.03981 [pdf, ps, other]
Title: DirectDrag: High-Fidelity, Mask-Free, Prompt-Free Drag-based Image Editing via Readout-Guided Feature Alignment
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[637]  arXiv:2512.03979 [pdf, ps, other]
Title: BlurDM: A Blur Diffusion Model for Image Deblurring
Comments: NeurIPS 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[638]  arXiv:2512.03964 [pdf, ps, other]
Title: Training for Identity, Inference for Controllability: A Unified Approach to Tuning-Free Face Personalization
Comments: 17 pages, 13 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[639]  arXiv:2512.03963 [pdf, ps, other]
Title: TempR1: Improving Temporal Understanding of MLLMs via Temporal-Aware Multi-Task Reinforcement Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[640]  arXiv:2512.03939 [pdf, ps, other]
Title: MUT3R: Motion-aware Updating Transformer for Dynamic 3D Reconstruction
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[641]  arXiv:2512.03932 [pdf, ps, other]
Title: Beyond the Ground Truth: Enhanced Supervision for Image Restoration
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[642]  arXiv:2512.03918 [pdf, ps, other]
Title: UniMo: Unifying 2D Video and 3D Human Motion with an Autoregressive Framework
Comments: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[643]  arXiv:2512.03905 [pdf, ps, other]
Title: Zero-Shot Video Translation and Editing with Frame Spatial-Temporal Correspondence
Comments: Code: this https URL, Project: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[644]  arXiv:2512.03883 [pdf, ps, other]
Title: Dual Cross-Attention Siamese Transformer for Rectal Tumor Regrowth Assessment in Watch-and-Wait Endoscopy
Comments: 6 pages, 5 figures, 1 table, submitted to ISBI conference
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[645]  arXiv:2512.03869 [pdf, ps, other]
Title: An Automated Framework for Large-Scale Graph-Based Cerebrovascular Analysis
Comments: Submitted to ISBI 2026. 6 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[646]  arXiv:2512.03862 [pdf, ps, other]
Title: Diminishing Returns in Self-Supervised Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[647]  arXiv:2512.03854 [pdf, ps, other]
Title: Prostate biopsy whole slide image dataset from an underrepresented Middle Eastern population
Comments: 13 pages, 2 figures and 1 table
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[648]  arXiv:2512.03852 [pdf, ps, other]
Title: Traffic Image Restoration under Adverse Weather via Frequency-Aware Mamba
Comments: 12pages, 13 figures, 5tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[649]  arXiv:2512.03848 [pdf, ps, other]
Title: PULSE: A Unified Multi-Task Architecture for Cardiac Segmentation, Diagnosis, and Few-Shot Cross-Modality Clinical Adaptation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[650]  arXiv:2512.03844 [pdf, ps, other]
Title: CoDA: From Text-to-Image Diffusion Models to Training-Free Dataset Distillation
Comments: 34 pages, 24 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[651]  arXiv:2512.03837 [pdf, ps, other]
Title: Heatmap Pooling Network for Action Recognition from RGB Videos
Comments: Final Version of IEEE Transactions on Pattern Analysis and Machine Intelligence
Journal-ref: IEEE Transactions on Pattern Analysis and Machine Intelligence (2025)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[652]  arXiv:2512.03834 [pdf, ps, other]
Title: Lean Unet: A Compact Model for Image Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[653]  arXiv:2512.03827 [pdf, ps, other]
Title: A Robust Camera-based Method for Breath Rate Measurement
Comments: 9 pages, 4 figures, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[654]  arXiv:2512.03817 [pdf, ps, other]
Title: HieroGlyphTranslator: Automatic Recognition and Translation of Egyptian Hieroglyphs to English
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[655]  arXiv:2512.03796 [pdf, ps, other]
Title: LSRS: Latent Scale Rejection Sampling for Visual Autoregressive Modeling
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[656]  arXiv:2512.03794 [pdf, ps, other]
Title: AdaptVision: Efficient Vision-Language Models via Adaptive Visual Acquisition
Comments: 15 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[657]  arXiv:2512.03751 [pdf, ps, other]
Title: Research on Brain Tumor Classification Method Based on Improved ResNet34 Network
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[658]  arXiv:2512.03749 [pdf, ps, other]
Title: Fully Unsupervised Self-debiasing of Text-to-Image Diffusion Models
Comments: Accepted at WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[659]  arXiv:2512.03746 [pdf, ps, other]
Title: Thinking with Programming Vision: Towards a Unified View for Thinking with Images
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[660]  arXiv:2512.03745 [pdf, ps, other]
Title: Dual-level Modality Debiasing Learning for Unsupervised Visible-Infrared Person Re-Identification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[661]  arXiv:2512.03730 [pdf, ps, other]
Title: Out-of-the-box: Black-box Causal Attacks on Object Detectors
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[662]  arXiv:2512.03724 [pdf, ps, other]
Title: PosA-VLA: Enhancing Action Generation via Pose-Conditioned Anchor Attention
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[663]  arXiv:2512.03715 [pdf, ps, other]
Title: DINO-RotateMatch: A Rotation-Aware Deep Framework for Robust Image Matching in Large-Scale 3D Reconstruction
Comments: 9 pages, 5 figures, 1 table
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[664]  arXiv:2512.03701 [pdf, ps, other]
Title: Structured Uncertainty Similarity Score (SUSS): Learning a Probabilistic, Interpretable, Perceptual Metric Between Images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[665]  arXiv:2512.03687 [pdf, ps, other]
Title: Active Visual Perception: Opportunities and Challenges
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[666]  arXiv:2512.03683 [pdf, ps, other]
Title: GaussianBlender: Instant Stylization of 3D Gaussians with Disentangled Latent Spaces
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[667]  arXiv:2512.03673 [pdf, ps, other]
Title: ConvRot: Rotation-Based Plug-and-Play 4-bit Quantization for Diffusion Transformers
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[668]  arXiv:2512.03667 [pdf, ps, other]
Title: Colon-X: Advancing Intelligent Colonoscopy from Multimodal Understanding to Clinical Reasoning
Comments: Technical report
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[669]  arXiv:2512.03666 [pdf, ps, other]
Title: ToG-Bench: Task-Oriented Spatio-Temporal Grounding in Egocentric Videos
Comments: 26 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[670]  arXiv:2512.03663 [pdf, ps, other]
Title: Multi-Scale Visual Prompting for Lightweight Small-Image Classification
Authors: Salim Khazem
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[671]  arXiv:2512.03643 [pdf, ps, other]
Title: Optical Context Compression Is Just (Bad) Autoencoding
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[672]  arXiv:2512.03640 [pdf, ps, other]
Title: MKSNet: Advanced Small Object Detection in Remote Sensing Imagery with Multi-Kernel and Dual Attention Mechanisms
Journal-ref: MultiMedia Modeling. MMM 2025. Lecture Notes in Computer Science, vol 15521. Springer, Singapore
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[673]  arXiv:2512.03625 [pdf, ps, other]
Title: FeatureLens: A Highly Generalizable and Interpretable Framework for Detecting Adversarial Examples Based on Image Features
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[674]  arXiv:2512.03621 [pdf, ps, other]
Title: ReCamDriving: LiDAR-Free Camera-Controlled Novel Trajectory Video Generation
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[675]  arXiv:2512.03619 [pdf, ps, other]
Title: LAMP: Language-Assisted Motion Planning for Controllable Video Generation
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[676]  arXiv:2512.03601 [pdf, ps, other]
Title: Motion4D: Learning 3D-Consistent Motion and Semantics for 4D Scene Understanding
Comments: Accepted to NeurIPS 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[677]  arXiv:2512.03598 [pdf, ps, other]
Title: Memory-Guided Point Cloud Completion for Dental Reconstruction
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[678]  arXiv:2512.03597 [pdf, ps, other]
Title: HBFormer: A Hybrid-Bridge Transformer for Microtumor and Miniature Organ Segmentation
Comments: 6 pages, 4 figures, 3 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[679]  arXiv:2512.03593 [pdf, ps, other]
Title: CloseUpAvatar: High-Fidelity Animatable Full-Body Avatars with Mixture of Multi-Scale Textures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[680]  arXiv:2512.03592 [pdf, ps, other]
Title: Harnessing Hypergraphs in Geometric Deep Learning for 3D RNA Inverse Folding
Authors: Guang Yang, Lei Fan
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[681]  arXiv:2512.03590 [pdf, ps, other]
Title: Beyond Boundary Frames: Audio-Visual Semantic Guidance for Context-Aware Video Interpolation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[682]  arXiv:2512.03580 [pdf, ps, other]
Title: Dynamic Optical Test for Bot Identification (DOT-BI): A simple check to identify bots in surveys and online processes
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[683]  arXiv:2512.03577 [pdf, ps, other]
Title: Cross-Stain Contrastive Learning for Paired Immunohistochemistry and Histopathology Slide Representation Learning
Comments: 6 pages, 2 figures. Camera-ready version accepted for IEEE BIBM 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[684]  arXiv:2512.03575 [pdf, ps, other]
Title: UniComp: Rethinking Video Compression Through Informational Uniqueness
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[685]  arXiv:2512.03574 [pdf, ps, other]
Title: Global-Local Aware Scene Text Editing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[686]  arXiv:2512.03566 [pdf, ps, other]
Title: GAOT: Generating Articulated Objects Through Text-Guided Diffusion Models
Comments: Accepted by ACM MM Asia2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[687]  arXiv:2512.03558 [pdf, ps, other]
Title: CartoMapQA: A Fundamental Benchmark Dataset Evaluating Vision-Language Models on Cartographic Map Understanding
Comments: Accepted at SIGSPATIAL 2025 (Best paper candidates), 15 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[688]  arXiv:2512.03553 [pdf, ps, other]
Title: Dynamic Content Moderation in Livestreams: Combining Supervised Classification with MLLM-Boosted Similarity Matching
Comments: Accepted at KDD 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[689]  arXiv:2512.03542 [pdf, ps, other]
Title: V-ITI: Mitigating Hallucinations in Multimodal Large Language Models via Visual Inference-Time Intervention
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[690]  arXiv:2512.03540 [pdf, ps, other]
Title: CookAnything: A Framework for Flexible and Consistent Multi-Step Recipe Image Generation
Comments: Accepted by ACM Multimedia 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[691]  arXiv:2512.03534 [pdf, ps, other]
Title: Rethinking Prompt Design for Inference-time Scaling in Text-to-Visual Generation
Comments: Visualizations are available at the website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[692]  arXiv:2512.03532 [pdf, ps, other]
Title: OpenTrack3D: Towards Accurate and Generalizable Open-Vocabulary 3D Instance Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[693]  arXiv:2512.03520 [pdf, ps, other]
Title: FloodDiffusion: Tailored Diffusion Forcing for Streaming Motion Generation
Comments: 15 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[694]  arXiv:2512.03510 [pdf, ps, other]
Title: CSMapping: Scalable Crowdsourced Semantic Mapping and Topology Inference for Autonomous Driving
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[695]  arXiv:2512.03509 [pdf, ps, other]
Title: AfroBeats Dance Movement Analysis Using Computer Vision: A Proof-of-Concept Framework Combining YOLO and Segment Anything Model
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[696]  arXiv:2512.03508 [pdf, ps, other]
Title: Exploiting Domain Properties in Language-Driven Domain Generalization for Semantic Segmentation
Comments: ICCV 2025 (poster)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[697]  arXiv:2512.03500 [pdf, ps, other]
Title: EEA: Exploration-Exploitation Agent for Long Video Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[698]  arXiv:2512.03499 [pdf, ps, other]
Title: NAS-LoRA: Empowering Parameter-Efficient Fine-Tuning for Visual Foundation Models with Searchable Adaptation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[699]  arXiv:2512.03479 [pdf, ps, other]
Title: Towards Object-centric Understanding for Instructional Videos
Authors: Wenliang Guo, Yu Kong
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[700]  arXiv:2512.03477 [pdf, ps, other]
Title: Fairness-Aware Fine-Tuning of Vision-Language Models for Medical Glaucoma Diagnosis
Comments: 10 pages, 3 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[701]  arXiv:2512.03474 [pdf, ps, other]
Title: Procedural Mistake Detection via Action Effect Modeling
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[702]  arXiv:2512.03470 [pdf, ps, other]
Title: Difference Decomposition Networks for Infrared Small Target Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[703]  arXiv:2512.03463 [pdf, ps, other]
Title: Text-Printed Image: Bridging the Image-Text Modality Gap for Text-centric Training of Large Vision-Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[704]  arXiv:2512.03454 [pdf, ps, other]
Title: Think Before You Drive: World Model-Inspired Multimodal Grounding for Autonomous Vehicles
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[705]  arXiv:2512.03453 [pdf, ps, other]
Title: GeoVideo: Introducing Geometric Regularization into Video Generation Model
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 749 entries: 1-100 | ... | 306-405 | 406-505 | 506-605 | 606-705 | 706-749 ]
[ showing 100 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)