We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 175

[ total of 749 entries: 1-100 | 76-175 | 176-275 | 276-375 | 376-475 | 476-575 | ... | 676-749 ]
[ showing 100 entries per page: fewer | more | all ]

Tue, 9 Dec 2025 (continued, showing 100 of 259 entries)

[176]  arXiv:2512.07480 [pdf, ps, other]
Title: Single-step Diffusion-based Video Coding with Semantic-Temporal Guidance
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[177]  arXiv:2512.07469 [pdf, ps, other]
Title: Unified Video Editing with Temporal Reasoner
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[178]  arXiv:2512.07426 [pdf, ps, other]
Title: When normalization hallucinates: unseen risks in AI-powered whole slide image processing
Comments: 4 pages, accepted for oral presentation at SPIE Medical Imaging, 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[179]  arXiv:2512.07415 [pdf, ps, other]
Title: Data-driven Exploration of Mobility Interaction Patterns
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[180]  arXiv:2512.07410 [pdf, ps, other]
Title: InterAgent: Physics-based Multi-agent Command Execution via Diffusion on Interaction Graphs
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[181]  arXiv:2512.07394 [pdf, ps, other]
Title: Reconstructing Objects along Hand Interaction Timelines in Egocentric Video
Comments: webpage: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[182]  arXiv:2512.07391 [pdf, ps, other]
Title: GlimmerNet: A Lightweight Grouped Dilated Depthwise Convolutions for UAV-Based Emergency Monitoring
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[183]  arXiv:2512.07385 [pdf, ps, other]
Title: How Far are Modern Trackers from UAV-Anti-UAV? A Million-Scale Benchmark and New Baseline
Comments: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[184]  arXiv:2512.07383 [pdf, ps, other]
Title: LogicCBMs: Logic-Enhanced Concept-Based Learning
Comments: 18 pages, 19 figures, WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[185]  arXiv:2512.07381 [pdf, ps, other]
Title: Tessellation GS: Neural Mesh Gaussians for Robust Monocular Reconstruction of Dynamic Objects
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[186]  arXiv:2512.07379 [pdf, ps, other]
Title: Enhancing Small Object Detection with YOLO: A Novel Framework for Improved Accuracy and Efficiency
Comments: 22 pages, 16 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[187]  arXiv:2512.07360 [pdf, ps, other]
Title: Structure-Aware Feature Rectification with Region Adjacency Graphs for Training-Free Open-Vocabulary Semantic Segmentation
Comments: Accepted to WACV2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[188]  arXiv:2512.07351 [pdf, ps, other]
Title: DeepAgent: A Dual Stream Multi Agent Fusion for Robust Multimodal Deepfake Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Sound (cs.SD)
[189]  arXiv:2512.07348 [pdf, ps, other]
Title: MICo-150K: A Comprehensive Dataset Advancing Multi-Image Composition
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[190]  arXiv:2512.07345 [pdf, ps, other]
Title: Debiasing Diffusion Priors via 3D Attention for Consistent Gaussian Splatting
Comments: 15 pages, 8 figures, 5 tables, 2 algorithms, Accepted by AAAI 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[191]  arXiv:2512.07338 [pdf, ps, other]
Title: Generalized Referring Expression Segmentation on Aerial Photos
Comments: Submitted to IEEE J-STARS
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[192]  arXiv:2512.07331 [pdf, ps, other]
Title: The Inductive Bottleneck: Data-Driven Emergence of Representational Sparsity in Vision Transformers
Authors: Kanishk Awadhiya
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[193]  arXiv:2512.07328 [pdf, ps, other]
Title: ContextAnyone: Context-Aware Diffusion for Character-Consistent Text-to-Video Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[194]  arXiv:2512.07305 [pdf, ps, other]
Title: Reevaluating Automated Wildlife Species Detection: A Reproducibility Study on a Custom Image Dataset
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[195]  arXiv:2512.07302 [pdf, ps, other]
Title: Towards Accurate UAV Image Perception: Guiding Vision-Language Models with Stronger Task Prompts
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[196]  arXiv:2512.07276 [pdf, ps, other]
Title: Geo3DVQA: Evaluating Vision-Language Models for 3D Geospatial Reasoning from Aerial Imagery
Comments: Accepted to WACV 2026. Camera-ready-based version with minor edits for readability (no change in the contents)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[197]  arXiv:2512.07275 [pdf, ps, other]
Title: Effective Attention-Guided Multi-Scale Medical Network for Skin Lesion Segmentation
Comments: The paper has been accepted by BIBM 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[198]  arXiv:2512.07273 [pdf, ps, other]
Title: RVLF: A Reinforcing Vision-Language Framework for Gloss-Free Sign Language Translation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[199]  arXiv:2512.07269 [pdf, ps, other]
Title: A graph generation pipeline for critical infrastructures based on heuristics, images and depth data
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[200]  arXiv:2512.07253 [pdf, ps, other]
Title: DGGAN: Degradation Guided Generative Adversarial Network for Real-time Endoscopic Video Enhancement
Comments: 18 pages, 8 figures, and 7 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[201]  arXiv:2512.07251 [pdf, ps, other]
Title: See More, Change Less: Anatomy-Aware Diffusion for Contrast Enhancement
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[202]  arXiv:2512.07247 [pdf, ps, other]
Title: AdLift: Lifting Adversarial Perturbations to Safeguard 3D Gaussian Splatting Assets Against Instruction-Driven Editing
Comments: 40 pages, 34 figures, 18 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[203]  arXiv:2512.07245 [pdf, ps, other]
Title: Zero-Shot Textual Explanations via Translating Decision-Critical Features
Comments: 11+6 pages, 8 figures, 4 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[204]  arXiv:2512.07241 [pdf, ps, other]
Title: Squeezed-Eff-Net: Edge-Computed Boost of Tomography Based Brain Tumor Classification leveraging Hybrid Neural Network Architecture
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[205]  arXiv:2512.07237 [pdf, ps, other]
Title: Unified Camera Positional Encoding for Controlled Video Generation
Comments: Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[206]  arXiv:2512.07234 [pdf, ps, other]
Title: Dropout Prompt Learning: Towards Robust and Adaptive Vision-Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[207]  arXiv:2512.07230 [pdf, ps, other]
Title: STRinGS: Selective Text Refinement in Gaussian Splatting
Comments: Accepted to WACV 2026. Project Page, see this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[208]  arXiv:2512.07229 [pdf, ps, other]
Title: ReLKD: Inter-Class Relation Learning with Knowledge Distillation for Generalized Category Discovery
Comments: Accepted to the Main Track of the 28th European Conference on Artificial Intelligence (ECAI 2025). To appear in the proceedings published by IOS Press (DOI: 10.3233/FAIA413)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[209]  arXiv:2512.07228 [pdf, ps, other]
Title: Towards Robust Protective Perturbation against DeepFake Face Swapping
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[210]  arXiv:2512.07215 [pdf, ps, other]
Title: VFM-VLM: Vision Foundation Model and Vision Language Model based Visual Comparison for 3D Pose Estimation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[211]  arXiv:2512.07211 [pdf, ps, other]
Title: Object Pose Distribution Estimation for Determining Revolution and Reflection Uncertainty in Point Clouds
Comments: 8 pages, 8 figures, 5 tables, ICCR 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[212]  arXiv:2512.07206 [pdf, ps, other]
Title: AutoLugano: A Deep Learning Framework for Fully Automated Lymphoma Segmentation and Lugano Staging on FDG-PET/CT
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[213]  arXiv:2512.07203 [pdf, ps, other]
Title: MMRPT: MultiModal Reinforcement Pre-Training via Masked Vision-Dependent Reasoning
Comments: 7 pages, 1 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[214]  arXiv:2512.07201 [pdf, ps, other]
Title: Understanding Diffusion Models via Code Execution
Authors: Cheng Yu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[215]  arXiv:2512.07198 [pdf, ps, other]
Title: Generating Storytelling Images with Rich Chains-of-Reasoning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[216]  arXiv:2512.07197 [pdf, ps, other]
Title: SUCCESS-GS: Survey of Compactness and Compression for Efficient Static and Dynamic Gaussian Splatting
Comments: The first three authors contributed equally to this work. The last two authors are co-corresponding authors. Please visit our project page at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[217]  arXiv:2512.07192 [pdf, ps, other]
Title: HVQ-CGIC: Enabling Hyperprior Entropy Modeling for VQ-Based Controllable Generative Image Compression
Comments: 12 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[218]  arXiv:2512.07191 [pdf, ps, other]
Title: RefLSM: Linearized Structural-Prior Reflectance Model for Medical Image Segmentation and Bias-Field Correction
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[219]  arXiv:2512.07190 [pdf, ps, other]
Title: Integrating Multi-scale and Multi-filtration Topological Features for Medical Image Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[220]  arXiv:2512.07186 [pdf, ps, other]
Title: START: Spatial and Textual Learning for Chart Understanding
Comments: WACV2026 Camera Ready
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[221]  arXiv:2512.07171 [pdf, ps, other]
Title: TIDE: Two-Stage Inverse Degradation Estimation with Guided Prior Disentanglement for Underwater Image Restoration
Comments: 21 pages, 11 figures, 5 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[222]  arXiv:2512.07170 [pdf, ps, other]
Title: Towards Unified Semantic and Controllable Image Fusion: A Diffusion Transformer Approach
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[223]  arXiv:2512.07166 [pdf, ps, other]
Title: When Privacy Meets Recovery: The Overlooked Half of Surrogate-Driven Privacy Preservation for MLLM Editing
Comments: 9 pages,7figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[224]  arXiv:2512.07165 [pdf, ps, other]
Title: MuSASplat: Efficient Sparse-View 3D Gaussian Splats via Lightweight Multi-Scale Adaptation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[225]  arXiv:2512.07155 [pdf, ps, other]
Title: CHIMERA: Adaptive Cache Injection and Semantic Anchor Prompting for Zero-shot Image Morphing with Morphing-oriented Metrics
Comments: Please visit our project page at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[226]  arXiv:2512.07141 [pdf, ps, other]
Title: Think-Reflect-Revise: A Policy-Guided Reflective Framework for Safety Alignment in Large Vision Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[227]  arXiv:2512.07136 [pdf, ps, other]
Title: A Large-Scale Multimodal Dataset and Benchmarks for Human Activity Scene Understanding and Reasoning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[228]  arXiv:2512.07135 [pdf, ps, other]
Title: TrajMoE: Scene-Adaptive Trajectory Planning with Mixture of Experts and Reinforcement Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[229]  arXiv:2512.07128 [pdf, ps, other]
Title: MulCLIP: A Multi-level Alignment Framework for Enhancing Fine-grained Long-context CLIP
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[230]  arXiv:2512.07126 [pdf, ps, other]
Title: Training-free Clothing Region of Interest Self-correction for Virtual Try-On
Comments: 16 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[231]  arXiv:2512.07110 [pdf, ps, other]
Title: MSN: Multi-directional Similarity Network for Hand-crafted and Deep-synthesized Copy-Move Forgery Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[232]  arXiv:2512.07107 [pdf, ps, other]
Title: COREA: Coarse-to-Fine 3D Representation Alignment Between Relightable 3D Gaussians and SDF via Bidirectional 3D-to-3D Supervision
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[233]  arXiv:2512.07078 [pdf, ps, other]
Title: DFIR-DETR: Frequency Domain Enhancement and Dynamic Feature Aggregation for Cross-Scene Small Object Detection
Comments: 16 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[234]  arXiv:2512.07076 [pdf, ps, other]
Title: Context-measure: Contextualizing Metric for Camouflage
Comments: Technical Report
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[235]  arXiv:2512.07065 [pdf, ps, other]
Title: Persistent Homology-Guided Frequency Filtering for Image Compression
Comments: 17 pages, 8 figures, code available at github.com/RMATH3/persistent-homology-compression
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[236]  arXiv:2512.07062 [pdf, ps, other]
Title: $\mathrm{D}^{\mathrm{3}}$-Predictor: Noise-Free Deterministic Diffusion for Dense Prediction
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[237]  arXiv:2512.07052 [pdf, ps, other]
Title: RAVE: Rate-Adaptive Visual Encoding for 3D Gaussian Splatting
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[238]  arXiv:2512.07051 [pdf, ps, other]
Title: DAUNet: A Lightweight UNet Variant with Deformable Convolutions and Parameter-Free Attention for Medical Image Segmentation
Comments: 11 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[239]  arXiv:2512.07037 [pdf, ps, other]
Title: Evaluating and Preserving High-level Fidelity in Super-Resolution
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[240]  arXiv:2512.07034 [pdf, ps, other]
Title: Power of Boundary and Reflection: Semantic Transparent Object Segmentation using Pyramid Vision Transformer with Transparent Cues
Comments: Accepted to WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[241]  arXiv:2512.06981 [pdf, ps, other]
Title: Selective Masking based Self-Supervised Learning for Image Semantic Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[242]  arXiv:2512.06949 [pdf, ps, other]
Title: Can We Go Beyond Visual Features? Neural Tissue Relation Modeling for Relational Graph Analysis in Non-Melanoma Skin Histology
Comments: 19 pages, 5 figures, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[243]  arXiv:2512.06921 [pdf, ps, other]
Title: NeuroABench: A Multimodal Evaluation Benchmark for Neurosurgical Anatomy Identification
Comments: Accepted by IEEE ICIA 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[244]  arXiv:2512.06905 [pdf, ps, other]
Title: Scaling Zero-Shot Reference-to-Video Generation
Comments: Website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[245]  arXiv:2512.06888 [pdf, ps, other]
Title: Overcoming Small Data Limitations in Video-Based Infant Respiration Estimation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[246]  arXiv:2512.06886 [pdf, ps, other]
Title: Balanced Learning for Domain Adaptive Semantic Segmentation
Comments: Accepted by International Conference on Machine Learning (ICML 2025)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[247]  arXiv:2512.06885 [pdf, ps, other]
Title: JoPano: Unified Panorama Generation via Joint Modeling
Comments: Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[248]  arXiv:2512.06882 [pdf, ps, other]
Title: Hierarchical Image-Guided 3D Point Cloud Segmentation in Industrial Scenes via Multi-View Bayesian Fusion
Comments: Accepted to BMVC 2025 (Sheffield, UK, Nov 24-27, 2025). Supplementary video and poster available upon request
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[249]  arXiv:2512.06877 [pdf, ps, other]
Title: SceneMixer: Exploring Convolutional Mixing Networks for Remote Sensing Scene Classification
Comments: Accepted and presented in ICSPIS
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[250]  arXiv:2512.06870 [pdf, ps, other]
Title: Towards Robust Pseudo-Label Learning in Semantic Segmentation: An Encoding Perspective
Comments: Accepted by Conference on Neural Information Processing Systems (NeurIPS 2025)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[251]  arXiv:2512.06866 [pdf, ps, other]
Title: Less Is More, but Where? Dynamic Token Compression via LLM-Guided Keyframe Prior
Comments: Accepted by NeurIPS 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[252]  arXiv:2512.06865 [pdf, ps, other]
Title: Spatial Retrieval Augmented Autonomous Driving
Comments: Demo Page: this https URL with open sourced code, dataset, and checkpoints
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[253]  arXiv:2512.06864 [pdf, ps, other]
Title: Boosting Unsupervised Video Instance Segmentation with Automatic Quality-Guided Self-Training
Comments: Accepted to WACV 2026. arXiv admin note: substantial text overlap with arXiv:2508.19808
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[254]  arXiv:2512.06862 [pdf, ps, other]
Title: Omni-Referring Image Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[255]  arXiv:2512.06849 [pdf, ps, other]
Title: Hide-and-Seek Attribution: Weakly Supervised Segmentation of Vertebral Metastases in CT
Comments: In submission
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[256]  arXiv:2512.06845 [pdf, ps, other]
Title: Pseudo Anomalies Are All You Need: Diffusion-Based Generation for Weakly-Supervised Video Anomaly Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[257]  arXiv:2512.06840 [pdf, ps, other]
Title: CADE: Continual Weakly-supervised Video Anomaly Detection with Ensembles
Comments: Accepted to WACV 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[258]  arXiv:2512.06838 [pdf, ps, other]
Title: SparseCoop: Cooperative Perception with Kinematic-Grounded Queries
Comments: Accepted by AAAI 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[259]  arXiv:2512.06818 [pdf, ps, other]
Title: MeshSplatting: Differentiable Rendering with Opaque Meshes
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[260]  arXiv:2512.06811 [pdf, ps, other]
Title: RMAdapter: Reconstruction-based Multi-Modal Adapter for Vision-Language Models
Comments: Accepted by AAAI 2026(Oral)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[261]  arXiv:2512.06810 [pdf, ps, other]
Title: MMDuet2: Enhancing Proactive Interaction of Video MLLMs with Multi-Turn Reinforcement Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[262]  arXiv:2512.06802 [pdf, ps, other]
Title: VDOT: Efficient Unified Video Creation via Optimal Transport Distillation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[263]  arXiv:2512.06793 [pdf, ps, other]
Title: Generalized Geometry Encoding Volume for Real-time Stereo Matching
Comments: Accepted by AAAI 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[264]  arXiv:2512.06783 [pdf, ps, other]
Title: Physics Informed Human Posture Estimation Based on 3D Landmarks from Monocular RGB-Videos
Comments: 16 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[265]  arXiv:2512.06774 [pdf, ps, other]
Title: RDSplat: Robust Watermarking Against Diffusion Editing for 3D Gaussian Splatting
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[266]  arXiv:2512.06769 [pdf, ps, other]
Title: Stitch and Tell: A Structured Multimodal Data Augmentation Method for Spatial Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[267]  arXiv:2512.06763 [pdf, ps, other]
Title: JOCA: Task-Driven Joint Optimisation of Camera Hardware and Adaptive Camera Control Algorithms
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[268]  arXiv:2512.06759 [pdf, ps, other]
Title: VisChainBench: A Benchmark for Multi-Turn, Multi-Image Visual Reasoning Beyond Language Priors
Comments: 12 pages,13figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[269]  arXiv:2512.06750 [pdf, ps, other]
Title: UARE: A Unified Vision-Language Model for Image Quality Assessment, Restoration, and Enhancement
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[270]  arXiv:2512.06746 [pdf, ps, other]
Title: Task-Model Alignment: A Simple Path to Generalizable AI-Generated Image Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[271]  arXiv:2512.06738 [pdf, ps, other]
Title: FedSCAl: Leveraging Server and Client Alignment for Unsupervised Federated Source-Free Domain Adaptation
Comments: Accepted to Winter Conference on Applications of Computer Vision (WACV) 2026, Round 1
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[272]  arXiv:2512.06736 [pdf, ps, other]
Title: Graph Convolutional Long Short-Term Memory Attention Network for Post-Stroke Compensatory Movement Detection Based on Skeleton Data
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[273]  arXiv:2512.06726 [pdf, ps, other]
Title: The Role of Entropy in Visual Grounding: Analysis and Optimization
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[274]  arXiv:2512.06689 [pdf, ps, other]
Title: Lightweight Wasserstein Audio-Visual Model for Unified Speech Enhancement and Separation
Comments: Accepted to ASRU 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[275]  arXiv:2512.06684 [pdf, ps, other]
Title: EMGauss: Continuous Slice-to-3D Reconstruction via Dynamic Gaussian Modeling in Volume Electron Microscopy
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 749 entries: 1-100 | 76-175 | 176-275 | 276-375 | 376-475 | 476-575 | ... | 676-749 ]
[ showing 100 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2512, contact, help  (Access key information)