Electrical Engineering and Systems Science

New submissions

Submissions received from Tue 12 May 26 to Wed 13 May 26, announced Thu, 14 May 26

New submissions
Cross-lists
Replacements

[ total of 97 entries: 1-97 ]
[ showing up to 2000 entries per page: fewer | more ]

New submissions for Thu, 14 May 26

[1] arXiv:2605.12533 [pdf, ps, other]: Title: Investigation of Chaotic Behavior in Clapp Oscillator

Authors: Ivana Vasiljević, Nikola Petrović, Aleksandra Lekić

Journal-ref: IcETRAN-2018, Pali\'{c}, Serbia, 2018

Subjects: Systems and Control (eess.SY); Chaotic Dynamics (nlin.CD)

In this paper we investigate the chaotic behavior of the class of oscillators denoted as Clapp oscillators. Clapp oscillator is a simple oscillator containing one transistor and a few reactive elements - inductors and capacitors. This oscilllator is chosen for its design simplicity and a good performance. Oscillator with chaotic behavior can be used to construct chaotic radar. For that matter, in this paper is investigated approach for construction of the chaotic Clapp oscillator, which can be further verified experimentally using microstrip technology.
[2] arXiv:2605.12541 [pdf, ps, other]: Title: PG-LRF: Physiology-Guided Latent Rectified Flow for Electro-Hemodynamic PPG-to-ECG Generation

Authors: Xiaoda Wang, Minxiao Wang, Kaiqiao Han, Defu Cao, Ching Chang, Yidan Shi, Runze Yan, Xiao Luo, Yan Liu, Xiao Hu, Yizhou Sun, Wei Wang, Carl Yang

Subjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)

Electrocardiography (ECG) is the clinical standard for cardiac assessment but requires dedicated hardware that does not scale to daily-life monitoring. Photoplethysmography (PPG) is ubiquitous in wearables but lacks ECG-specific diagnostic morphology and is corrupted by motion and sensor noise. PPG-to-ECG generation aims to bridge this gap by recovering electrical morphology and timing from peripheral pulse signals. However, existing methods largely rely on statistical alignment and data-driven generation. They fail to explicitly structure the latent space around physiology-aware electro-hemodynamic factors and lack constraints from forward physiological dynamics. To address these challenges, we propose PG-LRF, a physiology-guided latent rectified flow framework. PG-LRF introduces an electro-hemodynamic simulator that co-models ECG and PPG through shared cardiac phase dynamics. Guided by this simulator, a Physiology-Aware AutoEncoder learns a structured electro-hemodynamic latent space. Then we integrate this simulator guidance into a PPG-conditioned latent rectified flow, enforcing ECG-side morphology consistency and ECG-to-PPG forward hemodynamic consistency during generative transport. Experiments on the large-scale MC-MED dataset demonstrate that PG-LRF significantly improves PPG-to-ECG generation and downstream cardiovascular disease classification, proving its ability to generate ECGs that are both signal-faithful and physiologically plausible under the ECG-to-PPG hemodynamic pathway
[3] arXiv:2605.12553 [pdf, ps, other]: Title: ChannelKAN: Multi-Scale Dual-Domain Channel Prediction via Hybrid CNN-KAN Architecture

Authors: Nanqing Jiang, Zhangyao Song, Tao Guo, Xiaoyu Zhao, Yinfei Xu

Subjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI)

Accurate channel state information (CSI) prediction is essential for improving the reliability and spectral efficiency of massive MIMO-OFDM systems in high-mobility scenarios. Existing deep learning methods struggle to jointly capture short-term local variations and long-range nonlinear dependencies in CSI sequences. To address this challenge, we propose ChannelKAN, a hybrid CNN-KAN channel prediction model with multi-scale frequency domain information enhancement. The key insight is that CNNs and Kolmogorov-Arnold Networks (KANs) are naturally complementary: CNNs extract intra-time-step local spatial-frequency correlations, while KANs with learnable Chebyshev polynomial activations fit inter-time-step nonlinear temporal evolution in a holistic manner. Specifically, a dual-domain expansion module first generates complementary frequency-domain and delay-domain CSI representations. A multi-scale frequency information enhancement module then retains dominant spectral components at multiple scales to strengthen key features and suppress noise. Next, a CNN-KAN feature extraction module captures local correlations via cascaded convolutions and models long-range dependencies via Chebyshev KAN layers. Finally, a dual-domain fusion module adaptively integrates features from both branches to produce the prediction. Experiments on 3GPP-compliant QuaDRiGa datasets demonstrate that ChannelKAN outperforms RNN, LSTM, GRU, CNN, and Transformer baselines in normalized mean square error (NMSE), spectral efficiency (SE), and bit error rate (BER) across various velocities and signal-to-noise ratios. Ablation studies further confirm the effectiveness of each proposed module.
[4] arXiv:2605.12557 [pdf, ps, other]: Title: Localization in OFDM Passive Distributed Antenna Systems with Pilots and Unknown Data Payloads: A Marginal Maximum Likelihood Approach

Authors: Mathieu Reniers, Martin Willame, Jérôme Louveaux, Luc Vandendorpe

Comments: 13 pages, 10 figures, submitted to IEEE Transactions on Radar Systems

Subjects: Signal Processing (eess.SP)

Integrated Sensing and Communications (ISAC) is emerging as a key paradigm for future Sixth-Generation (6G) networks, with communication-centric designs favored for their compatibility with existing standards. Communication signals contain both known deterministic pilot symbols and unknown random data payloads. Most localization approaches rely solely on pilots, discarding the position information contained in the data symbols, which constitute the majority of each transmitted frame. Alternatively, Decision-Directed (DD) approaches exploit data decisions, thereby inherently limiting positioning performance to that of the communication system. In this paper, we derive a Marginal Maximum Likelihood (MML) estimator that jointly leverages pilot and data payloads without requiring data decoding, enabling operation with high-order constellations and under challenging noise conditions. We consider an opportunistic scenario in which an Orthogonal Frequency-Division Multiplexing (OFDM) signal transmitted by a User Equipment (UE) is captured by a distributed receiver array. Through numerical simulations, we demonstrate that the proposed method achieves superior localization performance compared to existing approaches and consistently converges to the genie bound (where data symbols are assumed perfectly known) at a lower Signal-to-Noise Ratio (SNR) than any DD method. Furthermore, the proposed method remains robust to constellation size, unlike DD approaches, whose performance degrades with increasing modulation order. Finally, we provide a computational complexity analysis of the proposed method and the considered baselines, highlighting the impact of system parameters on their respective computational costs.
[5] arXiv:2605.12560 [pdf, ps, other]: Title: Brain Tumor Classification in MRI Images: A Computationally Efficient Convolutional Neural Network

Authors: Md Fahimul Kabir Chowdhury, Jannatul Ferdous

Journal-ref: 2025 IEEE International Conference on Biomedical Engineering, Computer and Information Technology for Health (BECITHCON), pp. 633-638, 2025

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)

Improving patient outcomes depends on the prompt and accurate diagnosis of brain tumors, but manual MRI scan analysis is still time-consuming and unreliable. Although deep learning has shown promise, many of the models that are now in use are computationally intensive and have difficulty handling the intrinsic complexity and variety of different types of brain tumors. In this work, we propose a lightweight yet high-performing Convolutional Neural Network (CNN) for multi-class brain tumor classification, employing MRI images to target gliomas, meningiomas, pituitary tumors, and healthy (no tumor) instances. The model was rigorously evaluated on two publicly accessible datasets from Figshare and Kaggle. Leveraging efficient feature extraction and optimized training strategies, our CNN achieved classification accuracies of 99.03% and 99.28%, along with ROC scores of 99.88% and 99.94% on Dataset 1 and Dataset 2, respectively-all while utilizing significantly fewer parameters than popular pre-trained architectures. In contrast to cutting-edge models like DenseNet201, MobileNetV2, VGG19, Xception, InceptionV3, and ResNet50, our approach consistently demonstrated superior performance with reduced computational overhead. These findings highlight the potential of the proposed model as a practical and reliable diagnostic aid in clinical environments.
[6] arXiv:2605.12562 [pdf, ps, other]: Title: Uncovering Latent Pathological Signatures in Pulmonary CT via Cross-Window Knowledge Distillation

Authors: Bo Peng, Wujian Xu, Kun Wang, Ximing Liao, Na Wang, Daqian Shi, Tian Li, Jing Gao, Johan Thygesen, Yingqun Ji, Honghan Wu

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)

Multi-window CT imaging captures complementary pathological information across anatomical structures of differing densities, yet existing deep learning methods fuse representations only at later stages, missing cross-density interactions. We propose a cross-window knowledge distillation framework in which student encoders learn latent clinical priors from a teacher trained on the most informative window. Evaluated retrospectively on three cohorts - COPD-CT-DF (n=719), RSNA PE (n=1,433), and an in-house CTEPD dataset (n=161) - distillation improved per-window AUC by 10.1-16.5 percentage points on COPD-CT-DF (0.75-0.81 to 0.90-0.94; all P<0.001), with ensemble AUC reaching 0.9960. Similar gains were observed on RSNA PE (0.80-0.83 to 0.90-0.92) and CTEPD (AUC 0.7481 vs. 0.6264). Cross-window distillation internalises pathological signatures invisible to supervised approaches, offering a generalisable solution for multi-window pulmonary CT analysis.
[7] arXiv:2605.12564 [pdf, ps, other]: Title: Multiport Antenna Q-factor

Authors: Vojtech Neuman, Miloslav Capek, Lukas Jelinek

Subjects: Signal Processing (eess.SP); Information Theory (cs.IT)

This article proposes an estimate of multiport antenna bandwidth based on a generalization of a single-port Q-factor. The explicit derivation is based on converting the stored energy matrix to its port equivalent and on the port parameters themselves. The work discusses the bandwidth dependencies on feeding and matching. Derived formulas are shown to utilize the total active reflection coefficient and allow for a single-frequency bandwidth evaluation. Examples comprising two different dipole arrays and electrically large patch antenna arrays validate the theory.
[8] arXiv:2605.12566 [pdf, ps, other]: Title: On Privacy-Preserving Image Transmission in Low-Altitude Networks: A Swin Transformer-Based Framework with Federated Learning

Authors: Kexin Zhang, Lixin Li, Yuna Yan, Xin Zhang, Wensheng Lin, Rui Li, Dongwei Zhao, Zhu Han

Comments: 13 pages, 10 figures, 2 tables

Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG)

The rapid development of low-altitude economy has driven the proliferation of Unmanned Aerial Vehicle (UAV) applications, including logistics, inspection, and emergency response. However, transmitting high-volume image data from UAVs to ground stations faces significant challenges due to limited bandwidth and stringent privacy requirements. To address these issues, a Semantic Communication (SC) framework based on Federated Learning (FL) is proposed for efficient and privacy-preserving image transmission. A Swin Transformer-based Semantic Communication (STSC) architecture is designed to extract multi-scale semantic features under constrained bandwidth conditions. Dedicated communication and computing nodes are deployed on UAVs to enhance real-time coverage and flexibility. Meanwhile, a FL mechanism enables global model training across distributed devices without sharing raw data, thus preserving user privacy. Simulation experiments conducted on the CIFAR-10 dataset demonstrate that the proposed STSC framework achieves at least 5.7 dB improvement in Peak Signal-to-Noise Ratio (PSNR) compared to DeepJSCC baselines, while also showing superior convergence and generalization performance. The framework effectively integrates UAV-assisted deployment with SC and privacy protection, offering a practical solution for bandwidth-constrained image transmission in low-altitude networks.
[9] arXiv:2605.12569 [pdf, ps, other]: Title: Active Sensing with Meta-Reinforcement Learning for Emitter Localization from RF Observations

Authors: M. Shamail J. Khan, Nisha L. Raichur, Lucas Heublein, Christian Wielenberg, Alexander Mattick, Tobias Feigl, Christopher Mutschler, Felix Ott

Subjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI)

Global navigation satellite system (GNSS) interference poses a serious threat to reliable positioning, especially in indoor and multipath-rich environments where source localization is highly challenging. In this paper, we formulate GNSS interference localization as an active sensing problem and propose a reinforcement learning (RL) framework in which an agent sequentially explores the environment to infer the position of an emitter source from radio frequency (RF) observations acquired with a 2x2 patch antenna. The localization task is modeled as a partially observable decision process, since single-snapshot measurements are often ambiguous under multipath propagation and changing channel conditions. To address this, the proposed framework combines high-dimensional RF sensing with deep RL and recurrent policy learning. We investigate both value-based and policy-based approaches, namely Deep Q-Networks (DQN) and Proximal Policy Optimization (PPO), and study their behavior under domain shift. The approach is evaluated on a simulated dataset generated with the Sionna ray-tracing module, which provides realistic propagation effects and diverse environment configurations. Experimental results show that the proposed method achieves a localization success rate of 80.1%, demonstrating the potential of RL for adaptive GNSS interference localization. Overall, the results highlight simulation-assisted training as a promising direction for robust interference localization in challenging propagation environments.
[10] arXiv:2605.12575 [pdf, ps, other]: Title: Are Compact Rationales Free? Measuring Tile Selection Headroom in Frozen WSI-MIL

Authors: Hyun Do Jung, Jungwon Choi, Soojung Choi, Yujin Oh, Hwiyoung Kim

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)

Whole-slide image (WSI) multiple instance learning (MIL) classifiers can achieve strong slide-level AUC while leaving the full-bag prediction opaque. Attention scores are widely reused as post-hoc explanations, but high attention can reflect aggregation preference rather than a compact, model-sufficient rationale. We study post-hoc rationale highlighting for frozen WSI-MIL: given a trained classifier, can its slide-level prediction be recovered from a compact, output-consistent tile subset without retraining the backbone? We instantiate this with Finding Optimal Contextual Instances (FOCI), a lightweight rationale-readout layer over a frozen MIL backbone. FOCI is trained with model-output sufficiency and exclusion objectives over keep/drop tile subsets, evaluated with an insertion-style Sequential Reveal Protocol (SRP) adapted to WSI-MIL, and summarized by the Selection Headroom Index (SHI). Across three WSI benchmarks and seven MIL backbones, FOCI reveals that compact rationales are selection-headroom dependent: transformer and multi-branch attention aggregators can admit compact rationales, near-minimal attention-pooling baselines enter a selection-saturation regime, and hard-selection backbones can conflict with an external readout. For TransMIL, relative to its documented CLS-proxy ranking, FOCI reduces the Minimum Sufficient K (MSK) tile count by 32-56% across benchmarks, while ACMIL+FOCI attains the highest mean SHI (+0.465). Deletion-based perturbation and selected-only downstream evaluation provide complementary checks. These results position FOCI as a model-level interpretability and audit layer: selected tiles are not claims of clinical or pathologist-level diagnostic sufficiency, but candidate rationales that offer a compact, reviewable view of when a frozen MIL prediction can be localized to a small output-consistent subset.
[11] arXiv:2605.12578 [pdf, ps, other]: Title: Recurrent Transformer-Based Near- and Far-Field THz Wideband Channel Estimation for UM-MIMO

Authors: Dmitry Artemasov (1), Alexander Shmatok (1), Kirill Andreev (1), Alexey Frolov (1), Manjesh K. Hanawal (2), Nikola Zlatanov (3) ((1) Center for Next Generation Wireless and IoT, Skolkovo Institute of Science and Technology, Moscow, Russia, (2) Department of IEOR, Indian Institute of Technology Bombay, India, (3) Faculty of Computer and Engineering Sciences, Innopolis University, Innopolis, Russia)

Comments: 15 pages, 15 figures

Journal-ref: IEEE Access, vol. 13, pp. 205396-205411, 2025

Subjects: Signal Processing (eess.SP); Information Theory (cs.IT); Machine Learning (cs.LG)

The integration of terahertz communications and ultra-massive multiple-input multiple-output (UM-MIMO) systems in 6G networks is motivated by their ability to enable unprecedented data rates, mitigate spectrum congestion, and enhance overall network performance. However, the enlarged antenna apertures and higher carrier frequencies in these systems increase the Rayleigh distance, causing users to span both the near-field and conventional far-field regions. Accurate spatial precoding thus requires exact channel estimation at the base station - a task made more challenging by the hybrid coexistence of near- and far-field effects and the limited number of digital chains available in hybrid beamforming architectures. In this paper, we propose a block recurrent transformer model to address this challenge. We demonstrate that a single transformer block equipped with state memory can be trained once and then iteratively applied for hybrid-field channel estimation. Furthermore, we train the model such that it generalizes to wireless channels with varying scatterer distances, different numbers of propagation paths, and wideband operation. Simulation results show that the proposed method achieves performance gains of approximately 5 dB and 7.5 dB in normalized mean squared error (NMSE) over state-of-the-art solutions in narrowband and wideband scenarios, respectively.
[12] arXiv:2605.12728 [pdf, ps, other]: Title: Grid-Orch: An LLM-Powered Orchestrator for Distribution Grid Simulation and Analytics

Authors: Boming Liu, Jin Dong, Jamie Lian

Subjects: Systems and Control (eess.SY); Artificial Intelligence (cs.AI); Software Engineering (cs.SE)

The power distribution engineering workforce faces a projected shortage of up to 1.5 million engineers by 2030, creating urgent demand for more accessible analysis tools. This paper introduces Grid-Orch, a framework that bridges Large Language Models (LLMs) and power system simulation through the Model Context Protocol (MCP), enabling engineers to perform complex distribution analyses via natural language. Using OpenDSS as the reference implementation, Grid-Orch provides 36 domain-specific tools across eleven categories, covering power flow, voltage analysis, quasi-static time series (QSTS) simulation, and automated optimization. A provider-agnostic LLM layer supports both cloud-hosted (Gemini, Claude) and locally deployed (Ollama, llama-cpp) models, enabling air-gapped operation for security-sensitive utility environments. Three optimization skills, capacitor placement, voltage violation analysis, and overvoltage mitigation, extend the platform beyond single-tool queries to multi-step engineering workflows. Grid-Orch is delivered as an interactive web platform with chat-based interaction, a QSTS dashboard, and feeder topology visualization, and renders simulation results inline. Workflow demonstrations show that distribution analyses formerly requiring hours of scripting, such as distributed energy resource (DER) interconnection screening, complete in under two minutes through natural language, producing numerically identical results to direct OpenDSS scripting.
[13] arXiv:2605.12753 [pdf, ps, other]: Title: Optimization in Sparse 2D to Dense 3D Weakly Supervised Learning: Application to Multi-Label Segmentation of Large ex vivo MRI Data

Authors: Paul Hoareau, Kuan Yi Wang, Brandon Bujak, Roy Sun, Govind Nair, Irene Cortese, Charidimos Tsagkas, Daniel Reich, Julien Cohen-Adad

Comments: 19 pages. Submitted to Machine Learning for Biomedical Imaging (MELBA). Code and models: this https URL

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)

INTRODUCTION | Fully supervised 3D segmentation of high-resolution ex vivo MRI is limited by the prohibitive cost of volumetric annotation, forcing reliance on sparse 2D slices. Weakly supervised Sparse-to-Dense frameworks bridge this gap, but guidelines remain ambiguous regarding human-centric visual enhancements and transferring optimization strategies across dimensions. We analyze divergent regularization needs for multi-class segmentation of high-resolution ex vivo spinal cord MRI.
METHODS | We used 9.4T MRI of multiple sclerosis spinal cords (>104,000 slices) with sparse annotations (428 slices). A 2D Teacher trained on sparse slices generated dense pseudo-labels to train a 3D Student. We systematically evaluated the impact of human-centric preprocessing, spatial augmentation, and soft-label regularization on both architectures.
RESULTS | We identified a critical divergence in training dynamics. The 2D Teacher required strong spatial augmentation and soft-labeling to overcome data scarcity, improving White Matter Lesion Dice scores by >11 points. However, propagating these techniques to the 3D Student degraded its performance. Furthermore, human-centric preprocessing (e.g., CLAHE) disrupted global statistical cues, dropping Gray Matter Lesion Dice scores by ~25 points.
DISCUSSION | Our study highlights a perception divergence (human-centric contrast enhancement harms machine models) and a regularization conflict across dimensions. 3D architectures trained on dense pseudo-labels exhibit fundamentally different optimization landscapes than 2D counterparts and require distinct, conservative regularization. Code and models: https://github.com/ivadomed/model_seg_sc-gm-lesion_human_ms_exvivo_t2star.
[14] arXiv:2605.12779 [pdf, ps, other]: Title: Safe and Energy-Aware Decentralized PDE-Constrained Optimization-Based Control of Multi-UAVs for Persistent Wildfire Suppression

Authors: Longchen Niu, Gennaro Notomista

Subjects: Systems and Control (eess.SY)

This paper presents a safe and energy-aware optimization-based control framework for multi-UAV wildfire suppression under localization and motion uncertainties. We first develop a centralized density-based controller that couples UAV motion and water deployment in a wildfire-specific control Lyapunov function. This framework is then extended to a decentralized setting suitable for large-scale operations using only local information. The controllers use control barrier function constraints to enforce both danger zone avoidance and the ability to reach a charging region. Simulations and real quadcopter experiments demonstrate the controller's effectiveness in fire suppression while preserving safety and energy sufficiency over multiple charge cycles.
[15] arXiv:2605.12806 [pdf, ps, other]: Title: Cross-Harmonic Ambiguity-Aligned Multiport Parameter Estimation for Time-Floquet RIS

Authors: Philipp del Hougne

Comments: 5 pages with 4 figures

Subjects: Signal Processing (eess.SP); Applied Physics (physics.app-ph)

A time-Floquet reconfigurable intelligent surface (TF-RIS) periodically modulates its elements within a signaling interval, enabling frequency conversion and additional degrees of freedom compared with a conventional RIS. Time-Floquet multiport-network theory (TF-MNT) provides a physics-consistent model for TF-RISs that accounts for inter-element coupling, but its practical use requires estimating the underlying parameters when the TF-RIS design and radio environment are (partially) unknown. In this Letter, we propose a segmented estimation approach for constructing an accurate proxy TF-MNT model from end-to-end measurements. First, with the TF-RIS operated as a conventional RIS, we estimate conventional proxy MNT parameters independently at each considered time-Floquet harmonic. Second, under periodic time modulation, we align the inherent ambiguities among the per-harmonic conventional proxy MNT parameters, considering three measurement setups with different access to phase and harmonic information. Based on full-wave numerical simulations, we quantify the impact of the number of measurements and the noise level on the proxy-model accuracy. Finally, we demonstrate the performance loss incurred without the proposed ambiguity alignment in a canonical harmonic backscatter communications scenario.
[16] arXiv:2605.12971 [pdf, ps, other]: Title: Port-Hamiltonian Systems with Dissipation Potential: Modelling and Trajectory Tracking Control

Authors: Jinjun Jia, Yuchen Liao, Kang An, Xun Yan, Tiedong Zhang, Dapeng Jiang

Comments: 7 pages, 2 figures

Subjects: Systems and Control (eess.SY)

Port-Hamiltonian systems (PHS) and interconnection and damping assignment passivity-based control (IDA-PBC) have achieved broad success in modelling and stabilisation of physical systems. However, the absence of a dedicated scalar potential for the momentum channel forces any modification of the momentum-dependent dynamics to proceed indirectly through the interconnection and damping matrices, rendering the matching partial differential equation (PDE) difficult to solve and complicating extensions to trajectory tracking. This paper proposes a port-Hamiltonian system with dissipation potential (PHS-DP), in which the damping matrix is replaced by scalar convex dissipation potentials, providing independent scalar objects for the momentum and auxiliary state channels and restoring the variational symmetry between stored and dissipated energy. Building on this framework, Dual Potential Shaping Control (DPSC) achieves trajectory tracking by sequentially shaping the potential energy and dissipation potentials without modifying the interconnection structure. Contraction of the closed-loop cascade is established via a hierarchical contraction argument, and the matching condition is satisfied automatically for any admissible choice of shaped potentials, requiring no PDE to be solved. In contrast to existing PDE-free energy shaping approaches, which achieve this by abandoning the port-Hamiltonian closed-loop structure and sacrificing physical interpretability, the proposed framework preserves the interconnection structure and retains a transparent energy-based interpretation at every stage of the design. Validation on a magnetic levitation system demonstrates tracking performance comparable to timed IDA-PBC with substantially reduced design complexity.
[17] arXiv:2605.13005 [pdf, ps, other]: Title: Geometry-Aware Multi-Armed Bandits for Antenna Beam Selection on Spheres, Tori, $\SO(3)$, and Reconfigurable Intelligent Surfaces

Authors: Yuriy Dorn, Changsheng Chen, Ning Xie

Comments: Prepared to submit to IEEE Transactions on Wireless Communications

Subjects: Signal Processing (eess.SP)

Beam alignment in mmWave phased arrays and RIS-assisted links is a stochastic bandit under both short TTI budgets and Doppler-induced non-stationarity. The arm space is a Riemannian manifold: $\sphere^2$ for steering, $\torus^n$ for phase combining, $\SO(3)$ for panel orientation, or the discrete torus $(\mathbb Z_B)^M$ with up to $K\!\sim\!10^{90}$ configurations for $B$-level RIS ($B\!=\!2^b$, $b$ bits/element); the intrinsic Mat\'ern kernel of Borovitskiy et al.\ provides the base GP. We contribute two algorithmic pieces. \textbf{(C1)} A Kronecker-factorised intrinsic-product Mat\'ern kernel on $(\mathbb Z_B)^M$ evaluating in $O(M)$ table lookups, making GP-UCB tractable at $K\sim 10^{90}$ where the extrinsic alternative is infeasible. \textbf{(C2)} AdaptiveGP-v2, an online sliding-window controller that selects $W$ by per-sample marginal likelihood, with predictive-variance and drift $z$-score reset triggers and a post-reset $\beta$-boost. On a four-speed ($v\!\in\!\{0.02,0.08,0.12,0.20\}$~km/h), $20$-seed paired campaign at $T\!=\!3000$, AdaptiveGP-v2 is statistically indistinguishable from the hand-tuned fixed-window oracle at every speed (Holm--Bonferroni-corrected paired differences cross zero); the operational benefit is the absence of a deployment-time per-speed calibration step, not a mean-regret improvement. On four static 3GPP-style mmWave benchmarks, intrinsic-kernel GP-UCB reduces cumulative regret by $25$--$45\%$ vs.\ codebook UCB1/Thompson and by $10$--$33\%$ vs.\ Euclidean-ambient GP-UCB on the toroidal arm spaces; a wideband OFDM ablation on a $100$~MHz channel confirms the advantage persists under frequency-selective fading ($\sim\!32$~Mbps/UE at initial access vs.\ UCB1). A third-party-simulator sanity check on Sionna CDL is reported in Section~V.
[18] arXiv:2605.13015 [pdf, ps, other]: Title: A General Bézier Tree Encoding Counterfactual Framework for Retinal-Vessel-Mediated Disease Analysis

Authors: Tan Su, Ethan Elio Meidinger, Lin Gu, Ruogu Fang

Comments: 33 pages, 6 figures; preprint

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)

The geometry of the retinal vessel is a key biomarker of vascular diseases, yet clinical evidence remains primarily observational. Existing generative counterfactuals intervene only at the image-level disease label, failing to isolate explicit anatomical structure. To address this limitation, we propose the B\'ezier Tree Encoding Counterfactual Framework (BTECF). By abstracting vascular networks into interconnected cubic-B\'ezier segments, BTECF establishes a disease-agnostic representation in which structural topology is explicitly preserved and atomically perturbable. Coupling this encoding with a diffusion-based generator enables parameter-level do-interventions on explicit geometric axes (e.g., tortuosity, caliber) while preserving background fundus textures. We validate BTECF on diabetic retinopathy, together with independent cohorts for ischemic stroke and Alzheimer's disease. Isolated counterfactual interventions produce dose-responsive shifts in classifier predictions; a matched pixel-drop control attenuates this response by an order of magnitude or more, ruling out out-of-distribution generation artifacts. By enforcing causal isolation between vessel topology and pixel-level confounders, BTECF provides a unified generative paradigm for hypothesis verification across systemic diseases. To support reproducibility, the code will be publicly released upon acceptance.
[19] arXiv:2605.13031 [pdf, ps, other]: Title: Relative Pose-Velocity Estimation Using Dual IMU Measurements and Relative Position Sensing

Authors: Alessandro Melis, Tarek Bouazza, Soulaimane Berkane, Tarek Hamel

Subjects: Systems and Control (eess.SY); Robotics (cs.RO)

This paper addresses the problem of estimating the relative pose (position and orientation) and velocity of a vehicle with respect to a moving target, where both are equipped with Inertial Measurement Units (IMUs), assuming the availability of relative position or bearing measurements. The body-target relative dynamics are formulated on $\mathbf{SE}_2(3)$ and recast into a linear time-varying (LTV) model in the ambient space $\mathbb{R}^{15}$, on which a deterministic Riccati observer is designed. We analyze the uniform observability (UO) conditions required to guarantee global exponential convergence of the estimation error in the ambient space for both measurement cases. In the case of relative position measurements, UO requires only a persistence-of-excitation condition on the target acceleration, whereas for bearing measurements, additional conditions are required. Building on this, a nonlinear complementary filter on $\mathbf{SO}(3)$ is designed to provide a smooth estimate of the orientation component of the state with almost global asymptotic stability. Finally, simulation results are provided to validate the proposed solution.
[20] arXiv:2605.13061 [pdf, ps, other]: Title: Revisiting Voltage and Synchronization Stability Analysis in Converter-Integrated Weak Grids: Insights from Non-Minimum-Phase Zeros

Authors: Fuyilong Ma, Lidong Zhang, Wangqianyun Tang, Waisheng Zheng, Huanhai Xin, Linbin Huang, Lennart Harnefors

Comments: 10 pages

Subjects: Systems and Control (eess.SY)

The increasing penetration of converter-interfaced generators (CIGs) intensifies concerns over small-signal voltage and synchronization stability. While existing theories treat these two stability issues distinctly, practical wisdom in contrast employs a unified and static metric, short-circuit ratio (SCR), to assess both in weak grids. This paper aims to bridge this theory-practice gap by introducing the insight of non-minimum phase (NMP) zeros. First, we demonstrate that the two stability issues in weak grids originate from NMP zeros in the grid Jacobian transfer matrix: a zero at the origin corresponds to voltage instability, while low-frequency zeros impose fundamental constraints on synchronization dynamics. The traditional SCR is proven to be a special case of our proposed novel stability metric, NMP-zero (NMP-Z) factor, evaluated at the rated operating point. This establishes the theoretical foundation for the empirical success of SCR. Building on this insight, we then develop a unified stability assessment method for multi-converter systems. The method retains the simplicity of SCR, requiring only the NMP-Z factor together with individual CIG dynamic models and enabling stability margin assessment under various operating points. Our work provides a simple yet theoretically rigorous framework for stability analysis in CIG-integrated weak grids, with all theoretical findings and the proposed method validated through detailed time-domain simulations.
[21] arXiv:2605.13098 [pdf, ps, other]: Title: Impact of Terrestrial Blockage on the Coverage of Integrated Satellite-Terrestrial Networks

Authors: Joon-Young Park, Byungju Lim, Young-Chai Ko

Comments: 7 pages, 4 figures, to be published in proc. IEEE 103rd Vehicular Technology Conference(VTC2026-Spring)

Subjects: Signal Processing (eess.SP)

The integration of non-terrestrial networks (NTNs) with terrestrial networks (TNs) is an important step toward ubiquitous connectivity in sixth-generation (6G). Despite growing interest, the geometric impact of urban blockages on an integrated satellite-terrestrial network (ISTN) has not been rigorously quantified. In this paper, we develop a stochastic geometry-based analytical framework that incorporates a Boolean blockage model to characterize the downlink coverage probability of the ISTN and to provide insights for blockage-aware system design. Our analysis reveals that blockages affect satellite links in two competing ways: while they attenuate desired signals, they can also act as spatial shields that suppress aggregate interference. Leveraging this observation, we analytically show that satellite-terrestrial integration can enhance coverage probability across diverse environments ranging from open areas to dense urban deployments, offering a resilient and mathematically tractable approach to maintaining connectivity under heterogeneous blockage conditions.
[22] arXiv:2605.13120 [pdf, ps, other]: Title: D-Optimized Sampling Design for System Identification

Authors: Enrico Dozzi, Tom Oomen, Rodrigo A. González

Comments: 6 pages, 2 figures. Accepted for IFAC 2026. Final author version

Subjects: Systems and Control (eess.SY); Optimization and Control (math.OC)

Traditional system identification with multisine inputs relies on uniform sampling and periodic excitation to preserve Fourier orthogonality and avoid spectral leakage, limiting its use in scenarios with irregular sampling or nonperiodic inputs. This work investigates continuous-time system identification under nonperiodic multisine excitation and nonuniform sampling. We develop a nonparametric frequency response function estimator suited to such conditions and design irregular sampling schemes that enhance the informativeness of measurements and reduce spectral leakage. The proposed sampling scheme improve the statistical accuracy of system identification in settings where periodic excitation is impractical.
[23] arXiv:2605.13134 [pdf, ps, other]: Title: Security-Aware Planning and Control of Multi-Agent Systems with LTL Tasks

Authors: Georgios Mitsos, Dimos V. Dimarogonas, Siyuan Liu

Comments: 8 pages, 2 figures; This paper has been accepted at the IFAC World Congress 2026

Subjects: Systems and Control (eess.SY)

This paper presents a secure-by-construction planning and control framework for multi-agent systems subject to linear temporal logic (LTL) specifications. The framework protects sensitive information from a passive intruder with partial observations of the agents' motion. Security in multi-agent coordination is captured by two notions that prevent the intruder from inferring whether a secret task has been executed and from identifying the agent responsible for its execution. The proposed framework incorporates the security constraints directly into the LTL synthesis procedure by constructing a secure finite transition system that removes all paths violating these constraints. Standard LTL synthesis is then applied to this secure abstraction to generate discrete plans, which are then refined into dynamically feasible continuous trajectories. This synthesis procedure provides formal guarantees that the resulting behavior of the multi-agent system satisfies both the global LTL specification and the security constraints. The effectiveness of the proposed framework is demonstrated through a two-drone case study.
[24] arXiv:2605.13135 [pdf, ps, other]: Title: Subspace Pruning via Principal Vectors for Accurate Koopman-Based Approximations

Authors: Dhruv Shah, Jorge Cortés

Subjects: Systems and Control (eess.SY)

The accuracy of Koopman operator approximations over
finite-dimensional spaces relies critically on their invariance
properties. These can be rigorously quantified via the principal
angles between a candidate subspace and its image under the Koopman
operator. This paper proposes a unified algebraic framework for
subspace pruning designed to systematically refine the invariance
error. We establish the geometric equivalence between
consistency-based methods and principal-vector pruning, and build on
this insight to introduce a hybrid strategy that balances between
multiple and single principal vector pruning for improved numerical
stability and scalability. We derive error bounds for the retention
of approximate and external eigenfunctions, demonstrating that the
multi-vector approach mitigates the numerical drift inherent to
sequential pruning. To ensure scalability, we develop an efficient
numerical update scheme based on rank-one modifications that reduces
the computational complexity of tracking principal angles by an
order of magnitude. Finally, we exploit the subspace obtained from
the pruning algorithms to build a lifted linear model for state
prediction that accounts for the trade-offs between improving
invariance and minimizing state reconstruction error. Simulations
demonstrate the effectiveness of our approach.
[25] arXiv:2605.13220 [pdf, ps, other]: Title: Real-time Gaussian Process based Approximate Model Predictive Trajectory Tracking Control for Autonomous Vehicles

Authors: Alexander Rose, Lukas Theiner, Rolf Findeisen

Comments: accepted for the 24th European Control Conference (ECC) 2026

Subjects: Systems and Control (eess.SY)

Applying model predictive control on embedded systems remains challenging due to the high computational cost of solving optimal control problems. To address this limitation, computationally efficient Gaussian process approximations of the implicit model predictive control law can be employed. However, for trajectory-tracking applications, the large amount of training data required for successful generalization across distinct reference trajectories poses a significant challenge. To improve data efficiency, we propose to transform the model into curvilinear coordinates around the reference trajectory. Secondly, we use a nominal feedforward component, allowing the Gaussian process to learn only the residual control input, making the approximation of a trajectory-tracking controller feasible. To underline the applicability of the approach, we deploy the controller on a Raspberry Pi in a small-scale vehicle and validate it experimentally. Compared to a model predictive control implementation using real-time iterations, the Gaussian process based approximation computes control inputs about five times faster while achieving similar closed-loop tracking performance.
[26] arXiv:2605.13243 [pdf, ps, other]: Title: Spatial Competition for Low-Complexity Learned Image Compression

Authors: Théophile Blard, Pierrick Philippe, Théo Ladune, Xiaoran Jiang, Olivier Déforges

Comments: Accepted at ICIP 2026

Subjects: Image and Video Processing (eess.IV)

Autoencoder-based image codecs achieve state-of-the-art compression performance but often incur high computational complexity, particularly at decoding time. This work introduces a low-complexity learned image compression framework based on spatial competition between multiple specialized neural codecs. For each image region, the encoder selects the codec that best matches the local content according to a rate-distortion cost. A mode map is transmitted as side information to indicate the per-region codec selection. At decoding time, this mode map-based selection guides reconstruction while preserving the complexity of a single codec. This design enables per-image adaptation with low decoding complexity and fast encoding. On the CLIC 2020 dataset, our method achieves up to -14.5% rate reduction compared to a single codec and reaches HEVC-level performance with a decoding complexity of 1433 MACs per pixel.
[27] arXiv:2605.13248 [pdf, ps, other]: Title: Compact Latent Manifold Translation: A Parameter-Efficient Foundation Model for Cross-Modal and Cross-Frequency Physiological Signal Synthesis

Authors: Bo Cui, Xiaowen Song, Yaowen Zhang, Shunzhe Zhang, B.J.F. van Beijnum, Monique Tabak, Ying Wang

Subjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI)

The analysis of physiological time series, such as electrocardiograms (ECG) and photoplethysmograms (PPG), is persistently hindered by modality and frequency gaps stemming from heterogeneous recording devices. Existing foundation models typically rely on continuous latent spaces, which frequently suffer from severe modality entanglement, lack high-fidelity cross-frequency generative capacity, and impose high computational costs that prohibit edge-device deployment. In this paper, we propose Compact Latent Manifold Translation (CLMT), a highly parameter-efficient (0.09B) unified framework that bridges these gaps through a novel two-stage discrete translation paradigm. First, we introduce a Universal Tokenizer utilizing Hierarchical Residual Vector Quantization (RVQ) to decouple heterogeneous signals into isolated, well-structured discrete latent manifolds, effectively preventing inter-modality interference. Second, a Context-Prompted Latent Translator maps these discrete tokens across modalities by integrating static physiological priors, reframing complex signal synthesis as a pure latent sequence translation task. Extensive evaluations demonstrate that our 0.09B model significantly outperforms massive baselines. In cross-modal PPG-to-ECG synthesis, it resolves temporal phase drift and dramatically improves the clinical R-peak detection F1-score from 0.37 (baseline) to 0.83. Furthermore, in extreme cross-frequency super-resolution (25Hz to 100Hz), it successfully recovers high-frequency diagnostic landmarks, achieving an unprecedented Pearson correlation of 0.9956. By learning a universal discrete language for biological signals with a fraction of the computational footprint, our approach sets a new trajectory for edge-deployable, multi-modal medical foundation models.
[28] arXiv:2605.13269 [pdf, ps, other]: Title: Submodular Multi-Agent Policy Learning for Online Distributed Task Allocation in Open Multi-Agent Systems

Authors: Jing Liu, Yangyang Yang, Luca Ballotta, Fangfei Li, Yang Tang, Ruggero Carli

Subjects: Systems and Control (eess.SY)

This paper studies multi-agent reinforcement learning with submodular team utilities for online distributed task allocation. In this setting, each agent selects one action from a local categorical policy, so feasible joint actions form a partition matroid over agent-action pairs. Classical multilinear extensions use independent Bernoulli sampling and therefore do not match the categorical policies executed by decentralized agents. To address this mismatch, we introduce the Partition Multilinear Extension (PME), a continuous relaxation whose value equals the expected team utility under factorized categorical policies. We prove that submodular difference rewards provide unbiased PME marginal-gradient information and yield a stagewise score-function policy-gradient estimator. Based on this connection, we propose SubMAPG, a centralized-training decentralized-execution policy-gradient framework with masked categorical policies and submodular difference-reward training signals. For the associated PME marginal-space projected stochastic-gradient dynamics, we prove a stagewise 1/2-approximation guarantee and sublinear dynamic regret in slowly varying environments, measured by the path length of the optimal PME marginals. To handle open systems with time-varying agents and targets, we instantiate SubMAPG with graph neural network policies. Experiments on multi-robot coverage and multi-target tracking show that SubMAPG outperforms local greedy and shared-reward baselines and is competitive with centralized myopic greedy strategies.
[29] arXiv:2605.13286 [pdf, ps, other]: Title: Implementing Fluid Antennas in the Beamspace: Performance Evaluation and Codebook Design

Authors: Pablo Ramírez-Espinosa, F. Javier López-Martínez, David Morales-Jiménez

Subjects: Signal Processing (eess.SP)

Metasurface-based fluid antenna systems (FASs) have been recently proposed as an inexpensive, scalable and practical alternative implementation for the fluid-antenna concept. This work thoroughly evaluates the performance of metasurface-based FASs in the context of multi-user communications. We extend the state-of-the-art signal model of FASs to electronically-reconfigurable designs, explicitly including the antenna response in the equivalent channel and resulting correlation structure. A general codebook design procedure, accounting for practical aspects like reflections and radiation efficiency, is presented and used to design the different antenna configurations (regarded as FAS ports). Importantly, we show that, with proper design, metasurface-based FASs can significantly outperform conceptual ones. While state-of-the-art theoretical embodiments of FAS rely on spatial flexibility for constructive/destructive interference, metasurface-based FASs exploit interference cancellation through projection onto the interference null space. Numerical results show a remarkable improvement when the system is dominated by interference (i.e., the natural FASs operational regime), regardless of spatial propagation characteristics.
[30] arXiv:2605.13309 [pdf, ps, other]: Title: SimART: A Unified and Open Real-world Multimodal Simulation Platform for 6G Integrated Sensing and Communication

Authors: Kang Yan, Yuqi Cao, Jiaqi Li, Luping Xiang, Kun Yang

Subjects: Signal Processing (eess.SP)

Research on sixth-generation (6G) integrated sensing and communication (ISAC) increasingly depends on multimodal datasets. These datasets need to jointly characterize wireless propagation, onboard sensing, and platform mobility. Existing tools cover only part of these aspects. Robotics simulators model physics and perception but not site-specific channels, while ray tracing and link level tools lack vehicle dynamics and onboard sensors. Combining them manually leads to workflows that are fragile and hard to reproduce. Rather than introducing another standalone simulator, this article presents SimART. It integrates mature robotics, ray tracing, and wireless evaluation engines into a single reproducible pipeline. The key idea is a robot operating system (ROS) backbone that both synchronizes and organizes all multimodal streams. A shared clock, a common coordinate frame, and timestamped messages keep the streams aligned in time and space, and a single rosbag recording captures the full session into one reproducible file. This design decouples the sensing front end from the wireless back end, so that any ROS-compatible simulator can be plugged in while reusing the same back end across aerial, ground, indoor, and maritime ISAC settings. On top of this backbone, SimART contributes a scene construction pipeline that converts both OpenStreetMap extracts and user-defined layouts into spatially aligned visual and electromagnetic assets, and a channel knowledge map (CKM) generator that aggregates ray tracing and system level outputs into spatial priors for ISAC algorithms. A case study on vision and position aided beam prediction demonstrates the utility of the platform. The code is publicly available at https://github.com/guchuanv-alt/SimART.
[31] arXiv:2605.13355 [pdf, ps, other]: Title: Impedance-Based VSC Unit Commitment with STATCOM Support under High IBG Penetration

Authors: Aoun Abbas, Zhongda Chu, Charalambos Konstantinou

Comments: Electric Power Systems Research

Subjects: Systems and Control (eess.SY)

The large-scale replacement of synchronous machines with inverter-based generation (IBG) introduces critical challenges to both voltage and frequency stability. This work builds on a mixed-integer second-order cone programming (MISOCP) framework that co-optimizes unit commitment (UC) model which embeds frequency-nadir constraints through synthetic inertia (SI) dispatch and an SOC voltage stability boundary for IBG buses. The formulation extends by modeling a STATCOM as a reactive-power decision variable in the same MISOCP model. A modified IEEE 30-bus system is used to assess three scheduling strategies: (i) baseline UC with SI only, (ii) voltage-stability-constrained (VSC) UC with SI, and (iii) the joint UC with SI and reactive power support from IBGs. The impact of incorporating a 30~MVAr STATCOM at a weak grid location near the IBG buses is investigated. Simulation results show that the proposed framework enhances voltage security, maintains frequency-nadir compliance, and reduces operating cost, while STATCOM integration further improves dispatch feasibility under high IBG.
[32] arXiv:2605.13390 [pdf, ps, other]: Title: Sensitivity Quantification for Distribution System State Estimation

Authors: Betül Mamudi, Jochen Stiasny, Jochen Cremer

Comments: Submitted for peer review

Subjects: Systems and Control (eess.SY)

Pseudo-measurements are the dominant source of uncertainty in distribution system state estimation (DSSE), yet their distributional assumptions are treated as fixed inputs by existing uncertainty quantification methods. This paper investigates whether the uncertainty bounds assumed by weighted least squares (WLS)-based DSSE are sensitive to these distributional assumptions, and whether this sensitivity is quantifiable using the Fisher Information Matrix (FIM). We propose a diagnostic framework that compares the true Cram\'er-Rao Bound (CRB) against the WLS-assumed CRB via a per-bus, per-scenario ratio, computed directly from the converged WLS solution. Pseudo-measurement distributions are varied across five types in 22 variants matched at equal spread to isolate shape effects from variance. Experiments on the CIGRE MV network across 100 operating scenarios yield three findings. First, heavy-tailed and skewed distributions show consistently that WLS systematically overstates its uncertainty bounds. Second, the degree of miscalibration varies across buses and operating scenarios, confirming that distributional sensitivity is not uniform. Third, the CRB ratio is structurally blind to mean-shift bias, exposing a fundamental limitation of variance-based uncertainty diagnostics. Together, these results confirm the hypothesis and show that the choice of pseudo-measurement distribution directly distorts the confidence limits under WLS-based assumptions, which must be explicitly accounted for in any uncertainty-aware DSSE method.
[33] arXiv:2605.13394 [pdf, ps, other]: Title: Decoupled Azimuth Elevation AoA Estimation Exploiting Kronecker Separable Steering Matrices

Authors: Faizan A. Khattak, Ian K. Proudler, Stephan Weiss, Fazal-E Asim

Subjects: Signal Processing (eess.SP)

Uniform rectangular arrays (URA), structured non-uniform rectangular arrays (NURA), and parallelogram shaped (UPgA and NUPgA) arrays admit steering vectors that can be expressed as the Kronecker product of azimuth and elevation steering vectors. Accordingly, the full steering matrix can be represented as the Khatri Rao product of the corresponding azimuth and elevation steering matrices. This paper exploits this structure to develop an economical subspace decoupling framework for two dimensional angle of arrival (AoA) estimation. The proposed method first extracts the joint signal subspace from the spatial covariance matrix. Then it applies a low complexity decoupling scheme to recover the column spaces of the azimuth and elevation steering matrices. With the estimated decoupled subspaces, conventional one dimensional algorithms such as MUSIC, root MUSIC, and ESPRIT can be applied independently along each dimension, followed by pairing through a two dimensional spectral function. Monte Carlo simulations show that the proposed approach achieves higher accuracy than state of the art methods, i.e., two dimensional MUSIC, reduced-dimension MUSIC, and two-dimensional ESPRIT, for medium- and large scale arrays while requiring fewer snapshots, consequently with improved spectral efficiency.
[34] arXiv:2605.13453 [pdf, ps, other]: Title: Learning a Contracting KKL-observer with Local Optimal Guarantees

Authors: Clara Lucía Galimberti, Johan Peralez, Daniele Astolfi, Vincent Andrieu, Madiha Nadri

Comments: Accepted to the 23rd IFAC World Congress 2026

Subjects: Systems and Control (eess.SY)

The Kazantzis-Kravaris-Luenberger (KKL) observer provides a general framework for nonlinear state estimation by immersing the system dynamics into a stable linear or nonlinear latent dynamics. However, the performance of KKL observers relies heavily on the specific choice of these latent dynamics, which is often heuristic. This paper proposes a methodology to learn a KKL observer that combines global stability guarantees with local optimality. We derive a condition on the latent dynamics such that the observer locally mimics the behavior of a Minimum Energy Estimator (Mortensen observer). We then employ Deep Learning to approximate the KKL transformation and the latent dynamics, using neural network architectures that structurally enforce the contraction property. The proposed strategy is validated through numerical simulations on nonlinear benchmarks, demonstrating a good performance in the presence of state and measurement noise.
[35] arXiv:2605.13502 [pdf, ps, other]: Title: A Multi-Modal Intelligent U2V Channel Model for 6G Sensing-Communication Integration

Authors: Shuo Wang, Zengrui Han, Lu Bai, Xiang Cheng

Subjects: Signal Processing (eess.SP)

This paper proposes a novel UAV-to-Vehicle (U2V) channel model for sixth-generation (6G) intelligent sensing-communication integration, based on three-dimensional (3D) scatterer prediction. To explore the mapping relationship between physical environment and electromagnetic space, a new high-fidelity mixed sensing-communication integration U2V simulation dataset under wide-lane scenarios with different vehicular traffic densities (VTDs) and UAV heights is constructed. Based on the constructed dataset, a novel 3D Scatterer Prediction and Distribution Estimation (3D-SPADE) algorithm is proposed, which leverages LiDAR point clouds to accurately predict the spatial distribution of scatterers. Furthermore, the clustering of scatterers and the subsequent classification into dynamic and static types are meticulously designed for highly dynamic U2V scenarios, while reducing computational complexity and improving modeling accuracy. As LiDAR point clouds vary over time, dynamic and static clusters evolve via 3D-SPADE, enabling precise modeling of channel non-stationarity and consistency. Simulation results demonstrate that, in the wide-lane scenario with varying VTDs and UAV heights, the proposed 3D-SPADE consistently achieves high scatterer occupancy detection performance within the voxel grid. In particular, under favorable configurations, recall reaches 93.26%, and precision reaches 95.74%, highlighting the reliability of 3D-SPADE. Key channel statistical characteristics are simulated and analyzed. These characteristics from the simulation experiments are highly consistent with ray-tracing results and exhibit better agreement than with the standardized model and inconsistent model, validating the necessity of exploring the mapping relationship and the effectiveness of the proposed model.
[36] arXiv:2605.13516 [pdf, ps, other]: Title: Sensing-Assisted LoS/NLoS Identification in Dynamic UAV Positioning Systems

Authors: Huijuan Qiao, Lu Bai, Mingran Sun, Mengyuan Lu, Jiajing Chen, Xiang Cheng

Subjects: Signal Processing (eess.SP)

In this paper, a sensing-assisted non-line-of-sight (NLoS) identification method for dynamic uncrewed aerial vehicle (UAV) positioning is proposed for the first time. For urban UAV-to-ground scenarios, a new multi-modal sensing-communication integrated dataset is constructed to support line-of-sight (LoS)/NLoS identification, covering two typical urban scenarios and a wide range of flight altitudes. Based on the constructed dataset, a novel dual-input feature fusion network is proposed, which addresses the challenge of heterogeneous representations between RGB images and channel impulse response (CIR) data to enable the joint extraction and fusion of sensing and communication features for LoS/NLoS identification. Simulation results show that the identification accuracy can reach up to 97.69%, while achieving an improvement of at least 3.59% compared to traditional CIR-only and RGB-only methods. Moreover, strong few-shot generalization is observed, as the proposed method stabilizes and approaches full-sample performance with fewer than 200 target samples and exceeds traditional CIR-only and RGB-only methods with fewer than 100 target samples in all cross-scenario and cross-altitude experiments. Even under Gaussian noise with a variance of 0.35 applied to RGB images, the accuracy degradation remains approximately 0.5%. By utilizing the proposed LoS/NLoS identification method, the error of trilateration positioning can be reduced by approximately 70% in a crossroad scenario, verifying the utility of the proposed method.
[37] arXiv:2605.13524 [pdf, ps, other]: Title: Manifold-Aware Information Gain and Lower Bounds for Gaussian-Process Bandits on Riemannian Quotient Spaces

Authors: Yuriy Dorn, Changsheng Chen, Ning Xie

Comments: It will be submitted to IEEE Transactions on Information Theory

Subjects: Signal Processing (eess.SP)

We prove a regret lower bound for Gaussian-process bandits on a smooth compact Riemannian manifold $\M$ of dimension $d$ with intrinsic Mat\'ern-$\nu$ kernel ($\nu>d/2$) that exposes how the geometry of the arm space enters the constant. For any algorithm and time horizon $T$ exceeding an explicit threshold, the worst-case expected regret over the RKHS-ball $\|f\|_{\Hil_{k_\nu}}\!\le\!B$ satisfies \begin{multline*} \E[R_T(f)]\;\ge\;c_*(d,\nu)\,B^{d/(2\nu+d)}\,\sigma_n^{2\nu/(2\nu+d)} \\ \cdot\,\vol_g(\M)^{\nu/(2\nu+d)}\,T^{(\nu+d)/(2\nu+d)}(\log T)^{\nu/(2\nu+d)}. \end{multline*} The exponent matches the Vakili--Khezeli--Picheny upper bound \cite{vakili2021information}; the $\vol_g(\M)^{\nu/(2\nu+d)}$ factor is, to our knowledge, the first explicit volume-dependent geometric constant in a manifold GP-bandit lower bound. We extend the analysis in five directions: (i)~a companion Assouad-style proof gives a different lower bound with a strictly smaller $T$-exponent $(2\nu+3d)/(4(\nu+d))$ but with a polylog factor of the form $1/(\log\log T)^{(2\nu+d)/(4(\nu+d))}$, sharpening the $(\log T)^{\nu/(2\nu+d)}$ Fano polylog of Theorem~\ref{thm:main}; (ii)~we prove a $|G|^{1/2}$ upper bound on the regret of an extrinsic-kernel GP-UCB algorithm on a quotient space $\M=\Mt/G$, plus a bracketing theorem (Theorem~\ref{thm:gauge-bracket}); the precise constant is conjectured to take the modulated form $(1+(|G|-1)h(\rinj/\kappa))^{1/2}$ (Conjecture~\ref{conj:gauge-modulated}), validated numerically on $\SO(3)$; (iii)~we write the leading constant $c_*(d,\nu)$ out fully; (iv)~we extract a curvature dependence $1+O(K\eps_T^2)$ via Bishop--Gromov; (v)~we transfer the bound to the Bayesian regret framework via the Yang--Barron / Castillo et al.\ Bayesian-Fano transfer.
[38] arXiv:2605.13529 [pdf, ps, other]: Title: Decentralized Frequency-Domain Conditions for D-Stability with Application to DC Microgrids

Authors: Zelin Sun, Shanshan Jiang, Xiaoyu Peng, Xiang Zhu, Xiuqiang He, Hua Geng

Subjects: Systems and Control (eess.SY)

This paper proposes a decentralized method for regional pole placement, or $\mathcal{D}$-stability, in linearized networked systems. Existing LMI-based methods are hindered by confidentiality concerns regarding proprietary subsystem models and the absence of communication infrastructures. To overcome these barriers, we map the target region $\mathcal{D}$ of pole placement to an auxiliary left-half plane and introduce positive functions to handle the resulting complex-coefficient dynamics. We prove that $\mathcal{D}$-stability is guaranteed via local frequency-domain criteria without requiring shared subsystem models or inter-subsystem communication. This method is then tailored to DC microgrids, where a loop transformation is utilized to reallocate the burden of stability certification, deriving a broadcastable grid code for decentralized parameter synthesis. Numerical examples verify the efficacy of the proposed method.
[39] arXiv:2605.13580 [pdf, ps, other]: Title: Joint Segment Activation and Antenna Placement for Uplink SWAN Systems

Authors: Songnan Gu, Zhenqiao Cheng, Hao Jiang, Chongjun Ouyang, Yuanwei Liu, Arumugam Nallanathan

Comments: 5 pages

Subjects: Signal Processing (eess.SP)

This article analyzes the achievable sum-rate of multiuser uplink segmented waveguide-enabled pinching-antenna systems (SWANs). To unveil system-design insights, an upper bound on the achievable sum-rate is derived, based on which the existence of an optimal segment activation level is theoretically established. Motivated by this result, hybrid segment selection and aggregation (HSS/A) schemes are proposed to jointly optimize segment activation and pinching-antenna (PA) placement. Correspondingly, low-complexity greedy algorithms are developed for the considered optimization problem. Numerical results validate the theoretical analysis and demonstrate that the proposed HSS/A schemes outperform conventional full-segment aggregation.
[40] arXiv:2605.13661 [pdf, ps, other]: Title: Air-Sea Surface Modeling and Operating Link Range Evaluation for AUV-to-UAV Optical Wireless Communication Links

Authors: Ikenna Chinazaekpere Ijeh, Mohammad Ali Khalighi, Wasiu O. Popoola

Subjects: Signal Processing (eess.SP)

Air-sea surface interactions play a critical role in underwater-to-air optical wireless communication (OWC) links, particularly in vertical autonomous underwater vehicle (AUV) to unmanned aerial vehicle (UAV) scenarios, where the stochastic nature of the sea surface introduces optical distortions that impair link reliability. This work investigates the impact of air-sea surface roughness on AUV-to-UAV OWC systems using two experimentally validated models: the classical Cox-Munk and the Elfouhaily-Chapron-Katsaros-Vandemark (ECKV). A tractable analytical representation of the ECKV model is derived and validated against measured sea-state data. Using both analytical and Monte Carlo approaches, the link ergodic capacity is evaluated with particular emphasis on operating range, pointing errors, receiver field-of-view, and solar noise level, providing practical system design insights.
[41] arXiv:2605.13669 [pdf, ps, other]: Title: Bounded-Input True Proportional Navigation for Impact-Time Control

Authors: Lohitvel Gopikannan, Shashi Ranjan Kumar, Abhinav Sinha

Comments: Preprint; Accepted for presentation at the 15th Asian Control Conference, June 17th-21st, 2026, Indonesia

Subjects: Systems and Control (eess.SY); Robotics (cs.RO); Dynamical Systems (math.DS)

This paper proposes a nonlinear guidance strategy capable of intercepting a constant-velocity, non-maneuvering target while strictly satisfying the prescribed bounds on the control input (commanded acceleration). Unlike conventional strategies that estimate time-to-go using linearization or small-angle approximations, the proposed strategy employs true proportional-navigation guidance (TPNG) as a baseline, which utilizes an exact time-to-go formulation and is applicable over a wide range of target motions. In contrast to most existing strategies, which do not incorporate control input bounds into the guidance design, the proposed approach explicitly accounts for these limits by modeling the interceptor acceleration as a dynamic variable. Based on the sliding mode control technique, an effective guidance law that achieves time-constrained interception while accounting for bounded input is then derived. The performance of the proposed strategy is evaluated for various engagement scenarios.
[42] arXiv:2605.13720 [pdf, ps, other]: Title: An Underwater Dehazing Network with Implicit Transmission Estimation

Authors: Sahana Ray, Sanjay Ghosh

Comments: 5 pages, 2 figures

Subjects: Image and Video Processing (eess.IV)

Underwater images suffer from wavelength-dependent light absorption and scattering, which reduces visual quality. This phenomenon could limit the operational reliability of autonomous underwater vehicles, marine surveys, and offshore inspection systems. Purely classical methods often achieve suboptimal performance in real-world datasets, while purely data-driven methods lack physical interpretability. In this letter, we propose UDehaze-iT, a deep network for underwater image enhancement that estimates scene depth implicitly and derives per-channel transmission through the Beer-Lambert law with learnable attenuation coefficients. We estimate atmospheric light as a semi-classical per-channel scalar, and a zero-initialized residual refiner corrects remaining artefacts after dehazing. To effectively train our method, we apply a composite loss function consisting of five key terms: a L1 loss, a multi-scale patchwise DCT loss, a forward model reconstruction loss, and two regularization terms. With ~0.9M parameters, UDehaze-iT achieves competitive performance on UIEB and UFO-120 datasets.
[43] arXiv:2605.13836 [pdf, ps, other]: Title: Reachable-Set Decomposition for Real-Time Aggregation of Multi-Zone HVAC Fleets

Authors: Jingguan Liu, Xiaomeng Ai, Cong Chen, Shaoze Li, Shichang Cui, Jiakun Fang, Jinyu Wen

Comments: 10 pages, 9 figures

Subjects: Systems and Control (eess.SY)

Aggregating building heating, ventilation, and air-conditioning (HVAC) fleets provides substantial real-time flexibility to power system operations. However, real-time aggregation of multi-zone HVAC fleets faces two key challenges: (i) strong coupling across zones and time makes flexibility characterization high-dimensional and computationally demanding, and (ii) the sequential revelation of temperature states and exogenous conditions requires that decisions made at each period preserve feasibility over the remaining horizon using only currently realized information. To address these challenges, this paper proposes a reachable-set decomposition framework comprising an offline decomposition stage and a real-time policy. In the offline stage, backward reachable sets are formulated to encode remaining-horizon feasibility into per-period state constraints, so that any state within the current reachable set is guaranteed to sustain feasible operation over the entire remaining horizon. A tailored inner approximation is then developed for tractable calculation in multi-zone-coupled HVAC settings. In the real-time stage, aggregate flexibility is computed efficiently via building-level parallel linear programs followed by closed-form Minkowski summation of power intervals, and any regulation signal within the reported flexibility interval admits a recursively feasible disaggregation. Case studies demonstrate the effectiveness of the proposed framework in aggregate flexibility characterization, disaggregation feasibility, and scalable computation.

Cross-lists for Thu, 14 May 26

[44] arXiv:2605.12506 (cross-list from cs.CV) [pdf, ps, other]: Title: Scale-Gest: Scalable Model-Space Synthesis and Runtime Selection for On-Device Gesture Detection

Authors: Abdul Basit, Saim Rehman, Muhammad Shafique

Comments: 7 pages, 11 figures, Accepted to DAC 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Robotics (cs.RO); Image and Video Processing (eess.IV)

Realizing on-device ML-based gesture detection under tight real-time performance, energy and memory constraints is challenging, especially when considering mobile devices with varying battery-power levels. Existing EdgeAI deployments typically rely on a single fixed detector, limiting optimization opportunities. We present Scale-Gest, a novel run-time adaptive gesture detection framework that expands the detector space into a dense family of tiny-YOLO architectures. We introduce multiple novel device-calibrated ACE (Accuracy-Complexity-Energy) profiles by analyzing different model-resolution-stride operating points. A lightweight run-time controller selects an appropriate ACE mode under user-defined and battery constraints, while a motion-aware hand-gesture-tracking ROI gate crops the input for reduced complexity detection. To evaluate performance of our system in real-world car driving scenarios, we introduce a temporally-annotated Driver Simulated Gesture (DSG-18) dataset. Scale-Gest maintains event-level F1 while significantly reducing energy and latency compared to single-detector approaches. On a battery-powered laptop running gesture streams, our ACE controller reduces per-frame energy by 4x (from 6.9 mJ to 1.6 mJ) while maintaining high gesture-detection performance (event-level F1 = 0.8-0.9) and low mean latency (6 ms).
[45] arXiv:2605.12552 (cross-list from cs.NI) [pdf, ps, other]: Title: DQN-Driven Adaptive Neighbor Discovery for Directional Aerial Networks

Authors: Md Asif Ishrak Sarder, Murat Yuksel, Elizabeth Bentley

Comments: Accepted at IEEE ICC 2025. This is the author-accepted manuscript

Subjects: Networking and Internet Architecture (cs.NI); Signal Processing (eess.SP)

Directional antenna systems are gaining substantial traction for aerial networks due to their higher gain, extended transmission range, and enhanced security. However, the requirement of beam alignment makes the task of finding and reaching neighbors challenging, particularly in a mobile setting. For wireless networks, privacy concerns play an equally critical role. However, the problem of ensuring network-wide connectivity while maintaining limited exposure when probing around is still unexplored. We address this trade-off by proposing an adaptive transceiver selection protocol based on the Deep Q-Network (DQN) framework. Each node acts as an independent DQN agent and interacts with the environment to learn how to balance the trade-off. Since the directional nodes operate only based on local observations, we adopt a weighted mechanism that guides them in prioritizing either high reachability or privacy by adaptively tuning the probing patterns. Results show that DQN framework surpasses the Random and Q-Learning baselines. Weights favoring discovery provide higher probing efficiency and reachability, while weights prioritizing privacy ensure limited exposure at the cost of low reachability, eventually attaining higher objective value.
[46] arXiv:2605.12771 (cross-list from cs.RO) [pdf, ps, other]: Title: Adaptive Smooth Tchebycheff Attention for Multi-Objective Policy Optimization

Authors: Alejandro Murillo-Gonzalez, Mahmoud Ali, Lantao Liu

Comments: To appear in the Proceedings of Robotics: Science and Systems (RSS) 2026

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Systems and Control (eess.SY); Optimization and Control (math.OC)

Multi-objective reinforcement learning in robotic domains requires balancing complex, non-convex trade-offs between conflicting objectives. While linear scalarization methods provide stability, they are theoretically incapable of recovering solutions within non-convex regions of the Pareto front. Conversely, static non-linear scalarizations (e.g., Tchebycheff) can theoretically access these regions but often suffer from severe gradient variance and optimization instability in deep RL. In this work, we propose an Adaptive Smooth Tchebycheff framework that resolves this tension by dynamically modulating the curvature of the optimization landscape. We introduce a novel conflict-driven controller that regulates the optimization smoothness based on real-time gradient interference. This allows the agent to anneal toward precise, non-convex scalarization when objectives align, while elastically reverting to stable, smooth approximations when destructive gradient conflicts emerge. We validate our approach on a challenging robotic stealth visual search task -- a proxy for monitoring of protected/fragile ecosystems -- where an agent must balance search, exposure/interference minimization and exploration speed. Extensive ablations confirm that our conflict-aware adaptation enables the robust discovery of Pareto-optimal policies in non-convex regions inaccessible to linear baselines and unstable for static non-linear methods.
Website: https://alejandromllo.github.io/research/pasta/
[47] arXiv:2605.12785 (cross-list from cs.LG) [pdf, ps, other]: Title: Identifying the nonlinear string dynamics with port-Hamiltonian neural networks

Authors: Maximino Linares, Guillaume Doras, Thomas Hélie

Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY); Dynamical Systems (math.DS)

Hybrid machine learning combines physical knowledge with data-driven models to enhance interpretability and performance. In this context, Port-Hamiltonian Systems (PHS), which generalize Hamiltonian mechanics to describe open, non-autonomous dynamical systems, have been successfully integrated with neural networks under the name Port-Hamiltonian Neural Networks (PHNNs). While the ability of PHNNs to identify Hamiltonian ordinary differential equation (ODE) systems has already been demonstrated, their application to learning Hamiltonian partial differential equation (PDE) systems remains largely unexplored. This limitation restricts their use in musical acoustics, where instruments are typically modeled as distributed parameter systems governed by PDEs. In this work, we demonstrate how to learn the nonlinear string dynamics from data in a physically-consistent framework through a PHNN extension to PDEs. By constructing structured neural network architectures based on PHS, we can recover both the Hamiltonian governing the string and the dissipation affecting it. This approach outperforms baseline, non-physics-informed methods in terms of both accuracy and interpretability. Numerical experiments using synthetic data demonstrate the ability of the proposed PHNN model to identify and emulate the nonlinear dynamics of the system.
[48] arXiv:2605.12794 (cross-list from cs.GT) [pdf, ps, other]: Title: Dynamic Transaction Scheduling and Pricing in the Ethereum Mempool

Authors: Fatemeh Fardno, S. Rasoul Etesami

Subjects: Computer Science and Game Theory (cs.GT); Cryptography and Security (cs.CR); Distributed, Parallel, and Cluster Computing (cs.DC); Networking and Internet Architecture (cs.NI); Systems and Control (eess.SY)

The Ethereum blockchain utilizes the EIP-1559 algorithm to manage transaction inclusion and block assembly. However, EIP-1559 and much of the existing literature study this problem from a static perspective, focusing on price evolution without modelling transaction dynamics within the mempool. Motivated by this limitation, we study a dynamic transaction scheduling problem in which transactions with heterogeneous sizes and per-unit values arrive over time and remain in the mempool until scheduled. To capture the stochastic mempool evolution, we formulate the problem as a Markov Decision Process (MDP) whose state represents the mempool configuration and whose actions correspond to block prices. We first provide a primal-dual interpretation of the static EIP-1559 mechanism, showing that block prices arise naturally as dual variables of a social-welfare maximization problem. Building on this perspective, we extend the framework to the dynamic setting and formulate an objective that maximizes long-run discounted reward while incorporating holding costs and overshoot penalties. We then employ a Natural Policy Gradient (NPG) algorithm to compute the optimal policy. Our results show that dynamic pricing stabilizes the mempool while maximizing long-run discounted reward. In particular, as the overshoot penalty increases, the average scheduled transaction volume converges to the target block capacity, and the resulting NPG updates closely resemble the EIP-1559 price update rule. Finally, we study two special cases of the MDP formulation: homogeneous transactions and uniform arrivals. In the homogeneous setting, where the protocol directly controls scheduled volume, we show that the optimal policy has a threshold structure. We then propose a bang-bang pricing mechanism for uniform arrivals and derive a lower bound on the block capacity needed to ensure system stability.
[49] arXiv:2605.12829 (cross-list from physics.soc-ph) [pdf, ps, other]: Title: Optimal excitation and measurement patterns for networks with tree topology

Authors: Eduardo Mapurunga, Alexandre Sanfelici Bazanella

Comments: Accepted for presentation at the 2026 IFAC World Congress, Busan, South Korea

Subjects: Physics and Society (physics.soc-ph); Systems and Control (eess.SY)

In this work we evaluate the excitation and measurement patterns (EMP) for networks with tree topology.
We investigate guidelines for the selection of the minimal EMPs, i.e. those with the least number of excited and measured nodes combined, for which
the accuracy obtained, in terms of the trace of the asymptotic covariance matrix, is optimal.
We introduce the concept of partial information matrix as a means to systematically obtain the information matrix for any dynamic network.
For a specific tree class, called cross, we show that the accuracy of a particular module depends on the magnitude of the parameters to be estimated.
Furthermore, when all factors are equal, it is best to excite. %we show that for small magnitudes of this parameter, it is best to excite.
We extend a topological condition for branches under which the accuracy of a particular module of the network is independent of the other parameters from the tree.
We provide a numerical analysis showing that our guidelines could be used as a selection tool for minimal EMPs for tree networks.
[50] arXiv:2605.12974 (cross-list from cs.RO) [pdf, ps, other]: Title: Distributionally Robust Safety Under Arbitrary Uncertainties: A Safety Filtering Approach

Authors: Daniel M. Cherenson, Haejoon Lee, Taekyung Kim, Dimitra Panagou

Comments: 10 pages, 4 figures, submitted to IEEE Robotics and Automation Letters (RA-L)

Subjects: Robotics (cs.RO); Systems and Control (eess.SY)

In this work, we study how to ensure probabilistic safety for nonlinear systems under distributional ambiguity. Our approach builds on a backup-based safety filtering framework that switches between a high-performance nominal policy and a certified backup policy to ensure safety. To handle arbitrary uncertainties from ambiguous distributions, i.e., where the distribution is not of specific structure and the true distribution is unknown, we adopt a distributionally robust (DR) formulation using Wasserstein ambiguity sets. Rather than solving a high-dimensional DR trajectory optimization problem online, we exploit the structure of backup-based safety filtering to reduce safety certification to a one-dimensional search over the switching time between nominal and backup policies. We then develop a sampling-based certification procedure with finite-sample guarantees, where empirical failure probabilities are compared against a Wasserstein-inflated threshold. We validate our method through simulations across three systems, from a Dubins vehicle to a high-speed racing car and a fighter jet, demonstrating the broad applicability and computational efficiency.
[51] arXiv:2605.13010 (cross-list from cs.CV) [pdf, ps, other]: Title: Amortized Guidance for Image Inpainting with Pretrained Diffusion Models

Authors: Yilie Huang, Xun Yu Zhou

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Systems and Control (eess.SY); Optimization and Control (math.OC)

We study image inpainting with generative diffusion models. Existing methods typically either train dedicated task-specific models, or adapt a pretrained diffusion model separately for each masked image at deployment. We introduce a middle-ground model, termed Amortized Inpainting with Diffusion (AID), which keeps a pretrained diffusion backbone fixed, trains a small reusable guidance module offline, and then reuses it across masked images without per-instance optimization. We formulate it as a deterministic guidance problem with a supervised terminal objective. To make this problem learnable in high dimensions, we derive an auxiliary Gaussian formulation and prove that solving this randomized problem recovers the optimal deterministic guidance field. This bridge yields a principled continuous-time actor--critic algorithm for learning the guidance module in a fully data-driven manner. Empirically, on AFHQv2 and FFHQ under the pixel EDM pipeline and on ImageNet under the latent EDM2 pipeline, AID consistently improves the quality--speed trade-off over strong fixed-backbone and amortized inpainting baselines across multiple mask types, while adding less than one percent trainable overhead.
[52] arXiv:2605.13028 (cross-list from cs.RO) [pdf, ps, other]: Title: Local Conformal Calibration of Dynamics Uncertainty from Semantic Images

Authors: Luís Marques, Dmitry Berenson

Comments: 26 pages, 8 figures. Accepted to the 17th World Symposium on the Algorithmic Foundations of Robotics (WAFR) 2026

Subjects: Robotics (cs.RO); Systems and Control (eess.SY)

We introduce Observation-aware Conformal Uncertainty Local-Calibration (OCULAR), a conformal prediction-based algorithm that uses perception information to provide uncertainty quantification guarantees for unseen test-time environments. While previous conformal approaches lack the ability to discriminate between state-action space regions leading to higher or lower model mismatch, and require environment-specific data, our method uses data collected from visually similar environments to provably calibrate a given linear Gaussian dynamics model of arbitrary fidelity. The prediction regions generated from OCULAR are guaranteed to contain the future system states with, at least, a user-set likelihood, despite both aleatoric and epistemic uncertainty -- i.e., uncertainty arising from both stochastic disturbances and lack of data. Our guarantees are non-asymptotic and distribution-free, not requiring strong assumptions about the unknown real system dynamics. Our calibration procedure enables distinguishing between observation-velocity-action inputs leading to higher and lower next-state-uncertainty, which is helpful for probabilistically-safe planning. We numerically validate our algorithm on a double-integrator system subject to random perturbations and significant model mismatch, using both a simplified sensor and a more realistic simulated camera. Our approach appropriately quantifies uncertainty both when in-distribution and out-of-distribution, being comparatively volume-efficient to baselines requiring environment-specific data.
[53] arXiv:2605.13103 (cross-list from math.OC) [pdf, ps, other]: Title: Guaranteed cost structured control in infinite-horizon linear-quadratic cooperative differential games

Authors: Aniruddha Roy, Pavankumar Tallapragada

Subjects: Optimization and Control (math.OC); Systems and Control (eess.SY)

In this paper, we consider infinite-horizon linear-quadratic cooperative differential games with output feedback information structure. We first demonstrate that, under output feedback information structure, computing Pareto optimal controls can be difficult even for simple low-dimensional differential games. To address this issue, this paper introduces the concept of feedback guaranteed cost structured control (GCSC). The feedback GCSC concept is inspired from suboptimal control. At a feedback GCSC, the total weighted team cost remains below a prescribed threshold while satisfying the structural constraints. We derive fundamental properties of the feedback GCSC and the admissible weight set, including their monotonicity properties. In particular, we show that if Pareto optimal controls exist, they belong to the class of feedback GCSCs. We also quantify the suboptimalty of Pareto optimal controls (if they exist) and the proposed GCSC with respect to output feedback optimal control. Furthermore, we provide the conditions for verification and the synthesis of a feedback GCSC. Finally, we illustrate the effectiveness of the proposed approach through numerical examples, including a case study on tracking synchronization in a microgrid.
[54] arXiv:2605.13133 (cross-list from cs.LG) [pdf, ps, other]: Title: KAST-BAR: Knowledge-Anchored Semantically-Dynamic Topology Brain Autoregressive Modeling for Universal Neural Interpretation

Authors: Haoning Wang, Wenchao Yang, Shuai Shen, Yang Li

Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)

While EEG foundation models have shown significant potential in universal neural decoding across tasks, their advancement remains constrained by the inadequacy modeling of complex spatiotemporal topology, as well as the inherent modality gap between low-level physiological signals and high-level textual semantics. To address these challenges, we propose a Knowledge-Anchored Semantically-Dynamic Topology Brain Autoregressive Model (KAST-BAR), which dynamically aligns physiological representations derived from multi-level brain topology with an expert-level semantic space. Specifically, we design a Dual-Stream Hierarchical Attention (DSHA) encoder that accurately captures the brain's intrinsic non-Euclidean topology by modeling local temporal dynamics with global spatial contexts. On this basis, a Knowledge-Anchored Semantic Profiler (KASP) is proposed to synthesize physically-grounded and instance-level textual profiles, which subsequently drive a Semantic Text-Aware Refiner (STAR) to dynamically reconstruct EEG representations using Latent Expert Queries. By conducting large-scale pre-training on 21 diverse datasets to build a foundation model, KAST-BAR effectively integrates expert-level medical knowledge into EEG signal representations, consistently achieving superior performance across six downstream tasks. Our code is available at https://github.com/KAST-BAR/KAST-BAR
[55] arXiv:2605.13302 (cross-list from cs.LG) [pdf, ps, other]: Title: Safe Bayesian Optimization for Uncertain Correlations Matrices in Linear Models of Co-Regionalization

Authors: Jannis Lübsen, Annika Eichler

Comments: Accepted at IFAC WC26

Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)

This paper extends safety guarantees for multi-task Bayesian optimization with uncertain correlation matrices from intrinsic co-reginalization models to linear models of co-reginalization. The latter allows for more flexible modeling of the inter-task correlations by composing multiple features. We derive uniform error bounds for vector-valued functions sampled from a Gaussian process with a linear model of co-reginalization kernel. Furthermore, we show the potential improvement of performance using linear models of co-reginalization in a numerical comparison on a safe multi-task Bayesian optimization benchmark.
[56] arXiv:2605.13315 (cross-list from cs.ET) [pdf, ps, other]: Title: Embodied Neurocomputation: A Framework for Interfacing Biological Neural Cultures with Scaled Task-Driven Validation

Authors: Johnson Zhou, Daniel Tanneberg, Forough Habibollahi, Alon Loeffler, Kiaran Lawson, Valentina Baccetti, Kwaku Dad Abu-Bonsrah, Candice Desouza, Finn Doensen, Bradley Watmuff, Daria Kornienko, Azin Azadi, Justin Leigh Bourke, Bernhard Sendhoff, Brett J. Kagan

Subjects: Emerging Technologies (cs.ET); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Systems and Control (eess.SY); Neurons and Cognition (q-bio.NC)

Biological neural networks (BNNs) have been established as a powerful and adaptive substrate that offer the potential for incredibly energy and data efficient information processing with distinct learning mechanisms. Yet a core challenge to utilizing BNN for neurocomputation is determining the optimal encoding and decoding mechanisms between the traditional silicon computing interface and the living biology. Here, we propose an Embodied Neurocomputation framework as a systems-level approach to this multi-variable optimization encoding/decoding problem. We operationalize this approach through the first large-scale parameter optimization of encoding configurations for a BNN agent performing closed-loop navigation along an odor-style gradient in a simulated grid-world. Despite the relative simplicity of the task, the biological interactions gave rise to a massive multi-combinatorial search space for optimal parameters. By considering how the components of the system are interconnected and parameterized, we evaluated approximately 1,300 parameter combinations, over 4,000 hours of real-time agent-environment interactions, to identify 12 configurations that consistently demonstrated learning across multiple episodes. These configurations achieved significantly higher task performances than optimized silicon-based DQN agents under the same interaction budget. These findings represent an initial step toward robust and scalable goal-oriented learning using BNNs. Our framework establishes a foundation for applying task-driven neurocomputing and supports the development of field-wide benchmarks. In the long term, this work supports the development of hybrid bio-silicon architectures capable of efficient, adaptive and real-time computation, including the potential for robotic control applications.
[57] arXiv:2605.13587 (cross-list from stat.ML) [pdf, ps, other]: Title: Reframing preprocessing selection as model-internal calibration in near-infrared spectroscopy: A large-scale benchmark of operator-adaptive PLS and Ridge models

Authors: Gregory Beurier, Robin Reiter, Camille Noûs, Lauriane Rouan, Denis Cornet

Comments: 23 pages, 4 figures; includes supplementary material. Code: this https URL

Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Signal Processing (eess.SP)

Near-infrared spectroscopy (NIRS) is rapid and non-destructive, but reliable calibration still depends heavily on spectral preprocessing. In routine practice, preprocessing is often selected by large external pipeline searches that are costly, unstable on small calibration sets, and difficult to audit. We introduce operator-adaptive calibration, a framework that moves linear preprocessing selection inside the calibration model. Candidate treatments are encoded as linear spectral operators, while nonlinear or sample-adaptive corrections such as SNV, MSC, and ASLS are handled as fold-local branches to prevent leakage.
We instantiate the framework for PLS and Ridge regression. For PLS, covariance identities enable fast NIPALS and SIMPLS variants while preserving original-wavelength coefficients. For Ridge, operator-adaptive kernels yield a dual formulation with recoverable original-space coefficients. The approach was evaluated on more than 50 heterogeneous NIRS datasets against conventional PLS, Ridge, CatBoost, and CNN baselines under documented search budgets.
Compact operator-adaptive PLS with ASLS branch preprocessing achieved a median RMSEP/PLS ratio of 0.960 with 42 wins on 57 datasets, while a deployable AOM-Ridge selector improved over tuned Ridge by a median 2.22% with 35 wins on 52 datasets. The proposed models reduce dependence on large preprocessing-HPO campaigns, produce traceable operator choices, retain interpretable coefficients, and fit in seconds for compact AOM-PLS. Operator-adaptive calibration therefore offers a practical route to faster, more robust, and more auditable NIRS method development.
[58] arXiv:2605.13713 (cross-list from cs.CV) [pdf, ps, other]: Title: Learning to Optimize Radiotherapy Plans via Fluence Maps Diffusion Model Generation and LSTM-based Optimization

Authors: Isabella Poles, Simon Arberet, Riqiang Gao, Martin Kraus, Marco D. Santambrogio, Florin C. Ghesu, Ali Kamen, Dorin Comaniciu

Comments: Early Accept at MICCAI 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)

Volumetric Modulated Arc Therapy (VMAT) is a cornerstone of modern radiation therapy, enabling highly conformal tumor irradiation and healthy-tissue sparing. Yet, its planning solves inverse and nested optimization for multi-leaf collimators, monitor units and dose parameters, while enforcing their consistency to ensure mechanical deliverability. Nevertheless, this process often requires repeated re-optimization when treatment configurations change, resulting in substantial planning time per patient. To address these problems, we present a diffusion-driven Learning-to-Optimize (L2O) method for end-to-end VMAT planning. A distribution-matching distilled diffusion model learns a clinically feasible manifold of fluence maps, enabling their one-shot generation. On top of this, an LSTM-based L2O module learns gradient update dynamics to swiftly refine fluence maps toward prescribed dose objectives during inference. Experimental results on clinical and public prostate cancer cohorts demonstrate improved planning efficiency, flexibility, and machine deliverability over currently available end-to-end VMAT planners.
[59] arXiv:2605.13748 (cross-list from cs.RO) [pdf, ps, other]: Title: TinySDP: Real Time Semidefinite Optimization for Certifiable and Agile Edge Robotics

Authors: Ishaan Mahajan, Jon Arrizabalaga, Andrea Grillo, Fausto Vega, James Anderson, Zachary Manchester, Brian Plancher

Comments: Accepted to Robotics: Science and Systems (RSS) 2026. 11 pages, 5 figures, 2 tables. Project website: this https URL

Subjects: Robotics (cs.RO); Systems and Control (eess.SY)

Semidefinite programming (SDP) provides a principled framework for convex relaxations of nonconvex geometric constraints in motion planning, yet existing solvers are too computationally expensive for real-time control, particularly on resource-constrained embedded systems. To address this gap, we introduce TinySDP, the first semidefinite programming solver designed for embedded systems, enabling real-time model-predictive control (MPC) on microcontrollers for problems with nonconvex obstacle constraints. Our approach integrates positive-semidefinite cone projections into a cached-Riccati-based ADMM solver, leveraging computational structure for embedded tractability. We pair this solver with an a posteriori rank-1 certificate that converts relaxed solutions into explicit geometric guarantees at each timestep. On challenging benchmarks, e.g., cul-de-sac and dynamic obstacle avoidance scenarios that induce failures in local methods, TinySDP achieves collision-free navigation with up to 73% shorter paths than state-of-the-art baselines. We validate our approach on a Crazyflie quadrotor, demonstrating that semidefinite constraints can be enforced at real-time rates for agile embedded robotics.
[60] arXiv:2605.13751 (cross-list from cs.RO) [pdf, ps, other]: Title: Learning Responsibility-Attributed Adversarial Scenarios for Testing Autonomous Vehicles

Authors: Yizhuo Xiao, Haotian Yan, Ying Wang, Zhongpan Zhu, Yuxin Zhang, Xintao Yan, Mustafa Suphi Erden, Cheng Wang

Subjects: Robotics (cs.RO); Software Engineering (cs.SE); Systems and Control (eess.SY)

Establishing trustworthy safety assurance for autonomous driving systems (ADSs) requires evidence that failures arise from avoidable system deficiencies rather than unavoidable traffic conflicts. Current adversarial simulation methods can efficiently expose collisions, but generally lack mechanisms to distinguish these fundamentally different failure modes. Here we present CARS (Context-Aware, Responsibility-attributed Scenario generation), a framework that integrates responsibility attribution directly into adversarial scenario generation. CARS combines context-aware adversary selection with a generative adversarial policy optimized in closed-loop simulation to construct collision scenarios that are both physically feasible and diagnostically attributable. Across benchmark datasets spanning heterogeneous national traffic environments, CARS consistently discovers feasible collision scenarios with high attribution rates under multiple regulation-prescribed careful and competent driver models. By coupling adversarial generation with normative responsibility assessment, CARS moves simulation testing beyond collision discovery toward the construction of interpretable, regulation-aligned safety evidence for scalable ADS validation.
[61] arXiv:2605.13822 (cross-list from cs.RO) [pdf, ps, other]: Title: Loiter UAV Reinsertion Guidance for Fixed-wing UAV Corridors

Authors: Pradeep J, Kedarisetty Siddhardha, Ashwini Ratnoo

Journal-ref: AIAA SCITECH 2026

Subjects: Robotics (cs.RO); Systems and Control (eess.SY)

This paper considers fixed-wing unmanned aerial vehicle (UAV) corridors comprising a main lane, a circular loiter lane for managing traffic congestion, and transit lanes connecting the two. In particular, we address the problem of conflict-free reinsertion of UAVs from the loiter lane back into the main lane. The loiter lane contains a fixed number of equidistant virtual slots that UAVs can occupy. Reinsertion of loiter UAVs into the main lane becomes essential either due to reduced traffic in the main lane or due to a loiter UAV needing to reach its destination urgently. Given the total number of loiter slots, UAV speed limits, and the minimum safety distance, a guidance algorithm is developed to compute the required speed of a loiter UAV in the transit lane to ensure safe reinsertion. The proposed guidance and automation strategies are validated through numerical simulations.

Replacements for Thu, 14 May 26

[62] arXiv:2307.14072 (replaced) [pdf, ps, other]: Title: Negative Spin $Δ_T$ noise Induced by Spin-Flip Scattering and Andreev Reflection

Authors: Sachiraj Mishra, Colin Benjamin

Comments: 14 pages, 6 figures, 2 tables, accepted for publication in Journal of Physics: Condensed Matter

Journal-ref: Journal of Physics: Condensed Matter (2026)

Subjects: Mesoscale and Nanoscale Physics (cond-mat.mes-hall); Signal Processing (eess.SP); Applied Physics (physics.app-ph); Quantum Physics (quant-ph)
[63] arXiv:2411.15913 (replaced) [pdf, ps, other]: Title: Repurposing Image Diffusion Models for Training-Free Music Style Transfer on Mel-spectrograms

Authors: Heehwan Wang, Joonwoo Kwon, Sooyoung Kim, Jungwoo Seo, Shinjae Yoo, Yuewei Lin, Jiook Cha

Comments: Accepted by ICIP 2026

Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[64] arXiv:2502.09592 (replaced) [pdf, ps, other]: Title: A Data-Driven Method for Microgrid System Identification: Physically Consistent Sparse Identification of Nonlinear Dynamics

Authors: Mohan Du, Xiaozhe Wang

Comments: 5 pages, 5 figures, 2025 IEEE Power & Energy Society General Meeting (PESGM), Accepted

Subjects: Systems and Control (eess.SY)
[65] arXiv:2502.20427 (replaced) [pdf, ps, other]: Title: DeePen: Penetration Testing for Audio Deepfake Detection

Authors: Nicolas Müller, Piotr Kawa, Adriana Stan, Thien-Phuc Doan, Souhwan Jung, Wei Herng Choong, Philip Sperl, Konstantin Böttinger

Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[66] arXiv:2506.22117 (replaced) [pdf, ps, other]: Title: Safe Multi-Agent Navigation via Constrained HJB-Informed Learning

Authors: Fenglan Wang, Xinguo Shu, Lei He, Lin Zhao

Comments: Accepted by Robotics: Science and Systems (RSS 2026)

Subjects: Systems and Control (eess.SY)
[67] arXiv:2507.02385 (replaced) [pdf, ps, other]: Title: Parameter estimation of range-migrating targets using OTFS signals from LEO satellites

Authors: Tong Ding, Luca Venturino, Emanuele Grossi

Comments: submitted to IEEE journal for possible publication

Subjects: Signal Processing (eess.SP)
[68] arXiv:2508.11351 (replaced) [pdf, ps, other]: Title: Important Bit Prefix M-ary Quadrature Amplitude Modulation for Semantic Communications

Authors: Haonan Lu, Rui Meng, Xiaodong Xu, Yiming Liu, Ping Zhang, Dusit Niyato

Subjects: Signal Processing (eess.SP)
[69] arXiv:2508.14950 (replaced) [pdf, ps, other]: Title: Potential and challenges of generative adversarial networks for super-resolution in 4D Flow MRI

Authors: Oliver Welin Odeback, Arivazhagan Geetha Balasubramanian, Jonas Schollenberger, Edward Ferdiand, Alistair A. Young, C. Alberto Figueroa, Susanne Schnell, Outi Tammisola, Ricardo Vinuesa, Tobias Granberg, Alexander Fyrdahl, David Marlevi

Comments: 26 pages, 10 figures

Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG)
[70] arXiv:2508.20474 (replaced) [pdf, ps, other]: Title: Unifying Diarization, Separation, and ASR with Multi-Speaker Encoder

Authors: Muhammad Shakeel, Yui Sudo, Yifan Peng, Chyi-Jiunn Lin, Shinji Watanabe

Comments: Accepted to IEEE ASRU 2025

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[71] arXiv:2510.14244 (replaced) [pdf, ps, other]: Title: Reinforcement Learning for Unsupervised Domain Adaptation in Spatio-Temporal Echocardiography Segmentation

Authors: Arnaud Judge, Nicolas Duchateau, Thierry Judge, Roman A. Sandler, Joseph Z. Sokol, Christian Desrosiers, Olivier Bernard, Pierre-Marc Jodoin

Comments: 13 pages, accepted for publication in IEEE TMI

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[72] arXiv:2510.19471 (replaced) [pdf, ps, other]: Title: Re-evaluating Minimum Bayes Risk Decoding for Automatic Speech Recognition

Authors: Yuu Jinnai

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[73] arXiv:2510.20152 (replaced) [pdf, ps, other]: Title: Soft Switching Expert Policies for Controlling Systems with Uncertain Parameters

Authors: Junya Ikemoto

Comments: 7 pages, 8 figures. Accepted to IFAC World Congress 2026

Subjects: Systems and Control (eess.SY)
[74] arXiv:2511.06257 (replaced) [pdf, ps, other]: Title: Fast Reconstruction of Motion-Corrupted Data with Mobile-GRAPPA: Motion and dB0 Inhomogeneity Correction Leveraging Efficient GRAPPA

Authors: Yimeng Lin, Nan Wang, Daniel Abraham, Daniel Polak, Xiaozhi Cao, Aizada Nurdinova, Stephen Cauley, Kawin Setsompop

Comments: 20 pages, 9 figures

Subjects: Signal Processing (eess.SP)
[75] arXiv:2511.06874 (replaced) [pdf, ps, other]: Title: Radio-Coverage-Aware Path Planning for Cooperative Autonomous Vehicles

Authors: Giuseppe Baruffa, Luca Rugini, Francesco Binucci, Fabrizio Frescura, Paolo Banelli, Renzo Perfetti

Comments: 11 pages, 19 figures

Subjects: Signal Processing (eess.SP); Systems and Control (eess.SY)
[76] arXiv:2511.08499 (replaced) [pdf, ps, other]: Title: Approaching Safety-Argumentation-by-Design: A Requirement-based Safety Argumentation Life Cycle for Automated Vehicles

Authors: Marvin Loba, Robert Graubohm, Niklas Braun, Nayel Fabian Salem, Andreas Dotzler, Marcus Nolte, Torben Stolte, Richard Schubert, Markus Maurer

Comments: Accepted to be published in 2026 IEEE 29th International Conference on Intelligent Transportation Systems (ITSC)}, Naples, Italy, September 15-18, 2026

Subjects: Systems and Control (eess.SY)
[77] arXiv:2511.19383 (replaced) [pdf, ps, other]: Title: A Hybrid Learning-to-Optimize Framework for Mixed-Integer Quadratic Programming

Authors: Viet-Anh Le, Mu Xie, Rahul Mangharam

Comments: fianl L4DC 2026

Subjects: Systems and Control (eess.SY)
[78] arXiv:2511.20383 (replaced) [pdf, ps, other]: Title: Accelerating Time-Optimal Trajectory Planning for Connected and Automated Vehicles with Graph Neural Networks

Authors: Viet-Anh Le, Andreas A. Malikopoulos

Comments: final IFAC WC 2026

Subjects: Systems and Control (eess.SY)
[79] arXiv:2512.06109 (replaced) [pdf, ps, other]: Title: Unifying Entropy Regularization in Optimal Control: From and Back to Classical Objectives via Iterated Soft Policies and Path Integral Solutions

Authors: Ajinkya Bhole, Mohammad Mahmoudi Filabadi, Guillaume Crevecoeur, Tom Lefebvre

Comments: refurbished introduction, added a few remarks, reduced size

Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Robotics (cs.RO); Systems and Control (eess.SY)
[80] arXiv:2512.20211 (replaced) [pdf, ps, other]: Title: Aliasing-Free Neural Audio Synthesis

Authors: Yicheng Gu, Junan Zhang, Chaoren Wang, Jerry Li, Zhizheng Wu, Lauri Juvela

Comments: Accepted by TASLP

Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
[81] arXiv:2601.02053 (replaced) [pdf, ps, other]: Title: Ageing Monitoring for Commercial Microcontrollers Based on Timing Windows

Authors: Leandro Lanzieri, Jiri Kral, Goerschwin Fey, Holger Schlarb, Thomas C. Schmidt

Subjects: Hardware Architecture (cs.AR); Systems and Control (eess.SY)
[82] arXiv:2601.12695 (replaced) [pdf, ps, other]: Title: From Noise to Knowledge: System Identification with Systematic Polytope Construction via Cyclic Reformulation

Authors: Hiroshi Okajima, Shun Shirahama, Tatsunori Hayashi, Nobutomo Matsunaga

Subjects: Systems and Control (eess.SY)
[83] arXiv:2601.22792 (replaced) [pdf, ps, other]: Title: CALM: Joint Contextual Acoustic-Linguistic Modeling for Personalization of Multi-Speaker ASR

Authors: Muhammad Shakeel, Yosuke Fukumoto, Chikara Maeda, Chyi-Jiunn Lin, Shinji Watanabe

Comments: Accepted to IEEE ICASSP 2026

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[84] arXiv:2602.01629 (replaced) [pdf, ps, other]: Title: AdaptNC: Adaptive Nonconformity Scores for Conformal Prediction under Distribution Shift

Authors: Renukanandan Tumu, Aditya Singh, Rahul Mangharam

Subjects: Machine Learning (cs.LG); Robotics (cs.RO); Systems and Control (eess.SY)
[85] arXiv:2602.07029 (replaced) [pdf, ps, other]: Title: Guidestar-Free Adaptive Optics with Asymmetric Apertures

Authors: Weiyun Jiang, Haiyun Guo, Christopher A. Metzler, Ashok Veeraraghavan

Comments: Accepted to ACM Transactions on Graphics (TOG)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[86] arXiv:2602.16253 (replaced) [pdf, ps, other]: Title: How Much Does Machine Identity Matter in Anomalous Sound Detection at Test Time?

Authors: Kevin Wilkinghoff, Keisuke Imoto, Zheng-Hua Tan

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[87] arXiv:2603.02245 (replaced) [pdf, ps, other]: Title: LMU-Based Sequential Learning and Posterior Ensemble Fusion for Cross-Domain Infant Cry Classification

Authors: Niloofar Jazaeri, Hilmi R. Dajani, Marco Janeczek, Martin Bouchard

Comments: 7 pages, to appear in Proc. Int. Conf. IEEE Engineering in Medicine and Biology Society (EMBC 2026), Toronto, Canada, July 26-30 2026

Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[88] arXiv:2603.09708 (replaced) [pdf, ps, other]: Title: Adapting a Text-to-Audio Model for Room Impulse Response Generation

Authors: Kirak Kim, Sungyoung Kim

Comments: 4 pages, 1 figure, submitted to IWAENC 2026

Subjects: Audio and Speech Processing (eess.AS)
[89] arXiv:2603.19910 (replaced) [pdf, ps, other]: Title: Learning Adaptive Parameter Policies for Nonlinear Bayesian Filtering

Authors: Ondrej Straka, Felipe Giraldo-Grueso, Renato Zanetti

Comments: Accepted for presentation at 29th International Conference on Information Fusion

Subjects: Systems and Control (eess.SY)
[90] arXiv:2603.22267 (replaced) [pdf, ps, other]: Title: TiCo: Time-Controllable Spoken Dialogue Model

Authors: Kai-Wei Chang, Wei-Chih Chen, En-Pei Hu, Hung-yi Lee, James Glass

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
[91] arXiv:2604.13505 (replaced) [pdf, ps, other]: Title: Cascaded TD3-PID Hybrid Controller for Quadrotor Trajectory Tracking in Wind Disturbance Environments

Authors: Yukang Zhang, Shuqi Chai, Yuhang Zhang, Danlan Huang, Quanbo Ge

Subjects: Systems and Control (eess.SY)
[92] arXiv:2604.14842 (replaced) [pdf, ps, other]: Title: Simplification Ad Absurdum? Revisiting Gas Flow Modeling for Integrated Energy System Planning

Authors: Thomas Klatzer, Yannick Werner, Sonja Wogrin

Subjects: Systems and Control (eess.SY); Optimization and Control (math.OC)
[93] arXiv:2604.18263 (replaced) [pdf, ps, other]: Title: Passive RIS Is Not Silent: Revisiting Performance Limits Under Thermal Noise

Authors: Farjam Karim, Deepak Kumar, Prathapasinghe Dharmawansa, Nurul Huda Mahmood, Arthur Sousa de Sena, Matti-Latva-aho

Subjects: Signal Processing (eess.SP)
[94] arXiv:2604.24610 (replaced) [pdf, ps, other]: Title: Matching-free Acquisition of Channels with Anisotropic Wavefronts

Authors: Heling Zhang, Shidong Zhou

Subjects: Signal Processing (eess.SP)
[95] arXiv:2605.07653 (replaced) [pdf, ps, other]: Title: Aquatic Neuromorphic Optical Flow

Authors: Pei Zhang, Yunkai Liang, Kaiqiang Wang

Comments: This work is under review. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[96] arXiv:2605.11940 (replaced) [pdf, ps, other]: Title: Lane-Aware Graph Attention Network for Multi-Vehicle Trajectory Prediction in Expressway Merge Zones

Authors: Eni Solomon Laughter

Subjects: Systems and Control (eess.SY)
[97] arXiv:2605.12244 (replaced) [pdf, ps, other]: Title: Estimation Problems and the Modulating Function Method: The Algebra of Modulating Functions

Authors: Davi G. Accioli, Jerome Jouffroy

Comments: 13 pages, 6 figures

Subjects: Systems and Control (eess.SY); Rings and Algebras (math.RA)

New submissions
Cross-lists
Replacements

[ total of 97 entries: 1-97 ]
[ showing up to 2000 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, eess, recent, 2605, contact, help (Access key information)

> eess

Electrical Engineering and Systems Science

New submissions

New submissions for Thu, 14 May 26

Cross-lists for Thu, 14 May 26

Replacements for Thu, 14 May 26