Electrical Engineering and Systems Science
New submissions
[ showing up to 2000 entries per page: fewer | more ]
New submissions for Thu, 14 May 26
- [1] arXiv:2605.12533 [pdf, ps, other]
-
Title: Investigation of Chaotic Behavior in Clapp OscillatorJournal-ref: IcETRAN-2018, Pali\'{c}, Serbia, 2018Subjects: Systems and Control (eess.SY); Chaotic Dynamics (nlin.CD)
In this paper we investigate the chaotic behavior of the class of oscillators denoted as Clapp oscillators. Clapp oscillator is a simple oscillator containing one transistor and a few reactive elements - inductors and capacitors. This oscilllator is chosen for its design simplicity and a good performance. Oscillator with chaotic behavior can be used to construct chaotic radar. For that matter, in this paper is investigated approach for construction of the chaotic Clapp oscillator, which can be further verified experimentally using microstrip technology.
- [2] arXiv:2605.12541 [pdf, ps, other]
-
Title: PG-LRF: Physiology-Guided Latent Rectified Flow for Electro-Hemodynamic PPG-to-ECG GenerationAuthors: Xiaoda Wang, Minxiao Wang, Kaiqiao Han, Defu Cao, Ching Chang, Yidan Shi, Runze Yan, Xiao Luo, Yan Liu, Xiao Hu, Yizhou Sun, Wei Wang, Carl YangSubjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Electrocardiography (ECG) is the clinical standard for cardiac assessment but requires dedicated hardware that does not scale to daily-life monitoring. Photoplethysmography (PPG) is ubiquitous in wearables but lacks ECG-specific diagnostic morphology and is corrupted by motion and sensor noise. PPG-to-ECG generation aims to bridge this gap by recovering electrical morphology and timing from peripheral pulse signals. However, existing methods largely rely on statistical alignment and data-driven generation. They fail to explicitly structure the latent space around physiology-aware electro-hemodynamic factors and lack constraints from forward physiological dynamics. To address these challenges, we propose PG-LRF, a physiology-guided latent rectified flow framework. PG-LRF introduces an electro-hemodynamic simulator that co-models ECG and PPG through shared cardiac phase dynamics. Guided by this simulator, a Physiology-Aware AutoEncoder learns a structured electro-hemodynamic latent space. Then we integrate this simulator guidance into a PPG-conditioned latent rectified flow, enforcing ECG-side morphology consistency and ECG-to-PPG forward hemodynamic consistency during generative transport. Experiments on the large-scale MC-MED dataset demonstrate that PG-LRF significantly improves PPG-to-ECG generation and downstream cardiovascular disease classification, proving its ability to generate ECGs that are both signal-faithful and physiologically plausible under the ECG-to-PPG hemodynamic pathway
- [3] arXiv:2605.12553 [pdf, ps, other]
-
Title: ChannelKAN: Multi-Scale Dual-Domain Channel Prediction via Hybrid CNN-KAN ArchitectureSubjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI)
Accurate channel state information (CSI) prediction is essential for improving the reliability and spectral efficiency of massive MIMO-OFDM systems in high-mobility scenarios. Existing deep learning methods struggle to jointly capture short-term local variations and long-range nonlinear dependencies in CSI sequences. To address this challenge, we propose ChannelKAN, a hybrid CNN-KAN channel prediction model with multi-scale frequency domain information enhancement. The key insight is that CNNs and Kolmogorov-Arnold Networks (KANs) are naturally complementary: CNNs extract intra-time-step local spatial-frequency correlations, while KANs with learnable Chebyshev polynomial activations fit inter-time-step nonlinear temporal evolution in a holistic manner. Specifically, a dual-domain expansion module first generates complementary frequency-domain and delay-domain CSI representations. A multi-scale frequency information enhancement module then retains dominant spectral components at multiple scales to strengthen key features and suppress noise. Next, a CNN-KAN feature extraction module captures local correlations via cascaded convolutions and models long-range dependencies via Chebyshev KAN layers. Finally, a dual-domain fusion module adaptively integrates features from both branches to produce the prediction. Experiments on 3GPP-compliant QuaDRiGa datasets demonstrate that ChannelKAN outperforms RNN, LSTM, GRU, CNN, and Transformer baselines in normalized mean square error (NMSE), spectral efficiency (SE), and bit error rate (BER) across various velocities and signal-to-noise ratios. Ablation studies further confirm the effectiveness of each proposed module.
- [4] arXiv:2605.12557 [pdf, ps, other]
-
Title: Localization in OFDM Passive Distributed Antenna Systems with Pilots and Unknown Data Payloads: A Marginal Maximum Likelihood ApproachComments: 13 pages, 10 figures, submitted to IEEE Transactions on Radar SystemsSubjects: Signal Processing (eess.SP)
Integrated Sensing and Communications (ISAC) is emerging as a key paradigm for future Sixth-Generation (6G) networks, with communication-centric designs favored for their compatibility with existing standards. Communication signals contain both known deterministic pilot symbols and unknown random data payloads. Most localization approaches rely solely on pilots, discarding the position information contained in the data symbols, which constitute the majority of each transmitted frame. Alternatively, Decision-Directed (DD) approaches exploit data decisions, thereby inherently limiting positioning performance to that of the communication system. In this paper, we derive a Marginal Maximum Likelihood (MML) estimator that jointly leverages pilot and data payloads without requiring data decoding, enabling operation with high-order constellations and under challenging noise conditions. We consider an opportunistic scenario in which an Orthogonal Frequency-Division Multiplexing (OFDM) signal transmitted by a User Equipment (UE) is captured by a distributed receiver array. Through numerical simulations, we demonstrate that the proposed method achieves superior localization performance compared to existing approaches and consistently converges to the genie bound (where data symbols are assumed perfectly known) at a lower Signal-to-Noise Ratio (SNR) than any DD method. Furthermore, the proposed method remains robust to constellation size, unlike DD approaches, whose performance degrades with increasing modulation order. Finally, we provide a computational complexity analysis of the proposed method and the considered baselines, highlighting the impact of system parameters on their respective computational costs.
- [5] arXiv:2605.12560 [pdf, ps, other]
-
Title: Brain Tumor Classification in MRI Images: A Computationally Efficient Convolutional Neural NetworkJournal-ref: 2025 IEEE International Conference on Biomedical Engineering, Computer and Information Technology for Health (BECITHCON), pp. 633-638, 2025Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Improving patient outcomes depends on the prompt and accurate diagnosis of brain tumors, but manual MRI scan analysis is still time-consuming and unreliable. Although deep learning has shown promise, many of the models that are now in use are computationally intensive and have difficulty handling the intrinsic complexity and variety of different types of brain tumors. In this work, we propose a lightweight yet high-performing Convolutional Neural Network (CNN) for multi-class brain tumor classification, employing MRI images to target gliomas, meningiomas, pituitary tumors, and healthy (no tumor) instances. The model was rigorously evaluated on two publicly accessible datasets from Figshare and Kaggle. Leveraging efficient feature extraction and optimized training strategies, our CNN achieved classification accuracies of 99.03% and 99.28%, along with ROC scores of 99.88% and 99.94% on Dataset 1 and Dataset 2, respectively-all while utilizing significantly fewer parameters than popular pre-trained architectures. In contrast to cutting-edge models like DenseNet201, MobileNetV2, VGG19, Xception, InceptionV3, and ResNet50, our approach consistently demonstrated superior performance with reduced computational overhead. These findings highlight the potential of the proposed model as a practical and reliable diagnostic aid in clinical environments.
- [6] arXiv:2605.12562 [pdf, ps, other]
-
Title: Uncovering Latent Pathological Signatures in Pulmonary CT via Cross-Window Knowledge DistillationAuthors: Bo Peng, Wujian Xu, Kun Wang, Ximing Liao, Na Wang, Daqian Shi, Tian Li, Jing Gao, Johan Thygesen, Yingqun Ji, Honghan WuSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
Multi-window CT imaging captures complementary pathological information across anatomical structures of differing densities, yet existing deep learning methods fuse representations only at later stages, missing cross-density interactions. We propose a cross-window knowledge distillation framework in which student encoders learn latent clinical priors from a teacher trained on the most informative window. Evaluated retrospectively on three cohorts - COPD-CT-DF (n=719), RSNA PE (n=1,433), and an in-house CTEPD dataset (n=161) - distillation improved per-window AUC by 10.1-16.5 percentage points on COPD-CT-DF (0.75-0.81 to 0.90-0.94; all P<0.001), with ensemble AUC reaching 0.9960. Similar gains were observed on RSNA PE (0.80-0.83 to 0.90-0.92) and CTEPD (AUC 0.7481 vs. 0.6264). Cross-window distillation internalises pathological signatures invisible to supervised approaches, offering a generalisable solution for multi-window pulmonary CT analysis.
- [7] arXiv:2605.12564 [pdf, ps, other]
-
Title: Multiport Antenna Q-factorSubjects: Signal Processing (eess.SP); Information Theory (cs.IT)
This article proposes an estimate of multiport antenna bandwidth based on a generalization of a single-port Q-factor. The explicit derivation is based on converting the stored energy matrix to its port equivalent and on the port parameters themselves. The work discusses the bandwidth dependencies on feeding and matching. Derived formulas are shown to utilize the total active reflection coefficient and allow for a single-frequency bandwidth evaluation. Examples comprising two different dipole arrays and electrically large patch antenna arrays validate the theory.
- [8] arXiv:2605.12566 [pdf, ps, other]
-
Title: On Privacy-Preserving Image Transmission in Low-Altitude Networks: A Swin Transformer-Based Framework with Federated LearningComments: 13 pages, 10 figures, 2 tablesSubjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG)
The rapid development of low-altitude economy has driven the proliferation of Unmanned Aerial Vehicle (UAV) applications, including logistics, inspection, and emergency response. However, transmitting high-volume image data from UAVs to ground stations faces significant challenges due to limited bandwidth and stringent privacy requirements. To address these issues, a Semantic Communication (SC) framework based on Federated Learning (FL) is proposed for efficient and privacy-preserving image transmission. A Swin Transformer-based Semantic Communication (STSC) architecture is designed to extract multi-scale semantic features under constrained bandwidth conditions. Dedicated communication and computing nodes are deployed on UAVs to enhance real-time coverage and flexibility. Meanwhile, a FL mechanism enables global model training across distributed devices without sharing raw data, thus preserving user privacy. Simulation experiments conducted on the CIFAR-10 dataset demonstrate that the proposed STSC framework achieves at least 5.7 dB improvement in Peak Signal-to-Noise Ratio (PSNR) compared to DeepJSCC baselines, while also showing superior convergence and generalization performance. The framework effectively integrates UAV-assisted deployment with SC and privacy protection, offering a practical solution for bandwidth-constrained image transmission in low-altitude networks.
- [9] arXiv:2605.12569 [pdf, ps, other]
-
Title: Active Sensing with Meta-Reinforcement Learning for Emitter Localization from RF ObservationsAuthors: M. Shamail J. Khan, Nisha L. Raichur, Lucas Heublein, Christian Wielenberg, Alexander Mattick, Tobias Feigl, Christopher Mutschler, Felix OttSubjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI)
Global navigation satellite system (GNSS) interference poses a serious threat to reliable positioning, especially in indoor and multipath-rich environments where source localization is highly challenging. In this paper, we formulate GNSS interference localization as an active sensing problem and propose a reinforcement learning (RL) framework in which an agent sequentially explores the environment to infer the position of an emitter source from radio frequency (RF) observations acquired with a 2x2 patch antenna. The localization task is modeled as a partially observable decision process, since single-snapshot measurements are often ambiguous under multipath propagation and changing channel conditions. To address this, the proposed framework combines high-dimensional RF sensing with deep RL and recurrent policy learning. We investigate both value-based and policy-based approaches, namely Deep Q-Networks (DQN) and Proximal Policy Optimization (PPO), and study their behavior under domain shift. The approach is evaluated on a simulated dataset generated with the Sionna ray-tracing module, which provides realistic propagation effects and diverse environment configurations. Experimental results show that the proposed method achieves a localization success rate of 80.1%, demonstrating the potential of RL for adaptive GNSS interference localization. Overall, the results highlight simulation-assisted training as a promising direction for robust interference localization in challenging propagation environments.
- [10] arXiv:2605.12575 [pdf, ps, other]
-
Title: Are Compact Rationales Free? Measuring Tile Selection Headroom in Frozen WSI-MILSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
Whole-slide image (WSI) multiple instance learning (MIL) classifiers can achieve strong slide-level AUC while leaving the full-bag prediction opaque. Attention scores are widely reused as post-hoc explanations, but high attention can reflect aggregation preference rather than a compact, model-sufficient rationale. We study post-hoc rationale highlighting for frozen WSI-MIL: given a trained classifier, can its slide-level prediction be recovered from a compact, output-consistent tile subset without retraining the backbone? We instantiate this with Finding Optimal Contextual Instances (FOCI), a lightweight rationale-readout layer over a frozen MIL backbone. FOCI is trained with model-output sufficiency and exclusion objectives over keep/drop tile subsets, evaluated with an insertion-style Sequential Reveal Protocol (SRP) adapted to WSI-MIL, and summarized by the Selection Headroom Index (SHI). Across three WSI benchmarks and seven MIL backbones, FOCI reveals that compact rationales are selection-headroom dependent: transformer and multi-branch attention aggregators can admit compact rationales, near-minimal attention-pooling baselines enter a selection-saturation regime, and hard-selection backbones can conflict with an external readout. For TransMIL, relative to its documented CLS-proxy ranking, FOCI reduces the Minimum Sufficient K (MSK) tile count by 32-56% across benchmarks, while ACMIL+FOCI attains the highest mean SHI (+0.465). Deletion-based perturbation and selected-only downstream evaluation provide complementary checks. These results position FOCI as a model-level interpretability and audit layer: selected tiles are not claims of clinical or pathologist-level diagnostic sufficiency, but candidate rationales that offer a compact, reviewable view of when a frozen MIL prediction can be localized to a small output-consistent subset.
- [11] arXiv:2605.12578 [pdf, ps, other]
-
Title: Recurrent Transformer-Based Near- and Far-Field THz Wideband Channel Estimation for UM-MIMOAuthors: Dmitry Artemasov (1), Alexander Shmatok (1), Kirill Andreev (1), Alexey Frolov (1), Manjesh K. Hanawal (2), Nikola Zlatanov (3) ((1) Center for Next Generation Wireless and IoT, Skolkovo Institute of Science and Technology, Moscow, Russia, (2) Department of IEOR, Indian Institute of Technology Bombay, India, (3) Faculty of Computer and Engineering Sciences, Innopolis University, Innopolis, Russia)Comments: 15 pages, 15 figuresJournal-ref: IEEE Access, vol. 13, pp. 205396-205411, 2025Subjects: Signal Processing (eess.SP); Information Theory (cs.IT); Machine Learning (cs.LG)
The integration of terahertz communications and ultra-massive multiple-input multiple-output (UM-MIMO) systems in 6G networks is motivated by their ability to enable unprecedented data rates, mitigate spectrum congestion, and enhance overall network performance. However, the enlarged antenna apertures and higher carrier frequencies in these systems increase the Rayleigh distance, causing users to span both the near-field and conventional far-field regions. Accurate spatial precoding thus requires exact channel estimation at the base station - a task made more challenging by the hybrid coexistence of near- and far-field effects and the limited number of digital chains available in hybrid beamforming architectures. In this paper, we propose a block recurrent transformer model to address this challenge. We demonstrate that a single transformer block equipped with state memory can be trained once and then iteratively applied for hybrid-field channel estimation. Furthermore, we train the model such that it generalizes to wireless channels with varying scatterer distances, different numbers of propagation paths, and wideband operation. Simulation results show that the proposed method achieves performance gains of approximately 5 dB and 7.5 dB in normalized mean squared error (NMSE) over state-of-the-art solutions in narrowband and wideband scenarios, respectively.
- [12] arXiv:2605.12728 [pdf, ps, other]
-
Title: Grid-Orch: An LLM-Powered Orchestrator for Distribution Grid Simulation and AnalyticsSubjects: Systems and Control (eess.SY); Artificial Intelligence (cs.AI); Software Engineering (cs.SE)
The power distribution engineering workforce faces a projected shortage of up to 1.5 million engineers by 2030, creating urgent demand for more accessible analysis tools. This paper introduces Grid-Orch, a framework that bridges Large Language Models (LLMs) and power system simulation through the Model Context Protocol (MCP), enabling engineers to perform complex distribution analyses via natural language. Using OpenDSS as the reference implementation, Grid-Orch provides 36 domain-specific tools across eleven categories, covering power flow, voltage analysis, quasi-static time series (QSTS) simulation, and automated optimization. A provider-agnostic LLM layer supports both cloud-hosted (Gemini, Claude) and locally deployed (Ollama, llama-cpp) models, enabling air-gapped operation for security-sensitive utility environments. Three optimization skills, capacitor placement, voltage violation analysis, and overvoltage mitigation, extend the platform beyond single-tool queries to multi-step engineering workflows. Grid-Orch is delivered as an interactive web platform with chat-based interaction, a QSTS dashboard, and feeder topology visualization, and renders simulation results inline. Workflow demonstrations show that distribution analyses formerly requiring hours of scripting, such as distributed energy resource (DER) interconnection screening, complete in under two minutes through natural language, producing numerically identical results to direct OpenDSS scripting.
- [13] arXiv:2605.12753 [pdf, ps, other]
-
Title: Optimization in Sparse 2D to Dense 3D Weakly Supervised Learning: Application to Multi-Label Segmentation of Large ex vivo MRI DataAuthors: Paul Hoareau, Kuan Yi Wang, Brandon Bujak, Roy Sun, Govind Nair, Irene Cortese, Charidimos Tsagkas, Daniel Reich, Julien Cohen-AdadComments: 19 pages. Submitted to Machine Learning for Biomedical Imaging (MELBA). Code and models: this https URLSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
INTRODUCTION | Fully supervised 3D segmentation of high-resolution ex vivo MRI is limited by the prohibitive cost of volumetric annotation, forcing reliance on sparse 2D slices. Weakly supervised Sparse-to-Dense frameworks bridge this gap, but guidelines remain ambiguous regarding human-centric visual enhancements and transferring optimization strategies across dimensions. We analyze divergent regularization needs for multi-class segmentation of high-resolution ex vivo spinal cord MRI.
METHODS | We used 9.4T MRI of multiple sclerosis spinal cords (>104,000 slices) with sparse annotations (428 slices). A 2D Teacher trained on sparse slices generated dense pseudo-labels to train a 3D Student. We systematically evaluated the impact of human-centric preprocessing, spatial augmentation, and soft-label regularization on both architectures.
RESULTS | We identified a critical divergence in training dynamics. The 2D Teacher required strong spatial augmentation and soft-labeling to overcome data scarcity, improving White Matter Lesion Dice scores by >11 points. However, propagating these techniques to the 3D Student degraded its performance. Furthermore, human-centric preprocessing (e.g., CLAHE) disrupted global statistical cues, dropping Gray Matter Lesion Dice scores by ~25 points.
DISCUSSION | Our study highlights a perception divergence (human-centric contrast enhancement harms machine models) and a regularization conflict across dimensions. 3D architectures trained on dense pseudo-labels exhibit fundamentally different optimization landscapes than 2D counterparts and require distinct, conservative regularization. Code and models: https://github.com/ivadomed/model_seg_sc-gm-lesion_human_ms_exvivo_t2star. - [14] arXiv:2605.12779 [pdf, ps, other]
-
Title: Safe and Energy-Aware Decentralized PDE-Constrained Optimization-Based Control of Multi-UAVs for Persistent Wildfire SuppressionSubjects: Systems and Control (eess.SY)
This paper presents a safe and energy-aware optimization-based control framework for multi-UAV wildfire suppression under localization and motion uncertainties. We first develop a centralized density-based controller that couples UAV motion and water deployment in a wildfire-specific control Lyapunov function. This framework is then extended to a decentralized setting suitable for large-scale operations using only local information. The controllers use control barrier function constraints to enforce both danger zone avoidance and the ability to reach a charging region. Simulations and real quadcopter experiments demonstrate the controller's effectiveness in fire suppression while preserving safety and energy sufficiency over multiple charge cycles.
- [15] arXiv:2605.12806 [pdf, ps, other]
-
Title: Cross-Harmonic Ambiguity-Aligned Multiport Parameter Estimation for Time-Floquet RISAuthors: Philipp del HougneComments: 5 pages with 4 figuresSubjects: Signal Processing (eess.SP); Applied Physics (physics.app-ph)
A time-Floquet reconfigurable intelligent surface (TF-RIS) periodically modulates its elements within a signaling interval, enabling frequency conversion and additional degrees of freedom compared with a conventional RIS. Time-Floquet multiport-network theory (TF-MNT) provides a physics-consistent model for TF-RISs that accounts for inter-element coupling, but its practical use requires estimating the underlying parameters when the TF-RIS design and radio environment are (partially) unknown. In this Letter, we propose a segmented estimation approach for constructing an accurate proxy TF-MNT model from end-to-end measurements. First, with the TF-RIS operated as a conventional RIS, we estimate conventional proxy MNT parameters independently at each considered time-Floquet harmonic. Second, under periodic time modulation, we align the inherent ambiguities among the per-harmonic conventional proxy MNT parameters, considering three measurement setups with different access to phase and harmonic information. Based on full-wave numerical simulations, we quantify the impact of the number of measurements and the noise level on the proxy-model accuracy. Finally, we demonstrate the performance loss incurred without the proposed ambiguity alignment in a canonical harmonic backscatter communications scenario.
- [16] arXiv:2605.12971 [pdf, ps, other]
-
Title: Port-Hamiltonian Systems with Dissipation Potential: Modelling and Trajectory Tracking ControlComments: 7 pages, 2 figuresSubjects: Systems and Control (eess.SY)
Port-Hamiltonian systems (PHS) and interconnection and damping assignment passivity-based control (IDA-PBC) have achieved broad success in modelling and stabilisation of physical systems. However, the absence of a dedicated scalar potential for the momentum channel forces any modification of the momentum-dependent dynamics to proceed indirectly through the interconnection and damping matrices, rendering the matching partial differential equation (PDE) difficult to solve and complicating extensions to trajectory tracking. This paper proposes a port-Hamiltonian system with dissipation potential (PHS-DP), in which the damping matrix is replaced by scalar convex dissipation potentials, providing independent scalar objects for the momentum and auxiliary state channels and restoring the variational symmetry between stored and dissipated energy. Building on this framework, Dual Potential Shaping Control (DPSC) achieves trajectory tracking by sequentially shaping the potential energy and dissipation potentials without modifying the interconnection structure. Contraction of the closed-loop cascade is established via a hierarchical contraction argument, and the matching condition is satisfied automatically for any admissible choice of shaped potentials, requiring no PDE to be solved. In contrast to existing PDE-free energy shaping approaches, which achieve this by abandoning the port-Hamiltonian closed-loop structure and sacrificing physical interpretability, the proposed framework preserves the interconnection structure and retains a transparent energy-based interpretation at every stage of the design. Validation on a magnetic levitation system demonstrates tracking performance comparable to timed IDA-PBC with substantially reduced design complexity.
- [17] arXiv:2605.13005 [pdf, ps, other]
-
Title: Geometry-Aware Multi-Armed Bandits for Antenna Beam Selection on Spheres, Tori, $\SO(3)$, and Reconfigurable Intelligent SurfacesComments: Prepared to submit to IEEE Transactions on Wireless CommunicationsSubjects: Signal Processing (eess.SP)
Beam alignment in mmWave phased arrays and RIS-assisted links is a stochastic bandit under both short TTI budgets and Doppler-induced non-stationarity. The arm space is a Riemannian manifold: $\sphere^2$ for steering, $\torus^n$ for phase combining, $\SO(3)$ for panel orientation, or the discrete torus $(\mathbb Z_B)^M$ with up to $K\!\sim\!10^{90}$ configurations for $B$-level RIS ($B\!=\!2^b$, $b$ bits/element); the intrinsic Mat\'ern kernel of Borovitskiy et al.\ provides the base GP. We contribute two algorithmic pieces. \textbf{(C1)} A Kronecker-factorised intrinsic-product Mat\'ern kernel on $(\mathbb Z_B)^M$ evaluating in $O(M)$ table lookups, making GP-UCB tractable at $K\sim 10^{90}$ where the extrinsic alternative is infeasible. \textbf{(C2)} AdaptiveGP-v2, an online sliding-window controller that selects $W$ by per-sample marginal likelihood, with predictive-variance and drift $z$-score reset triggers and a post-reset $\beta$-boost. On a four-speed ($v\!\in\!\{0.02,0.08,0.12,0.20\}$~km/h), $20$-seed paired campaign at $T\!=\!3000$, AdaptiveGP-v2 is statistically indistinguishable from the hand-tuned fixed-window oracle at every speed (Holm--Bonferroni-corrected paired differences cross zero); the operational benefit is the absence of a deployment-time per-speed calibration step, not a mean-regret improvement. On four static 3GPP-style mmWave benchmarks, intrinsic-kernel GP-UCB reduces cumulative regret by $25$--$45\%$ vs.\ codebook UCB1/Thompson and by $10$--$33\%$ vs.\ Euclidean-ambient GP-UCB on the toroidal arm spaces; a wideband OFDM ablation on a $100$~MHz channel confirms the advantage persists under frequency-selective fading ($\sim\!32$~Mbps/UE at initial access vs.\ UCB1). A third-party-simulator sanity check on Sionna CDL is reported in Section~V.
- [18] arXiv:2605.13015 [pdf, ps, other]
-
Title: A General Bézier Tree Encoding Counterfactual Framework for Retinal-Vessel-Mediated Disease AnalysisComments: 33 pages, 6 figures; preprintSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
The geometry of the retinal vessel is a key biomarker of vascular diseases, yet clinical evidence remains primarily observational. Existing generative counterfactuals intervene only at the image-level disease label, failing to isolate explicit anatomical structure. To address this limitation, we propose the B\'ezier Tree Encoding Counterfactual Framework (BTECF). By abstracting vascular networks into interconnected cubic-B\'ezier segments, BTECF establishes a disease-agnostic representation in which structural topology is explicitly preserved and atomically perturbable. Coupling this encoding with a diffusion-based generator enables parameter-level do-interventions on explicit geometric axes (e.g., tortuosity, caliber) while preserving background fundus textures. We validate BTECF on diabetic retinopathy, together with independent cohorts for ischemic stroke and Alzheimer's disease. Isolated counterfactual interventions produce dose-responsive shifts in classifier predictions; a matched pixel-drop control attenuates this response by an order of magnitude or more, ruling out out-of-distribution generation artifacts. By enforcing causal isolation between vessel topology and pixel-level confounders, BTECF provides a unified generative paradigm for hypothesis verification across systemic diseases. To support reproducibility, the code will be publicly released upon acceptance.
- [19] arXiv:2605.13031 [pdf, ps, other]
-
Title: Relative Pose-Velocity Estimation Using Dual IMU Measurements and Relative Position SensingSubjects: Systems and Control (eess.SY); Robotics (cs.RO)
This paper addresses the problem of estimating the relative pose (position and orientation) and velocity of a vehicle with respect to a moving target, where both are equipped with Inertial Measurement Units (IMUs), assuming the availability of relative position or bearing measurements. The body-target relative dynamics are formulated on $\mathbf{SE}_2(3)$ and recast into a linear time-varying (LTV) model in the ambient space $\mathbb{R}^{15}$, on which a deterministic Riccati observer is designed. We analyze the uniform observability (UO) conditions required to guarantee global exponential convergence of the estimation error in the ambient space for both measurement cases. In the case of relative position measurements, UO requires only a persistence-of-excitation condition on the target acceleration, whereas for bearing measurements, additional conditions are required. Building on this, a nonlinear complementary filter on $\mathbf{SO}(3)$ is designed to provide a smooth estimate of the orientation component of the state with almost global asymptotic stability. Finally, simulation results are provided to validate the proposed solution.
- [20] arXiv:2605.13061 [pdf, ps, other]
-
Title: Revisiting Voltage and Synchronization Stability Analysis in Converter-Integrated Weak Grids: Insights from Non-Minimum-Phase ZerosAuthors: Fuyilong Ma, Lidong Zhang, Wangqianyun Tang, Waisheng Zheng, Huanhai Xin, Linbin Huang, Lennart HarneforsComments: 10 pagesSubjects: Systems and Control (eess.SY)
The increasing penetration of converter-interfaced generators (CIGs) intensifies concerns over small-signal voltage and synchronization stability. While existing theories treat these two stability issues distinctly, practical wisdom in contrast employs a unified and static metric, short-circuit ratio (SCR), to assess both in weak grids. This paper aims to bridge this theory-practice gap by introducing the insight of non-minimum phase (NMP) zeros. First, we demonstrate that the two stability issues in weak grids originate from NMP zeros in the grid Jacobian transfer matrix: a zero at the origin corresponds to voltage instability, while low-frequency zeros impose fundamental constraints on synchronization dynamics. The traditional SCR is proven to be a special case of our proposed novel stability metric, NMP-zero (NMP-Z) factor, evaluated at the rated operating point. This establishes the theoretical foundation for the empirical success of SCR. Building on this insight, we then develop a unified stability assessment method for multi-converter systems. The method retains the simplicity of SCR, requiring only the NMP-Z factor together with individual CIG dynamic models and enabling stability margin assessment under various operating points. Our work provides a simple yet theoretically rigorous framework for stability analysis in CIG-integrated weak grids, with all theoretical findings and the proposed method validated through detailed time-domain simulations.
- [21] arXiv:2605.13098 [pdf, ps, other]
-
Title: Impact of Terrestrial Blockage on the Coverage of Integrated Satellite-Terrestrial NetworksComments: 7 pages, 4 figures, to be published in proc. IEEE 103rd Vehicular Technology Conference(VTC2026-Spring)Subjects: Signal Processing (eess.SP)
The integration of non-terrestrial networks (NTNs) with terrestrial networks (TNs) is an important step toward ubiquitous connectivity in sixth-generation (6G). Despite growing interest, the geometric impact of urban blockages on an integrated satellite-terrestrial network (ISTN) has not been rigorously quantified. In this paper, we develop a stochastic geometry-based analytical framework that incorporates a Boolean blockage model to characterize the downlink coverage probability of the ISTN and to provide insights for blockage-aware system design. Our analysis reveals that blockages affect satellite links in two competing ways: while they attenuate desired signals, they can also act as spatial shields that suppress aggregate interference. Leveraging this observation, we analytically show that satellite-terrestrial integration can enhance coverage probability across diverse environments ranging from open areas to dense urban deployments, offering a resilient and mathematically tractable approach to maintaining connectivity under heterogeneous blockage conditions.
- [22] arXiv:2605.13120 [pdf, ps, other]
-
Title: D-Optimized Sampling Design for System IdentificationComments: 6 pages, 2 figures. Accepted for IFAC 2026. Final author versionSubjects: Systems and Control (eess.SY); Optimization and Control (math.OC)
Traditional system identification with multisine inputs relies on uniform sampling and periodic excitation to preserve Fourier orthogonality and avoid spectral leakage, limiting its use in scenarios with irregular sampling or nonperiodic inputs. This work investigates continuous-time system identification under nonperiodic multisine excitation and nonuniform sampling. We develop a nonparametric frequency response function estimator suited to such conditions and design irregular sampling schemes that enhance the informativeness of measurements and reduce spectral leakage. The proposed sampling scheme improve the statistical accuracy of system identification in settings where periodic excitation is impractical.
- [23] arXiv:2605.13134 [pdf, ps, other]
-
Title: Security-Aware Planning and Control of Multi-Agent Systems with LTL TasksComments: 8 pages, 2 figures; This paper has been accepted at the IFAC World Congress 2026Subjects: Systems and Control (eess.SY)
This paper presents a secure-by-construction planning and control framework for multi-agent systems subject to linear temporal logic (LTL) specifications. The framework protects sensitive information from a passive intruder with partial observations of the agents' motion. Security in multi-agent coordination is captured by two notions that prevent the intruder from inferring whether a secret task has been executed and from identifying the agent responsible for its execution. The proposed framework incorporates the security constraints directly into the LTL synthesis procedure by constructing a secure finite transition system that removes all paths violating these constraints. Standard LTL synthesis is then applied to this secure abstraction to generate discrete plans, which are then refined into dynamically feasible continuous trajectories. This synthesis procedure provides formal guarantees that the resulting behavior of the multi-agent system satisfies both the global LTL specification and the security constraints. The effectiveness of the proposed framework is demonstrated through a two-drone case study.
- [24] arXiv:2605.13135 [pdf, ps, other]
-
Title: Subspace Pruning via Principal Vectors for Accurate Koopman-Based ApproximationsSubjects: Systems and Control (eess.SY)
The accuracy of Koopman operator approximations over
finite-dimensional spaces relies critically on their invariance
properties. These can be rigorously quantified via the principal
angles between a candidate subspace and its image under the Koopman
operator. This paper proposes a unified algebraic framework for
subspace pruning designed to systematically refine the invariance
error. We establish the geometric equivalence between
consistency-based methods and principal-vector pruning, and build on
this insight to introduce a hybrid strategy that balances between
multiple and single principal vector pruning for improved numerical
stability and scalability. We derive error bounds for the retention
of approximate and external eigenfunctions, demonstrating that the
multi-vector approach mitigates the numerical drift inherent to
sequential pruning. To ensure scalability, we develop an efficient
numerical update scheme based on rank-one modifications that reduces
the computational complexity of tracking principal angles by an
order of magnitude. Finally, we exploit the subspace obtained from
the pruning algorithms to build a lifted linear model for state
prediction that accounts for the trade-offs between improving
invariance and minimizing state reconstruction error. Simulations
demonstrate the effectiveness of our approach. - [25] arXiv:2605.13220 [pdf, ps, other]
-
Title: Real-time Gaussian Process based Approximate Model Predictive Trajectory Tracking Control for Autonomous VehiclesComments: accepted for the 24th European Control Conference (ECC) 2026Subjects: Systems and Control (eess.SY)
Applying model predictive control on embedded systems remains challenging due to the high computational cost of solving optimal control problems. To address this limitation, computationally efficient Gaussian process approximations of the implicit model predictive control law can be employed. However, for trajectory-tracking applications, the large amount of training data required for successful generalization across distinct reference trajectories poses a significant challenge. To improve data efficiency, we propose to transform the model into curvilinear coordinates around the reference trajectory. Secondly, we use a nominal feedforward component, allowing the Gaussian process to learn only the residual control input, making the approximation of a trajectory-tracking controller feasible. To underline the applicability of the approach, we deploy the controller on a Raspberry Pi in a small-scale vehicle and validate it experimentally. Compared to a model predictive control implementation using real-time iterations, the Gaussian process based approximation computes control inputs about five times faster while achieving similar closed-loop tracking performance.
- [26] arXiv:2605.13243 [pdf, ps, other]
-
Title: Spatial Competition for Low-Complexity Learned Image CompressionComments: Accepted at ICIP 2026Subjects: Image and Video Processing (eess.IV)
Autoencoder-based image codecs achieve state-of-the-art compression performance but often incur high computational complexity, particularly at decoding time. This work introduces a low-complexity learned image compression framework based on spatial competition between multiple specialized neural codecs. For each image region, the encoder selects the codec that best matches the local content according to a rate-distortion cost. A mode map is transmitted as side information to indicate the per-region codec selection. At decoding time, this mode map-based selection guides reconstruction while preserving the complexity of a single codec. This design enables per-image adaptation with low decoding complexity and fast encoding. On the CLIC 2020 dataset, our method achieves up to -14.5% rate reduction compared to a single codec and reaches HEVC-level performance with a decoding complexity of 1433 MACs per pixel.
- [27] arXiv:2605.13248 [pdf, ps, other]
-
Title: Compact Latent Manifold Translation: A Parameter-Efficient Foundation Model for Cross-Modal and Cross-Frequency Physiological Signal SynthesisAuthors: Bo Cui, Xiaowen Song, Yaowen Zhang, Shunzhe Zhang, B.J.F. van Beijnum, Monique Tabak, Ying WangSubjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI)
The analysis of physiological time series, such as electrocardiograms (ECG) and photoplethysmograms (PPG), is persistently hindered by modality and frequency gaps stemming from heterogeneous recording devices. Existing foundation models typically rely on continuous latent spaces, which frequently suffer from severe modality entanglement, lack high-fidelity cross-frequency generative capacity, and impose high computational costs that prohibit edge-device deployment. In this paper, we propose Compact Latent Manifold Translation (CLMT), a highly parameter-efficient (0.09B) unified framework that bridges these gaps through a novel two-stage discrete translation paradigm. First, we introduce a Universal Tokenizer utilizing Hierarchical Residual Vector Quantization (RVQ) to decouple heterogeneous signals into isolated, well-structured discrete latent manifolds, effectively preventing inter-modality interference. Second, a Context-Prompted Latent Translator maps these discrete tokens across modalities by integrating static physiological priors, reframing complex signal synthesis as a pure latent sequence translation task. Extensive evaluations demonstrate that our 0.09B model significantly outperforms massive baselines. In cross-modal PPG-to-ECG synthesis, it resolves temporal phase drift and dramatically improves the clinical R-peak detection F1-score from 0.37 (baseline) to 0.83. Furthermore, in extreme cross-frequency super-resolution (25Hz to 100Hz), it successfully recovers high-frequency diagnostic landmarks, achieving an unprecedented Pearson correlation of 0.9956. By learning a universal discrete language for biological signals with a fraction of the computational footprint, our approach sets a new trajectory for edge-deployable, multi-modal medical foundation models.
- [28] arXiv:2605.13269 [pdf, ps, other]
-
Title: Submodular Multi-Agent Policy Learning for Online Distributed Task Allocation in Open Multi-Agent SystemsSubjects: Systems and Control (eess.SY)
This paper studies multi-agent reinforcement learning with submodular team utilities for online distributed task allocation. In this setting, each agent selects one action from a local categorical policy, so feasible joint actions form a partition matroid over agent-action pairs. Classical multilinear extensions use independent Bernoulli sampling and therefore do not match the categorical policies executed by decentralized agents. To address this mismatch, we introduce the Partition Multilinear Extension (PME), a continuous relaxation whose value equals the expected team utility under factorized categorical policies. We prove that submodular difference rewards provide unbiased PME marginal-gradient information and yield a stagewise score-function policy-gradient estimator. Based on this connection, we propose SubMAPG, a centralized-training decentralized-execution policy-gradient framework with masked categorical policies and submodular difference-reward training signals. For the associated PME marginal-space projected stochastic-gradient dynamics, we prove a stagewise 1/2-approximation guarantee and sublinear dynamic regret in slowly varying environments, measured by the path length of the optimal PME marginals. To handle open systems with time-varying agents and targets, we instantiate SubMAPG with graph neural network policies. Experiments on multi-robot coverage and multi-target tracking show that SubMAPG outperforms local greedy and shared-reward baselines and is competitive with centralized myopic greedy strategies.
- [29] arXiv:2605.13286 [pdf, ps, other]
-
Title: Implementing Fluid Antennas in the Beamspace: Performance Evaluation and Codebook DesignSubjects: Signal Processing (eess.SP)
Metasurface-based fluid antenna systems (FASs) have been recently proposed as an inexpensive, scalable and practical alternative implementation for the fluid-antenna concept. This work thoroughly evaluates the performance of metasurface-based FASs in the context of multi-user communications. We extend the state-of-the-art signal model of FASs to electronically-reconfigurable designs, explicitly including the antenna response in the equivalent channel and resulting correlation structure. A general codebook design procedure, accounting for practical aspects like reflections and radiation efficiency, is presented and used to design the different antenna configurations (regarded as FAS ports). Importantly, we show that, with proper design, metasurface-based FASs can significantly outperform conceptual ones. While state-of-the-art theoretical embodiments of FAS rely on spatial flexibility for constructive/destructive interference, metasurface-based FASs exploit interference cancellation through projection onto the interference null space. Numerical results show a remarkable improvement when the system is dominated by interference (i.e., the natural FASs operational regime), regardless of spatial propagation characteristics.
- [30] arXiv:2605.13309 [pdf, ps, other]
-
Title: SimART: A Unified and Open Real-world Multimodal Simulation Platform for 6G Integrated Sensing and CommunicationSubjects: Signal Processing (eess.SP)
Research on sixth-generation (6G) integrated sensing and communication (ISAC) increasingly depends on multimodal datasets. These datasets need to jointly characterize wireless propagation, onboard sensing, and platform mobility. Existing tools cover only part of these aspects. Robotics simulators model physics and perception but not site-specific channels, while ray tracing and link level tools lack vehicle dynamics and onboard sensors. Combining them manually leads to workflows that are fragile and hard to reproduce. Rather than introducing another standalone simulator, this article presents SimART. It integrates mature robotics, ray tracing, and wireless evaluation engines into a single reproducible pipeline. The key idea is a robot operating system (ROS) backbone that both synchronizes and organizes all multimodal streams. A shared clock, a common coordinate frame, and timestamped messages keep the streams aligned in time and space, and a single rosbag recording captures the full session into one reproducible file. This design decouples the sensing front end from the wireless back end, so that any ROS-compatible simulator can be plugged in while reusing the same back end across aerial, ground, indoor, and maritime ISAC settings. On top of this backbone, SimART contributes a scene construction pipeline that converts both OpenStreetMap extracts and user-defined layouts into spatially aligned visual and electromagnetic assets, and a channel knowledge map (CKM) generator that aggregates ray tracing and system level outputs into spatial priors for ISAC algorithms. A case study on vision and position aided beam prediction demonstrates the utility of the platform. The code is publicly available at https://github.com/guchuanv-alt/SimART.
- [31] arXiv:2605.13355 [pdf, ps, other]
-
Title: Impedance-Based VSC Unit Commitment with STATCOM Support under High IBG PenetrationComments: Electric Power Systems ResearchSubjects: Systems and Control (eess.SY)
The large-scale replacement of synchronous machines with inverter-based generation (IBG) introduces critical challenges to both voltage and frequency stability. This work builds on a mixed-integer second-order cone programming (MISOCP) framework that co-optimizes unit commitment (UC) model which embeds frequency-nadir constraints through synthetic inertia (SI) dispatch and an SOC voltage stability boundary for IBG buses. The formulation extends by modeling a STATCOM as a reactive-power decision variable in the same MISOCP model. A modified IEEE 30-bus system is used to assess three scheduling strategies: (i) baseline UC with SI only, (ii) voltage-stability-constrained (VSC) UC with SI, and (iii) the joint UC with SI and reactive power support from IBGs. The impact of incorporating a 30~MVAr STATCOM at a weak grid location near the IBG buses is investigated. Simulation results show that the proposed framework enhances voltage security, maintains frequency-nadir compliance, and reduces operating cost, while STATCOM integration further improves dispatch feasibility under high IBG.
- [32] arXiv:2605.13390 [pdf, ps, other]
-
Title: Sensitivity Quantification for Distribution System State EstimationComments: Submitted for peer reviewSubjects: Systems and Control (eess.SY)
Pseudo-measurements are the dominant source of uncertainty in distribution system state estimation (DSSE), yet their distributional assumptions are treated as fixed inputs by existing uncertainty quantification methods. This paper investigates whether the uncertainty bounds assumed by weighted least squares (WLS)-based DSSE are sensitive to these distributional assumptions, and whether this sensitivity is quantifiable using the Fisher Information Matrix (FIM). We propose a diagnostic framework that compares the true Cram\'er-Rao Bound (CRB) against the WLS-assumed CRB via a per-bus, per-scenario ratio, computed directly from the converged WLS solution. Pseudo-measurement distributions are varied across five types in 22 variants matched at equal spread to isolate shape effects from variance. Experiments on the CIGRE MV network across 100 operating scenarios yield three findings. First, heavy-tailed and skewed distributions show consistently that WLS systematically overstates its uncertainty bounds. Second, the degree of miscalibration varies across buses and operating scenarios, confirming that distributional sensitivity is not uniform. Third, the CRB ratio is structurally blind to mean-shift bias, exposing a fundamental limitation of variance-based uncertainty diagnostics. Together, these results confirm the hypothesis and show that the choice of pseudo-measurement distribution directly distorts the confidence limits under WLS-based assumptions, which must be explicitly accounted for in any uncertainty-aware DSSE method.
- [33] arXiv:2605.13394 [pdf, ps, other]
-
Title: Decoupled Azimuth Elevation AoA Estimation Exploiting Kronecker Separable Steering MatricesSubjects: Signal Processing (eess.SP)
Uniform rectangular arrays (URA), structured non-uniform rectangular arrays (NURA), and parallelogram shaped (UPgA and NUPgA) arrays admit steering vectors that can be expressed as the Kronecker product of azimuth and elevation steering vectors. Accordingly, the full steering matrix can be represented as the Khatri Rao product of the corresponding azimuth and elevation steering matrices. This paper exploits this structure to develop an economical subspace decoupling framework for two dimensional angle of arrival (AoA) estimation. The proposed method first extracts the joint signal subspace from the spatial covariance matrix. Then it applies a low complexity decoupling scheme to recover the column spaces of the azimuth and elevation steering matrices. With the estimated decoupled subspaces, conventional one dimensional algorithms such as MUSIC, root MUSIC, and ESPRIT can be applied independently along each dimension, followed by pairing through a two dimensional spectral function. Monte Carlo simulations show that the proposed approach achieves higher accuracy than state of the art methods, i.e., two dimensional MUSIC, reduced-dimension MUSIC, and two-dimensional ESPRIT, for medium- and large scale arrays while requiring fewer snapshots, consequently with improved spectral efficiency.
- [34] arXiv:2605.13453 [pdf, ps, other]
-
Title: Learning a Contracting KKL-observer with Local Optimal GuaranteesComments: Accepted to the 23rd IFAC World Congress 2026Subjects: Systems and Control (eess.SY)
The Kazantzis-Kravaris-Luenberger (KKL) observer provides a general framework for nonlinear state estimation by immersing the system dynamics into a stable linear or nonlinear latent dynamics. However, the performance of KKL observers relies heavily on the specific choice of these latent dynamics, which is often heuristic. This paper proposes a methodology to learn a KKL observer that combines global stability guarantees with local optimality. We derive a condition on the latent dynamics such that the observer locally mimics the behavior of a Minimum Energy Estimator (Mortensen observer). We then employ Deep Learning to approximate the KKL transformation and the latent dynamics, using neural network architectures that structurally enforce the contraction property. The proposed strategy is validated through numerical simulations on nonlinear benchmarks, demonstrating a good performance in the presence of state and measurement noise.
- [35] arXiv:2605.13502 [pdf, ps, other]
-
Title: A Multi-Modal Intelligent U2V Channel Model for 6G Sensing-Communication IntegrationSubjects: Signal Processing (eess.SP)
This paper proposes a novel UAV-to-Vehicle (U2V) channel model for sixth-generation (6G) intelligent sensing-communication integration, based on three-dimensional (3D) scatterer prediction. To explore the mapping relationship between physical environment and electromagnetic space, a new high-fidelity mixed sensing-communication integration U2V simulation dataset under wide-lane scenarios with different vehicular traffic densities (VTDs) and UAV heights is constructed. Based on the constructed dataset, a novel 3D Scatterer Prediction and Distribution Estimation (3D-SPADE) algorithm is proposed, which leverages LiDAR point clouds to accurately predict the spatial distribution of scatterers. Furthermore, the clustering of scatterers and the subsequent classification into dynamic and static types are meticulously designed for highly dynamic U2V scenarios, while reducing computational complexity and improving modeling accuracy. As LiDAR point clouds vary over time, dynamic and static clusters evolve via 3D-SPADE, enabling precise modeling of channel non-stationarity and consistency. Simulation results demonstrate that, in the wide-lane scenario with varying VTDs and UAV heights, the proposed 3D-SPADE consistently achieves high scatterer occupancy detection performance within the voxel grid. In particular, under favorable configurations, recall reaches 93.26%, and precision reaches 95.74%, highlighting the reliability of 3D-SPADE. Key channel statistical characteristics are simulated and analyzed. These characteristics from the simulation experiments are highly consistent with ray-tracing results and exhibit better agreement than with the standardized model and inconsistent model, validating the necessity of exploring the mapping relationship and the effectiveness of the proposed model.
- [36] arXiv:2605.13516 [pdf, ps, other]
-
Title: Sensing-Assisted LoS/NLoS Identification in Dynamic UAV Positioning SystemsSubjects: Signal Processing (eess.SP)
In this paper, a sensing-assisted non-line-of-sight (NLoS) identification method for dynamic uncrewed aerial vehicle (UAV) positioning is proposed for the first time. For urban UAV-to-ground scenarios, a new multi-modal sensing-communication integrated dataset is constructed to support line-of-sight (LoS)/NLoS identification, covering two typical urban scenarios and a wide range of flight altitudes. Based on the constructed dataset, a novel dual-input feature fusion network is proposed, which addresses the challenge of heterogeneous representations between RGB images and channel impulse response (CIR) data to enable the joint extraction and fusion of sensing and communication features for LoS/NLoS identification. Simulation results show that the identification accuracy can reach up to 97.69%, while achieving an improvement of at least 3.59% compared to traditional CIR-only and RGB-only methods. Moreover, strong few-shot generalization is observed, as the proposed method stabilizes and approaches full-sample performance with fewer than 200 target samples and exceeds traditional CIR-only and RGB-only methods with fewer than 100 target samples in all cross-scenario and cross-altitude experiments. Even under Gaussian noise with a variance of 0.35 applied to RGB images, the accuracy degradation remains approximately 0.5%. By utilizing the proposed LoS/NLoS identification method, the error of trilateration positioning can be reduced by approximately 70% in a crossroad scenario, verifying the utility of the proposed method.
- [37] arXiv:2605.13524 [pdf, ps, other]
-
Title: Manifold-Aware Information Gain and Lower Bounds for Gaussian-Process Bandits on Riemannian Quotient SpacesComments: It will be submitted to IEEE Transactions on Information TheorySubjects: Signal Processing (eess.SP)
We prove a regret lower bound for Gaussian-process bandits on a smooth compact Riemannian manifold $\M$ of dimension $d$ with intrinsic Mat\'ern-$\nu$ kernel ($\nu>d/2$) that exposes how the geometry of the arm space enters the constant. For any algorithm and time horizon $T$ exceeding an explicit threshold, the worst-case expected regret over the RKHS-ball $\|f\|_{\Hil_{k_\nu}}\!\le\!B$ satisfies \begin{multline*} \E[R_T(f)]\;\ge\;c_*(d,\nu)\,B^{d/(2\nu+d)}\,\sigma_n^{2\nu/(2\nu+d)} \\ \cdot\,\vol_g(\M)^{\nu/(2\nu+d)}\,T^{(\nu+d)/(2\nu+d)}(\log T)^{\nu/(2\nu+d)}. \end{multline*} The exponent matches the Vakili--Khezeli--Picheny upper bound \cite{vakili2021information}; the $\vol_g(\M)^{\nu/(2\nu+d)}$ factor is, to our knowledge, the first explicit volume-dependent geometric constant in a manifold GP-bandit lower bound. We extend the analysis in five directions: (i)~a companion Assouad-style proof gives a different lower bound with a strictly smaller $T$-exponent $(2\nu+3d)/(4(\nu+d))$ but with a polylog factor of the form $1/(\log\log T)^{(2\nu+d)/(4(\nu+d))}$, sharpening the $(\log T)^{\nu/(2\nu+d)}$ Fano polylog of Theorem~\ref{thm:main}; (ii)~we prove a $|G|^{1/2}$ upper bound on the regret of an extrinsic-kernel GP-UCB algorithm on a quotient space $\M=\Mt/G$, plus a bracketing theorem (Theorem~\ref{thm:gauge-bracket}); the precise constant is conjectured to take the modulated form $(1+(|G|-1)h(\rinj/\kappa))^{1/2}$ (Conjecture~\ref{conj:gauge-modulated}), validated numerically on $\SO(3)$; (iii)~we write the leading constant $c_*(d,\nu)$ out fully; (iv)~we extract a curvature dependence $1+O(K\eps_T^2)$ via Bishop--Gromov; (v)~we transfer the bound to the Bayesian regret framework via the Yang--Barron / Castillo et al.\ Bayesian-Fano transfer.
- [38] arXiv:2605.13529 [pdf, ps, other]
-
Title: Decentralized Frequency-Domain Conditions for D-Stability with Application to DC MicrogridsSubjects: Systems and Control (eess.SY)
This paper proposes a decentralized method for regional pole placement, or $\mathcal{D}$-stability, in linearized networked systems. Existing LMI-based methods are hindered by confidentiality concerns regarding proprietary subsystem models and the absence of communication infrastructures. To overcome these barriers, we map the target region $\mathcal{D}$ of pole placement to an auxiliary left-half plane and introduce positive functions to handle the resulting complex-coefficient dynamics. We prove that $\mathcal{D}$-stability is guaranteed via local frequency-domain criteria without requiring shared subsystem models or inter-subsystem communication. This method is then tailored to DC microgrids, where a loop transformation is utilized to reallocate the burden of stability certification, deriving a broadcastable grid code for decentralized parameter synthesis. Numerical examples verify the efficacy of the proposed method.
- [39] arXiv:2605.13580 [pdf, ps, other]
-
Title: Joint Segment Activation and Antenna Placement for Uplink SWAN SystemsComments: 5 pagesSubjects: Signal Processing (eess.SP)
This article analyzes the achievable sum-rate of multiuser uplink segmented waveguide-enabled pinching-antenna systems (SWANs). To unveil system-design insights, an upper bound on the achievable sum-rate is derived, based on which the existence of an optimal segment activation level is theoretically established. Motivated by this result, hybrid segment selection and aggregation (HSS/A) schemes are proposed to jointly optimize segment activation and pinching-antenna (PA) placement. Correspondingly, low-complexity greedy algorithms are developed for the considered optimization problem. Numerical results validate the theoretical analysis and demonstrate that the proposed HSS/A schemes outperform conventional full-segment aggregation.
- [40] arXiv:2605.13661 [pdf, ps, other]
-
Title: Air-Sea Surface Modeling and Operating Link Range Evaluation for AUV-to-UAV Optical Wireless Communication LinksSubjects: Signal Processing (eess.SP)
Air-sea surface interactions play a critical role in underwater-to-air optical wireless communication (OWC) links, particularly in vertical autonomous underwater vehicle (AUV) to unmanned aerial vehicle (UAV) scenarios, where the stochastic nature of the sea surface introduces optical distortions that impair link reliability. This work investigates the impact of air-sea surface roughness on AUV-to-UAV OWC systems using two experimentally validated models: the classical Cox-Munk and the Elfouhaily-Chapron-Katsaros-Vandemark (ECKV). A tractable analytical representation of the ECKV model is derived and validated against measured sea-state data. Using both analytical and Monte Carlo approaches, the link ergodic capacity is evaluated with particular emphasis on operating range, pointing errors, receiver field-of-view, and solar noise level, providing practical system design insights.
- [41] arXiv:2605.13669 [pdf, ps, other]
-
Title: Bounded-Input True Proportional Navigation for Impact-Time ControlComments: Preprint; Accepted for presentation at the 15th Asian Control Conference, June 17th-21st, 2026, IndonesiaSubjects: Systems and Control (eess.SY); Robotics (cs.RO); Dynamical Systems (math.DS)
This paper proposes a nonlinear guidance strategy capable of intercepting a constant-velocity, non-maneuvering target while strictly satisfying the prescribed bounds on the control input (commanded acceleration). Unlike conventional strategies that estimate time-to-go using linearization or small-angle approximations, the proposed strategy employs true proportional-navigation guidance (TPNG) as a baseline, which utilizes an exact time-to-go formulation and is applicable over a wide range of target motions. In contrast to most existing strategies, which do not incorporate control input bounds into the guidance design, the proposed approach explicitly accounts for these limits by modeling the interceptor acceleration as a dynamic variable. Based on the sliding mode control technique, an effective guidance law that achieves time-constrained interception while accounting for bounded input is then derived. The performance of the proposed strategy is evaluated for various engagement scenarios.
- [42] arXiv:2605.13720 [pdf, ps, other]
-
Title: An Underwater Dehazing Network with Implicit Transmission EstimationComments: 5 pages, 2 figuresSubjects: Image and Video Processing (eess.IV)
Underwater images suffer from wavelength-dependent light absorption and scattering, which reduces visual quality. This phenomenon could limit the operational reliability of autonomous underwater vehicles, marine surveys, and offshore inspection systems. Purely classical methods often achieve suboptimal performance in real-world datasets, while purely data-driven methods lack physical interpretability. In this letter, we propose UDehaze-iT, a deep network for underwater image enhancement that estimates scene depth implicitly and derives per-channel transmission through the Beer-Lambert law with learnable attenuation coefficients. We estimate atmospheric light as a semi-classical per-channel scalar, and a zero-initialized residual refiner corrects remaining artefacts after dehazing. To effectively train our method, we apply a composite loss function consisting of five key terms: a L1 loss, a multi-scale patchwise DCT loss, a forward model reconstruction loss, and two regularization terms. With ~0.9M parameters, UDehaze-iT achieves competitive performance on UIEB and UFO-120 datasets.
- [43] arXiv:2605.13836 [pdf, ps, other]
-
Title: Reachable-Set Decomposition for Real-Time Aggregation of Multi-Zone HVAC FleetsComments: 10 pages, 9 figuresSubjects: Systems and Control (eess.SY)
Aggregating building heating, ventilation, and air-conditioning (HVAC) fleets provides substantial real-time flexibility to power system operations. However, real-time aggregation of multi-zone HVAC fleets faces two key challenges: (i) strong coupling across zones and time makes flexibility characterization high-dimensional and computationally demanding, and (ii) the sequential revelation of temperature states and exogenous conditions requires that decisions made at each period preserve feasibility over the remaining horizon using only currently realized information. To address these challenges, this paper proposes a reachable-set decomposition framework comprising an offline decomposition stage and a real-time policy. In the offline stage, backward reachable sets are formulated to encode remaining-horizon feasibility into per-period state constraints, so that any state within the current reachable set is guaranteed to sustain feasible operation over the entire remaining horizon. A tailored inner approximation is then developed for tractable calculation in multi-zone-coupled HVAC settings. In the real-time stage, aggregate flexibility is computed efficiently via building-level parallel linear programs followed by closed-form Minkowski summation of power intervals, and any regulation signal within the reported flexibility interval admits a recursively feasible disaggregation. Case studies demonstrate the effectiveness of the proposed framework in aggregate flexibility characterization, disaggregation feasibility, and scalable computation.
Cross-lists for Thu, 14 May 26
- [44] arXiv:2605.12506 (cross-list from cs.CV) [pdf, ps, other]
-
Title: Scale-Gest: Scalable Model-Space Synthesis and Runtime Selection for On-Device Gesture DetectionComments: 7 pages, 11 figures, Accepted to DAC 2026Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Robotics (cs.RO); Image and Video Processing (eess.IV)
Realizing on-device ML-based gesture detection under tight real-time performance, energy and memory constraints is challenging, especially when considering mobile devices with varying battery-power levels. Existing EdgeAI deployments typically rely on a single fixed detector, limiting optimization opportunities. We present Scale-Gest, a novel run-time adaptive gesture detection framework that expands the detector space into a dense family of tiny-YOLO architectures. We introduce multiple novel device-calibrated ACE (Accuracy-Complexity-Energy) profiles by analyzing different model-resolution-stride operating points. A lightweight run-time controller selects an appropriate ACE mode under user-defined and battery constraints, while a motion-aware hand-gesture-tracking ROI gate crops the input for reduced complexity detection. To evaluate performance of our system in real-world car driving scenarios, we introduce a temporally-annotated Driver Simulated Gesture (DSG-18) dataset. Scale-Gest maintains event-level F1 while significantly reducing energy and latency compared to single-detector approaches. On a battery-powered laptop running gesture streams, our ACE controller reduces per-frame energy by 4x (from 6.9 mJ to 1.6 mJ) while maintaining high gesture-detection performance (event-level F1 = 0.8-0.9) and low mean latency (6 ms).
- [45] arXiv:2605.12552 (cross-list from cs.NI) [pdf, ps, other]
-
Title: DQN-Driven Adaptive Neighbor Discovery for Directional Aerial NetworksComments: Accepted at IEEE ICC 2025. This is the author-accepted manuscriptSubjects: Networking and Internet Architecture (cs.NI); Signal Processing (eess.SP)
Directional antenna systems are gaining substantial traction for aerial networks due to their higher gain, extended transmission range, and enhanced security. However, the requirement of beam alignment makes the task of finding and reaching neighbors challenging, particularly in a mobile setting. For wireless networks, privacy concerns play an equally critical role. However, the problem of ensuring network-wide connectivity while maintaining limited exposure when probing around is still unexplored. We address this trade-off by proposing an adaptive transceiver selection protocol based on the Deep Q-Network (DQN) framework. Each node acts as an independent DQN agent and interacts with the environment to learn how to balance the trade-off. Since the directional nodes operate only based on local observations, we adopt a weighted mechanism that guides them in prioritizing either high reachability or privacy by adaptively tuning the probing patterns. Results show that DQN framework surpasses the Random and Q-Learning baselines. Weights favoring discovery provide higher probing efficiency and reachability, while weights prioritizing privacy ensure limited exposure at the cost of low reachability, eventually attaining higher objective value.
- [46] arXiv:2605.12771 (cross-list from cs.RO) [pdf, ps, other]
-
Title: Adaptive Smooth Tchebycheff Attention for Multi-Objective Policy OptimizationComments: To appear in the Proceedings of Robotics: Science and Systems (RSS) 2026Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Systems and Control (eess.SY); Optimization and Control (math.OC)
Multi-objective reinforcement learning in robotic domains requires balancing complex, non-convex trade-offs between conflicting objectives. While linear scalarization methods provide stability, they are theoretically incapable of recovering solutions within non-convex regions of the Pareto front. Conversely, static non-linear scalarizations (e.g., Tchebycheff) can theoretically access these regions but often suffer from severe gradient variance and optimization instability in deep RL. In this work, we propose an Adaptive Smooth Tchebycheff framework that resolves this tension by dynamically modulating the curvature of the optimization landscape. We introduce a novel conflict-driven controller that regulates the optimization smoothness based on real-time gradient interference. This allows the agent to anneal toward precise, non-convex scalarization when objectives align, while elastically reverting to stable, smooth approximations when destructive gradient conflicts emerge. We validate our approach on a challenging robotic stealth visual search task -- a proxy for monitoring of protected/fragile ecosystems -- where an agent must balance search, exposure/interference minimization and exploration speed. Extensive ablations confirm that our conflict-aware adaptation enables the robust discovery of Pareto-optimal policies in non-convex regions inaccessible to linear baselines and unstable for static non-linear methods.
Website: https://alejandromllo.github.io/research/pasta/ - [47] arXiv:2605.12785 (cross-list from cs.LG) [pdf, ps, other]
-
Title: Identifying the nonlinear string dynamics with port-Hamiltonian neural networksSubjects: Machine Learning (cs.LG); Systems and Control (eess.SY); Dynamical Systems (math.DS)
Hybrid machine learning combines physical knowledge with data-driven models to enhance interpretability and performance. In this context, Port-Hamiltonian Systems (PHS), which generalize Hamiltonian mechanics to describe open, non-autonomous dynamical systems, have been successfully integrated with neural networks under the name Port-Hamiltonian Neural Networks (PHNNs). While the ability of PHNNs to identify Hamiltonian ordinary differential equation (ODE) systems has already been demonstrated, their application to learning Hamiltonian partial differential equation (PDE) systems remains largely unexplored. This limitation restricts their use in musical acoustics, where instruments are typically modeled as distributed parameter systems governed by PDEs. In this work, we demonstrate how to learn the nonlinear string dynamics from data in a physically-consistent framework through a PHNN extension to PDEs. By constructing structured neural network architectures based on PHS, we can recover both the Hamiltonian governing the string and the dissipation affecting it. This approach outperforms baseline, non-physics-informed methods in terms of both accuracy and interpretability. Numerical experiments using synthetic data demonstrate the ability of the proposed PHNN model to identify and emulate the nonlinear dynamics of the system.
- [48] arXiv:2605.12794 (cross-list from cs.GT) [pdf, ps, other]
-
Title: Dynamic Transaction Scheduling and Pricing in the Ethereum MempoolSubjects: Computer Science and Game Theory (cs.GT); Cryptography and Security (cs.CR); Distributed, Parallel, and Cluster Computing (cs.DC); Networking and Internet Architecture (cs.NI); Systems and Control (eess.SY)
The Ethereum blockchain utilizes the EIP-1559 algorithm to manage transaction inclusion and block assembly. However, EIP-1559 and much of the existing literature study this problem from a static perspective, focusing on price evolution without modelling transaction dynamics within the mempool. Motivated by this limitation, we study a dynamic transaction scheduling problem in which transactions with heterogeneous sizes and per-unit values arrive over time and remain in the mempool until scheduled. To capture the stochastic mempool evolution, we formulate the problem as a Markov Decision Process (MDP) whose state represents the mempool configuration and whose actions correspond to block prices. We first provide a primal-dual interpretation of the static EIP-1559 mechanism, showing that block prices arise naturally as dual variables of a social-welfare maximization problem. Building on this perspective, we extend the framework to the dynamic setting and formulate an objective that maximizes long-run discounted reward while incorporating holding costs and overshoot penalties. We then employ a Natural Policy Gradient (NPG) algorithm to compute the optimal policy. Our results show that dynamic pricing stabilizes the mempool while maximizing long-run discounted reward. In particular, as the overshoot penalty increases, the average scheduled transaction volume converges to the target block capacity, and the resulting NPG updates closely resemble the EIP-1559 price update rule. Finally, we study two special cases of the MDP formulation: homogeneous transactions and uniform arrivals. In the homogeneous setting, where the protocol directly controls scheduled volume, we show that the optimal policy has a threshold structure. We then propose a bang-bang pricing mechanism for uniform arrivals and derive a lower bound on the block capacity needed to ensure system stability.
- [49] arXiv:2605.12829 (cross-list from physics.soc-ph) [pdf, ps, other]
-
Title: Optimal excitation and measurement patterns for networks with tree topologyComments: Accepted for presentation at the 2026 IFAC World Congress, Busan, South KoreaSubjects: Physics and Society (physics.soc-ph); Systems and Control (eess.SY)
In this work we evaluate the excitation and measurement patterns (EMP) for networks with tree topology.
We investigate guidelines for the selection of the minimal EMPs, i.e. those with the least number of excited and measured nodes combined, for which
the accuracy obtained, in terms of the trace of the asymptotic covariance matrix, is optimal.
We introduce the concept of partial information matrix as a means to systematically obtain the information matrix for any dynamic network.
For a specific tree class, called cross, we show that the accuracy of a particular module depends on the magnitude of the parameters to be estimated.
Furthermore, when all factors are equal, it is best to excite. %we show that for small magnitudes of this parameter, it is best to excite.
We extend a topological condition for branches under which the accuracy of a particular module of the network is independent of the other parameters from the tree.
We provide a numerical analysis showing that our guidelines could be used as a selection tool for minimal EMPs for tree networks. - [50] arXiv:2605.12974 (cross-list from cs.RO) [pdf, ps, other]
-
Title: Distributionally Robust Safety Under Arbitrary Uncertainties: A Safety Filtering ApproachComments: 10 pages, 4 figures, submitted to IEEE Robotics and Automation Letters (RA-L)Subjects: Robotics (cs.RO); Systems and Control (eess.SY)
In this work, we study how to ensure probabilistic safety for nonlinear systems under distributional ambiguity. Our approach builds on a backup-based safety filtering framework that switches between a high-performance nominal policy and a certified backup policy to ensure safety. To handle arbitrary uncertainties from ambiguous distributions, i.e., where the distribution is not of specific structure and the true distribution is unknown, we adopt a distributionally robust (DR) formulation using Wasserstein ambiguity sets. Rather than solving a high-dimensional DR trajectory optimization problem online, we exploit the structure of backup-based safety filtering to reduce safety certification to a one-dimensional search over the switching time between nominal and backup policies. We then develop a sampling-based certification procedure with finite-sample guarantees, where empirical failure probabilities are compared against a Wasserstein-inflated threshold. We validate our method through simulations across three systems, from a Dubins vehicle to a high-speed racing car and a fighter jet, demonstrating the broad applicability and computational efficiency.
- [51] arXiv:2605.13010 (cross-list from cs.CV) [pdf, ps, other]
-
Title: Amortized Guidance for Image Inpainting with Pretrained Diffusion ModelsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Systems and Control (eess.SY); Optimization and Control (math.OC)
We study image inpainting with generative diffusion models. Existing methods typically either train dedicated task-specific models, or adapt a pretrained diffusion model separately for each masked image at deployment. We introduce a middle-ground model, termed Amortized Inpainting with Diffusion (AID), which keeps a pretrained diffusion backbone fixed, trains a small reusable guidance module offline, and then reuses it across masked images without per-instance optimization. We formulate it as a deterministic guidance problem with a supervised terminal objective. To make this problem learnable in high dimensions, we derive an auxiliary Gaussian formulation and prove that solving this randomized problem recovers the optimal deterministic guidance field. This bridge yields a principled continuous-time actor--critic algorithm for learning the guidance module in a fully data-driven manner. Empirically, on AFHQv2 and FFHQ under the pixel EDM pipeline and on ImageNet under the latent EDM2 pipeline, AID consistently improves the quality--speed trade-off over strong fixed-backbone and amortized inpainting baselines across multiple mask types, while adding less than one percent trainable overhead.
- [52] arXiv:2605.13028 (cross-list from cs.RO) [pdf, ps, other]
-
Title: Local Conformal Calibration of Dynamics Uncertainty from Semantic ImagesComments: 26 pages, 8 figures. Accepted to the 17th World Symposium on the Algorithmic Foundations of Robotics (WAFR) 2026Subjects: Robotics (cs.RO); Systems and Control (eess.SY)
We introduce Observation-aware Conformal Uncertainty Local-Calibration (OCULAR), a conformal prediction-based algorithm that uses perception information to provide uncertainty quantification guarantees for unseen test-time environments. While previous conformal approaches lack the ability to discriminate between state-action space regions leading to higher or lower model mismatch, and require environment-specific data, our method uses data collected from visually similar environments to provably calibrate a given linear Gaussian dynamics model of arbitrary fidelity. The prediction regions generated from OCULAR are guaranteed to contain the future system states with, at least, a user-set likelihood, despite both aleatoric and epistemic uncertainty -- i.e., uncertainty arising from both stochastic disturbances and lack of data. Our guarantees are non-asymptotic and distribution-free, not requiring strong assumptions about the unknown real system dynamics. Our calibration procedure enables distinguishing between observation-velocity-action inputs leading to higher and lower next-state-uncertainty, which is helpful for probabilistically-safe planning. We numerically validate our algorithm on a double-integrator system subject to random perturbations and significant model mismatch, using both a simplified sensor and a more realistic simulated camera. Our approach appropriately quantifies uncertainty both when in-distribution and out-of-distribution, being comparatively volume-efficient to baselines requiring environment-specific data.
- [53] arXiv:2605.13103 (cross-list from math.OC) [pdf, ps, other]
-
Title: Guaranteed cost structured control in infinite-horizon linear-quadratic cooperative differential gamesSubjects: Optimization and Control (math.OC); Systems and Control (eess.SY)
In this paper, we consider infinite-horizon linear-quadratic cooperative differential games with output feedback information structure. We first demonstrate that, under output feedback information structure, computing Pareto optimal controls can be difficult even for simple low-dimensional differential games. To address this issue, this paper introduces the concept of feedback guaranteed cost structured control (GCSC). The feedback GCSC concept is inspired from suboptimal control. At a feedback GCSC, the total weighted team cost remains below a prescribed threshold while satisfying the structural constraints. We derive fundamental properties of the feedback GCSC and the admissible weight set, including their monotonicity properties. In particular, we show that if Pareto optimal controls exist, they belong to the class of feedback GCSCs. We also quantify the suboptimalty of Pareto optimal controls (if they exist) and the proposed GCSC with respect to output feedback optimal control. Furthermore, we provide the conditions for verification and the synthesis of a feedback GCSC. Finally, we illustrate the effectiveness of the proposed approach through numerical examples, including a case study on tracking synchronization in a microgrid.
- [54] arXiv:2605.13133 (cross-list from cs.LG) [pdf, ps, other]
-
Title: KAST-BAR: Knowledge-Anchored Semantically-Dynamic Topology Brain Autoregressive Modeling for Universal Neural InterpretationSubjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
While EEG foundation models have shown significant potential in universal neural decoding across tasks, their advancement remains constrained by the inadequacy modeling of complex spatiotemporal topology, as well as the inherent modality gap between low-level physiological signals and high-level textual semantics. To address these challenges, we propose a Knowledge-Anchored Semantically-Dynamic Topology Brain Autoregressive Model (KAST-BAR), which dynamically aligns physiological representations derived from multi-level brain topology with an expert-level semantic space. Specifically, we design a Dual-Stream Hierarchical Attention (DSHA) encoder that accurately captures the brain's intrinsic non-Euclidean topology by modeling local temporal dynamics with global spatial contexts. On this basis, a Knowledge-Anchored Semantic Profiler (KASP) is proposed to synthesize physically-grounded and instance-level textual profiles, which subsequently drive a Semantic Text-Aware Refiner (STAR) to dynamically reconstruct EEG representations using Latent Expert Queries. By conducting large-scale pre-training on 21 diverse datasets to build a foundation model, KAST-BAR effectively integrates expert-level medical knowledge into EEG signal representations, consistently achieving superior performance across six downstream tasks. Our code is available at https://github.com/KAST-BAR/KAST-BAR
- [55] arXiv:2605.13302 (cross-list from cs.LG) [pdf, ps, other]
-
Title: Safe Bayesian Optimization for Uncertain Correlations Matrices in Linear Models of Co-RegionalizationComments: Accepted at IFAC WC26Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
This paper extends safety guarantees for multi-task Bayesian optimization with uncertain correlation matrices from intrinsic co-reginalization models to linear models of co-reginalization. The latter allows for more flexible modeling of the inter-task correlations by composing multiple features. We derive uniform error bounds for vector-valued functions sampled from a Gaussian process with a linear model of co-reginalization kernel. Furthermore, we show the potential improvement of performance using linear models of co-reginalization in a numerical comparison on a safe multi-task Bayesian optimization benchmark.
- [56] arXiv:2605.13315 (cross-list from cs.ET) [pdf, ps, other]
-
Title: Embodied Neurocomputation: A Framework for Interfacing Biological Neural Cultures with Scaled Task-Driven ValidationAuthors: Johnson Zhou, Daniel Tanneberg, Forough Habibollahi, Alon Loeffler, Kiaran Lawson, Valentina Baccetti, Kwaku Dad Abu-Bonsrah, Candice Desouza, Finn Doensen, Bradley Watmuff, Daria Kornienko, Azin Azadi, Justin Leigh Bourke, Bernhard Sendhoff, Brett J. KaganSubjects: Emerging Technologies (cs.ET); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Systems and Control (eess.SY); Neurons and Cognition (q-bio.NC)
Biological neural networks (BNNs) have been established as a powerful and adaptive substrate that offer the potential for incredibly energy and data efficient information processing with distinct learning mechanisms. Yet a core challenge to utilizing BNN for neurocomputation is determining the optimal encoding and decoding mechanisms between the traditional silicon computing interface and the living biology. Here, we propose an Embodied Neurocomputation framework as a systems-level approach to this multi-variable optimization encoding/decoding problem. We operationalize this approach through the first large-scale parameter optimization of encoding configurations for a BNN agent performing closed-loop navigation along an odor-style gradient in a simulated grid-world. Despite the relative simplicity of the task, the biological interactions gave rise to a massive multi-combinatorial search space for optimal parameters. By considering how the components of the system are interconnected and parameterized, we evaluated approximately 1,300 parameter combinations, over 4,000 hours of real-time agent-environment interactions, to identify 12 configurations that consistently demonstrated learning across multiple episodes. These configurations achieved significantly higher task performances than optimized silicon-based DQN agents under the same interaction budget. These findings represent an initial step toward robust and scalable goal-oriented learning using BNNs. Our framework establishes a foundation for applying task-driven neurocomputing and supports the development of field-wide benchmarks. In the long term, this work supports the development of hybrid bio-silicon architectures capable of efficient, adaptive and real-time computation, including the potential for robotic control applications.
- [57] arXiv:2605.13587 (cross-list from stat.ML) [pdf, ps, other]
-
Title: Reframing preprocessing selection as model-internal calibration in near-infrared spectroscopy: A large-scale benchmark of operator-adaptive PLS and Ridge modelsComments: 23 pages, 4 figures; includes supplementary material. Code: this https URLSubjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Signal Processing (eess.SP)
Near-infrared spectroscopy (NIRS) is rapid and non-destructive, but reliable calibration still depends heavily on spectral preprocessing. In routine practice, preprocessing is often selected by large external pipeline searches that are costly, unstable on small calibration sets, and difficult to audit. We introduce operator-adaptive calibration, a framework that moves linear preprocessing selection inside the calibration model. Candidate treatments are encoded as linear spectral operators, while nonlinear or sample-adaptive corrections such as SNV, MSC, and ASLS are handled as fold-local branches to prevent leakage.
We instantiate the framework for PLS and Ridge regression. For PLS, covariance identities enable fast NIPALS and SIMPLS variants while preserving original-wavelength coefficients. For Ridge, operator-adaptive kernels yield a dual formulation with recoverable original-space coefficients. The approach was evaluated on more than 50 heterogeneous NIRS datasets against conventional PLS, Ridge, CatBoost, and CNN baselines under documented search budgets.
Compact operator-adaptive PLS with ASLS branch preprocessing achieved a median RMSEP/PLS ratio of 0.960 with 42 wins on 57 datasets, while a deployable AOM-Ridge selector improved over tuned Ridge by a median 2.22% with 35 wins on 52 datasets. The proposed models reduce dependence on large preprocessing-HPO campaigns, produce traceable operator choices, retain interpretable coefficients, and fit in seconds for compact AOM-PLS. Operator-adaptive calibration therefore offers a practical route to faster, more robust, and more auditable NIRS method development. - [58] arXiv:2605.13713 (cross-list from cs.CV) [pdf, ps, other]
-
Title: Learning to Optimize Radiotherapy Plans via Fluence Maps Diffusion Model Generation and LSTM-based OptimizationAuthors: Isabella Poles, Simon Arberet, Riqiang Gao, Martin Kraus, Marco D. Santambrogio, Florin C. Ghesu, Ali Kamen, Dorin ComaniciuComments: Early Accept at MICCAI 2026Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
Volumetric Modulated Arc Therapy (VMAT) is a cornerstone of modern radiation therapy, enabling highly conformal tumor irradiation and healthy-tissue sparing. Yet, its planning solves inverse and nested optimization for multi-leaf collimators, monitor units and dose parameters, while enforcing their consistency to ensure mechanical deliverability. Nevertheless, this process often requires repeated re-optimization when treatment configurations change, resulting in substantial planning time per patient. To address these problems, we present a diffusion-driven Learning-to-Optimize (L2O) method for end-to-end VMAT planning. A distribution-matching distilled diffusion model learns a clinically feasible manifold of fluence maps, enabling their one-shot generation. On top of this, an LSTM-based L2O module learns gradient update dynamics to swiftly refine fluence maps toward prescribed dose objectives during inference. Experimental results on clinical and public prostate cancer cohorts demonstrate improved planning efficiency, flexibility, and machine deliverability over currently available end-to-end VMAT planners.
- [59] arXiv:2605.13748 (cross-list from cs.RO) [pdf, ps, other]
-
Title: TinySDP: Real Time Semidefinite Optimization for Certifiable and Agile Edge RoboticsAuthors: Ishaan Mahajan, Jon Arrizabalaga, Andrea Grillo, Fausto Vega, James Anderson, Zachary Manchester, Brian PlancherComments: Accepted to Robotics: Science and Systems (RSS) 2026. 11 pages, 5 figures, 2 tables. Project website: this https URLSubjects: Robotics (cs.RO); Systems and Control (eess.SY)
Semidefinite programming (SDP) provides a principled framework for convex relaxations of nonconvex geometric constraints in motion planning, yet existing solvers are too computationally expensive for real-time control, particularly on resource-constrained embedded systems. To address this gap, we introduce TinySDP, the first semidefinite programming solver designed for embedded systems, enabling real-time model-predictive control (MPC) on microcontrollers for problems with nonconvex obstacle constraints. Our approach integrates positive-semidefinite cone projections into a cached-Riccati-based ADMM solver, leveraging computational structure for embedded tractability. We pair this solver with an a posteriori rank-1 certificate that converts relaxed solutions into explicit geometric guarantees at each timestep. On challenging benchmarks, e.g., cul-de-sac and dynamic obstacle avoidance scenarios that induce failures in local methods, TinySDP achieves collision-free navigation with up to 73% shorter paths than state-of-the-art baselines. We validate our approach on a Crazyflie quadrotor, demonstrating that semidefinite constraints can be enforced at real-time rates for agile embedded robotics.
- [60] arXiv:2605.13751 (cross-list from cs.RO) [pdf, ps, other]
-
Title: Learning Responsibility-Attributed Adversarial Scenarios for Testing Autonomous VehiclesAuthors: Yizhuo Xiao, Haotian Yan, Ying Wang, Zhongpan Zhu, Yuxin Zhang, Xintao Yan, Mustafa Suphi Erden, Cheng WangSubjects: Robotics (cs.RO); Software Engineering (cs.SE); Systems and Control (eess.SY)
Establishing trustworthy safety assurance for autonomous driving systems (ADSs) requires evidence that failures arise from avoidable system deficiencies rather than unavoidable traffic conflicts. Current adversarial simulation methods can efficiently expose collisions, but generally lack mechanisms to distinguish these fundamentally different failure modes. Here we present CARS (Context-Aware, Responsibility-attributed Scenario generation), a framework that integrates responsibility attribution directly into adversarial scenario generation. CARS combines context-aware adversary selection with a generative adversarial policy optimized in closed-loop simulation to construct collision scenarios that are both physically feasible and diagnostically attributable. Across benchmark datasets spanning heterogeneous national traffic environments, CARS consistently discovers feasible collision scenarios with high attribution rates under multiple regulation-prescribed careful and competent driver models. By coupling adversarial generation with normative responsibility assessment, CARS moves simulation testing beyond collision discovery toward the construction of interpretable, regulation-aligned safety evidence for scalable ADS validation.
- [61] arXiv:2605.13822 (cross-list from cs.RO) [pdf, ps, other]
-
Title: Loiter UAV Reinsertion Guidance for Fixed-wing UAV CorridorsJournal-ref: AIAA SCITECH 2026Subjects: Robotics (cs.RO); Systems and Control (eess.SY)
This paper considers fixed-wing unmanned aerial vehicle (UAV) corridors comprising a main lane, a circular loiter lane for managing traffic congestion, and transit lanes connecting the two. In particular, we address the problem of conflict-free reinsertion of UAVs from the loiter lane back into the main lane. The loiter lane contains a fixed number of equidistant virtual slots that UAVs can occupy. Reinsertion of loiter UAVs into the main lane becomes essential either due to reduced traffic in the main lane or due to a loiter UAV needing to reach its destination urgently. Given the total number of loiter slots, UAV speed limits, and the minimum safety distance, a guidance algorithm is developed to compute the required speed of a loiter UAV in the transit lane to ensure safe reinsertion. The proposed guidance and automation strategies are validated through numerical simulations.
Replacements for Thu, 14 May 26
- [62] arXiv:2307.14072 (replaced) [pdf, ps, other]
-
Title: Negative Spin $Δ_T$ noise Induced by Spin-Flip Scattering and Andreev ReflectionComments: 14 pages, 6 figures, 2 tables, accepted for publication in Journal of Physics: Condensed MatterJournal-ref: Journal of Physics: Condensed Matter (2026)Subjects: Mesoscale and Nanoscale Physics (cond-mat.mes-hall); Signal Processing (eess.SP); Applied Physics (physics.app-ph); Quantum Physics (quant-ph)
- [63] arXiv:2411.15913 (replaced) [pdf, ps, other]
-
Title: Repurposing Image Diffusion Models for Training-Free Music Style Transfer on Mel-spectrogramsComments: Accepted by ICIP 2026Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
- [64] arXiv:2502.09592 (replaced) [pdf, ps, other]
-
Title: A Data-Driven Method for Microgrid System Identification: Physically Consistent Sparse Identification of Nonlinear DynamicsComments: 5 pages, 5 figures, 2025 IEEE Power & Energy Society General Meeting (PESGM), AcceptedSubjects: Systems and Control (eess.SY)
- [65] arXiv:2502.20427 (replaced) [pdf, ps, other]
-
Title: DeePen: Penetration Testing for Audio Deepfake DetectionAuthors: Nicolas Müller, Piotr Kawa, Adriana Stan, Thien-Phuc Doan, Souhwan Jung, Wei Herng Choong, Philip Sperl, Konstantin BöttingerSubjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [66] arXiv:2506.22117 (replaced) [pdf, ps, other]
-
Title: Safe Multi-Agent Navigation via Constrained HJB-Informed LearningComments: Accepted by Robotics: Science and Systems (RSS 2026)Subjects: Systems and Control (eess.SY)
- [67] arXiv:2507.02385 (replaced) [pdf, ps, other]
-
Title: Parameter estimation of range-migrating targets using OTFS signals from LEO satellitesComments: submitted to IEEE journal for possible publicationSubjects: Signal Processing (eess.SP)
- [68] arXiv:2508.11351 (replaced) [pdf, ps, other]
-
Title: Important Bit Prefix M-ary Quadrature Amplitude Modulation for Semantic CommunicationsSubjects: Signal Processing (eess.SP)
- [69] arXiv:2508.14950 (replaced) [pdf, ps, other]
-
Title: Potential and challenges of generative adversarial networks for super-resolution in 4D Flow MRIAuthors: Oliver Welin Odeback, Arivazhagan Geetha Balasubramanian, Jonas Schollenberger, Edward Ferdiand, Alistair A. Young, C. Alberto Figueroa, Susanne Schnell, Outi Tammisola, Ricardo Vinuesa, Tobias Granberg, Alexander Fyrdahl, David MarleviComments: 26 pages, 10 figuresSubjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG)
- [70] arXiv:2508.20474 (replaced) [pdf, ps, other]
-
Title: Unifying Diarization, Separation, and ASR with Multi-Speaker EncoderComments: Accepted to IEEE ASRU 2025Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
- [71] arXiv:2510.14244 (replaced) [pdf, ps, other]
-
Title: Reinforcement Learning for Unsupervised Domain Adaptation in Spatio-Temporal Echocardiography SegmentationAuthors: Arnaud Judge, Nicolas Duchateau, Thierry Judge, Roman A. Sandler, Joseph Z. Sokol, Christian Desrosiers, Olivier Bernard, Pierre-Marc JodoinComments: 13 pages, accepted for publication in IEEE TMISubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [72] arXiv:2510.19471 (replaced) [pdf, ps, other]
-
Title: Re-evaluating Minimum Bayes Risk Decoding for Automatic Speech RecognitionAuthors: Yuu JinnaiSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
- [73] arXiv:2510.20152 (replaced) [pdf, ps, other]
-
Title: Soft Switching Expert Policies for Controlling Systems with Uncertain ParametersAuthors: Junya IkemotoComments: 7 pages, 8 figures. Accepted to IFAC World Congress 2026Subjects: Systems and Control (eess.SY)
- [74] arXiv:2511.06257 (replaced) [pdf, ps, other]
-
Title: Fast Reconstruction of Motion-Corrupted Data with Mobile-GRAPPA: Motion and dB0 Inhomogeneity Correction Leveraging Efficient GRAPPAAuthors: Yimeng Lin, Nan Wang, Daniel Abraham, Daniel Polak, Xiaozhi Cao, Aizada Nurdinova, Stephen Cauley, Kawin SetsompopComments: 20 pages, 9 figuresSubjects: Signal Processing (eess.SP)
- [75] arXiv:2511.06874 (replaced) [pdf, ps, other]
-
Title: Radio-Coverage-Aware Path Planning for Cooperative Autonomous VehiclesAuthors: Giuseppe Baruffa, Luca Rugini, Francesco Binucci, Fabrizio Frescura, Paolo Banelli, Renzo PerfettiComments: 11 pages, 19 figuresSubjects: Signal Processing (eess.SP); Systems and Control (eess.SY)
- [76] arXiv:2511.08499 (replaced) [pdf, ps, other]
-
Title: Approaching Safety-Argumentation-by-Design: A Requirement-based Safety Argumentation Life Cycle for Automated VehiclesAuthors: Marvin Loba, Robert Graubohm, Niklas Braun, Nayel Fabian Salem, Andreas Dotzler, Marcus Nolte, Torben Stolte, Richard Schubert, Markus MaurerComments: Accepted to be published in 2026 IEEE 29th International Conference on Intelligent Transportation Systems (ITSC)}, Naples, Italy, September 15-18, 2026Subjects: Systems and Control (eess.SY)
- [77] arXiv:2511.19383 (replaced) [pdf, ps, other]
-
Title: A Hybrid Learning-to-Optimize Framework for Mixed-Integer Quadratic ProgrammingComments: fianl L4DC 2026Subjects: Systems and Control (eess.SY)
- [78] arXiv:2511.20383 (replaced) [pdf, ps, other]
-
Title: Accelerating Time-Optimal Trajectory Planning for Connected and Automated Vehicles with Graph Neural NetworksComments: final IFAC WC 2026Subjects: Systems and Control (eess.SY)
- [79] arXiv:2512.06109 (replaced) [pdf, ps, other]
-
Title: Unifying Entropy Regularization in Optimal Control: From and Back to Classical Objectives via Iterated Soft Policies and Path Integral SolutionsComments: refurbished introduction, added a few remarks, reduced sizeSubjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Robotics (cs.RO); Systems and Control (eess.SY)
- [80] arXiv:2512.20211 (replaced) [pdf, ps, other]
-
Title: Aliasing-Free Neural Audio SynthesisComments: Accepted by TASLPSubjects: Sound (cs.SD); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
- [81] arXiv:2601.02053 (replaced) [pdf, ps, other]
-
Title: Ageing Monitoring for Commercial Microcontrollers Based on Timing WindowsSubjects: Hardware Architecture (cs.AR); Systems and Control (eess.SY)
- [82] arXiv:2601.12695 (replaced) [pdf, ps, other]
-
Title: From Noise to Knowledge: System Identification with Systematic Polytope Construction via Cyclic ReformulationSubjects: Systems and Control (eess.SY)
- [83] arXiv:2601.22792 (replaced) [pdf, ps, other]
-
Title: CALM: Joint Contextual Acoustic-Linguistic Modeling for Personalization of Multi-Speaker ASRComments: Accepted to IEEE ICASSP 2026Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
- [84] arXiv:2602.01629 (replaced) [pdf, ps, other]
-
Title: AdaptNC: Adaptive Nonconformity Scores for Conformal Prediction under Distribution ShiftSubjects: Machine Learning (cs.LG); Robotics (cs.RO); Systems and Control (eess.SY)
- [85] arXiv:2602.07029 (replaced) [pdf, ps, other]
-
Title: Guidestar-Free Adaptive Optics with Asymmetric AperturesComments: Accepted to ACM Transactions on Graphics (TOG)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [86] arXiv:2602.16253 (replaced) [pdf, ps, other]
-
Title: How Much Does Machine Identity Matter in Anomalous Sound Detection at Test Time?Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
- [87] arXiv:2603.02245 (replaced) [pdf, ps, other]
-
Title: LMU-Based Sequential Learning and Posterior Ensemble Fusion for Cross-Domain Infant Cry ClassificationComments: 7 pages, to appear in Proc. Int. Conf. IEEE Engineering in Medicine and Biology Society (EMBC 2026), Toronto, Canada, July 26-30 2026Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
- [88] arXiv:2603.09708 (replaced) [pdf, ps, other]
-
Title: Adapting a Text-to-Audio Model for Room Impulse Response GenerationComments: 4 pages, 1 figure, submitted to IWAENC 2026Subjects: Audio and Speech Processing (eess.AS)
- [89] arXiv:2603.19910 (replaced) [pdf, ps, other]
-
Title: Learning Adaptive Parameter Policies for Nonlinear Bayesian FilteringComments: Accepted for presentation at 29th International Conference on Information FusionSubjects: Systems and Control (eess.SY)
- [90] arXiv:2603.22267 (replaced) [pdf, ps, other]
-
Title: TiCo: Time-Controllable Spoken Dialogue ModelSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
- [91] arXiv:2604.13505 (replaced) [pdf, ps, other]
-
Title: Cascaded TD3-PID Hybrid Controller for Quadrotor Trajectory Tracking in Wind Disturbance EnvironmentsSubjects: Systems and Control (eess.SY)
- [92] arXiv:2604.14842 (replaced) [pdf, ps, other]
-
Title: Simplification Ad Absurdum? Revisiting Gas Flow Modeling for Integrated Energy System PlanningSubjects: Systems and Control (eess.SY); Optimization and Control (math.OC)
- [93] arXiv:2604.18263 (replaced) [pdf, ps, other]
-
Title: Passive RIS Is Not Silent: Revisiting Performance Limits Under Thermal NoiseAuthors: Farjam Karim, Deepak Kumar, Prathapasinghe Dharmawansa, Nurul Huda Mahmood, Arthur Sousa de Sena, Matti-Latva-ahoSubjects: Signal Processing (eess.SP)
- [94] arXiv:2604.24610 (replaced) [pdf, ps, other]
-
Title: Matching-free Acquisition of Channels with Anisotropic WavefrontsSubjects: Signal Processing (eess.SP)
- [95] arXiv:2605.07653 (replaced) [pdf, ps, other]
-
Title: Aquatic Neuromorphic Optical FlowComments: This work is under review. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [96] arXiv:2605.11940 (replaced) [pdf, ps, other]
-
Title: Lane-Aware Graph Attention Network for Multi-Vehicle Trajectory Prediction in Expressway Merge ZonesAuthors: Eni Solomon LaughterSubjects: Systems and Control (eess.SY)
- [97] arXiv:2605.12244 (replaced) [pdf, ps, other]
-
Title: Estimation Problems and the Modulating Function Method: The Algebra of Modulating FunctionsComments: 13 pages, 6 figuresSubjects: Systems and Control (eess.SY); Rings and Algebras (math.RA)
[ showing up to 2000 entries per page: fewer | more ]
Disable MathJax (What is MathJax?)
Links to: arXiv, form interface, find, eess, recent, 2605, contact, help (Access key information)