We gratefully acknowledge support from
the Simons Foundation and member institutions.

Electrical Engineering and Systems Science

New submissions

[ total of 88 entries: 1-88 ]
[ showing up to 2000 entries per page: fewer | more ]

New submissions for Fri, 27 Mar 26

[1]  arXiv:2603.24596 [pdf, ps, other]
Title: X-OPD: Cross-Modal On-Policy Distillation for Capability Alignment in Speech LLMs
Comments: 5 pages
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

While the shift from cascaded dialogue systems to end-to-end (E2E) speech Large Language Models (LLMs) improves latency and paralinguistic modeling, E2E models often exhibit a significant performance degradation compared to their text-based counterparts. The standard Supervised Fine-Tuning (SFT) and Reinforcement Learning (RL) training methods fail to close this gap. To address this, we propose X-OPD, a novel Cross-Modal On-Policy Distillation framework designed to systematically align the capabilities of Speech LLMs to their text-based counterparts. X-OPD enables the Speech LLM to explore its own distribution via on-policy rollouts, where a text-based teacher model evaluates these trajectories and provides token-level feedback, effectively distilling teacher's capabilities into student's multi-modal representations. Extensive experiments across multiple benchmarks demonstrate that X-OPD significantly narrows the gap in complex tasks while preserving the model's inherent capabilities.

[2]  arXiv:2603.24599 [pdf, ps, other]
Title: A Learnable SIM Paradigm: Fundamentals, Training Techniques, and Applications
Comments: 9 pages, 5 figures, accepted by IEEE Wireless Communications Magazine
Subjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI)

Stacked intelligent metasurfaces (SIMs) represent a breakthrough in wireless hardware by comprising multilayer, programmable metasurfaces capable of analog computing in the electromagnetic (EM) wave domain. By examining their architectural analogies, this article reveals a deeper connection between SIMs and artificial neural networks (ANNs). Leveraging this profound structural similarity, this work introduces a learnable SIM architecture and proposes a learnable SIM-based machine learning (ML) paradigm for sixth-generation (6G)-andbeyond systems. Then, we develop two SIM-empowered wireless signal processing schemes to effectively achieve multi-user signal separation and distinguish communication signals from jamming signals. The use cases highlight that the proposed SIM-enabled signal processing system can significantly enhance spectrum utilization efficiency and anti-jamming capability in a lightweight manner and pave the way for ultra-efficient and intelligent wireless infrastructures.

[3]  arXiv:2603.24601 [pdf, ps, other]
Title: FED-HARGPT: A Hybrid Centralized-Federated Approach of a Transformer-based Architecture for Human Context Recognition
Comments: Paper presented on: July 2025 Conference: XVII Simp\'osio Brasileiro de Automa\c{c}\~ao Inteligente (SBAI) At: S\~ao Jo\~ao del-Rei
Subjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)

The study explores a hybrid centralized-federated approach for Human Activity Recognition (HAR) using a Transformer-based architecture. With the increasing ubiquity of edge devices, such as smartphones and wearables, a significant amount of private data from wearable and inertial sensors is generated, facilitating discreet monitoring of human activities, including resting, sleeping, and walking. This research focuses on deploying HAR technologies using mobile sensor data and leveraging Federated Learning within the Flower framework to evaluate the training of a federated model derived from a centralized baseline. The experimental results demonstrate the effectiveness of the proposed hybrid approach in improving the accuracy and robustness of HAR models while preserving data privacy in a non-IID data scenario. The federated learning setup demonstrated comparable performance to centralized models, highlighting the potential of federated learning to strike a balance between data privacy and model performance in real-world applications.

[4]  arXiv:2603.24602 [pdf, ps, other]
Title: MuViS: Multimodal Virtual Sensing Benchmark
Comments: Submitted to European Signal Processing Conference (EUSIPCO) 2026
Subjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI)

Virtual sensing aims to infer hard-to-measure quantities from accessible measurements and is central to perception and control in physical systems. Despite rapid progress from first-principle and hybrid models to modern data-driven methods research remains siloed, leaving no established default approach that transfers across processes, modalities, and sensing configurations. We introduce MuViS, a domain-agnostic benchmarking suite for multimodal virtual sensing that consolidates diverse datasets into a unified interface for standardized preprocessing and evaluation. Using this framework, we benchmark established approaches spanning gradient-boosted decision trees and deep neural network (NN) architectures, and show that none of these provides a universal advantage, underscoring the need for generalizable virtual sensing architectures. MuViS is released as an open-source, extensible platform for reproducible comparison and future integration of new datasets and model classes.

[5]  arXiv:2603.24604 [pdf, ps, other]
Title: Analog Computing with Hybrid Couplers and Phase Shifters
Comments: Submitted to IEEE for publication
Subjects: Signal Processing (eess.SP); Information Theory (cs.IT); Applied Physics (physics.app-ph)

Analog computing with microwave signals can enable exceptionally fast computations, potentially surpassing the limits of conventional digital computing. For example, by letting some input signals propagate through a linear microwave network and reading the corresponding output signals, we can instantly compute a matrix-vector product without any digital operations. In this paper, we investigate the computational capabilities of linear microwave networks made exclusively of two low-cost and fundamental components: hybrid couplers and phase shifters, which are both implementable in microstrip. We derive a sufficient and necessary condition characterizing the class of linear transformations that can be computed in the analog domain using these two components. Within this class, we identify three transformations of particular relevance to signal processing, namely the discrete Fourier transform (DFT), the Hadamard transform, and the Haar transform. For each of these, we provide a systematic design method to construct networks of hybrid couplers and phase shifters capable of computing the transformation for any size power of two. To validate our theoretical results, a hardware prototype was designed and fabricated, integrating hybrid couplers and phase shifters to implement the $4\times4$ DFT. A systematic calibration procedure was subsequently developed to characterize the prototype and compensate for fabrication errors. Measured results from the prototype demonstrate successful DFT computation in the analog domain, showing high correlation with theoretical expectations. By realizing an analog computer through standard microwave components, this work demonstrates a practical pathway toward low-latency, real-time analog signal processing.

[6]  arXiv:2603.24633 [pdf, ps, other]
Title: Coronary artery calcification assessment in National Lung Screening Trial CT images (DeepCAC2)
Subjects: Image and Video Processing (eess.IV)

Coronary artery calcification (CAC) is a strong predictor of cardiovascular risk but remains underutilized in clinical routine thoracic imaging due to the need for dedicated imaging protocols and manual annotation. We present DeepCAC2, a publicly available dataset containing automated CAC segmentations, coronary artery calcium scores, and derived risk categories generated from low-dose chest CT scans of the National Lung Screening Trial (NLST). Using a fully automated deep learning pipeline trained on expert-annotated cardiac CT data, we processed 127,776 CT scans from 26,228 individuals and generated standardized CAC segmentations and risk estimates for each acquisition. We already provide a public dashboard as a simple tool to visually inspect a random subset of 200 NLST patients of the dataset. The dataset will be released with DICOM-compatible segmentation objects and structured metadata to support reproducible downstream analysis. The deep learning pipeline will be made publicly available as a DICOM-compatible MHub.ai container. DeepCAC2 provides a transparent, large-scale, public, fully reproducible resource for research in cardiovascular risk assessment, opportunistic screening, and imaging biomarker development.

[7]  arXiv:2603.24785 [pdf, ps, other]
Title: Cyber-Physical System Design Space Exploration for Affordable Precision Agriculture
Comments: 2026 Design, Automation & Test in Europe Conference (DATE)
Subjects: Systems and Control (eess.SY); Emerging Technologies (cs.ET)

Precision agriculture promises higher yields and sustainability, but adoption is slowed by the high cost of cyber-physical systems (CPS) and the lack of systematic design methods. We present a cost-aware design space exploration (DSE) framework for multimodal drone-rover platforms to integrate budget, energy, sensing, payload, computation, and communication constraints. Using integer linear programming (ILP) with SAT-based verification, our approach trades off among cost, coverage, and payload while ensuring constraint compliance and a multitude of alternatives. We conduct case studies on smaller and larger-sized farms to show that our method consistently achieves full coverage within budget while maximizing payload efficiency, outperforming state-of-the-art CPS DSE approaches.

[8]  arXiv:2603.24810 [pdf, ps, other]
Title: Unified Diffusion Refinement for Multi-Channel Speech Enhancement and Separation
Comments: Paper in submission
Subjects: Audio and Speech Processing (eess.AS)

We propose Uni-ArrayDPS, a novel diffusion-based refinement framework for unified multi-channel speech enhancement and separation. Existing methods for multi-channel speech enhancement/separation are mostly discriminative and are highly effective at producing high-SNR outputs. However, they can still generate unnatural speech with non-linear distortions caused by the neural network and regression-based objectives. To address this issue, we propose Uni-ArrayDPS, which refines the outputs of any strong discriminative model using a speech diffusion prior. Uni-ArrayDPS is generative, array-agnostic, and training-free, and supports both enhancement and separation. Given a discriminative model's enhanced/separated speech, we use it, together with the noisy mixtures, to estimate the noise spatial covariance matrix (SCM). We then use this SCM to compute the likelihood required for diffusion posterior sampling of the clean speech source(s). Uni-ArrayDPS requires only a pre-trained clean-speech diffusion model as a prior and does not require additional training or fine-tuning, allowing it to generalize directly across tasks (enhancement/separation), microphone array geometries, and discriminative model backbones. Extensive experiments show that Uni-ArrayDPS consistently improves a wide range of discriminative models for both enhancement and separation tasks. We also report strong results on a real-world dataset. Audio demos are provided at \href{https://xzwy.github.io/Uni-ArrayDPS/}{https://xzwy.github.io/Uni-ArrayDPS/}.

[9]  arXiv:2603.24894 [pdf, ps, other]
Title: Active Calibration of Reachable Sets Using Approximate Pick-to-Learn
Comments: This paper has been submitted to the IEEE Control Systems Letters (L-CSS) jointly with the IEEE Conference on Decision and Control (CDC), with the addition of the crucial citation [3] and the code repo link
Subjects: Systems and Control (eess.SY)

Reachability computations that rely on learned or estimated models require calibration in order to uphold confidence about their guarantees. Calibration generally involves sampling scenarios inside the reachable set. However, producing reasonable probabilistic guarantees may require many samples, which can be costly. To remedy this, we propose that calibration of reachable sets be performed using active learning strategies. In order to produce a probabilistic guarantee on the active learning, we adapt the Pick-to-Learn algorithm, which produces generalization bounds for standard supervised learning, to the active learning setting. Our method, Approximate Pick-to-Learn, treats the process of choosing data samples as maximizing an approximate error function. We can then use conformal prediction to ensure that the approximate error is close to the true model error. We demonstrate our technique for a simulated drone racing example in which learning is used to provide an initial guess of the reachable tube. Our method requires fewer samples to calibrate the model and provides more accurate sets than the baselines. We simultaneously provide tight generalization bounds.

[10]  arXiv:2603.24954 [pdf, ps, other]
Title: On Performance of Fluid Antenna Relay (FAR)-Assisted AAV-NOMA Wireless Network
Subjects: Signal Processing (eess.SP)

In this paper, we investigate the performance of a fluid antenna relay (FAR)-assisted downlink communication system utilizing non-orthogonal multiple access (NOMA). The FAR, which integrates a fluid antenna system (FAS), is equipped on an autonomous aerial vehicle (AAV), and introduces extra degrees of freedom to improve the performance of the system. The transmission is divided into a first phase from the base station (BS) to the users and the FAR, and a second phase where the FAR forwards the signal using amplify-and-forward (AF) or decode-and-forward (DF) relaying to reduce the outage probability (OP) for the user maintaining weaker channel conditions. To analyze the OP performance of the weak user, Copula theory and the Gaussian copula function are employed to model the statistical distribution of the FAS channels. Analytical expressions for weak user's OP are derived for both the AF and the DF schemes. Simulation results validate the effectiveness of the proposed scheme, showing that it consistently outperforms benchmark schemes without the FAR. In addition, numerical simulations also demonstrate the values of the relaying scheme selection parameter under different FAR positions and communication outage thresholds.

[11]  arXiv:2603.24960 [pdf, ps, other]
Title: Near-field Beam Training under Multi-path Channels: A Hybrid Learning-and-Optimization Approach
Comments: Submitted to IEEE for possible publication
Subjects: Signal Processing (eess.SP)

For extremely large-scale arrays (XL-arrays), the discrete Fourier transform (DFT) codebook, conventionally used in the far-field, has recently been employed for near-field beam training. However, most existing methods rely on the line-of-sight (LoS) dominant channel assumption, which may suffer degraded communication performance when applied to the general multi-path scenario due to the more complex received signal power pattern at the user. To address this issue, we propose in this paper a new hybrid learning-and-optimization-based beam training method that first leverages deep learning (DL) to obtain coarse channel parameter estimates, and then refines them via a model-based optimization algorithm, hence achieving high-accuracy estimation with low computational complexity. Specifically, in the first stage, a tailored U-Net architecture is developed to learn the non-linear mapping from the received power pattern to coarse estimates of the angles and ranges of multi-path components. In particular, the inherent permutation ambiguity in multi-path parameter matching is effectively resolved by a permutation invariant training (PIT) strategy, while the unknown number of paths is estimated based on defined path existence logits. In the second stage, we further propose an efficient particle swarm optimization method to refine the angular and range parameters within a confined search region; in the meanwhile, a Gerchberg-Saxton algorithm is used to retrieve multi-path channel gains from the received power pattern. Last, numerical results demonstrate that the proposed hybrid design significantly outperforms various benchmarks in terms of parameter estimation accuracy and achievable rate, yet with low computational complexity.

[12]  arXiv:2603.24968 [pdf, ps, other]
Title: Subject-Specific Low-Field MRI Synthesis via a Neural Operator
Comments: 11 pages, 2 figures, 2 tables
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI)

Low-field (LF) magnetic resonance imaging (MRI) improves accessibility and reduces costs but generally has lower signal-to-noise ratios and degraded contrast compared to high field (HF) MRI, limiting its clinical utility. Simulating LF MRI from HF MRI enables virtual evaluation of novel imaging devices and development of LF algorithms. Existing low field simulators rely on noise injection and smoothing, which fail to capture the contrast degradation seen in LF acquisitions. To this end, we introduce an end-to-end LF-MRI synthesis framework that learns HF to LF image degradation directly from a small number of paired HF-LF MRIs. Specifically, we introduce a novel HF to LF coordinate-image decoupled neural operator (H2LO) to model the underlying degradation process, and tailor it to capture high-frequency noise textures and image structure. Experimental results in T1w and T2w MRI demonstrate that H2LO produces more faithful simulated low-field images than existing parameterized noise synthesis models and popular image-to-image translation models. Furthermore, it improves performance in downstream image enhancement tasks, showcasing its potential to enhance LF MRI diagnostic capabilities.

[13]  arXiv:2603.24990 [pdf, ps, other]
Title: From Global to Local: Hierarchical Probabilistic Verification for Reachability Learning
Comments: Submitted to the 65th IEEE Conference on Decision and Control (CDC 2026) and IEEE Control Systems Letters (L-CSS)
Subjects: Systems and Control (eess.SY)

Hamilton-Jacobi (HJ) reachability provides formal safety guarantees for nonlinear systems. However, it becomes computationally intractable in high-dimensional settings, motivating learning-based approximations that may introduce unsafe errors or overly optimistic safe sets. In this work, we propose a hierarchical probabilistic verification framework for reachability learning that bridges offline global certification and online local refinement. We first construct a coarse safe set using scenario optimization, providing an efficient global probabilistic certificate. We then introduce an online local refinement module that expands the certified safe set near its boundary by solving a sequence of convex programs, recovering regions excluded by the global verification. This refinement reduces conservatism while focusing computation on critical regions of the state space. We provide probabilistic safety guarantees for both the global and locally refined sets. Integrated with a switching mechanism between a learned reachability policy and a model-based controller, the proposed framework improves success rates in goal-reaching tasks with safety constraints, as demonstrated in simulation experiments of two drones racing to a goal with complex safety constraints.

[14]  arXiv:2603.25041 [pdf, ps, other]
Title: AdaLTM: Adaptive Layer-wise Task Vector Merging for Categorical Speech Emotion Recognition with ASR Knowledge Integration
Comments: Submitted to Interspeech 2026
Subjects: Audio and Speech Processing (eess.AS)

Integrating Automatic Speech Recognition (ASR) into Speech Emotion Recognition (SER) enhances modeling by providing linguistic context. However, conventional feature fusion faces performance bottlenecks, and multi-task learning often suffers from optimization conflicts. While task vectors and model merging have addressed such conflicts in NLP and CV, their potential in speech tasks remains largely unexplored. In this work, we propose an Adaptive Layer-wise Task Vector Merging (AdaLTM) framework based on WavLM-Large. Instead of joint optimization, we extract task vectors from in-domain ASR and SER models fine-tuned on emotion datasets. These vectors are integrated into a frozen base model using layer-wise learnable coefficients. This strategy enables depth-aware balancing of linguistic and paralinguistic knowledge across transformer layers without gradient interference. Experiments on the MSP-Podcast demonstrate that the proposed approach effectively mitigates conflicts between ASR and SER.

[15]  arXiv:2603.25057 [pdf, ps, other]
Title: From Noisy Data to Hierarchical Control: A Model-Order-Reduction Framework
Subjects: Systems and Control (eess.SY)

This paper develops a direct data-driven framework for constructing reduced-order models (ROMs) of discrete-time linear dynamical systems with unknown dynamics and process disturbances. The proposed scheme enables controller synthesis on the ROM and its refinement to the original system by an interface function designed using noisy data. To achieve this, the notion of simulation functions (SFs) is employed to establish a formal relation between the original system and its ROM, yielding a quantitative bound on the mismatch between their output trajectories. To construct such relations and interface functions, we rely on data collected from the unknown system. In particular, using noise-corrupted input-state data gathered along a single trajectory of the system, and without identifying the original dynamics, we propose data-dependent conditions, cast as a semidefinite program, for the simultaneous construction of ROMs, SFs, and interface functions. Through a case study, we demonstrate that data-driven controller synthesis on the ROM, combined with controller refinement via the interface function, enables the enforcement of complex specifications beyond stability.

[16]  arXiv:2603.25161 [pdf, ps, other]
Title: Distributed Event-Triggered Consensus Control of Discrete-Time Linear Multi-Agent Systems under LQ Performance Constraints
Comments: 11 pages
Subjects: Systems and Control (eess.SY)

This paper proposes a distributed event-triggered control method that not only guarantees consensus of multi-agent systems but also satisfies a prescribed LQ performance constraint. Taking the standard distributed control scheme with all-time communication as a baseline, we consider the problem of designing an event-triggered communication rule such that the resulting LQ cost satisfies a performance constraint with respect to the baseline cost while consensus is achieved. For general linear agents over an undirected graph, we employ local state predictors and a local triggering condition based only on information available to each agent. We then derive a sufficient condition for the proposed method to satisfy the performance constraint and guarantee consensus. In addition, we develop a tractable parameter design method for selecting the triggering parameters offline. Numerical examples demonstrate the effectiveness of the proposed method.

[17]  arXiv:2603.25166 [pdf, ps, other]
Title: Efficient compressive sensing for machinery vibration signals
Authors: Imen Tounsi (UJM, LASPI), Fadi Karkafi, Mohammed El Badaoui (UJM, LASPI), François Guillet (UJM, LASPI)
Journal-ref: The International Conference on Acoustics and Vibration and Green Technologies ICAV-GreenTech'2025, Dec 2025, Sousse, Tunisia
Subjects: Signal Processing (eess.SP)

Mechanical vibration monitoring often requires high sampling rates and generates large data volumes, posing challenges for storage, transmission, and power efficiency. Compressive Sensing (CS) offers a promising approach to overcome these constraints by exploiting signal sparsity to enable sub-Nyquist acquisition and efficient reconstruction. This study presents a comprehensive comparative analysis of the key components of the CS framework: sparse basis, measurement matrix, and reconstruction algorithm for machinery vibration signals. In addition, a hardware-efficient measurement matrix, the Wang matrix, originally developed for image compression, is introduced and evaluated for the first time in this context. Experimental assessment using the HUMS2023 and the CETIM gearbox datasets demonstrates that this matrix achieves superior reconstruction quality, with higher SNR, compared to conventional Gaussian and Bernoulli matrices, especially at high compression ratios.

[18]  arXiv:2603.25167 [pdf, ps, other]
Title: Multi-Swing Transient Stability of Synchronous Generators and IBR Combined Generation Systems
Subjects: Systems and Control (eess.SY)

In traditional views, the build-up of accelerating energy during faults can cause the well-known first-swing angle instability in synchronous generators (SGs). Interestingly, this letter presents a new insight that the accumulation of decelerating energy due to the low voltage ride-through (LVRT) and recovery control of grid-following inverter-based resources (GFL-IBRs), might also result in transient angle instability in SGs. The transient energy accumulated during angle-decreasing swing transforms into the acceleration energy of the subsequent swing, hence such phenomena often manifest as multi-swing instability. Both theoretical analysis and simulation support these findings.

[19]  arXiv:2603.25192 [pdf, ps, other]
Title: Dominant Transient Stability of the Co-located PLL-Based Grid-Following Renewable Plant and Synchronous Condenser Systems
Subjects: Systems and Control (eess.SY)

Deploying synchronous condensers (SynCons) near grid-following renewable energy sources (GFLRs) is an effective and increasingly adopted strategy for grid support. However, the potential transient instability risks in such configurations remain an open research question. This study investigates the mechanism of dominant synchronization instability source transition upon SynCon integration and proposes a straightforward approach to enhance system stability by leveraging their interactive characteristics. Firstly, a dual-timescale decoupling model is established, partitioning the system into a fast subsystem representing phase-locked loop (PLL) dynamics and a slow subsystem characterizing SynCon rotor dynamics. The study then examines the influence of SynCons on the transient stability of nearby PLLs and their own inherent stability. The study shows that SynCon's voltage-source characteristics and its time-scale separation from PLL dynamics can significantly enhance the PLL's stability boundary and mitigate non-coherent coupling effects among multiple GFLRs. However, the dominant instability source shifts from the fast-time-scale PLL to the slow-time-scale SynCon after SynCon integration. Crucially, this paper demonstrates that the damping effect of PLL control can also be transferred from the fast to the slow time scale, allowing well-tuned PLL damping to suppress SynCon rotor acceleration. Consequently, by utilizing SynCon's inherent support capability and a simple PLL damping loop, the transient stability of the co-located system can be significantly enhanced. These conclusions are validated using a converter controller-based Hardware-in-the-Loop (CHIL) platform.

[20]  arXiv:2603.25211 [pdf, ps, other]
Title: On Port-Hamiltonian Formulation of HystereticEnergy Storage Elements: The Backlash Case
Subjects: Systems and Control (eess.SY)

This paper presents a port-Hamiltonian formulation of hysteretic energy storage elements. First, we revisit the passivity property of backlash-driven storage elements by presenting a family of storage functions associated to the dissipativity property of such elements. We explicitly derive the corresponding available storage and required supply functions `a la Willems [1], and show the interlacing property of the aforementioned family of storage functions sandwiched between the available storage and required supply functions. Second, using the proposed family of storage functions, we present a port-Hamiltonian formulation of hysteretic inductors as prototypical storage elements in port-Hamiltonian systems. In particular, we show how a Hamiltonian function can be chosen from the family of storage functions and how the hysteretic elements can be expressed as port-Hamiltonian system with feedthrough term, where the feedthrough term represents energy dissipation. Correspondingly, we illustrate its applicability in describing an RLC circuit (in parallel and in series) containing a hysteretic inductor element.

[21]  arXiv:2603.25238 [pdf, ps, other]
Title: Rate-Splitting Multiple Access with a SIC-Free Receiver: An Experimental Study
Subjects: Signal Processing (eess.SP); Information Theory (cs.IT)

Most Rate-Splitting Multiple Access (RSMA) implementations rely on successive interference cancellation (SIC) at the receiver, whose performance is inherently limited by error propagation during common-stream decoding. This paper addresses this issue by developing a SIC-free RSMA receiver based on joint demapping (JD), which directly evaluates bit vectors over a composite constellation. Using a two-user Multiple-Input Single-Output (MISO) prototype, we conduct over-the-air measurements to systematically compare SIC and JD-based receivers. The results show that the proposed SIC-free receiver provides stronger reliability and better practicality over a wider operating range, with all observations being consistent with theoretical expectations.

[22]  arXiv:2603.25274 [pdf, ps, other]
Title: Feature Selection for Fault Prediction in Distribution Systems
Comments: Submitted to PSCC 2026
Subjects: Systems and Control (eess.SY)

While conventional power system protection isolates faulty components only after a fault has occurred, fault prediction approaches try to detect faults before they can cause significant damage. Although initial studies have demonstrated successful proofs of concept, development is hindered by scarce field data and ineffective feature selection. To address these limitations, this paper proposes a surrogate task that uses simulation data for feature selection. This task exhibits a strong correlation (r = 0.92) with real-world fault prediction performance. We generate a large dataset containing 20000 simulations with 34 event classes and diverse grid configurations. From 1556 candidate features, we identify 374 optimal features. A case study on three substations demonstrates the effectiveness of the selected features, achieving an F1-score of 0.80 and outperforming baseline approaches that use frequency-domain and wavelet-based features.

[23]  arXiv:2603.25287 [pdf, ps, other]
Title: Entire Period Transient Stability of Synchronous Generators Considering LVRT Switching of Nearby Renewable Energy Sources
Subjects: Systems and Control (eess.SY)

In scenarios where synchronous generators (SGs) and grid-following renewable energy sources (GFLR) are co-located, existing research, which mainly focuses on the first-swing stability of SGs, often overlooks ongoing dynamic interactions between GFLRs and SGs throughout the entire rotor swing period. To address this gap, this study first reveals that the angle oscillations of SG can cause periodic grid voltage fluctuations, potentially triggering low-voltage ride-through (LVRT) control switching of GFLR repeatedly. Then, the periodic energy changes of SGs under "circular" and "rectangular" LVRT limits are analyzed. The results indicate that circular limits are detrimental to SG's first-swing stability, while rectangular limits and their slow recovery strategies can lead to SG's multi-swing instability. Conservative stability criteria are also proposed for these phenomena. Furthermore, an additional controller based on feedback linearization is introduced to enhance the entire period transient stability of SG by adjusting the post-fault GFLR output current. Finally, the efficacy of the analysis is validated through electromagnetic transient simulations and controller hardware-in-the-loop (CHIL) tests.

[24]  arXiv:2603.25299 [pdf, ps, other]
Title: Joint Training Scattering Matrix Learning and Channel Estimation for Beyond-Diagonal Reconfigurable Intelligent Surfaces
Subjects: Signal Processing (eess.SP)

Beyond-diagonal reconfigurable intelligent surface (BD-RIS) generalizes the conventional diagonal RIS (D-RIS) by introducing tunable inter-element connections, offering enhanced wave manipulation capabilities. However, realizing the advantages of BD-RIS requires accurate channel state information (CSI), whose acquisition becomes significantly more challenging due to the increased number of channel coefficients, leading to prohibitively large pilot training overhead in BD-RIS-aided multi-user multiple-input multiple-output (MU-MIMO) systems. Existing studies reduce pilot overhead by exploiting the channel correlations induced by the Kronecker-product or multi-linear structure of BD-RIS-aided channels, which neglect the spatial correlation among antennas and the statistical correlation across RIS-user channels. In this paper, we propose a learning-based channel estimation framework, namely the joint training scattering matrix learning and channel estimation framework (JTSMLCEF), which jointly optimizes the BD-RIS training scattering matrix and estimates the cascaded channels in an end-to-end manner to achieve accurate channel estimation and reduce the pilot overhead. The proposed JTSMLCEF follows a two-phase channel estimation protocol to enable adaptive training scattering matrix optimization with a training scattering matrix optimizer (TSMO) and cascaded channel estimation with a dual-attention channel estimator (DACE). Specifically, the DACE is designed with intra-user and inter-user attention modules to capture the multi-dimensional correlations in multi-user cascaded channels. Simulation results demonstrate the superiority of JTSMLCEF. Compared with the current state-of-the-art method, it reduces the pilot overhead by $80\%$ while further reducing the normalized mean squared error (NMSE) by $82.6\%$ and $92.5\%$ in indoor and urban micro-cell (UMi) scenarios, respectively.

[25]  arXiv:2603.25332 [pdf, ps, other]
Title: DRL-Based Spectrum Sharing for RIS-Aided Local High-Quality Wireless Networks
Subjects: Systems and Control (eess.SY)

This paper investigates a smart spectrum-sharing framework for reconfigurable intelligent surface (RIS)-aided local high-quality wireless networks (LHQWNs) within a mobile network operator (MNO) ecosystem. Although RISs are often considered potentially harmful due to interference, this work shows that properly controlled RISs can enhance the quality of service (QoS). The proposed system enables temporary spectrum access for multiple vertical service providers (VSPs) by dynamically allocating radio resources according to traffic demand. The spectrum is divided into dedicated subchannels assigned to individual VSPs and reusable subchannels shared among multiple VSPs, while RIS is employed to improve propagation conditions. We formulate a multi-VSP utility maximization problem that jointly optimizes subchannel assignment, transmit power, and RIS phase configuration while accounting for spectrum access costs, RIS leasing costs, and QoS constraints. The resulting mixed-integer non-linear program (MINLP) is intractable using conventional optimization methods. To address this challenge, the problem is modeled as a Markov decision process (MDP) and solved using deep reinforcement learning (DRL). Specifically, deep deterministic policy gradient (DDPG) and soft actor-critic (SAC) algorithms are developed and compared. Simulation results show that SAC outperforms DDPG in convergence speed, stability, and achievable utility, reaching up to 96% of the exhaustive search benchmark and demonstrating the potential of RIS to improve overall utility in multi-VSP scenarios.

[26]  arXiv:2603.25384 [pdf, ps, other]
Title: Underdetermined Blind Source Separation via Weighted Simplex Shrinkage Regularization and Quantum Deep Image Prior
Comments: Published in: IEEE Transactions on Image Processing ( Volume: 35)
Journal-ref: IEEE Transactions on Image Processing, 2026
Subjects: Image and Video Processing (eess.IV)

As most optical satellites remotely acquire multispectral images (MSIs) with limited spatial resolution, multispectral unmixing (MU) becomes a critical signal processing technology for analyzing the pure material spectra for high-precision classification and identification. Unlike the widely investigated hyperspectral unmixing (HU) problem, MU is much more challenging as it corresponds to the underdetermined blind source separation (BSS) problem, where the number of sources is larger than the number of available multispectral bands. In this article, we transform MU into its overdetermined counterpart (i.e., HU) by inventing a radically new quantum deep image prior (QDIP), which relies on the virtual band-splitting task conducted on the observed MSI for generating the virtual hyperspectral image (HSI). Then, we perform HU on the virtual HSI to obtain the virtual hyperspectral sources. Though HU is overdetermined, it still suffers from the ill-posed issue, for which we employ the convex geometry structure of the HSI pixels to customize a weighted simplex shrinkage (WSS) regularizer to mitigate the ill-posedness. Finally, the virtual hyperspectral sources are spectrally downsampled to obtain the desired multispectral sources. The proposed geometry/quantum-empowered MU (GQ-$\mu$) algorithm can also effectively obtain the spatial abundance distribution map for each source, where the geometric WSS regularization is adaptively and automatically controlled based on the sparsity pattern of the abundance tensor. Simulation and real-world data experiments demonstrate the practicality of our unsupervised GQ-$\mu$ algorithm for the challenging MU task. Ablation study demonstrates the strength of QDIP, not achieved by classical DIP, and validates the mechanics-inspired WSS geometry regularizer.

[27]  arXiv:2603.25430 [pdf, ps, other]
Title: Four-Transistor Four-Diode (4T4D) Series/Parallel Chopper Module for Auto-Balancing STATCOM and Low Control and Development Complexity
Subjects: Systems and Control (eess.SY)

Static synchronous compensators (STATCOMs) manage reactive power compensation in modern power grids and have become essential for the integration of renewable energy sources such as wind farms. Cascaded H bridges have become the preferred topology for high-power STATCOMs, but balancing module capacitor voltages remains a persistent challenge. Conventional solutions equip every module with a voltage sensor -- a component that is costly, temperature-sensitive, and prone to aging-related failures. Recent parallel-capable module topologies can balance voltage through switched-capacitor operation. The latest developments reduced the sensor requirement from one per module to one per arm. However, these implementations require twice as many individual transistors compared to series-only topologies. We present a STATCOM solution based on the four-transistor four-diode (4T4D) series\,/\,parallel chopper cell. This topology achieves bidirectional parallelization with only four transistors per module -- exactly as many as a conventional full bridge. Furthermore, we propose a dual-loop control strategy that fully eliminates module voltage sensors by inferring voltage levels from the modulation index. This scheme also improves output quality by regulating the modulation depth. We validated our proposal through simulation and experiments. We built a prototype to interface the grid. The prototype further passed robustness tests with step change, current direction reversal, and grid disturbance. This work demonstrates the first modular STATCOM implementation that combines minimum transistor count with complete elimination of module voltage sensors.

[28]  arXiv:2603.25441 [pdf, ps, other]
Title: Language-Free Generative Editing from One Visual Example
Comments: Accepted at CVPR 2026
Subjects: Image and Video Processing (eess.IV)

Text-guided diffusion models have advanced image editing by enabling intuitive control through language. However, despite their strong capabilities, we surprisingly find that SOTA methods struggle with simple, everyday transformations such as rain or blur. We attribute this limitation to weak and inconsistent textual supervision during training, which leads to poor alignment between language and vision. Existing solutions often rely on extra finetuning or stronger text conditioning, but suffer from high data and computational requirements. We argue that diffusion-based editing capabilities aren't lost but merely hidden from text. The door to cost-efficient visual editing remains open, and the key lies in a vision-centric paradigm that perceives and reasons about visual change as humans do, beyond words. Inspired by this, we introduce Visual Diffusion Conditioning (VDC), a training-free framework that learns conditioning signals directly from visual examples for precise, language-free image editing. Given a paired example -one image with and one without the target effect- VDC derives a visual condition that captures the transformation and steers generation through a novel condition-steering mechanism. An accompanying inversion-correction step mitigates reconstruction errors during DDIM inversion, preserving fine detail and realism. Across diverse tasks, VDC outperforms both training-free and fully fine-tuned text-based editing methods. The code and models are open-sourced at https://omaralezaby.github.io/vdc/

[29]  arXiv:2603.25549 [pdf, ps, other]
Title: Multi-User Covert Communication in Spatially Heterogeneous Wireless Networks
Subjects: Signal Processing (eess.SP)

This paper investigates an uplink multi-user covert communication system with spatially distributed users. Unlike prior works that approximate channel statistics using averaged parameters and homogeneous assumptions, this study explicitly models each user's geometric position and corresponding user-to-Willie and user-to-Bob channel variances. This approach enables an accurate characterization of spatially heterogeneous covert environments. We mathematically prove that a generalized on-off power control scheme, which jointly accounts for both Bob's and Willie's channels, constitutes the optimal transmission strategy in heterogeneous user configurations. Leveraging the optimal strategy, we derive closed-form expressions for the minimum detection error probability and the minimum number of cooperative users required to satisfy a covert constraint. With the closed-form expressions, comprehensive theoretical analyses are conducted, which are validated by Monte-Carlo simulations. One important insight obtained from the analysis is that user spatial heterogeneity can enhance covert communication performance. Building on these findings, a piecewise search algorithm is proposed to achieve exact optimality with significantly reduced computational complexity. We demonstrate that optimization considering user's spatial heterogeneity achieves substantially improved covert communication performance than that based on the assumption of spatial homogeneity.

[30]  arXiv:2603.25566 [pdf, ps, other]
Title: A Mamba-based Perceptual Loss Function for Learning-based UGC Transcoding
Comments: 7 pages, 6 figures
Subjects: Image and Video Processing (eess.IV)

In user-generated content (UGC) transcoding, source videos typically suffer various degradations due to prior compression, editing, or suboptimal capture conditions. Consequently, existing video compression paradigms that solely optimize for fidelity relative to the reference become suboptimal, as they force the codec to replicate the inherent artifacts of the non-pristine source. To address this, we propose a novel perceptually inspired loss function for learning-based UGC video transcoding that redefines the role of the reference video, shifting it from a ground-truth pixel anchor to an informative contextual guide. Specifically, we train a lightweight neural quality model based on a Selective Structured State-Space Model (Mamba) optimized using a weakly-supervised Siamese ranking strategy. The proposed model is then integrated into the rate-distortion optimization (RDO) process of two neural video codecs (DCVC and HiNeRV) as a loss function, aiming to generate reconstructed content with improved perceptual quality. Our experiments demonstrate that this framework achieves substantial coding gains over both autoencoder and implicit neural representation-based baselines, with 8.46% and 12.89% BD-rate savings, respectively.

[31]  arXiv:2603.25574 [pdf, ps, other]
Title: Physics-informed structured learning of a class of recurrent neural networks with guaranteed properties
Subjects: Systems and Control (eess.SY)

This paper proposes a physics-informed learning framework for a class of recurrent neural networks tailored to large-scale and networked systems. The approach aims to learn control-oriented models that preserve the structural and stability properties of the plant. The learning algorithm is formulated as a convex optimisation problem, allowing the inclusion of linear matrix inequality constraints to enforce desired system features. Furthermore, when the plant exhibits structural modularity, the resulting optimisation problem can be parallelised, requiring communication only among neighbouring subsystems. Simulation results show the effectiveness of the proposed approach.

[32]  arXiv:2603.25576 [pdf, ps, other]
Title: Challenge-Response Authentication for LEO Satellite Channels: Exploiting Orbit-Specific Uniqueness
Subjects: Signal Processing (eess.SP)

The number of low Earth orbit (LEO) satellite constellations has grown rapidly in recent years, bringing a major change to global wireless communications. As LEO satellite links take on a growing role in critical services such as emergency communications, navigation, wide-area data collection, and military operations, keeping these links secure has become an important concern. In particular, verifying the identity of a satellite transmitter is now a basic requirement for protecting the services that rely on satellite access. In this article, we propose an active challenge-response authentication framework in which the verifier checks the satellite at randomly chosen times that are not known in advance, removing the fixed measurement window that existing passive methods expose to adversaries. The proposed framework uses the deterministic yet unpredictably sampled nature of orbital observables to establish a physics based root of trust for satellite identity authentication. This approach transforms satellite authentication from static feature matching into a spatiotemporal consistency verification problem inherently constrained by orbital dynamics, providing robust protection even against trajectory-aware spoofing attacks.

[33]  arXiv:2603.25593 [pdf, ps, other]
Title: Intelligent Reflection as a Service (IRaaS): System Architecture, Enabling Technologies, and Deployment Strategy
Authors: Wei Wang, Yutian Shen
Subjects: Signal Processing (eess.SP)

Reflecting intelligent surface (RIS) is a promising technology for 6G mobile communications. However, identifying the niche of RIS within the mobile networks is a challenging task. To mitigate the escalating system complexity of mobile networks, we propose the concept of Intelligent Reflection as a Service (IRaaS), and discuss its system architecture, enabling technologies, and deployment strategy, respectively. By leveraging technologies such as resource pooling, service based architecture (SBA), cloud infrastructure, and model-free signal processing, IRaaS empowers telecom operators to deliver on-demand intelligent reflection services without a radical update of current communication protocols. In addition, IRaaS brings a novel deployment strategy that creates new opportunities for the vendors of intelligent reflection service and balances the interests of both telecom operators and property owners. IRaaS is expected to speed up the rollout of RIS from both technical perspective and commercial perspective, fostering an authentic smart radio environment for future mobile communications.

[34]  arXiv:2603.25602 [pdf, ps, other]
Title: Parameter-interval estimation for cooperative reactive sputtering processes
Subjects: Systems and Control (eess.SY)

Reactive sputtering is a plasma-based technique to deposit a thin film on a substrate. This contribution presents a novel parameter-interval estimation method for a well-established model that describes the uncertain and nonlinear reactive sputtering process behaviour. Building on a proposed monotonicity-based model classification, the method guarantees that all parameterizations within the parameter interval yield output trajectories and static characteristics consistent with the enclosure induced by the parameter interval. Correctness and practical applicability of the new method are demonstrated by an experimental validation, which also reveals inherent structural limitations of the well-established process model for state-estimation tasks.

[35]  arXiv:2603.25621 [pdf, ps, other]
Title: A Ray-Based Characterization of Satellite-to-Urban Propagation
Subjects: Signal Processing (eess.SP)

The evolution toward 6G communication systems is expected to rely on integrated three-dimensional network architectures where terrestrial infrastructures coexist with non-terrestrial stations such as satellites, enabling ubiquitous connectivity and service continuity. In this context, accurate channel models for satellite-to-ground propagation in urban environments are essential, particularly for user equipment located at street level where obstruction and multipath effects are significant. This work investigates satellite-to-urban propagation through deterministic ray-tracing simulations. Three representative urban layouts are considered, namely dense urban, urban, and suburban. Multiple use cases are investigated, including handheld devices, vehicular terminals, and fixed rooftop receivers operating across several frequency bands. The analysis focuses on the relative importance of competing propagation mechanisms and on two key channel parameters, namely the Rician K-factor and the delay spread, which are relevant for the calibration of channel models to be used in link- and system-level simulations. Results highlight the strong - and in some cases unconventional - dependence of channel dispersion and fading characteristics on satellite elevation, antenna placement, and urban morphology.

[36]  arXiv:2603.25645 [pdf, ps, other]
Title: Colon-Bench: An Agentic Workflow for Scalable Dense Lesion Annotation in Full-Procedure Colonoscopy Videos
Comments: preprint
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)

Early screening via colonoscopy is critical for colon cancer prevention, yet developing robust AI systems for this domain is hindered by the lack of densely annotated, long-sequence video datasets. Existing datasets predominantly focus on single-class polyp detection and lack the rich spatial, temporal, and linguistic annotations required to evaluate modern Multimodal Large Language Models (MLLMs). To address this critical gap, we introduce Colon-Bench, generated via a novel multi-stage agentic workflow. Our pipeline seamlessly integrates temporal proposals, bounding-box tracking, AI-driven visual confirmation, and human-in-the-loop review to scalably annotate full-procedure videos. The resulting verified benchmark is unprecedented in scope, encompassing 528 videos, 14 distinct lesion categories (including polyps, ulcers, and bleeding), over 300,000 bounding boxes, 213,000 segmentation masks, and 133,000 words of clinical descriptions. We utilize Colon-Bench to rigorously evaluate state-of-the-art MLLMs across lesion classification, Open-Vocabulary Video Object Segmentation (OV-VOS), and video Visual Question Answering (VQA). The MLLM results demonstrate surprisingly high localization performance in medical domains compared to SAM-3. Finally, we analyze common VQA errors from MLLMs to introduce a novel "colon-skill" prompting strategy, improving zero-shot MLLM performance by up to 9.7% across most MLLMs. The dataset and the code are available at https://abdullahamdi.com/colon-bench .

Cross-lists for Fri, 27 Mar 26

[37]  arXiv:2603.22225 (cross-list from cs.CL) [pdf, ps, other]
Title: Adapting Self-Supervised Speech Representations for Cross-lingual Dysarthria Detection in Parkinson's Disease
Comments: Submitted to Interspeech 2026
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)

The limited availability of dysarthric speech data makes cross-lingual detection an important but challenging problem. A key difficulty is that speech representations often encode language-dependent structure that can confound dysarthria detection. We propose a representation-level language shift (LS) that aligns source-language self-supervised speech representations with the target-language distribution using centroid-based vector adaptation estimated from healthy-control speech. We evaluate the approach on oral DDK recordings from Parkinson's disease speech datasets in Czech, German, and Spanish under both cross-lingual and multilingual settings. LS substantially improves sensitivity and F1 in cross-lingual settings, while yielding smaller but consistent gains in multilingual settings. Representation analysis further shows that LS reduces language identity in the embedding space, supporting the interpretation that LS removes language-dependent structure.

[38]  arXiv:2603.24598 (cross-list from math.OC) [pdf, ps, other]
Title: Response-Aware Risk-Constrained Control Barrier Function With Application to Vehicles
Authors: Qijun Liao, Jue Yang
Comments: 22 pages, 20 figures
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Systems and Control (eess.SY)

This paper proposes a unified control framework based on Response-Aware Risk-Constrained Control Barrier Function for dynamic safety boundary control of vehicles. Addressing the problem of physical model parameter mismatch, the framework constructs an uncertainty propagation model that fuses nominal dynamics priors with direct vehicle body responses. Utilizing simplified single-track dynamics to provide a baseline direction for control gradients and covering model deviations through statistical analysis of body response signals, the framework eliminates the dependence on accurate online estimation of road surface adhesion coefficients. By introducing Conditional Value at Risk (CVaR) theory, the framework reformulates traditional deterministic safety constraints into probabilistic constraints on the tail risk of barrier function derivatives. Combined with a Bayesian online learning mechanism based on inverse Wishart priors, it identifies environmental noise covariance in real-time, adaptively tuning safety margins to reduce performance loss under prior parameter mismatch. Finally, based on Control Lyapunov Function (CLF), a unified Second-Order Cone Programming (SOCP) controller is constructed. Theoretical analysis establishes convergence of Sequential Convex Programming to local Karush-Kuhn-Tucker points and provides per-step probabilistic safety bounds. High-fidelity dynamics simulations demonstrate that under extreme conditions, the method not only eliminates the output divergence phenomenon of traditional methods but also achieves Pareto improvement in both safety and tracking performance. For the chosen risk level, the per-step safety violation probability is theoretically bounded by approximately 2%, validated through high-fidelity simulations showing zero boundary violations across all tested scenarios.

[39]  arXiv:2603.24600 (cross-list from math.OC) [pdf, ps, other]
Title: Period-aware asymptotic gain with application to a periodically forced synchronization circuit
Comments: Accepted for the 2026 American Control Conference (ACC)
Subjects: Optimization and Control (math.OC); Systems and Control (eess.SY)

The classical asymptotic gain (AG) is a concept known from the input-to-state stability theory. Given a uniform input bound, AG estimates the asymptotic bound of the output. Sometimes, however, more information is known about the input than just a bound. In this paper we consider the case of a periodic input. Under the assumption that the system converges to a periodic solution, we introduce a new gain, called period-aware asymptotic gain (PAG), which employs periodicity to enable a sharper asymptotic estimation of the output. Since the PAG can distinguish between short-period ("high-frequency") and long-period ("low-frequency") signals, it is able to rigorously quantify such properties as bandwidth, resonant behavior, and high-frequency damping. We discuss how the PAG can be computed and illustrate it with a numerical example from the field of power electronics.

[40]  arXiv:2603.24620 (cross-list from cs.NI) [pdf, ps, other]
Title: Scalable Air-to-Ground Wireless Channel Modeling Using Environmental Context and Generative Diffusion
Authors: Jingyi Tian, Lin Cai
Comments: 11 pages, 13 figures
Subjects: Networking and Internet Architecture (cs.NI); Signal Processing (eess.SP)

The fast motion of Low Earth Orbit (LEO) satellites causes the propagation channel to vary rapidly, and its behavior is strongly shaped by the surrounding environment, especially at low elevation angles where signals are highly susceptible to terrain blockage and other environmental effects. Existing studies mostly rely on assumed statistical channel distributions and therefore ignore the influence of the actual geographic environment. In this paper, we propose an environment-aware channel modeling method for air-to-ground wireless links. We leverage real environmental data, including digital elevation models (DEMs) and land cover information, together with ray tracing (RT) to determine whether a link is line-of-sight (LOS) or non-line-of-sight (NLOS) and to identify possible reflection paths of the signal. The resulting obstruction and reflection profiles are then combined with models of diffraction loss, vegetation absorption, and atmospheric attenuation to quantitatively characterize channel behavior in realistic geographic environments. Since RT is computationally intensive, we use RT-generated samples and environmental features to train a scalable diffusion model that can efficiently predict channel performance for arbitrary satellite and ground terminal positions, thereby supporting real-time decision-making. In the experiments, we validate the proposed model with measurement data from both cellular and LEO satellite links, demonstrating its effectiveness in realistic environments.

[41]  arXiv:2603.24629 (cross-list from cs.SE) [pdf, ps, other]
Title: Sketch2Simulation: Automating Flowsheet Generation via Multi Agent Large Language Models
Comments: 27 pages, 14 figures, 8 tables
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA); Systems and Control (eess.SY)

Converting process sketches into executable simulation models remains a major bottleneck in process systems engineering, requiring substantial manual effort and simulator-specific expertise. Recent advances in generative AI have improved both engineering-diagram interpretation and LLM-assisted flowsheet generation, but these remain largely disconnected: diagram-understanding methods often stop at extracted graphs, while text-to-simulation workflows assume structured inputs rather than raw visual artifacts. To bridge this gap, we present an end-to-end multi-agent large language model system that converts process diagrams directly into executable Aspen HYSYS flowsheets. The framework decomposes the task into three coordinated layers: diagram parsing and interpretation, simulation model synthesis, and multi-level validation. Specialized agents handle visual interpretation, graph-based intermediate representation construction, code generation for the HYSYS COM interface, execution, and structural verification. We evaluate the framework on four chemical engineering case studies of increasing complexity, from a simple desalting process to an industrial aromatic production flowsheet with multiple recycle loops. The system produces executable HYSYS models in all cases, achieving complete structural fidelity on the two simpler cases and strong performance on the more complex ones, with connection consistency above 0.93 and stream consistency above 0.96. These results demonstrate a viable end-to-end sketch-to-simulation workflow while highlighting remaining challenges in dense recycle structures, implicit diagram semantics, and simulator-interface constraints.

[42]  arXiv:2603.24651 (cross-list from cs.CL) [pdf, ps, other]
Title: When Consistency Becomes Bias: Interviewer Effects in Semi-Structured Clinical Interviews
Comments: Accepted to LREC 2026 Conference
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Sound (cs.SD); Audio and Speech Processing (eess.AS)

Automatic depression detection from doctor-patient conversations has gained momentum thanks to the availability of public corpora and advances in language modeling. However, interpretability remains limited: strong performance is often reported without revealing what drives predictions. We analyze three datasets: ANDROIDS, DAIC-WOZ, E-DAIC and identify a systematic bias from interviewer prompts in semi-structured interviews. Models trained on interviewer turns exploit fixed prompts and positions to distinguish depressed from control subjects, often achieving high classification scores without using participant language. Restricting models to participant utterances distributes decision evidence more broadly and reflects genuine linguistic cues. While semi-structured protocols ensure consistency, including interviewer prompts inflates performance by leveraging script artifacts. Our results highlight a cross-dataset, architecture-agnostic bias and emphasize the need for analyses that localize decision evidence by time and speaker to ensure models learn from participants' language.

[43]  arXiv:2603.24703 (cross-list from cs.SE) [pdf, ps, other]
Title: IndustriConnect: MCP Adapters and Mock-First Evaluation for AI-Assisted Industrial Operations
Subjects: Software Engineering (cs.SE); Robotics (cs.RO); Systems and Control (eess.SY)

AI assistants can decompose multi-step workflows, but they do not natively speak industrial protocols such as Modbus, MQTT/Sparkplug B, or OPC UA, so this paper presents INDUSTRICONNECT, a prototype suite of Model Context Protocol (MCP) adapters that expose industrial operations as schema-discoverable AI tools while preserving protocol-specific connectivity and safety controls; the system uses a common response envelope and a mock-first workflow so adapter behavior can be exercised locally before connecting to plant equipment, and a deterministic benchmark covering normal, fault-injected, stress, and recovery scenarios evaluates the flagship adapters, comprising 870 runs (480 normal, 210 fault-injected, 120 stress, 60 recovery trials) and 2820 tool calls across 7 fault scenarios and 12 stress scenarios, where the normal suite achieved full success, the fault suite confirmed structured error handling with adapter-level uint16 range validation, the stress suite identified concurrency boundaries, and same-session recovery after endpoint restart is demonstrated for all three protocols, with results providing evidence spanning adapter correctness, concurrency behavior, and structured error handling for AI-assisted industrial operations.

[44]  arXiv:2603.24714 (cross-list from cs.LG) [pdf, ps, other]
Title: Can an Actor-Critic Optimization Framework Improve Analog Design Optimization?
Comments: 7 pages, 5 figures
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)

Analog design often slows down because even small changes to device sizes or biases require expensive simulation cycles, and high-quality solutions typically occupy only a narrow part of a very large search space. While existing optimizers reduce some of this burden, they largely operate without the kind of judgment designers use when deciding where to search next. This paper presents an actor-critic optimization framework (ACOF) for analog sizing that brings that form of guidance into the loop. Rather than treating optimization as a purely black-box search problem, ACOF separates the roles of proposal and evaluation: an actor suggests promising regions of the design space, while a critic reviews those choices, enforces design legality, and redirects the search when progress is hampered. This structure preserves compatibility with standard simulator-based flows while making the search process more deliberate, stable, and interpretable. Across our test circuits, ACOF improves the top-10 figure of merit by an average of 38.9% over the strongest competing baseline and reduces regret by an average of 24.7%, with peak gains of 70.5% in FoM and 42.2% lower regret on individual circuits. By combining iterative reasoning with simulation-driven search, the framework offers a more transparent path toward automated analog sizing across challenging design spaces.

[45]  arXiv:2603.24733 (cross-list from cs.CV) [pdf, ps, other]
Title: OpenCap Monocular: 3D Human Kinematics and Musculoskeletal Dynamics from a Single Smartphone Video
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Quantitative Methods (q-bio.QM)

Quantifying human movement (kinematics) and musculoskeletal forces (kinetics) at scale, such as estimating quadriceps force during a sit-to-stand movement, could transform prediction, treatment, and monitoring of mobility-related conditions. However, quantifying kinematics and kinetics traditionally requires costly, time-intensive analysis in specialized laboratories, limiting clinical translation. Scalable, accurate tools for biomechanical assessment are needed. We introduce OpenCap Monocular, an algorithm that estimates 3D skeletal kinematics and kinetics from a single smartphone video. The method refines 3D human pose estimates from a monocular pose estimation model (WHAM) via optimization, computes kinematics of a biomechanically constrained skeletal model, and estimates kinetics via physics-based simulation and machine learning. We validated OpenCap Monocular against marker-based motion capture and force plate data for walking, squatting, and sit-to-stand tasks. OpenCap Monocular achieved low kinematic error (4.8{\deg} mean absolute error for rotational degrees of freedom; 3.4 cm for pelvis translations), outperforming a regression-only computer vision baseline by 48% in rotational accuracy (p = 0.036) and 69% in translational accuracy (p < 0.001). OpenCap Monocular also estimated ground reaction forces during walking with accuracy comparable to, or better than, our prior two-camera OpenCap system. We demonstrate that the algorithm estimates important kinetic outcomes with clinically meaningful accuracy in applications related to frailty and knee osteoarthritis, including estimating knee extension moment during sit-to-stand transitions and knee adduction moment during walking. OpenCap Monocular is deployed via a smartphone app, web app, and secure cloud computing (https://opencap.ai), enabling free, accessible single-smartphone biomechanical assessments.

[46]  arXiv:2603.24795 (cross-list from math.OC) [pdf, ps, other]
Title: Structure, Analysis, and Synthesis of First-Order Algorithms
Comments: 72 pages, 27 figures, 6 Tables
Subjects: Optimization and Control (math.OC); Systems and Control (eess.SY)

Optimization algorithms can be interpreted through the lens of dynamical systems as the interconnection of linear systems and a set of subgradient nonlinearities. This dynamical systems formulation allows for the analysis and synthesis of optimization algorithms by solving robust control problems. In this work, we use the celebrated internal model principle in control theory to structurally factorize convergent composite optimization algorithms into suitable network-dependent internal models and core subcontrollers. As the key benefit, we reveal that this permits us to synthesize optimization algorithms even if information is transmitted over networks featuring dynamical phenomena such as time delays, channel memory, or crosstalk. Design of these algorithms is achieved under bisection in the exponential convergence rate either through a nonconvex local search or by alternation of convex semidefinite programs. We demonstrate factorization of existing optimization algorithms and the automated synthesis of new optimization algorithms in the networked setting.

[47]  arXiv:2603.24959 (cross-list from cs.RO) [pdf, ps, other]
Title: Wireless bioelectronics for untethered biohybrid robots
Subjects: Robotics (cs.RO); Systems and Control (eess.SY)

Biohybrid robots integrate living tissues with engineered artificial structures to achieve organism-inspired actuation and behavior. A persistent challenge is delivering stimulation and control signals without relying on tethered wiring or bulky hardware immersed in cell-culture media. Wireless bioelectronics addresses this limitation by enabling the remote transfer of control signals, typically via radio-frequency magnetic fields, to locally stimulate muscle tissues at tissue-electrode interfaces. In parallel, wireless optoelectronics enables remote control of optogenetically modified, muscle-based robots by embedding light emitters that initiate muscle actuation through light-gated ion channels. Further advances incorporate neuromuscular junctions, leveraging biological signal transduction to enable selective control of multiple actuators through wireless frequency- and time-division multiplexing. This perspective article summarizes recent advances in control strategies for biohybrid robots, namely, wireless electrical stimulation, wireless optical stimulation, and neuromuscular integration. Then this describes cross-cutting design principles and highlights a future direction, namely, co-integration of neural organoid-bioelectronics toward autonomous, closed-loop biohybrid robots.

[48]  arXiv:2603.25139 (cross-list from cs.RO) [pdf, ps, other]
Title: Dissimilarity-Based Persistent Coverage Control of Multi-Robot Systems for Improving Solar Irradiance Prediction Accuracy in Solar Thermal Power Plants
Comments: 8 pages, 6 figures, 5 tables
Subjects: Robotics (cs.RO); Systems and Control (eess.SY)

Accurate forecasting of future solar irradiance is essential for the effective control of solar thermal power plants. Although various kriging-based methods have been proposed to address the prediction problem, these methods typically do not provide an appropriate sampling strategy to dynamically position mobile sensors for optimizing prediction accuracy in real time, which is critical for achieving accurate forecasts with a minimal number of sensors. This paper introduces a dissimilarity map derived from a kriging model and proposes a persistent coverage control algorithm that effectively guides agents toward regions where additional observations are required to improve prediction performance. By means of experiments using mobile robots, the proposed approach was shown to obtain more accurate predictions than the considered baselines under various emulated irradiance fields.

[49]  arXiv:2603.25216 (cross-list from cs.NI) [pdf, ps, other]
Title: A Wireless World Model for AI-Native 6G Networks
Subjects: Networking and Internet Architecture (cs.NI); Artificial Intelligence (cs.AI); Signal Processing (eess.SP)

Integrating AI into the physical layer is a cornerstone of 6G networks. However, current data-driven approaches struggle to generalize across dynamic environments because they lack an intrinsic understanding of electromagnetic wave propagation. We introduce the Wireless World Model (WWM), a multi-modal foundation framework predicting the spatiotemporal evolution of wireless channels by internalizing the causal relationship between 3D geometry and signal dynamics. Pre-trained on a massive ray-traced multi-modal dataset, WWM overcomes the data authenticity gap, further validated under real-world measurement data. Using a joint-embedding predictive architecture with a multi-modal mixture-of-experts Transformer, WWM fuses channel state information, 3D point clouds, and user trajectories into a unified representation. Across the five key downstream tasks supported by WWM, it achieves remarkable performance in seen environments, unseen generalization scenarios, and real-world measurements, consistently outperforming SOTA uni-modal foundation models and task-specific models. This paves the way for physics-aware 6G intelligence that adapts to the physical world.

[50]  arXiv:2603.25259 (cross-list from cs.RO) [pdf, ps, other]
Title: A Minimum-Energy Control Approach for Redundant Mobile Manipulators in Physical Human-Robot Interaction Applications
Subjects: Robotics (cs.RO); Systems and Control (eess.SY)

Research on mobile manipulation systems that physically interact with humans has expanded rapidly in recent years, opening the way to tasks which could not be performed using fixed-base manipulators. Within this context, developing suitable control methodologies is essential since mobile manipulators introduce additional degrees of freedom, making the design of control approaches more challenging and more prone to performance optimization. This paper proposes a control approach for a mobile manipulator, composed of a mobile base equipped with a robotic arm mounted on the top, with the objective of minimizing the overall kinetic energy stored in the whole-body mobile manipulator in physical human-robot interaction applications. The approach is experimentally tested with reference to a peg-in-hole task, and the results demonstrate that the proposed approach reduces the overall kinetic energy stored in the whole-body robotic system and improves the system performance compared with the benchmark method.

[51]  arXiv:2603.25276 (cross-list from math.DS) [pdf, ps, other]
Title: Global Stability Analysis of the Age-Structured Chemostat With Substrate Dynamics
Comments: 46 pages
Subjects: Dynamical Systems (math.DS); Systems and Control (eess.SY); Optimization and Control (math.OC); Populations and Evolution (q-bio.PE)

In this paper we study the stability properties of the equilibrium point for an age-structured chemostat model with renewal boundary condition and coupled substrate dynamics under constant dilution rate. This is a complex infinite-dimensional feedback system. It has two feedback loops, both nonlinear. A positive static loop due to reproduction at the age-zero boundary of the PDE, counteracted and dominated by a negative dynamic loop with the substrate dynamics. The derivation of explicit sufficient conditions that guarantee global stability estimates is carried out by using an appropriate Lyapunov functional. The constructed Lyapunov functional guarantees global exponential decay estimates and uniform global asymptotic stability with respect to a measure related to the Lyapunov functional. From a biological perspective, stability arises because reproduction is constrained by substrate availability, while dilution, mortality, and substrate depletion suppress transient increases in biomass before age-structure effects can amplify them. The obtained results are applied to a chemostat model from the literature, where the derived stability condition is compared with existing results that are based on (necessarily local) linearization methods.

[52]  arXiv:2603.25288 (cross-list from cs.IT) [pdf, ps, other]
Title: CSI-tuples-based 3D Channel Fingerprints Construction Assisted by MultiModal Learning
Comments: 14 pages, 9 figures
Subjects: Information Theory (cs.IT); Artificial Intelligence (cs.AI); Emerging Technologies (cs.ET); Machine Learning (cs.LG); Signal Processing (eess.SP)

Low-altitude communications can promote the integration of aerial and terrestrial wireless resources, expand network coverage, and enhance transmission quality, thereby empowering the development of sixth-generation (6G) mobile communications. As an enabler for low-altitude transmission, 3D channel fingerprints (3D-CF), also referred to as the 3D radio map or 3D channel knowledge map, are expected to enhance the understanding of communication environments and assist in the acquisition of channel state information (CSI), thereby avoiding repeated estimations and reducing computational complexity. In this paper, we propose a modularized multimodal framework to construct 3D-CF. Specifically, we first establish the 3D-CF model as a collection of CSI-tuples based on Rician fading channels, with each tuple comprising the low-altitude vehicle's (LAV) positions and its corresponding statistical CSI. In consideration of the heterogeneous structures of different prior data, we formulate the 3D-CF construction problem as a multimodal regression task, where the target channel information in the CSI-tuple can be estimated directly by its corresponding LAV positions, together with communication measurements and geographic environment maps. Then, a high-efficiency multimodal framework is proposed accordingly, which includes a correlation-based multimodal fusion (Corr-MMF) module, a multimodal representation (MMR) module, and a CSI regression (CSI-R) module. Numerical results show that our proposed framework can efficiently construct 3D-CF and achieve at least 27.5% higher accuracy than the state-of-the-art algorithms under different communication scenarios, demonstrating its competitive performance and excellent generalization ability. We also analyze the computational complexity and illustrate its superiority in terms of the inference time.

[53]  arXiv:2603.25308 (cross-list from physics.flu-dyn) [pdf, ps, other]
Title: Real-time control of multiphase processes with learned operators
Subjects: Fluid Dynamics (physics.flu-dyn); Systems and Control (eess.SY)

Multiphase flows frequently occur naturally and in manufactured devices. Controlling such phenomena is extremely challenging due to the strongly non-linear dynamics, rapid phase transitions, and the limited spatial and temporal resolution of available sensors, which can lead to significant inaccuracies in predicting and managing these flows. In most cases, numerical models are the only way to access high spatial and temporal resolution data to an extent that allows for fine control. While embedding numerical models in control algorithms could enable fine control of multiphase processes, the significant computational burden currently limits their practical application. This work proposes a surrogate-assisted model predictive control (MPC) framework for regulating multiphase processes using learned operators. A Fourier Neural Operator (FNO) is trained to forecast the spatiotemporal evolution of a phase-indicator field (the volume fraction) over a finite horizon from a short history of recent states and a candidate actuation signal. The neural operator surrogate is then iteratively called during the optimisation process to identify the optimal control variable. To illustrate the approach, we solve an optimal control problem (OCP) on a two-phase Eulerian bubble column. Here, the controller tracks piecewise-constant liquid level setpoints by adjusting the gas flow rate introduced into the system. The results we obtained indicate that field-level forecasting with FNOs are well suited for closed-loop optimization since they have relatively low evaluation cost. The latter provide a practical route toward MPC for fast multiphase unit operations and a foundation for future extensions to partial observability and physics-informed operator learning.

[54]  arXiv:2603.25351 (cross-list from cs.CV) [pdf, ps, other]
Title: Image Rotation Angle Estimation: Comparing Circular-Aware Methods
Comments: 7 pages, 3 figures, 2 tables. Under review at Pattern Recognition Letters
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)

Automatic image rotation estimation is a key preprocessing step in many vision pipelines. This task is challenging because angles have circular topology, creating boundary discontinuities that hinder standard regression methods. We present a comprehensive study of five circular-aware methods for global orientation estimation: direct angle regression with circular loss, classification via angular binning, unit-vector regression, phase-shifting coder, and circular Gaussian distribution. Using transfer learning from ImageNet-pretrained models, we systematically evaluate these methods across sixteen modern architectures by adapting their output heads for rotation-specific predictions. Our results show that probabilistic methods, particularly the circular Gaussian distribution, are the most robust across architectures, while classification achieves the best accuracy on well-matched backbones but suffers training instabilities on others. The best configuration (classification with EfficientViT-B3) achieves a mean absolute error (MAE) of 1.23{\deg} (mean across five independent runs) on the DRC-D dataset, while the circular Gaussian distribution with MambaOut Base achieves a virtually identical 1.24{\deg} with greater robustness across backbones. Training and evaluating our top-performing method-architecture combinations on COCO 2014, the best configuration reaches 3.71{\deg} MAE, improving substantially over prior work, with further improvement to 2.84{\deg} on the larger COCO 2017 dataset.

[55]  arXiv:2603.25510 (cross-list from cs.CV) [pdf, ps, other]
Title: Challenges in Hyperspectral Imaging for Autonomous Driving: The HSI-Drive Case
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)

The use of hyperspectral imaging (HSI) in autonomous driving (AD), while promising, faces many challenges related to the specifics and requirements of this application domain. On the one hand, non-controlled and variable lighting conditions, the wide depth-of-field ranges, and dynamic scenes with fast-moving objects. On the other hand, the requirements for real-time operation and the limited computational resources of embedded platforms. The combination of these factors determines both the criteria for selecting appropriate HSI technologies and the development of custom vision algorithms that leverage the spectral and spatial information obtained from the sensors. In this article, we analyse several techniques explored in the research of HSI-based vision systems with application to AD, using as an example results obtained from experiments using data from the most recent version of the HSI-Drive dataset.

[56]  arXiv:2603.25559 (cross-list from cs.IT) [pdf, ps, other]
Title: Rotatable Antenna-Empowered Wireless Networks: A Tutorial
Comments: The first tutorial on rotatable antenna (RA)-empowered wireless networks, 34 pages, 20 figures
Subjects: Information Theory (cs.IT); Emerging Technologies (cs.ET); Signal Processing (eess.SP)

Non-fixed flexible antenna architectures, such as fluid antenna system (FAS), movable antenna (MA), and pinching antenna, have garnered significant interest in recent years. Among them, rotatable antenna (RA) has emerged as a promising technology for enhancing wireless communication and sensing performance through flexible antenna orientation/boresight rotation. By enabling mechanical or electronic boresight adjustment without altering physical antenna positions, RA introduces additional spatial degrees of freedom (DoFs) beyond conventional beamforming. In this paper, we provide a comprehensive tutorial on the fundamentals, architectures, and applications of RA-empowered wireless networks. Specifically, we begin by reviewing the historical evolution of RA-related technologies and clarifying the distinctive role of RA among flexible antenna architectures. Then, we establish a unified mathematical framework for RA-enabled systems, including general antenna/array rotation models, as well as channel models that cover near- and far-field propagation characteristics, wideband frequency selectivity, and polarization effects. Building upon this foundation, we investigate antenna/array rotation optimization in representative communication and sensing scenarios. Furthermore, we examine RA channel estimation/acquisition strategies encompassing orientation scheduling mechanisms and signal processing methods that exploit multi-view channel observations. Beyond theoretical modeling and algorithmic design, we discuss practical RA configurations and deployment strategies. We also present recent RA prototypes and experimental results that validate the practical performance gains enabled by antenna rotation. Finally, we highlight promising extensions of RA to emerging wireless paradigms and outline open challenges to inspire future research.

Replacements for Fri, 27 Mar 26

[57]  arXiv:2006.09534 (replaced) [pdf, ps, other]
Title: Discriminative reconstruction via simultaneous dense and sparse coding
Comments: 27 pages. Made changes to improve the clarity and presentation of the paper
Subjects: Information Theory (cs.IT); Machine Learning (cs.LG); Signal Processing (eess.SP)
[58]  arXiv:2401.12546 (replaced) [src]
Title: On Building Myopic MPC Policies using Supervised Learning
Comments: Updated version available as arXiv:2508.05804
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY); Optimization and Control (math.OC)
[59]  arXiv:2412.06327 (replaced) [pdf, ps, other]
Title: Control of Human-Induced Seismicity in Underground Reservoirs Governed by a Nonlinear 3D PDE-ODE System
Subjects: Systems and Control (eess.SY); Optimization and Control (math.OC)
[60]  arXiv:2502.05228 (replaced) [pdf, ps, other]
Title: Physics-Informed Evolution: An Evolutionary Framework for Solving Quantum Control Problems Involving the Schrödinger Equation
Comments: 17 pages, 4 figures
Subjects: Quantum Physics (quant-ph); Artificial Intelligence (cs.AI); Systems and Control (eess.SY)
[61]  arXiv:2503.10900 (replaced) [pdf, ps, other]
Title: Planning Future Microgrids with Second-Life Batteries: A Degradation-Aware Iterative Optimization Framework
Subjects: Systems and Control (eess.SY)
[62]  arXiv:2505.16662 (replaced) [pdf, ps, other]
Title: Joint Magnetometer-IMU Calibration via Maximum A Posteriori Estimation
Comments: Latest version
Subjects: Robotics (cs.RO); Signal Processing (eess.SP)
[63]  arXiv:2507.09855 (replaced) [pdf, ps, other]
Title: Optimal Satellite Constellation Configuration Design: A Collection of Mixed Integer Linear Programs
Comments: 42 pages, Journal of Spacecraft and Rockets (Published)
Subjects: Optimization and Control (math.OC); Systems and Control (eess.SY)
[64]  arXiv:2507.14237 (replaced) [pdf, other]
Title: U-DREAM: Unsupervised Dereverberation guided by a Reverberation Model
Authors: Louis Bahrman (IDS, S2A), Marius Rodrigues (IDS, S2A), Mathieu Fontaine (IDS, S2A), Gaël Richard (IDS, S2A)
Journal-ref: IEEE Transactions on Audio, Speech and Language Processing, 2026, 34, pp.1552-1563
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
[65]  arXiv:2508.00307 (replaced) [pdf, ps, other]
Title: Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Sound (cs.SD); Signal Processing (eess.SP)
[66]  arXiv:2508.03983 (replaced) [pdf, ps, other]
Title: MiDashengLM: Efficient Audio Understanding with General Audio Captions
Comments: Added ACAVCaps reference (ICASSP 2026)
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[67]  arXiv:2508.14185 (replaced) [pdf, ps, other]
Title: Lightweight Tracking Control for Computationally Constrained Aerial Systems with the Newton-Raphson Method
Subjects: Robotics (cs.RO); Systems and Control (eess.SY); Optimization and Control (math.OC)
[68]  arXiv:2509.00956 (replaced) [pdf, ps, other]
Title: On the Global Optimality of Linear Policies for Sinkhorn Distributionally Robust Linear Quadratic Control
Subjects: Systems and Control (eess.SY); Optimization and Control (math.OC)
[69]  arXiv:2509.01388 (replaced) [pdf, ps, other]
Title: End-to-End Low-Level Neural Control of an Industrial-Grade 6D Magnetic Levitation System
Comments: 8 pages, 7 figures, 2 tables
Subjects: Systems and Control (eess.SY); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[70]  arXiv:2509.15917 (replaced) [pdf, ps, other]
Title: An MPC framework for efficient navigation of mobile robots in cluttered environments
Comments: - Code available at: this https URL - Supplementary video: this https URL
Subjects: Robotics (cs.RO); Systems and Control (eess.SY); Optimization and Control (math.OC)
[71]  arXiv:2510.04900 (replaced) [pdf, ps, other]
Title: Benchmarking M-LTSF: Frequency and Noise-Based Evaluation of Multivariate Long Time Series Forecasting Models
Comments: Number of pages: 13 Number of figures: 16 Number of Tables: 1
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
[72]  arXiv:2510.10215 (replaced) [pdf, ps, other]
Title: Bounds of Validity for Bifurcations of Equilibria in a Class of Networked Dynamical Systems
Comments: This manuscript has been accepted to the 2026 American Control Conference taking place in New Orleans, Louisiana, in May 2026
Subjects: Systems and Control (eess.SY)
[73]  arXiv:2510.18474 (replaced) [pdf, ps, other]
Title: Designing trajectories in the Earth-Moon system: a Levenberg-Marquardt approach
Comments: Preprint submitted to Acta Astronautica
Subjects: Optimization and Control (math.OC); Earth and Planetary Astrophysics (astro-ph.EP); Systems and Control (eess.SY)
[74]  arXiv:2510.25562 (replaced) [pdf, ps, other]
Title: Deep Reinforcement Learning-Based Cooperative Rate Splitting for Satellite-to-Underground Communication Networks
Comments: 6 pages, 3 figures, 1 table, and submitted to IEEE TVT
Subjects: Networking and Internet Architecture (cs.NI); Signal Processing (eess.SP); Systems and Control (eess.SY)
[75]  arXiv:2511.06832 (replaced) [pdf, ps, other]
Title: Learning stabilising policies for constrained nonlinear systems
Comments: 3 figures
Subjects: Systems and Control (eess.SY)
[76]  arXiv:2511.07711 (replaced) [pdf, ps, other]
Title: Geometric Conditions for Lossless Convexification in Linear Optimal Control with Discrete-Valued Inputs
Subjects: Optimization and Control (math.OC); Systems and Control (eess.SY)
[77]  arXiv:2511.08499 (replaced) [pdf, ps, other]
Title: Approaching Safety-Argumentation-by-Design: A Requirement-based Safety Argumentation Life Cycle for Automated Vehicles
Subjects: Systems and Control (eess.SY)
[78]  arXiv:2511.22839 (replaced) [pdf, ps, other]
Title: Can industrial overcapacity enable seasonal flexibility in electricity use? A case study of aluminum smelting in China
Comments: Submitted to Nature Energy
Subjects: Physics and Society (physics.soc-ph); General Economics (econ.GN); Systems and Control (eess.SY)
[79]  arXiv:2512.00435 (replaced) [pdf, ps, other]
Title: Rotatable Antenna-array-enhanced Direction-sensing for Low-altitude Communication Network: Method and Performance
Comments: 10 pages, 16 figures
Subjects: Signal Processing (eess.SP)
[80]  arXiv:2512.00462 (replaced) [pdf, ps, other]
Title: Distributionally Robust Acceleration Control Barrier Filter for Efficient UAV Obstacle Avoidance
Comments: This work has been accepted for publication in IEEE RA-L
Subjects: Systems and Control (eess.SY)
[81]  arXiv:2512.06617 (replaced) [pdf, ps, other]
Title: Teaching large language models to see in radar: aspect-distributed prototypes for few-shot HRRP ATR
Comments: This paper is a preprint of a paper submitted to the IET International Radar Conference (IRC 2025) and is subject to Institution of Engineering and Technology Copyright. If accepted, the copy of recordwill be available at IET Digital Library
Subjects: Signal Processing (eess.SP)
[82]  arXiv:2512.18356 (replaced) [pdf, ps, other]
Title: Robust H2/H-infinity control under stochastic requirements: minimizing conditional value-at-risk instead of worst-case performance
Comments: Authors version. Published version (IEEE Control systems letters) available at: this https URL
Journal-ref: E. Kassarian, F. Sanfedino, D. Alazard and A. Marrazza, "Robust H-infinity control under stochastic requirements: minimizing conditional value-at-risk instead of worst-case performance," in IEEE Control Systems Letters, 2026
Subjects: Systems and Control (eess.SY); Optimization and Control (math.OC)
[83]  arXiv:2601.03944 (replaced) [pdf, ps, other]
Title: ASVspoof 5: Evaluation of Spoofing, Deepfake, and Adversarial Attack Detection Using Crowdsourced Speech
Comments: This work has been submitted to the IEEE TASLP for possible publication
Subjects: Signal Processing (eess.SP); Sound (cs.SD)
[84]  arXiv:2602.23765 (replaced) [pdf, ps, other]
Title: DashengTokenizer: One layer is enough for unified audio understanding and generation
Comments: Added ACAVCaps reference
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[85]  arXiv:2603.00141 (replaced) [pdf, ps, other]
Title: From Scale to Speed: Adaptive Test-Time Scaling for Image Editing
Comments: Accepted to the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[86]  arXiv:2603.17499 (replaced) [pdf, ps, other]
Title: A Tutorial on Learning-Based Radio Map Construction: Data, Paradigms, and Physics-Awarenes
Subjects: Systems and Control (eess.SY); Signal Processing (eess.SP)
[87]  arXiv:2603.18865 (replaced) [pdf, ps, other]
Title: RadioDiff-FS: Physics-Informed Manifold Alignment in Few-Shot Diffusion Models for High-Fidelity Radio Map Construction
Subjects: Systems and Control (eess.SY); Machine Learning (cs.LG)
[88]  arXiv:2603.22445 (replaced) [pdf, ps, other]
Title: Finite-time Convergent Control Barrier Functions with Feasibility Guarantees
Subjects: Systems and Control (eess.SY)
[ total of 88 entries: 1-88 ]
[ showing up to 2000 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, eess, recent, 2603, contact, help  (Access key information)