| 引用本文: |
-
叶子绿,许魁,周涛,等.可移动阵元STAR-RIS辅助抗干扰传输:基于DDPG算法的双尺度优化[J].电讯技术,2026,66(6): - . [点击复制]
- YE Zilü,XU Kui,ZHOU Tao,et al.Movable Element STAR-RIS Assisted Anti-jamming Transmission: Two-timescale Optimization via DDPG Algorithm[J].,2026,66(6): - . [点击复制]
|
|
| 摘要: |
| 针对可移动阵元同时透射和反射可重构智能表面(Movable Elements Based Simultaneous Transmitting and Reflecting Reconfigurable Intelligent Surface,ME-STAR-RIS)辅助抗干扰系统中信道估计开销巨大的问题,提出一种基于深度确定性策略梯度(Deep Deterministic Policy Gradient,DDPG)算法的双尺度协同优化抗干扰传输方法。首先利用统计信道状态信息(Channel State Information,CSI)优化长时阵元位置,再基于优化后的阵元位置估计瞬时CSI,进而优化短时波束成形。为解决高维连续状态空间以及阵元位置、相移系数等连续动作空间带来的优化难题,引入DDPG算法实现动态策略学习。仿真结果表明,所以方法相较于瞬时CSI联合优化方案虽存在约15 b/s/Hz的性能损失,但显著降低了信道估计开销。 |
| 关键词: 可重构智能表面 ME-STAR-RIS 抗干扰传输 统计信道状态信息 双尺度优化 深度确定性策略梯度算法 |
| DOI:10.20079/j.issn.1001-893x.251012004 |
|
| 基金项目:国家自然科学基金面上项目(62471488,61671472) |
|
| Movable Element STAR-RIS Assisted Anti-jamming Transmission: Two-timescale Optimization via DDPG Algorithm |
| YE Zilü,XU Kui,ZHOU Tao,ZENG Mingcong,ZHANG Beihua |
| (College of Communication Engineering,Army Engineering University of PLA,Nanjing 210007,China) |
| Abstract: |
| For the problem of excessive channel estimation overhead in the anti-jamming system assisted by movable elements based simultaneous transmitting and reflecting reconfigurable intelligent surface (ME-STAR-RIS),a two-timescale collaborative optimization method for anti-jamming transmission is proposed by leveraging the deep deterministic policy gradient (DDPG) algorithm.First,long-term element positions are optimized using statistical channel state information (CSI),then the instantaneous CSI is estimated based on the optimized element positions,which further enables the optimization of short-term beamforming.To address the optimization challenges posed by high-dimensional continuous state spaces and continuous action spaces (such as element positions and phase shift coefficients),the DDPG algorithm is introduced to achieve dynamic policy learning.Simulation results demonstrate that although this method incurs a performance loss of approximately 1.5 b/s/Hz compared with the joint optimization scheme based on instantaneous CSI,it significantly reduces the channel estimation overhead. |
| Key words: IRS ME-STAR-RIS anti-jamming transmitting statistical CSI two-timescale optimization DDPG algorithm |