TY - GEN
T1 - Multi-objective reinforcement learning-based deep neural networks for cognitive space communications
AU - Ferreira, Paulo Victor R.
AU - Paffenroth, Randy
AU - Wyglinski, Alexander M.
AU - Hackett, Timothy M.
AU - Bilén, Sven G.
AU - Reinhart, Richard C.
AU - Mortensen, Dale J.
N1 - Funding Information:
This work was partially supported by: NASA John H. Glenn Research Center, grant number NNC14AA01A; NASA Space Technology Research Fellowship, grant number NNX15AQ41H; and CAPES Science without Borders scholarship, grant number BEX 18701/12-4.
Publisher Copyright:
© 2017 IEEE.
PY - 2017/8/3
Y1 - 2017/8/3
N2 - Future communication subsystems of space exploration missions can potentially benefit from software-defined radios (SDRs) controlled by machine learning algorithms. In this paper, we propose a novel hybrid radio resource allocation management control algorithm that integrates multi-objective reinforcement learning and deep artificial neural networks. The objective is to efficiently manage communications system resources by monitoring performance functions with common dependent variables that result in conflicting goals. The uncertainty in the performance of thousands of different possible combinations of radio parameters makes the trade-off between exploration and exploitation in reinforcement learning (RL) much more challenging for future critical space-based missions. Thus, the system should spend as little time as possible on exploring actions, and whenever it explores an action, it should perform at acceptable levels most of the time. The proposed approach enables on-line learning by interactions with the environment and restricts poor resource allocation performance through 'virtual environment exploration'. Improvements in the multi-objective performance can be achieved via transmitter parameter adaptation on a packet-basis, with poorly predicted performance promptly resulting in rejected decisions. Simulations presented in this work considered the DVB-S2 standard adaptive transmitter parameters and additional ones expected to be present in future adaptive radio systems. Performance results are provided by analysis of the proposed hybrid algorithm when operating across a satellite communication channel from Earth to GEO orbit during clear sky conditions. The proposed approach constitutes part of the core cognitive engine proof-of-concept to be delivered to the NASA Glenn Research Center SCaN Testbed located on-board the International Space Station.
AB - Future communication subsystems of space exploration missions can potentially benefit from software-defined radios (SDRs) controlled by machine learning algorithms. In this paper, we propose a novel hybrid radio resource allocation management control algorithm that integrates multi-objective reinforcement learning and deep artificial neural networks. The objective is to efficiently manage communications system resources by monitoring performance functions with common dependent variables that result in conflicting goals. The uncertainty in the performance of thousands of different possible combinations of radio parameters makes the trade-off between exploration and exploitation in reinforcement learning (RL) much more challenging for future critical space-based missions. Thus, the system should spend as little time as possible on exploring actions, and whenever it explores an action, it should perform at acceptable levels most of the time. The proposed approach enables on-line learning by interactions with the environment and restricts poor resource allocation performance through 'virtual environment exploration'. Improvements in the multi-objective performance can be achieved via transmitter parameter adaptation on a packet-basis, with poorly predicted performance promptly resulting in rejected decisions. Simulations presented in this work considered the DVB-S2 standard adaptive transmitter parameters and additional ones expected to be present in future adaptive radio systems. Performance results are provided by analysis of the proposed hybrid algorithm when operating across a satellite communication channel from Earth to GEO orbit during clear sky conditions. The proposed approach constitutes part of the core cognitive engine proof-of-concept to be delivered to the NASA Glenn Research Center SCaN Testbed located on-board the International Space Station.
UR - http://www.scopus.com/inward/record.url?scp=85030265790&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85030265790&partnerID=8YFLogxK
U2 - 10.1109/CCAAW.2017.8001880
DO - 10.1109/CCAAW.2017.8001880
M3 - Conference contribution
AN - SCOPUS:85030265790
T3 - 2017 Cognitive Communications for Aerospace Applications Workshop, CCAA 2017
BT - 2017 Cognitive Communications for Aerospace Applications Workshop, CCAA 2017
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2017 Cognitive Communications for Aerospace Applications Workshop, CCAA 2017
Y2 - 27 June 2017 through 28 June 2017
ER -