PhD Defense: Machine Learning-based Efficient Resource Scheduling for Future Wireless Communication Networks

We’re happy to welcome you to the PhD defence of Yuan Yaxiong (SigCom group) on 13 May 2022 at 11:00.

The event will take place digitally on WebEx. Click here to join.

Please be informed that Webex events are not accessible on Linux OS.

Members of the defense committee:

Prof. Dr. Björn Ottersten, University of Luxembourg, Chairman
Prof. Dr Lei Lei, Xi’an Jiaotong University, China, Deputy Chairman
Prof. Dr. Symeon Chatzinotas, University of Luxembourg, Supervisor
Prof. Dr. Arumugam Nallanathan, Queen Mary University of London, United Kingdom, Member
Prof. Dr. Osvaldo Simeone, King’s London College London, United Kingdom, member

Abstract:

The next-generation mobile communication system, e.g., 6G communication system, is envisioned to support unprecedented performance requirements such as exponentially increased data requests, heterogeneous service demands, and massive connectivity. When these challenging tasks meet the scarcity of wireless resources, efﬁcient resource management becomes increasingly important. Conventionally, optimization algorithms, either optimal or suboptimal, are the major approaches in the toolbox for solving resource allocation problems. However, the efﬁciency of these iterative optimization algorithms can signiﬁcantly degrade when the problems become large or difﬁcult, e.g., non-convex or combinatorial optimization problems. Over the past few years, machine learning (ML), as an emerging approach in the toolbox, is widely investigated to accelerate the decision-making process. Since applying ML-based approaches to solve complex resource management problems is in its early-stage study, many open issues and challenges need to be solved towards maturity and practical applications. This dissertation aims at enriching this line of studies. The motivation and objective of this dissertation lie in investigating and providing answers to the following research questions: 1) How to overcome the shortcomings of extensively adopted end-to-end learning in addressing resource management problems, and which types of features are suited to be learned if supervised learning is applied? 2) What are the limitations and beneﬁts when widely-used deep reinforcement learning (DRL) approaches are used to address constrained and combinatorial optimization problems in wireless networks, and are any tailored solutions to overcome the inherent drawbacks? 3) How to make ML-based approaches timely adapt to dynamic and complex wireless environments? 4) How to enlarge the performance gains when the paradigm shifts from centralized learning to distributed learning? The main contributions are organized by the following four research works.

Firstly, from a supervised-learning perspective, we address common issues, e.g., unsatisfactory pre- diction performance and resultant infeasible solutions, when end-to-end learning approaches apply to solve resource scheduling problems. Based on the analysis of optimal results, we design suited-to-learn features for a class of resource scheduling problems and develop combined learning-and-optimization approaches to enable time-efﬁcient and energy-efﬁcient resource scheduling in multi-antenna systems. The original optimization problems are mixed-integer programming problems with high-dimensional decision vectors. The optimal solution requires exponential complexity due to the inherent difﬁculties of the problems. Towards an efﬁcient and competitive solution, we apply a fully-connected deep neural network (DNN) and convolutional neural network (CNN) to learn the designed features. The predicted information can effectively reduce the large search space and accelerate the optimization process. Compare to the conventional optimization and pure ML algorithms, the proposed method achieves a good trade-off between quality and complexity.

Secondly, we address typical issues when DRL is adopted to deal with combinatorial and non-convex scheduling problems. The original problem is to provide energy-saving solutions via resource scheduling in energy-constrained networks. An optimal algorithm and a golden section search suboptimal approach are developed to serve as ofﬂine benchmarks. For online operations, we propose an actor-critic-based deep stochastic online scheduling (AC-DSOS) algorithm. Compared to supervised learning, DRL is suitable for dynamic environments and capable of making decisions based on the current state without an ofﬂine training phase. However, for the speciﬁc constrained scheduling problem, conventional DRL may not be able to handle two major issues of exponentially-increased action space and infeasible actions. The proposed AC-DSOS is developed to overcome these drawbacks. In simulations, AC-DSOS is able to provide feasible solutions and save more energy compared to the conventional DRL algorithms. Compared to the ofﬂine benchmarks, AC-DSOS reduces the computational time from second-level to millisecond-level.

Thirdly, the dissertation pays more attention to the performance of the ML-based approaches in highly dynamic and complex environments. Most of the ML models are trained by the collected data or the observed environments. They may not be able to timely respond to the large variations of environments, such as dramatically ﬂuctuated channel states or bursty data demands. In this work, we develop ML-based approaches in a time-varying satellite-terrestrial network, and address two practical issues. The ﬁrst is how to efﬁciently schedule resources to serve the massive number of connected users, such that more data and users can be delivered/served. The second is how to make the algorithmic solution more resilient in adapting to the time-varying wireless environments. We propose an enhanced meta-critic learning (EMCL) algorithm, combining a DRL model with a meta-learning technique, where the meta-learning can acquire meta-knowledge from different tasks and fast adapt to the new task. The results demonstrate EMCL’s effectiveness and fast-response capabilities in over-loaded systems and in adapting to dynamic environments compare to previous actor-critic and meta-learning methods.

Fourthly, the dissertation focuses on reducing the energy consumption for federated learning (FL), in mobile edge computing. The power supply and computation capabilities are typically limited in edge devices, then energy becomes a critical issue in FL. We propose a joint energy-saving scheme (JESS) to jointly reduce computational and transmission energy. In the ﬁrst part of JESS, we introduce sparsity and adopt sparse or binary neural networks (SNN or BNN) as the learning model to complete the local training tasks at the devices. Compared to fully-connected DNN, the computational operations can be signiﬁcantly reduced, thus enabling less energy consumption and fewer transmitted data to the central node. In the second part, we develop an efﬁcient scheduling scheme to minimize the overall transmission energy by optimizing wireless resources and learning parameters. We develop an enhanced FL algorithm in JESS, i.e., non-smoothness and constraints – stochastic gradient descent, to handle the non-smoothness and constraints of SNN and BNN, and provide guarantees for convergence.

Finally, we conclude the thesis with the main ﬁndings and insights on future research directions.

Partager ce contenu