Q-Learning-Based Dynamic Spectrum Access in Cognitive Industrial Internet of Things

Li, Feng; Lam, Kwok-Yan; Sheng, Zhengguo; Zhang, Xinggan; Zhao, Kanglian; Wang, Li

doi:10.1007/s11036-018-1109-9

Q-Learning-Based Dynamic Spectrum Access in Cognitive Industrial Internet of Things

Open access
Published: 11 September 2018

Volume 23, pages 1636–1644, (2018)
Cite this article

Download PDF

You have full access to this open access article

Mobile Networks and Applications Aims and scope Submit manuscript

Q-Learning-Based Dynamic Spectrum Access in Cognitive Industrial Internet of Things

Download PDF

Feng Li^1,2,
Kwok-Yan Lam²,
Zhengguo Sheng ORCID: orcid.org/0000-0003-2143-4003³,
Xinggan Zhang⁴,
Kanglian Zhao⁴ &
…
Li Wang¹

2768 Accesses
23 Citations
Explore all metrics

Abstract

In recent years, Industrial Internet of Things (IIoT) has attracted growing attention from both academia and industry. Meanwhile, when traditional wireless sensor networks are applied to complex industrial field with high requirements for real time and robustness, how to design an efficient and practical cross-layer transmission mechanism needs to be fully investigated. In this paper, we propose a Q-learning-based dynamic spectrum access method for IIoT by introducing cognitive self-learning technical solution to solve the difficulty of distributed and ordered self-accessing for unlicensed terminals. We first devise a simplified MAC access protocol for unlicensed users to use single available channel. Then, a Q-learning-based multi-channels access scheme is raised for the unlicensed users migrating to other lower cells. The channel with most Q value will be considered to be selected. Every mobile terminals store and update their own channel lists due to distributed network mode and non-perfect sensing ability. Numerical results are provided to evaluate the performances of our proposed method on dynamic spectrum access in IIoT. Our proposed method outperforms the traditional simplified accessing methods without self-learning capability on channel usage rate and conflict probability.

Dynamic Spectrum Access of Virtualized-Operated Networks over MIMO-OFDMA Dedicated to 5G Cognitive WSSNs

Opportunistic Spectrum Distribution Protocol for Wireless Sensor Networks

A Q-learning-based distributed queuing Mac protocol for Internet-of-Things networks

Article Open access 16 August 2023

1 Introduction

In the context of Industry 4.0, Industrial Internet of Things (IIoT) provides new driving force for the development of high efficient, low-energy, flexible and smart factories, by introducing sensing capability, cloud computing, intelligent robotics and wireless sensor networks into modern industrial environment [1,2,3,4]. An inevitable tendency toward global mobile networks that combines artificial intelligent, automation, warehousing systems and production facilities in the shape of Cyber-Physical Systems as well as cognitive IIoT emerges [5,6,7].

In the process of continuously sensing industrial field, exchanging control information, self-learning and adapting dynamic networks, deciding and performing transmission strategy, plenty of challenges need to be addressed. Many techniques including intelligent algorithms, deep learning, cognitive radio have been applied to enhance the robustness, accuracy and efficiency of the IIoT [8,9,10,11]. In particular, many technical solutions which have been adopted in wireless sensor networks can be adapted to IIoT environment after being revised according to the corresponding new characteristics of IIoT [12,13,14,15,16,17]. In [12], a wireless sensor networks based on safe navigation scheme for micro flying robots in the IIoT has been raised to detect the static and dynamic obstacles in indoor environment. In [13], a three-stage multi-view stacking ensemble machine learning model based on hierarchical time series feature extraction methods were designed to resolve the anomaly detection problem in IIoT. In [14], a multi-level DDoS mitigation framework were devised to defend against DDoS attacks for IIoT, which includes the edge computing level, fog computing level, and cloud computing level. In [15], the authors developed an IIoT based solution to ensure a real-time connection between products and assembly lines. The raised dynamic cycle time setting method considered the varying complexity of the product on the basis of the real-time information offered by sensor nodes and indoor positioning systems.

Due to the high criterion for reliability and robustness of measurement system in industrial field, the network structure of common wireless sensor networks should be refined to fit in IIoT [18,19,20]. Too many tiers in IIoT will increase complexity of protocol management and hardware design, yet too few ters constrict network’s flexibility and application areas. Besides, with the increase of network nodes deployed in the IIoT, how to improve network efficiency and system capacity still needs to be deeply investigated so far. In [21], the authors presented a three-factor user authentication protocol for wireless sensor networks to overcome the weakness of other traditional protocols. The proposed protocol is robust and energy efficient for IoT applications. In [22], to solve the security challenges, the authors explored the consortium blockchain technology to raise a secure energy trading system denoted as energy blockchain. Besides, a credit-based payment scheme to support fast and frequent energy trading energy trading. In [23], the authors proposed a securing IIoT, a practical authorization framework on annotated metadata for securing IIoT objects. The method supports multi-dimension and large data processing with flexible and efficient authorization model to meet new security requirements for IIoT. In [24], the study designed a resilient section selection mechanism of power fingerprinting applied to device load recognition, so as to determine the transmission time and select the power fingerprinting section to be resiliently transferred. Furthermore in [25], the authors used an energy-efficient architecture for IIoT, which involves a sense entities domain where huge amounts of energy are consumed by a tremendous number of nodes. Besides, many techniques applied in other networks have been referenced for solving the relevant problems in IIoT [21, 26,27,28,29,30,31,32].

In this paper, we propose a Q-learning-based dynamic spectrum access strategy for IIoT to improve the spectrum efficiency and degrade access conflict. In IIoT, with the increase of system nodes and network complexity, how to devise an efficient MAC protocol and network structure to adapt the new characteristics of IIoT becomes significant. We consider a multitiered heterogeneous IIoT with lower mesh networks where numbers of small cells perform spectrum sharing strategy to enhance spectrum efficiency for IIoT. Furthermore, in this work, we assume the spectrum sensing ability of the sensor nodes in IIoT is not perfect and all the lower nodes are incorporated in distributed mode, thus self-learning function should be deeply exploited to dynamically access the sharing channels. We first design a self-learning-based MAC protocol for the lower-tier sensor users in IIoT. Then, when plenty of channels and unlicensed users need to competitively access the limited channels, a Q-learning-based spectrum access method is proposed. In the process of channel selection, unlicensed users will choose the channel with most Q value by using Q-learning.

The main contribution of this paper can be highlighted as follows.

A deep learning method is introduced to dynamic spectrum access in IIoT after taking the complex multitiered structure of industrial network field.
A distributed dynamic spectrum access strategy is raised in this paper to decrease conflict probability in mesh-networks-based IIoT.
Numerical results are provided in this paper to testify the performances of our proposal. Comparison tests are performed to present the conflict probability and channel usage.

The remainder of this paper is organized as follows. We introduce the system model for dynamic spectrum access in IIoT in Section 2. Section 3 gives the details of our deep learning method. Furthermore, numerical results are supplied to analyze the performance of the spectrum access strategy in Section 4. Finally, we conclude this paper in Section 5.

2 System model

According to the characteristics of Industrial Internet of Things (IIoT), we consider to adopt two-tier architecture in this paper as shown in Fig. 1. Based on the specific situation of the control object in industrial field and the relation of different kinds of industrial devices, the wireless sensor nodes installed in these devices need to be appropriately arranged. In this case, we consider the sensor nodes in the lower tier form several mesh networks and the cluster heads in the upper tier construct mesh networks or star topology networks. In lower tier mesh networks, sensor nodes within one mesh network communicate with the corresponding cluster head in mode of single hop, and the cluster heads connect with each other under the protocol of wireless local area networks (WLAN). Generally, sensor nodes contacting with the other nodes within one mesh networks do not need to send any information to corresponding cluster head. The periodic data transmission of detecting tasks in industrial field is assumed to mainly occur in same cluster. When the transmission across different mesh networks is required, the cluster head will relay the signal through upper WLAN protocol to another cluster.

Since most of the transmission tasks involved in the industrial field focus on data acquisition and exchange for each cluster, we suppose the sensor nodes in one cluster constitute a small cell. However, when the cell number is growing, spectrum sharing mechanism is required to be applied for band saving and improving spectrum efficiency. Thus, the sensor cells locating far away can share same band to perform spectrum reuse. At this time, when sensor nodes in the IIoT environment move across various lower cells, proper dynamic spectrum access scheme should be devised to avoid severe internet interference.

In the process of designing dynamic spectrum access strategy in this heterogeneous IIoT, the following two characteristics should be taken into account.

1.
Limited spectrum sensing ability of sensor nodes which means a node cannot identify accurately that the current spectrum occupying is caused by licensed devices within same cell or other devices migrating from adjacent cells.
2.
The sensor nodes in the lower tier form a mesh network which means they are working in a distributed mode.
3.
The sensor nodes cannot gather all the required information from a central controller due to the distributed communication mode.

In this situation, we consider to adopt the intelligent characteristic of sensor nodes in IIoT, based on Q-Learning algorithm and memorable cognitive MAC protocol, to propose a distributed multi-channel dynamic spectrum access strategy.

For the dynamic spectrum access in heterogeneous mesh networks of IIoT, due to distributed structure, we assume the spectrum sensing capability of the sensor nodes is not perfect and cannot obtain all the essential information of other nodes from cluster head acting as a central controller. Thus, in this case, we assume the sensor nodes are intelligent devices with self-study ability to adapt the dynamic spectrum circumstance and select proper channel to access.

In our system model, we assume the node working at its own cell as the licensed user and the node migrating to another mesh cell as the unlicensed user. Hence, the unlicensed users should dynamically access the spectrum in this IIoT. During this process, the strategy of licensed users is that whenever they have the demand of transmitting packet, they can initiate their transmission immediately without any consideration of other unlicensed sensor nodes.

The spectrum access strategy of unlicensed terminals is to adopt a slot-memorized MAC protocol which appoints a transmission probability for every potential status in one slot. Thus, we have the function f : y_s → [0,1], where y_s denotes the status set which can be expressed as y_s = {idle,busy,success,failure}. The unlicensed user with status of y ∈ y_s in previous slot can transmit data in probability f(y) at present slot.

We take the non-invasive protocol and fairness definition into account when designing the MAC protocol.

Non-invasive protocol: If f(busy) = 0, then the spectrum access is non-invasive. If a unlicensed terminal complies with non-invasive protocol, it should wait in the slot which follows a busy slot. Hence, the non-invasive protocol makes the licensed users once succeed in setting up transmission will not be disturbed by unlicensed users.

Fairness: Define a fairness level 𝜃 ∈ (0,1].

Suppose no licensed transmission available, once a unlicensed user succeeds in spectrum access, the probability of successful transmission for this user at the next slot can be denoted as

$$ p_{success}=f(success)(1-f(busy))^{N-1} $$

(1)

where p_success is the probability of successful transmission, f(success) denotes the probability of successful channel access, f(busy) denotes the probability of busy channel status and N denotes the slot number.

Then, the average number of continuous transmission for the unlicensed user is

$$ n_{success}= 1/[1-f(success)(1-f(busy))^{N-1}] $$

(2)

When licensed users do not have data to transmit, the average number of unlicensed user’s continuous transmission can be denoted to be 1/𝜃, we define the fairness level 𝜃 as

$$ \theta= 1-f(success)(1-f(busy))^{N-1} $$

(3)

With the decrease of fairness level, an unlicensed user will have an opportunity to increase the time using current channel after it performs a successful transmission, which makes other unlicensed users wait longer time to access this channel.

The cognitive MAC protocol with memory function pays attention on the non-invasive mode which provides priority for the licensed users. The fairness level of a spectrum access protocol can be expressed by Eq. 3, then combining the definition of non-invasive protocol and Eq. 3, we have

$$ f(success)= 1-\theta $$

(4)

The other factors f(idle) and f(failure) can be denoted by q and r, respectively. The MAC protocol with memory function in fairness level 𝜃 can be depicted as

$$\begin{array}{@{}rcl@{}} f(idle)&=&q, f(busy)= 0\\ f(success)&=&1-\theta, f(failure)=r \end{array} $$

(5)

3 Q-learning-based dynamic spectrum access

The cognitive MAC protocol with memory function can overcome the limitation of spectrum sensing in physical layer, and increase the channel utilization efficiency. However, when the number of channel and user increases obviously, how to perform more efficient spectrum access scheme in IIoT based on the MAC protocol and user’s self-learning ability, still needs to be solved.

To describe the dynamic spectrum access in IIoT in detail, we propose a Q-Learning-based access algorithm whose mechanism can be presented in the following Fig. 2.

In this paper, our proposal focuses on the multiply channels environment and fully takes the sensor nodes’ intelligent characteristics into account by using Q-Learning-based method. Then, combining the MAC protocol with memory function, sensor nodes can dynamically access the idle channels in IIoT in spectrum sharing mode.

It should be noted that unlicensed users (the users migrating to other lower cells) need to find a solution to avoid the fluent appearance of licensed users (the users working in their own cells). In this work, we introduce a new index to perform the judgement of whether licensed users emerge. This index is the number of BUSY status of unlicensed user i at channel j. We use b_ij to denote the index.

At the beginning of every slot, unlicensed users will first analyze all the available channels currently and judge whether the selected channel’s BUSY number exceeds the given threshold th_B. In this case, we set threshold th_B to provide a detailed tolerance level for the unlicensed users to decide how many slots should be waited and evade when licensed users emerge. The analytical process can be presented as follows.

If the number of BUSY status does not exceed threshold th_B, the unlicensed user considers there is no licensed user on current channel. Then, spectrum access can be allowed for the unlicensed user. Meanwhile, if the previous slot’s status is BUSY, the sensor node should update the BUSY number of this channel as b_ij = b_ij + 1.

If current channel’s BUSY number exceeds threshold th_B, users can determine that the licensed user is very likely to be appear at this channel. So, the unlicensed user should evade to avoid severe interference and enter the process of channel selection again.

The main parts of this algorithm include two sections: channel selection and channel access. In this paper, our spectrum selection process is based on Q-Learning algorithm and unlicensed users can receive the most delay award from the optimal channel selection strategy. The principle of our channel selection strategy lies in that the unlicensed users improve their channel efficiency by learning those channels with the experiences of most successful access.

Each unlicensed user figures out its Q value according to own success experience information, then predicts award through Q value. Use Q value to denote status or action value. Q function can be depicted by Q(s,a) which means the award received by the unlicensed user at status s with action a. Then, the Q value can be updated by the following

$$\begin{array}{@{}rcl@{}} Q_{i}(s,a_{j})[t + 1]&=&Q_{i}(s,a_{j})[t]+\alpha[r(i,j)[t]\\ &&+\gamma V(s)[t + 1]-Q_{i}(s,a_{j})[t]] \end{array} $$

(6)

where Q_i(s,a_j) denotes the Q value function attained when user i adopts the corresponding action (selecting chann j). Besides, a_j ∈ A, A denotes the action set affecting the spectrum environment by unlicensed users. For the available channels to be selected by unlicensed users, a_j denotes the user chooses channel j; r(i,j) denotes the award function in the environment after unlicensed user i selects channel j; γ(0 ≤ γ ≤ 1) is the discount factor representing the importance of future anticipation award on current award. α(0 ≤ α ≤ 1) is the learning rate, and V (s)[t + 1] is the estimation value of next status function which can be expressed as

$$ V(s)[t + 1]=\max\limits_{b\in A} (Q(s[t + 1],b)) $$

(7)

The strategy of action selection follows the following rule

$$ \pi^{*}(s)=\arg\max\limits_{Q(s,a)} $$

(8)

Then, we give the definition of status set S, action set A and award function R.

Status set S: S = s, denotes the unlicensed users are learning environment information and attempting to access channel.

Action set A: Optional action set A = {a₁,a₂,⋯ ,a_M}. Choosing action a_i means the unlicensed user selects channel i(i ∈{1,2,⋯ ,M}) as the channel to access.

Award function R: The award value should reflect the learning objective of the proposed algorithm. The target of this paper is to select the channel with most success experience, therefore award function R is related to the situation of whether the unlicensed user succeed in accessing current channel.

Then, when user i accesses channel j, the award function r(i,j) can be obtained as

$$ r(i,j)=\left\{ \begin{array}{ll} 1, & \text{succeed in accessing;} \\ 0, & \text{wait due to BUSY status;} \\ -1, & \text{unsuccessful.} \end{array} \right. $$

(9)

For the access process, the slot structure of unlicensed users can be given as Fig. 3. In on slot, the operation process of a unlicensed user is as follows: Judge whether it requires to reselect channel, if does, proceed access decision and judge whether to send data according to our strategy. If the judgement result is to transmit data, then perform the transmission and collect ACK feedback for continuous sensing at every following slots.

After unlicensed users decide to access current channel through self-learning, the transmission will be started in probability of f(y) based previous slot’s channel status y ∈ y_s,y₂ = {idle,busy,success,failure}.

During the data transmission, users acknowledge whether the transmission is successful by checking ACK feedback information. If receiving the ACK feedback, we consider the transmission is successful. If not, it is a failure. Users will judge whether the channel can be accessed in every time slot by spectrum sensing.

When the information collection is completed, unlicensed users will analyze the channel’s status as follows.

Idle: The sensing result shows that no user is access the channel and no data is being transmitted.
Busy: The sensing result shows that there is a user accessing current channel yet no data is being transmitted.
Success: User is transmitting data and can receive ACK feedback.
Failure: User is transmitting data yet cannot receive ACK feedback.

On the other hand, once a licensed user transmits successfully, in the following slot, its transmission will not be disturbed by unlicensed users. Therefore, at the circle of ’on’, the conflict emerges only before the first successful transmission of the licensed user. The average conflict number suffering by a licensed user in a ’on’ circle can be expressed by T_col which has no relation with its service time length T_poc. If q = 0 which means f(idle) = 0, then only the idle slot emerges at ’off’ circle, we have T_col = 0. Otherwise, if q > 0,r = 1, there exists T_col = + ∞ when unlicensed users do not evade in case of conflict.

The main routine of our Q-Learning-based spectrum access can be given in Fig. 4.

The detailed process can be presented as follows.

1.
Initialization: Initialize every user’s Q value and other parameters.
2.
Selecting channel: An unlicensed user randomly and averagely chooses a channel as the one ready to access. Ensure the average number of unlicensed users accessing to each channel is uniform.
3.
Channel analysis: Judge whether the BUSY number of the user at current channel exceeds the given threshold. If does, go to the Step 4, otherwise Step 5.
4.
Channel selection: If the user chooses the channel whose Q value is most and less than the given threshold, then go to the Step. 6.
5.
Channel access: Perform channel access according to the scheme above mentioned.
6.
Update BUSY number: If the last step is Step 4, reset the channel’s previous BUSY number. If the last step is Step 5, plus the BUSY number.
7.
Analyzing channel status: Analyze current user’s channel status.
8.
Update parameter: According Eq. 6, update Q value.
9.
End of slot: At the ending of this slot, if the status of simulation is END, then end this slot, otherwise go to Step 3.

4 Numerical tesults

In this section, we carry out simulated tests in Matlab platform to testify the performances of our dynamic spectrum access method on channel usage rate and conflict probabilities with various parameters. In the tests, we consider there are 50 sensor nodes randomly distributing in the 8 small mesh cells of the industrial field. Each cell has its own cluster head. 20 licensed channels are allocated to the mesh cells. If the sensor nodes locate in their own cell, they can access the channels as licensed users. Otherwise, when they migrate to other cells and wish to use the other cluster head, they serve as unlicensed users.

In Figs. 5 and 6, we give the performances of channel usage rate with different slot number and th_B which is the slot threshold unlicensed users should wait. When th_B is fixed, the slot number is set to be 5000. As shown in Fig. 5, with the increase of slot number, the channel usage rate becomes steady but not convergent. A higher slot threshold means more waiting time for unlicensed users. Too long waiting time will lead to relatively low channel usage rate and decrease of system transmission capacity. The unlicensed users need to ensure there are no licensed users available on the target channel in given threshold time. If the channel is still idle for th_B slots, unlicensed users will access the channel.

Besides, we give the comparison figures as Figs. 7 and 8 to present the performances of our proposed method. In Figs. 7 and 8, the Q-learning method refers to our proposal. SDSA means the simplified dynamic spectrum access scheme which enable the memory function for each sensor nodes. When the unlicensed users wish to access one channel, they should recall and update their channel list to ascertain the situation. However, they do not have the self-learning ability. Aloha denotes the unlicensed users use aloha instruction to communicate with each other before they begin to access a channel. All the methods above mentioned have an assumption that the sensor nodes do not have perfect sensing capability and are organized in distributed mode in IIoT. We can obtain from the figures that our scheme has steady channel usage rate outperforming the traditional SDSA and Aloha method. Even our proposal’s complexity is relatively high, it can be easily realized especially with the rapid development of mobile computing and cognitive science.

5 Conclusions

In this paper, we propose a Q-learning-based dynamic spectrum access method in IIoT by taking into account the heterogenous wireless sensor networks’ characteristics to enhance spectrum efficiency and degrade accessing conflict probability. The main contribution of this work lies in that we introduce a self-learning method to address the situation where sensor nodes’ sensing ability is non-perfect and distributed network mode are applied. In specific, we devised a self-learning-based MAC protocol to assist unlicensed user to access spectrum in IIoT when only single idle channel is available. Besides, for the case of accessing multi-channels simultaneously, we propose a Q-learning-based access algorithm which considers the unlicensed users to select the idle channels with most Q value through self-learning. The specific algorithm routine has been given. Numerical results prove that the proposed algorithm has better channel accessing effects compared with traditional simplified self-access protocol and aloha method.

References

Wan J, Tang S, Hua Q et al Context-aware cloud robotics for material handling in cognitive industrial internet of things, IEEE Internet of Things Journal, to appear. https://doi.org/10.1109/JIOT.2017.2728722
Article Google Scholar
Shu Z, Wan J, Zhang D et al (2016) Cloud-integrated cyber-physical systems for complex industrial applications. Mobile Netw Appl 21(5):865–878
Article Google Scholar
Zhang D, He Z, Qian Y et al (2016) Revisiting unknown RFID tag identification in large-scale internet of things. IEEE Wirel Commun 23(5):24–29
Article Google Scholar
Li X, Li D, Wan J et al (2017) A review of industrial wireless networks in the context of Industry 4.0. Wirel Netw 23(1):23–41
Article Google Scholar
Gao Q, Zhu G, Lin S et al (2016) Robust QoS-aware cross-layer design of adaptive modulation transmission on OFDM systems in high-speed railway. IEEE Access PP(99):1–1
Article Google Scholar
Mainetti L et al (2011) Evolution of wireless sensor networks towards the internet of things: A survey. In: IEEE international conference on software, telecommunications and computer networks, pp 15–17
Lin Y, Yang J, Lv Z et al (2015) A self-assessment stereo capture model applicable to the internet of things. Sensors 15(8):20925–20944
Article Google Scholar
Zhu J, Song Y, Jiang D et al A new deep-Q-Learning-Based transmission scheduling mechanism for the cognitive internet of things, IEEE Internet of Things Journal, to appear. https://doi.org/10.1109/JIOT.2017.2759728
Article Google Scholar
Wu Q, Ding G, Xu Y et al (2014) Cognitive internet of things: a new paradigm beyong connection. IEEE Internet Things J 1(2):129–143
Article Google Scholar
Perera C, Zaslavsky A, Christen P, Georgakopoulos D (2014) Context aware computing for the internet of things: A survey. IEEE Commun Surveys Tuts 16(1):414–454
Article Google Scholar
Vlacheas P et al (2013) Enabling smart cities through a cognitive management framework for the internet of things. IEEE Commun Mag 51(6):102–111
Article Google Scholar
Li H, Savkin AV Wireless sensor network based navigation of micro flying robots in the industrial internet of things. IEEE Transactions on Industrial Informatics, to appear. https://doi.org/10.1109/TII.2018.2825225
Article Google Scholar
Ouyang Z, Sun X, Chen J et al (2018) Multi-view stacking ensemble for power consumption anomaly detection in the context of industrial internet of things. IEEE Access 6:9623–9631
Article Google Scholar
Yan Q, Huang W, Luo X et al (2018) A multi-level DDoS mitigation framework for the industrial internet of things. IEEE Commun Mag 56(2):30–36
Article Google Scholar
Ruppert T, Abonyi J (2018) Industrial internet of things based cycle time control of assembly lines. In: IEEE international conference on future of technologies, pp 1–4
Zhu C, Rodrigues J, Leung VC M et al (2018) Trust-based communication for the industril internet of things. IEEE Commun Mag 56(2):16–22
Article Google Scholar
Cui H, Deng RH, Liu J K et al Server-aided attribute-based signature with revocation for resource-constrained industrial internet of things devices. IEEE Transactions on Industrial Informattics, to appear. https://doi.org/10.1109/TII.2018.2813304
Article Google Scholar
Yan H, Zhang Y, Pang Z, Xu LD (2014) Superframe planning and access latency of slotted MAC for industrial WSN in IoT environment. IEEE Trans Ind Inf 10(2):1242–1251
Article Google Scholar
Iqbal Z, Kim K, Lee HN (2017) A cooperative wireless sensor network for indoor industrial monitoring. IEEE Trans Ind Inf 13(2):482–491
Article Google Scholar
Wang F, Wang K, Lai S, Phang SK, Chen BM, Lee TH (2014) An efficient UAV navigation solution for confined but partially known indoor environments. In: 11th IEEE International Conference on Control Automation (ICCA), Taichung, Taiwan, pp 1351– 1356
Li X, Peng J, Niu J et al A robust and energy efficient authentication protocol for industrial internet of things, IEEE Internet of Things Journal, to appear. https://doi.org/10.1109/JIOT.2017.2787800
Article Google Scholar
Li Z, Kang J, Yu R et al Consortium blockchain for secure energy trading in industrial internet of things, IEEE Transactions on Industrial Informatics, to appear. https://doi.org/10.1109/TII.2017.2786307
Chen G, Ng WS (2017) An efficient authorization framework for securing industrial internet of things. IEEE TENCON, pp 1–5
Lai CF, Chen SY, Hwang RH A resilient power fingerprinting selection mechanism of device load recognition for trusted industrial internet of things. IEEE Transactions on Industrial Informatics, to appear. https://doi.org/10.1109/TII.2017.2766885
Article Google Scholar
Wang K, Wang Y, Sun Y et al (2016) Green industrial internet of things architecture: an energy-efficient perspective. IEEE Commun Mag 54(12):48–54
Article Google Scholar
Chen M, Qian Y, Hao Y, Li Y, Song J (2018) Data-driven computing and caching in 5G networks architecture and delay analysis. IEEE Wirel Commun 25(1):70–75
Article Google Scholar
Chen M, Li W, Hao Y, Qian Y, Humar I (2018) Edge cognitive computing based smart healthcare system, Future Generation Computing System. https://doi.org/10.1016/j.future.2018.03.054
Article Google Scholar
Chen M, Tian Y, Fortino G, Zhang J, Humar I (2018) Cognitive internet of vehicles. Comput Commun 120:58–70
Article Google Scholar
Chen M, Herrera F, Hwang K (2018) Cognitive computing: Architecture, technologies and intelligent applications. IEEE Access 6:19774–19783
Article Google Scholar
Chen M, Hao Y (2018) Task offloading for mobile edge computing in software defined ultra-dense network. IEEE J Sel Areas Commun 36(3):1–11
Article MathSciNet Google Scholar
Zhao N, Yu FR, Sun H, Li M (2016) Adaptive power allocation schemes for spectrum sharing in interference-alignment-based cognitive radio networks. IEEE Trans Veh Technol 65(5):3700–3714
Article Google Scholar
Li X, Zhao N, Sun Y, Yu FR (2016) Interference alignment based on antenna selection with imperfect channel state information in cognitive radio networks. IEEE Trans Veh Technol 65(7):5497–5511
Article Google Scholar

Download references

Acknowledgments

This work was partly supported by the ROSE Lab and SPIRIT. This research was carried out at the Rapid-Rich Object Search (ROSE) Lab at the Nanyang Technological University, Singapore. The ROSE Lab is supported by the National Research Foundation, Singapore, and the Infocomm Media Development Authority, Singapore.

Author information

Authors and Affiliations

College of Information Engineering, Zhejiang University of Technology, Hangzhou, 310023, China
Feng Li & Li Wang
School of Computer Science and Engineering, Nanyang Technological University, Nanyang, 639798, Singapore
Feng Li & Kwok-Yan Lam
Department of Engineering and Design, University of Sussex, Brighton, BN1 9RH, UK
Zhengguo Sheng
School of Electronic Science and Engineering, Nanjing University, Nanjing, 210093, China
Xinggan Zhang & Kanglian Zhao

Authors

Feng Li
View author publications
You can also search for this author in PubMed Google Scholar
Kwok-Yan Lam
View author publications
You can also search for this author in PubMed Google Scholar
Zhengguo Sheng
View author publications
You can also search for this author in PubMed Google Scholar
Xinggan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Kanglian Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Li Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhengguo Sheng.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Li, F., Lam, KY., Sheng, Z. et al. Q-Learning-Based Dynamic Spectrum Access in Cognitive Industrial Internet of Things. Mobile Netw Appl 23, 1636–1644 (2018). https://doi.org/10.1007/s11036-018-1109-9

Download citation

Published: 11 September 2018
Issue Date: December 2018
DOI: https://doi.org/10.1007/s11036-018-1109-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Q-Learning-Based Dynamic Spectrum Access in Cognitive Industrial Internet of Things

Abstract

Similar content being viewed by others

Dynamic Spectrum Access of Virtualized-Operated Networks over MIMO-OFDMA Dedicated to 5G Cognitive WSSNs

Opportunistic Spectrum Distribution Protocol for Wireless Sensor Networks

A Q-learning-based distributed queuing Mac protocol for Internet-of-Things networks

1 Introduction

2 System model

3 Q-learning-based dynamic spectrum access

4 Numerical tesults

5 Conclusions

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Q-Learning-Based Dynamic Spectrum Access in Cognitive Industrial Internet of Things

Abstract

Similar content being viewed by others

Dynamic Spectrum Access of Virtualized-Operated Networks over MIMO-OFDMA Dedicated to 5G Cognitive WSSNs

Opportunistic Spectrum Distribution Protocol for Wireless Sensor Networks

A Q-learning-based distributed queuing Mac protocol for Internet-of-Things networks

1 Introduction

2 System model

3 Q-learning-based dynamic spectrum access

4 Numerical tesults

5 Conclusions

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation