%0 Journal Article
%A HUANG Tian-Xing
%A LI Li
%A LI Yang-Tiao
%A LIN Zi-Han
%A QUAN Yi-Xuan
%A ZHENG Jia-Li
%T RFID Indoor positioning based on Semi-supervised Actor-Critic Co-training
%D 2020
%R 10.19682/j.cnki.1005-8885.2020.0030
%J Journal of China Universities of Posts and Telecommunications
%P 69-81
%V 27
%N 5
%X For large-scale radio frequency identification ( RFID) indoor positioning system, the positioning scale is&nbsp;relatively large, with less labeled data and more unlabeled data, and it is easily affected by multipath and white&nbsp;noise. An RFID positioning algorithm based on semi-supervised actor-critic co-training (SACC) was proposed to&nbsp;solve this problem. In this research, the positioning is regarded as Markov decision-making process. Firstly, the&nbsp;actor-critic was combined with random actions and selects the unlabeled best received signal arrival intensity&nbsp;(RSSI) data by co-training of the semi-supervised. Secondly, the actor and the critic were updated by employing&nbsp;Kronecker-factored approximation calculate (K-FAC) natural gradient. Finally, the target position was obtained by co-locating with labeled RSSI data and the selected unlabeled RSSI data. The proposed method reduced the cost of&nbsp;indoor positioning significantly by decreasing the number of labeled data. Meanwhile, with the increase of the&nbsp;positioning targets, the actor could quickly select unlabeled RSSI data and updates the location model. Experiment&nbsp;shows that, compared with other RFID indoor positioning algorithms, such as twin delayed deep deterministic policy&nbsp;gradient (TD3), deep deterministic policy gradient (DDPG), and actor-critic using Kronecker-factored trust&nbsp;region ( ACKTR), the proposed method decreased the average positioning error respectively by 50.226%,&nbsp;41.916%, and 25.004%. Meanwhile, the positioning stability was improved by 23.430%, 28.518%, and&nbsp;38.631%.<br>
%U https://jcupt.bupt.edu.cn/EN/10.19682/j.cnki.1005-8885.2020.0030