Research on downlink channel state information prediction technique for 5G system based on deep neural network

With the development of society, the increasing user demand has catalyzed the creation of the fifth generation mobile communication system (5G), and also brought great challenges to 5G. Massive multiple-input multiple-output (MIMO) technology has received widespread attention for its advantages of high system capacity and high spectral efficiency, etc. After more than 10 years of research, massive MIMO technology has been successfully applied in 5G mobile communication systems [1–3]. By continuously increasing the antenna array size, the ultra-large-scale MIMO technology will be one of the key technologies to further improve the network performance of the 6th generation (6G) mobile communication system in the future. However, the ability of the base station to obtain accurate downlink channel state information (CS1) is a prerequisite for the massive MIMO technique to realize the system gain [4–6]. Currently, massive MIMO systems usually work in time-division duplex (TDD) communication mode, by utilizing the reciprocity of uplink and downlink channels, the downlink CSI can be obtained directly from the CSI estimation of the uplink, which avoids the huge downlink CSI acquisition overhead. Compared with TDD communication mode, frequency division duplex (FDD) communication mode has the advantages of high transmission rate, continuous communication, and applicable to high-speed mobile scenarios [7–9]. At the same time, a large number of existing communication systems still use FDD communication mode. In order to utilize the inherent advantages of FDD communication mode and reduce the resource waste and performance loss caused by changing communication modes, FDD massive MIMO system has recently received extensive attention and research [10–11]. In FDD mode, the uplink and downlink channels do not have strict reciprocity, and the downlink CSI obtained by the base station requires downlink CS1 estimation and feedback from the user. Since the guide frequency overhead of downlink CSI estimation and the link overhead of feedback are proportional to the number of antennas at the base station end, one of the great challenges facing FDD massive MIMO systems is the huge guide frequency and feedback overhead of downlink CSI acquisition [12–14]. Currently, FDD systems usually use codebook-based and compression-aware CSI feedback techniques. However, with the increasing size of antenna arrays at the base station end, the design of the codebook is very difficult, making the codebook-based CSI feedback technique no longer suitable for FDD massive MIMO systems. The high dimensional CSI matrix also makes the CSI feedback technique based on compressed sensing have high computational complexity and low feedback accuracy [15-17]. In recent years, deep neural networks have been applied in various fields and have shown excellent performance, such as parts defect detection and recommendation systems. A large number of works have been done to effectively use deep neural networks to solve the problem of downlink channel state information prediction for wireless communications. It can be seen that deep neural networks have a powerful ability to learn and process high-dimensional data, which can provide a potential solution to the problem of obtaining high-dimensional downlink CSI for FDD massive MIMO systems [18–22].

In order to solve the problem that the base station cannot obtain CSI in time due to the influence of upstream and downstream delay and the time required for signal processing, this paper proposes a deep learning-based downstream channel state information prediction method for 5G systems by utilizing massive MIMO technology and deep learning with CSI feedback. After that, the channel model of massive MIMO system is utilized to prove the effectiveness of CSI feedback technique of data-driven scheme. Finally, the superiority of the scheme in channel estimation performance is verified by simulation experiments and CSI amplitude feedback.

2

Method

2.1

Massive MIMO Technology

2.1.1

Massive MIMO channels

Consider a simple single-user MIMO [23–24] (SU-MIMO) system where the base station is equipped with N_t transmit antenna and the user is equipped with N_r receive antennas. The channel matrix H ∈ C^N_r×^N_t of a massive MIMO channel is defined as: 1 $H = [\begin{matrix} h_{1, 1} & h_{1, 2} & \dots & h_{1, N_{t}} \\ h_{2, 1} & h_{2, 2} & \dots & h_{2, N_{t}} \\ ⋮ & ⋮ & ⋮ \\ h_{N_{r}, 1} & h_{N_{r}, 2} & \dots & h_{N_{r}, N_{t}} \end{matrix}]$ h_i,j denotes the channel gain coefficient between the i nd antenna at the base station and the j rd antenna at the user.

Let the transmitted signal from the antenna array at the base station be x =[x₁,x₂,⋯,x_{N_t}]^{^T}. The transmitted signal arrives at the receiving end through an additive Gaussian white noise (AWGN) [25] channel. The signal y ∈ C^N_r^×1 received by the user can be expressed as: 2 $y = H x + ω$

where y = [y₁, y₂,⋯,y_{N_t}]^T, y_i denote the received signals at the user’s i rd antenna port; ω ∈ C^N_r^×1 denotes the additive Gaussian white noise of the wireless channel that satisfies E{ωω*} = N₀I_{N_r}, (·)* denotes the conjugate transpose of the matrix, and N₀ is the power spectral density of the noise.

2.1.2

Downlink MIMO channel tolerance analysis

Massive MIMO technology dramatically increases the channel capacity of a wireless communication system by concentrating the energy of radio waves into a smaller spatial area using a massive antenna array. The channel capacity of a communication system is defined as the maximum mutual information of the transmitted and received signals, so the channel capacity of a massive MIMO system can be expressed as: 3 $C = \max_{f (x)} l (x; y)$

The mutual information of transmit vector x and receive vector y is defined as: 4 $I (x; y) = H (y) - H (y ∣ x)$

Since emission vector x and noise vector ω are statistically independent, according to Eq. (2): 5 $I (x; y) = H (y) - H (ω)$

According to the power-limited maximum entropy theorem, the receiving vector y is maximized when H(y) obeys a zero-mean cyclic symmetric Gaussian distribution, and then the transmitting vector x must also obey a zero-mean cyclic pairwise Gaussian distribution. Let the covariance matrices of x and y be K_x and K_y, respectively. Under the above assumptions, H(y) and H (ω), respectively: 6 $H (y) = \log_{2} \det (π e K_{y})$ 7 $H (z) = \log_{2} \det (π e N_{0} I_{N_{r}})$ where N₀ denotes the noise power. Thus, the mutual information of transmit vector x and receive vector y can be expressed as: 8 $I (x; y) = \log_{2} \det (I_{N_{τ}} + \frac{H K_{x} H^{*}}{N_{0}})$

Limit the maximum transmit energy of the antenna array to P_t. From the above derivation, the MIMO channel capacity is obtained: 9 $C = \max_{T r (K_{x}) \leq P_{t}} \log_{2} \det (I_{N_{r}} + \frac{H K_{x} H^{*}}{N_{0}})$

When the base station does not hold CSI, the transmit power is divided equally among the antennas, and the channel capacity of the MIMO system at this time is: 10 $C^{w i t h o u t - C S I} = \log_{2} \det (I_{N_{r}} + \frac{H H^{*}}{N_{0}})$

The above derivation shows that the MIMO system can utilize space-division multiplexing to improve the spectral efficiency when there is no base station that does not have knowledge of the CSI, which is called open-loop MIMO. The channel capacity mentioned above can be achieved by using the hierarchical space-time coding (BLAST) proposed by Bell Labs. When the CSI is known to the base station, the MIMO at this point is called closed-loop MIMO. Using singular value decomposition (SVD), the original channel can be decomposed into n_min parallel subchannels, n_min = min(N_t,_Nτ), as shown in the following equation: 11 $H = U Λ V^{*}$ where U ∈ C^N_τ^×^N_τ and V ∈ C^N_τ×N_τ denote the left singular and right singular matrices, respectively, both of which are You matrices. Λ ∈ C^N_r×N_t is the diagonal matrix and the elements on the diagonal are the singular values of the channel matrix H. Matrix H has a total of n_min non-zero singular values and is denoted as {λ_i},i = 1…n_min. Let the power assigned to the subchannels by the base station be {P},i = 1…n_min, then the signal-to-noise ratio of each subchannel is: 12 ${SNR}_{i} = \frac{P_{i} λ_{i}^{2}}{N_{0}} i = 1, 2, \dots, n_{\min}$

Thus, according to Shannon’s Second Theorem, the system and capacity expressed in Eq. (9) can again be written as: 13 $C_{S V D} = \sum_{i = 1}^{n_{\min}} \log (1 + \frac{P_{i} λ_{i}^{2}}{N_{0}})$

The allocation of power according to the water-flooding algorithm (WF) allows the system and capacity of the above equation to be maximized. The optimal power allocation is expressed as: 14 $P_{i}^{*} = {(μ - \frac{N_{0}}{λ_{i}^{2}})}^{+}$ where (·)* is an identifier only and does not denote a conjugate transpose; μ is a constant, and the value of μ is required to satisfy the constraints on the total power, i.e.,; $\sum_{i = 1}^{n_{\min}} P_{i}^{*} = P_{t}$ and (x)⁺ is defined as: 15 $(^{x) +} = {\begin{array}{l} x, & x \geq 0 \\ 0, & x < 0 \end{array}$

From Eq. (14), it can be seen that the principle of the water injection algorithm is to allocate more transmit power to the subchannels with good channel quality.

The analytic expression shows that the sum rate of the MIMOBC channel is a K -dimensional capacity region under the total power constraint and consists of the set of reachable rates of K users, denoted as: 16 $R (π, Q_{1}, Q_{2}, \dots, Q_{K}) = {[R_{π (1)}, R_{π (2)}, \dots, R_{π (K)}]}^{T}$ where π = {π(1),π(2)…,π(K)} denotes an ordered arrangement for K users; {Q₁,Q₂,…,Q_k} denotes the covariance matrix of each user’s transmit signal, which is required to satisfy the total transmit power limit of the base station $Σ_{k = 1}^{K} Tr (Q_{k}) \leq P_{t r}$ . Based on the dirty paper coding (DPC) principle, the reachable rate of user π (k) is given and denoted as: 17 $R_{π (k)} = \log_{2} \frac{| N_{0} I + H_{π (k)} (\sum_{j \geq k} Q_{π (j)}) H_{π (k)}^{H} / N_{0} |}{| N_{0} I + H_{π (k)} (\sum_{j > k} Q_{π (j)}) H_{π (k)}^{H} / N_{0} |} k = 1 \dots K$

Thus, the capacity of the MIMOBC channel can be obtained by optimizing the ordering of the users and the covariance matrix of the transmitted signals as shown in the following equation: 18 $C^{B C} = \max_{π, Q_{1}, \dots Q_{K}} \sum_{k = 1}^{K} R_{π (k)}$ 19 $s . t . \sum_{k = 1}^{K} T r (Q_{k}) \leq P_{t r}$

It can be seen that obtaining accurate downlink CSI is extremely important for massive MIMO systems.

2.2

Deep Neural Networks and CSI Feedback

2.2.1

Neurons and how they work

Neuron is the most basic unit in deep learning and all neural networks are neurons combined in different ways. The basic structure of a neuron is shown in Figure 1. A complete neuron consists of a linear model and an activation function. For the linear model, it is assumed that the input sample X is a N -dimensional column vector, W is the weight parameter matrix of the sample, the number of rows of the matrix is the same as that of X, and the number of columns can be adjusted according to the need. The column parameters in W characterize the importance of each element in X, and b represents the bias term. The input sample X can be linearly calculated to obtain the linear relationship between each feature in sample X and the output.

For the activation function f (·), it is generally a nonlinear function, which is usually located after the linear model of neurons, and serves two purposes: one is to provide nonlinear factors for the network, and the other is to map the function according to different task objectives. Common activation functions are Sigmoid, linear rectifier function (ReLU), tanh, etc., of which the mathematical formula of ReLU is as follows: 20 $Re L U (x) = \max (0, x)$

ReLU rectifies all negative values in input x to 0 and leaves the rest of the values unchanged, and is commonly used in the field of image processing, located after convolutional computation. The next subsection describes the CSI feedback framework based on deep learning.

2.2.2

Common Neural Network Structures

Neurons can be combined in different ways to form different network structures. Common neural network structures are described below. 1)

Convolutional Neural Network

The structure of the convolutional neural network is shown in Fig. 2. The blue part indicates a 4 × 4 single-channel input feature map, whose dashed part is the padding value (Padding), which is generally zero-padding, so the input size of the convolutional layer is 5 × 5, which is denoted as C_in : the gray part indicates a 3 × 3 single-channel convolutional kernel, which is denoted as W_kcmd ; the green part indicates a 4 × 4 single-channel output feature map, which is denoted as C_ost. The computation rule is: each element in C_ost is obtained by multiplying and summing the corresponding elements of W_kret and C_m, and W_kermel is shifted one element at a time during the computation. This process is called convolutional computation and is represented by the following equation: 21 $C_{o u t} = W_{k a r a d} * C_{i n}$ where * denotes the convolution calculation. When there are more than one channel in W_kσxl and C_in, the calculation process is the same, only the convolution calculation is performed for each channel in turn, and the number of channels in its output C_out is the same as the number of channels in W_kσmcl.

2)

Fully connected neural network

Similarly, the structure of the fully connected network and its nature is shown in Fig. 3. In the figure, ⊗ represents the matrix multiplication, and the yellow part represents the input feature map matrix, denoted as C_in The green part represents the parameter matrix of the fully connected layer, denoted as W_R : The blue part represents the output feature map matrix, denoted as C_out. It can be seen from the above figure that the essence of the fully connected layer is the computation of matrix multiplication between C_in and W_Fc, which is a linear transformation, which can be expressed by the following equation: 22 $C_{i n} \otimes W_{F C} = C_{o u t}$

2.2.3

Deep learning based CSI feedback

The CSI feedback method for designing massive MIMO systems using deep learning techniques does not require precise analysis of channel data features and clear modeling. Therefore, it can be applied to a variety of channel scenarios and greatly simplifies the complexity and difficulty of signal processing. The deep learning-based CSI feedback method is shown in Fig. 4. In the offline training phase, the CSI matrix data of the massive MIMO system needs to be collected in advance and fed into the network for learning and training, so that the network parameters can fit the corresponding channel data features. In the online prediction phase, using the already trained network model for CSI reconstruction can greatly reduce the time complexity and design difficulty of the system. Depending on the focus and direction of problem solving, the neural network model may have a large gap.

The classification results of CSI feedback methods based on existing deep learning are shown in Figure 5. At present, for the CSI feedback methods based on white coding, the use of encoder + decoder structure of the implementation of the form of more, and the reconstruction accuracy is higher: a single decoder of the network, although the complexity of the network is lower, but the reconstruction accuracy is lower, and can not meet the actual needs: for the CSI feedback methods of adaptive quantizer, its main purpose is to he characterized CSI with lower quantization bit number, in order to reduce the feedback process required for the parametric number: the depth-expanded CSI feedback method focuses on the modification of certain high-performance compressed-aware reconstruction algorithms, using neural networks to replace all or part of the parameters in the algorithms in order to improve the reconstruction accuracy.

2.3

Channel Modeling for Massive MIMO Systems

2.3.1

Existence proof of uplink/downlink channel mapping relationship

Assuming that in an FDD massive MIMO system, there is M ≫ 1 antenna at the base station (using a uniform antenna array), 1 receiving antenna at the user equipment, and P transmission paths between the base station and the user equipment, the channel function can be expressed as follows: 23 $h (f) = \sum_{p = 1}^{p} α_{p} e^{- j 2 π f τ_{p} + j ϕ_{p}} a (θ_{p})$ where f is the carrier frequency, and α_p, ϕ_p, τ_p, θ_p are the fading factor, phase offset, time delay, and angle of arrival direction for the p rd path, respectively.

The fading factor α_p is determined by the following variables: the distance between the base station and the user equipment, the transmission antenna and reception antenna gains, the carrier frequency, and the scattering environment. Phase ϕ_p depends on the scattering material and the angle of incidence of the wave on the scatterer. The time delay τ_p depends on the distance the signal travels along the p path.

a(θ_p) is an array vector and can be expressed as: 24 $a (θ_{p}) = [^{1, e^{- j χ \sin θ_{p}}, \dots e^{- j χ (M - 1) \sin θ_{p}}] T}$ where χ = 2πdf / c, d is the antenna spacing and c is the speed of light. The DOA observed by the base station is confined to a certain area, i.e: 25 $θ_{p} \in [θ - Δ θ / 2, θ + Δ θ / 2]$

Define h(f_U) and h(f_D) as the uplink channel function and the downlink channel function from the user equipment to the base station, where f_U and f_D are the uplink frequency and the downlink frequency, respectively. For FDD systems, h(f_D) cannot be simply obtained from h(f_U). However, there is an intrinsic connection between the uplink CSI and the downlink CSI because the downlink and the uplink are located in the same propagation environment with the same physical path, and the spatial propagation characteristics of the wireless channel are basically unchanged within a certain bandwidth range.

Next, this paper will follow the definition of an uplink to downlink mapping function and prove its existence. From Eq. (23), the channel function h(f) is completely determined by parameters such as α_p, ϕ_p, τ_p, P, Δθ, θ while α_p, ϕ_p, τ_p, P,Δθ it depends on the average DOAθ and the distance between the base station and the user equipment D and other communication environment factors. Based on this, the following location-channel mapping is given: 26 $Φ_{f} : {(D, θ)} \to {h (f)}$ where {(D,θ)} and {h(f)} are the domain of definition and the domain of values of mapping Φ_f, respectively.

The inverse mapping of definition Φ_f is: 27 $Φ_{f}^{- 1} : {h (f)} \to {(D, θ)}$

The probability that the inverse mapping $Φ_{f}^{- 1}$ holds in a practical massive MIMO communication system is high and approaches 100% as the number of base station antennas increases.

The following mapping relation can be obtained from Eqs. (26) and (27): 28 $Φ_{f_{0}} : {(D, θ)} \to {h (f_{D})}$ 29 $Φ_{f_{c}}^{- 1} : {h (f_{U})} \to {(D, θ)}$

The following mapping relationship exists for upstream and downstream links introduced from the above equation: $Ψ_{U \to D} = Φ_{f_{D}} \circ Φ_{f_{C}}^{- 1} : {\begin{matrix} h (f_{U}) \end{matrix}} \to {\begin{matrix} h (f_{D}) \end{matrix}}$ where $Φ_{f_{0}} \circ Φ_{f_{0}}^{- 1}$ denotes the composite mapping of Φ_f0 and $Φ_{f_{0}^{- 1}}$ .

2.3.2

Data-driven CSI-based forecasting process

The effectiveness of the data-driven scheme will be demonstrated mathematically below. Since Φ_fn and $Φ_{f_{n}}^{- 1}$ are continuous mappings and the composition of a continuous mapping is still a continuous mapping, it is always a continuous function for any x ∈ h(f_U), Ψ_U→D(x). According to the literature, if Ψ_U → D(x) satisfies the continuous function condition, for a given small error ε > 0, there always exists a sufficiently large positive constant N that satisfies: 30 $\sup_{x \in H} ‖ O_{N} (x, w) - Ψ_{U \to D} (x) ‖ \leq ε, H = {h (f_{U})}$ where O_N(x, w) is the output of a feed-forward neural network, and x, w, N denotes the input data, model weights, and number of hidden layer neurons, respectively. We can train a neural network to predict the downlink CSI based on the uplink CSI, which significantly reduces the overhead required for the process of obtaining CSI in massive MIMO systems.

2.3.3

High-dimensional CSI matrix construction and preprocessing

In order to facilitate modeling and simulation, this paper only considers a simple FDD massive MIMO system, which uses orthogonal frequency division multiplexing (OFDM) [26–27] modulation mode. Assuming that there are N_T transmitting antennas at the base station end and N_R receiving antennas at the user equipment end, and that there are N_S subcarriers in this communication system, and that the data is collected once every N_i time slots, the signal received on the ith subcarrier can be expressed as: 31 $y_{i} = h_{i} x_{i} + n_{i}, i \in [1, N_{s}]$ where y_i ∈ C is the signal at the receiving end, x_i ∈ C is the transmit data vector, and n_i ∈ C is the additive Gaussian white noise in the channel. h_i ∈ C^{N_i×N_k×N_i} is the channel matrix between the N_T transmitting antenna and the N_R receiving antenna on the i th subcarrier, and h_i the complex values in the matrix represent the fading gain and phase shift of the subcarriers.

The channel matrix corresponding to the CSI is the set of information for each subcarrier, then the CSI matrix H ∈ C^{N_τ×N_k×N_i×N_j} can be expressed as: 32 $H = [^{h_{1}, h_{2}, h_{3}, \dots h_{N_{s}}] H}$ where (·)^H denotes the conjugate transpose.

2.4

Three-dimensional convolutional neural network model

2.4.1

CsiNet model

CsiNet is the first multilayer neural network designed for the problem of downlink CSI prediction in FDD massive MIMO environments. The CsiNet network is implemented by a convolutional neural network structure and is capable of processing channel state information from multiple antennas and outputting high quality channel estimation results.

LeakyReLU is a variant of modified linear unit (ReLU). The mathematical expression of LeakyReLU is shown below: 33 $f (x) = {\begin{array}{l} x, & x \geq 0 \\ α x, & x < 0 \end{array}$

The development of CsiNet network provides a new solution to the downlink CSI feedback problem in massive MIMO systems. Simulation results of CsiNet show that CsiNet network exhibits better prediction performance at all compression rates compared to compressed sensing techniques.

2.4.2

3D-CsiNet modeling

Inspired by the CsiNet decoder network model, this paper proposes a network model for predicting the downlink CSI based on the uplink CSI, which is called the three-dimensional convolutional neural network model [28] (3D-CsiNet), and the overall network structure of 3D-CsiNet is shown in Fig. 6. The design of the convolutional layer, residual network and other parts of this network will be described separately below. 1)

Three-dimensional convolution

In convolutional neural networks, 2D convolution refers to performing convolution operation on 2D data, which is done by sliding a small matrix called convolution kernel over an image or matrix and multiplying the convolution kernel with the localized region of the input, and finally adding all the results to get the output feature map. 3D convolution, on the other hand, refers to the convolution operation on 3D data, which is similar to 2D convolution, but adds a dimension (depth) to the convolution kernel by sliding a 3D convolution kernel over the 3D data and multiplying the convolution kernel and the local region of the input, and finally adding all the results to get the output feature sequence.

2)

Feature Extraction

In a convolutional neural network, the sensory field is the range of perception of a neuron in a particular layer for a location in the input data. In convolutional neural networks, the size of the perceptual field of each layer is related to the size of the convolutional kernel contained in that layer, and the size of the perceptual field gradually increases with the increase in the number of network layers. The size of the receptive field directly affects the recognition and classification ability of the neural network, so it is very important to choose the size of the receptive field reasonably in the network design and tuning.

3)

Residual network

CsiNet solves the gradient vanishing problem of convolutional model by adding residual network. In this paper, the residual network is used in the 3D-CsiNet network model. The residual network utilizes the error of the loss function to train the parameters of this residual block. The residual block consists of two convolutional layers, where the first convolutional layer performs feature extraction on the input data, the second convolutional layer further processes the features, and then the outputs of these two convolutional layers are summed up and nonlinearly transformed by the activation function.

3

Results and Discussion

3.1

Simulation results and analysis

3.1.1

Comparison of channel predicted and true values in walking scenario

The true value is the result of placing the SRS in all time slots with perfect channel estimation; the observed value is the result obtained by interpolating all time slots after the actual channel estimation of the SRS under the actual transmission period of the SRS. The predicted values are the results predicted by inputting the observations into the network as a training set. In this experiment the uplink SNR is set to 25 dB and the SRS period is 6 slots. The comparison of the predicted and real values of the channel in the walking scenario is shown in Fig. 7. From the figure it can be seen that both the observed and predicted values are very close to the true values, which indicates the accuracy of the channel estimation and the fact that the proposed MIMO-based 3D-CsiNet model is very favorable for channel prediction.

3.1.2

Comparison of channel predicted and true values in cycling scenarios

The results of the comparison between the predicted and true values of the channel in the cycling scenario are shown in Fig. 8. As can be seen from the figure, as the Doppler shift increases (f_d=250Hz), the channel changes faster over time and the error in channel prediction gradually increases. Part of the reason is that the error of the observed channel estimation increases with the increase of the Doppler shift, i.e., there is an error in the dataset itself, and part of the reason is that it is more difficult for the deep learning network to capture the change rule of the channel as it changes at a faster rate, which leads to an increase in the prediction error.

3.1.3

3D-CsiNet model prediction performance

The results of the comparison between the prediction performance of the 3D-CsiNet-based model prediction and the traditional second-order AR model prediction are shown in Fig. 9. The simulation sets the SRS period to 4 time slots, the Doppler shift to 250 Hz, and the prediction time length to 28. It can be seen that the estimation error of the 3D-CSINET-based prediction algorithm is lower than that of the second-order AR algorithm, and the MSE value of the 3D-CSINET algorithm is reduced by 59.70% compared to the second-order AR algorithm at the signal-to-noise ratio of 35 dB.

3.1.4

Channel prediction error for different SRS transmission periods

This paper also analyzes the effect of different SRS transmission periods on the channel prediction results, which are simulated with SRS periods of 2, 4, 8, 16, and 32 time slots respectively. The channel prediction errors for different SRS transmission periods are shown in Fig. 10. The results show the MSE comparison between the predicted and unpredicted schemes for five SRS cycles at a fixed Doppler shift of 250 Hz and an SNR of 30 dB, with a selected prediction time length of 24. It can be seen from the figure that the predicted scheme outperforms the unpredicted scheme for each SRS cycle, and the advantage increases with the increase of the SRS cycle from SRS cycle of 16 time slots. The proposed prediction method alone improves the MSE performance by 54.29% compared to the non-prediction scheme.

3.2

CSI amplitude feedback performance results and analysis

3.2.1

Bitstream Generation

Two typical bitstream generation methods, namely quantization and binarization, are first evaluated and compared. The simulation scenario is set up as follows: the base station is equipped with 256 ULA antennas, and the user is a single antenna. In this paper, the number of bits per dimension (BPD) is used to characterize the feedback prediction effect. The NMSE performance of CSI amplitude feedback under different bitstream generation methods is shown in Fig. 11. The results show that the best feedback performance is achieved when the number of quantization bits is set to 4, while the total number of feedback bits is fixed. Consequently, the number of quantization bits in the quantization layer in this section is set to 4. To some extent, binarization can be regarded as a special case of quantization, where the quantization bits are set to 1. This special 1-bit quantization (i.e., binarization) is far superior to the ordinary 1-bit quantization. At the same time, binarization operates better than 4-bit quantization when the feedback bits are extremely limited, and far worse than 4-bit quantization in other cases.

3.2.2

CSI Phase Feedback Performance

This paper is dedicated to evaluating the performance of two different CSI phase feedback mechanisms proposed, namely MDPF-1 and MDPF-2, where the most important difference between the two is whether statistical or instantaneous CSI amplitude information is introduced into the phase feedback NN. Unlike the compression and feedback of CSI amplitude, the compression of CSI phase is dependent on CSI amplitude. Therefore, when evaluating the feedback accuracy of the CSI phase, the NMSE or MSE between the original CSI phase and the reconstructed CSI phase is not directly computed, but rather the NMSE between the original complex CSI and the reconstructed complex CSI is computed, and the CSI amplitude is assumed to be perfect because of the dedication to the evaluation of the feedback accuracy of the CSI phase at this point.

Assuming that the base station side is equipped with 272 antennas and the user side is equipped with a single antenna, and the number of channel paths is set to 5. The performance of NMSE of different CSI phase feedback mechanisms is shown in Fig. 12. Where the raw feedback indicates that the CSI phase is fed back directly with the self-encoder and the loss function is the most primitive MSE function. When the BPD is very low, the NMSE of the primitive phase feedback method is > 0 dB, which means that very little useful information is fed back, comparable to noise. This is mainly because it is not known which information is important, so we just try to provide feedback on all the phase information, but this requires a large number of feedback bits. When the BPD is 0.5-0.6, the performance gains of MDPF-1 and MDPF-2 are 10.41-10.53 dB and 10.03-10.36 dB, respectively, compared to the original phase feedback method.In addition, utilizing the instantaneous CSI amplitude information, MDPF -2 outperforms MDPF-1 that utilizes statistical CSI magnitude information, and the performance gain is especially evident when the BPD is low. As the BPD increases, the performance gap between the two slowly becomes smaller. This is due to the fact that the feedback bits are sufficient for all the phase information required by both MDPF-1 and MDPF-2. In order to exploit the correlation in CSI amplitude between neighboring users, CSI phase and amplitude are fed back separately, which leads to the problem of bit allocation for CSI phase and amplitude feedbacks, and an unoptimized allocation strategy leads to a significant degradation of CSI feedback accuracy. Therefore, the optimal bit allocation strategy can be found by extensive simulation, similar to exhaustive search.

4

Conclusion

In this paper, we first analyze the massive MIMO system model and channel characteristics in detail, and explore the CSI feedback method based on compressed sensing technology; after that, we describe the working principle of the CSI feedback method based on deep learning, and construct a 3D convolutional neural network model (3D-CsiNet) under 5G system. The primary conclusions are as follows: 1)

Simulation results show that the 3D-CsiNet model proposed in this paper has higher prediction accuracy and generalization ability compared with the traditional AR model. Compared with the existing representative CSI feedback algorithms, the proposed 3D-CsiNet model has higher CSI reconstruction accuracy and lower number of model parameters in 3D-CsiNet, thus verifying the advantages of the proposed algorithm.

2)

The feedback performance is best when the total number of feedback bits is set to 4. The binarization operation outperforms 4-bit quantization when the feedback bits are extremely limited, but is much worse than 4-bit quantization in other cases. The original phase feedback method has an NMSE > 0 dB, and the feedback has very little useful information and is comparable to noise. The performance gains are 10.41-10.53 dB and 10.03-10.36 dB for MDPF-1 and MDPF-2, respectively, when the BPD of MDPF-1 and MDPF-2 is 0.5-0.6. Therefore, the optimal bit allocation strategy can be found through extensive simulation.

Funding:

This research was sponsored by the Beijing Nova Program (No.20240484645).

Idioma:: Inglés

Calendario de la edición:: 1 veces al año
Temas de la revista:: Ciencias de la vida, Ciencias de la vida, otros, Matemáticas, Matemáticas aplicadas, Matemáticas generales, Física, Física, otros

RSS Feed de revista

Research on downlink channel state information prediction technique for 5G system based on deep neural network

Jinhui Chen

Chenxi Zhu

Zhan Xu

Publicado en línea: 19 mar 2025

Recibido: 11 nov 2024

Aceptado: 09 feb 2025

DOI: https://doi.org/10.2478/amns-2025-0471

Palabras claveCSI, Deep neural network, 3D-CsiNet model, Orthogonal frequency division multiplexing

© 2025 Jinhui Chen et al., published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Palabras clave
CSI, Deep neural network, 3D-CsiNet model, Orthogonal frequency division multiplexing