Otwarty dostęp

Numerical simulation and optimization method of sports teaching and training based on embedded wireless communication network

  
27 lut 2025

Zacytuj
Pobierz okładkę

Introduction

As Internet of Things (IoT) technology advances, the application of embedded wireless communication networks is expanding across various sectors, particularly in physical education training, where they demonstrate considerable potential [8]. Traditional training methods often depend on the coach’s experience and the athlete’s perception, leading to challenges in accurately monitoring physical conditions and training results [17]. The emergence of sensor technology, coupled with advancements in wireless communication and artificial intelligence, has enabled the optimization of training programs and improved effectiveness through enhanced real-time monitoring, comprehensive data analysis, and feedback. Additionally, the integration of deep and machine learning methods has significantly increased the efficiency and intelligence of handling extensive training data [19]. The numerical simulation and optimization approach for physical education training based on embedded networks, as discussed in this paper, aims to provide more scientific, timely, and personalized training guidance [9], ultimately improving athletic performance and mitigating injury risks, which is of great research and practical significance.

In recent years, the advancement of embedded wireless communication networks has led to the growing use of deep learning and machine learning models in sports training data analysis, yielding impressive results. Convolutional Neural Networks (CNNs) [25], renowned for their strong feature extraction capabilities, especially in image and video data processing, have become widely utilized in this domain. For instance, M Hao et al. [5] for 3D object detection demonstrates the potential of CNNs in complex data fusion. However, showcasing CNNs’ potential in complex data fusion. However, CNNs face challenges with time-series data, as they often fail to effectively capture dynamic temporal information. To overcome this, Recurrent Neural Networks (RNNs) [11] and their advanced form, Long Short-Term Memory (LSTM) networks [23], were introduced, incorporating recurrent and gating mechanisms to manage time-series data and long-term dependencies. Despite their strengths, RNNs and LSTMs come with high computational demands and prolonged training times when dealing with large datasets, limiting their practical application. Additionally, Zhou et al. [47] highlighted the effectiveness of deep learning in sustainable urban development with their automated garbage recognition model based on ResNet-50 and weakly supervised CNNs, a method that can also be adapted for sports training data classification and recognition.

Beyond CNNs and RNNs, traditional machine learning models such as Support Vector Machines (SVMs) and Random Forests (RFs) also hold significant importance in sports training data analysis. For instance, H Zhang et al. [42] demonstrated SVM’s effectiveness in 3D point cloud classification and segmentation with the PointGT method, particularly in small sample scenarios. However, SVMs encounter challenges with computational costs and model complexity when applied to large datasets. In contrast, Random Forests improve model stability and accuracy by combining multiple decision trees, making them robust against overfitting, especially with high-dimensional, nonlinear data. X Ning et al. [16] highlighted the broad applications of deep learning through a differentiable rendering-based approach for zero-shot 3D shape understanding. Similarly, Zou et al. [48] introduced a GAN-based bidirectional AC-DC converter, which can be adapted to enhance energy management in embedded wireless communication systems, supporting real-time data processing. Moreover, Peng et al. [20] developed an automatic news generation and fact-checking system that exemplifies deep learning’s potential in natural language processing and data validation, which could be extended to sports training feedback systems, enhancing analysis accuracy and practicality. In the context of sports training, choosing the right model to balance accuracy, computational demands, and interpretability is essential for researchers.

The motivation for this study stems from the limitations of current physical education training methods in real-time monitoring, data transmission, and intelligent analysis, especially under high-intensity and complex conditions, where these limitations make it challenging to accurately assess and optimize training outcomes. To address these issues, we propose a numerical simulation and optimization method for physical education training based on embedded wireless communication networks. This method involves real-time data collection from athletes and efficient analysis and feedback to optimize training programs, enhance training effectiveness, and reduce injury risks. To overcome the limitations of existing methods, this study introduces a deep learning model that combines Atrous Spatial Pyramid Pooling (ASPP) [22] with LSTM networks [43] to more effectively process and analyze sports training data.

First, embedded wireless sensors are used to collect real-time data from athletes, including heart rate, acceleration, and speed, which are transmitted instantly to a central processing system via a wireless communication network. In the data preprocessing stage, noise reduction and normalization are performed to ensure data accuracy and consistency. The ASPP module is then applied to extract spatial features from the preprocessed data. ASPP captures spatial information at multiple scales, allowing the model to effectively handle variations in scale and capture complex movement patterns without increasing the number of parameters or computational load. Subsequently, the LSTM network captures temporal patterns in the data, enabling more accurate predictions of athletes’ states and training outcomes.

By continuously optimizing the combined ASPP and LSTM model, we enhance the model’s prediction accuracy and real-time performance. The system ultimately analyzes the results processed by the model, generates personalized training recommendations, and transmits feedback instantly to athletes and coaches via the embedded wireless communication network. This feedback system not only provides real-time training guidance but also dynamically adjusts training plans based on the progress of training, thereby achieving personalized and refined sports training.

The contributions of this study are as follows:

Innovatively applying embedded wireless communication networks to the real-time collection and transmission of sports training data, addressing issues of data latency and transmission instability present in traditional training methods.

The proposed deep learning model combining ASPP and LSTM effectively captures multi-scale spatial features and temporal patterns in sports training data, providing a powerful tool for real-time assessment and optimization of training outcomes.

Through real-time data analysis and personalized feedback systems, this method dynamically adjusts training plans based on athletes’ progress and real-time status, improving training specificity and effectiveness while reducing injury risks.

Related Work
Specific Applications of IoT in Physical Education Training

In physical education training, IoT devices such as wearable sensors[24], smartwatches[6], and smart clothing[38] are widely used to monitor athletes’ physiological parameters and movement data in real time, including heart rate, respiratory rate, body temperature, acceleration, and posture. [27]For example, smartwatches can record data such as heart rate and calories burned, which can then be uploaded to the cloud for analysis. Smart clothing, equipped with embedded sensors, can capture muscle activity, posture, and respiratory rate, enabling coaches to monitor athletes’ physical conditions in real time.

The data collected through IoT technology can be used to analyze athletes’ performance, identify shortcomings in their movements, and provide optimization suggestions[32]. For instance, smart shoes used in football training can measure data such as running speed, distance, and step frequency, and transmit this information to a coach’s tablet or smartphone via wireless communication modules. Coaches can instantly access performance data and adjust training plans accordingly. Additionally, AI algorithms can further analyze these data to provide personalized training recommendations[4].

The use of IoT technology in physical education training presents several advantages. IoT devices enable real-time collection and transmission of exercise data, allowing for immediate feedback and timely adjustments to training plans. The integration of sensors and data analysis enhances the precision of data collection, capturing subtle changes often missed by traditional methods. Furthermore, IoT can offer personalized training suggestions by analyzing individual athlete data, improving outcomes and reducing injury risks [29]. Another major advantage is data-driven decision-making[10]; the extensive data collected by IoT devices can be analyzed using AI and machine learning models, leading to more informed decisions. Additionally, IoT technology facilitates remote monitoring and guidance, enabling coaches to provide support regardless of athletes’ locations or training schedules, which is especially valuable.

However, the application of IoT technology in physical education training also faces challenges. Firstly, the transmission of large amounts of personal physiological data over wireless networks may pose risks of data breaches or unauthorized access, making privacy protection a pressing issue. Secondly, overreliance on IoT devices for training may cause athletes to neglect their own perception and experience, which are crucial in training. Furthermore, device malfunctions or data errors could lead to incorrect training plans. Cost is another challenge; high-quality IoT devices and the necessary technical support often come with a high price, which could pose economic pressure on some small- to medium-sized sports organizations or individual athletes. Additionally, as the dimensions and volume of collected data increase, efficiently processing and analyzing these data becomes a significant challenge, potentially requiring higher computational resources and specialized technical support.

Recent research has continued to innovate in the application of IoT technology. For example, intelligent training systems that combine Augmented Reality (AR) [28] with IoT are emerging. Recent studies have shown that such systems can provide athletes with an immersive training experience. AR glasses display real-time data feedback, allowing athletes to see their physiological data and movement metrics during training and make immediate adjustments. Moreover, IoT-based injury prediction models are under development, analyzing historical training data and real-time monitoring data to predict potential future injuries and provide corresponding warnings. For example, machine learning models can analyze data from knee pressure sensors to predict the risk of knee joint injuries. Additionally, multi-modal sensor fusion training monitoring systems have gained widespread attention. Researchers are exploring methods to integrate data from multiple sensors (e.g., heart rate sensors, accelerometers, electromyography) and using deep learning models to comprehensively analyze these multi-modal data, thus providing a more holistic assessment of an athlete’s status, including fatigue levels, exercise intensity, and recovery conditions. These recent research advances highlight the broad prospects of IoT technology in physical education training and point the way for future research and applications.

Applications of Deep Learning in Sports Data Analysis

The use of deep learning in sports data analysis has rapidly emerged as a forefront area in modern sports science research. By leveraging sophisticated algorithms, these technologies empower coaches, trainers, and athletes to make more data-driven and informed decisions by automatically identifying and extracting crucial features and patterns from extensive and often complex sports datasets. For instance, CNNs [44] are widely employed in image and video analysis, facilitating tasks such as action recognition and posture estimation. On the other hand, Recurrent Neural Networks (RNNs) and their advanced versions, such as Long Short-Term Memory (LSTM) networks, are particularly adept at handling sequential data, including time-series data and sensor-based information. These applications are especially beneficial in sports where continuous monitoring and real-time feedback are critical to enhancing performance and preventing injuries.

In recent years, the development of specialized systems, such as the language processing-based automatic news generation and fact-checking system proposed by Peng et al. [20], has further highlighted the vast potential of deep learning technologies beyond traditional sports analysis, extending their utility into areas like sports reporting and data verification. Moreover, the integration of traditional machine learning algorithms, such as Random Forests and Support Vector Machines (SVMs), continues to offer robust and reliable solutions for classification and regression tasks within the sports data context, complementing the strengths of deep learning approaches.

However, despite the significant advancements, several challenges persist in the application of deep learning to sports data. Sports datasets often exhibit high dimensionality and intricate temporal dependencies, making data preprocessing and model training more complicated. Additionally, deep learning models generally require substantial amounts of labeled data, which can be particularly difficult to acquire in certain niche sports domains where data availability is limited. The process of annotating large-scale datasets is often labor-intensive and time-consuming, further complicating the application of these models. Training these models also demands significant computational resources, and when working with large-scale datasets, the prolonged training times may not align with the needs of real-time applications or quick decision-making. For example, the model proposed by Hengmin Zhang et al. [41] is heavily reliant on computational resources, limiting its effectiveness in real-time data processing, particularly in fast-paced sports environments where decisions need to be made quickly.

Moreover, the challenge of interpretability in deep learning models remains a significant barrier. Coaches and athletes often require clear and understandable insights to make effective use of the predictions generated by these models. Therefore, as deep learning continues to advance, there is a growing need to develop techniques that improve the transparency and interpretability of these models, making their outputs more actionable for practitioners in the sports field.

In summary, the application of deep learning and machine learning technologies in sports data analysis has shown great potential, particularly in action recognition, posture estimation, and performance analysis. While these technologies offer significant advantages in handling complex data, automating feature extraction, and providing personalized recommendations, challenges remain in data labeling, high computational resource requirements, and the interpretability of models. As research progresses, more and more improvements are being proposed, such as using transfer learning to address small sample problems or employing model compression and optimization techniques to reduce computational complexity. Additionally, improving model interpretability to make training results more understandable and applicable is becoming an important direction for future research. In conclusion, the future of deep learning and machine learning in sports data analysis is promising, and ongoing research will further promote the application of these technologies in practical sports training and competition.

Design and Optimization of Intelligent Sports Training Systems

IoT advanced systems harness real-time monitoring, comprehensive data analysis, and feedback mechanisms to deliver personalized training programs, thereby improving the overall effectiveness of training outcomes. For example, Wang and Park [30] developed an intelligent training system specifically designed to improve college students’ mental health and physical fitness through targeted exercise regimens. Typically, these systems incorporate a wide array of sensors, wearable devices, and sophisticated data analysis platforms, which work in concert to collect and process real-time physiological and movement data from athletes. The data are then analyzed using AI algorithms to generate specific and actionable training recommendations tailored to individual athletes.

Despite the numerous advantages that intelligent sports training systems offer, their design and implementation are not without challenges. One significant hurdle is the effective integration of multi-source data to ensure the accuracy and consistency of the information provided by these systems. For example, Wei et al. [34] introduced an AI-based training system that leveraged big data to optimize training, but the system faced considerable challenges due to limitations in data integration and processing capabilities. These challenges ultimately hindered the system’s scalability, leading to its retraction from broader application. Moreover, the necessity for real-time data processing adds another layer of complexity, as it requires the system to function seamlessly without disrupting the athletes’ training routines. This demands a careful balance between generalization and personalization to deliver training programs that cater to athletes with varying skill levels and needs.

Furthermore, the application of intelligent sports training systems in physical education offers a multitude of advantages. Firstly, these systems are capable of monitoring athletes’ training status in real-time by collecting vast amounts of physiological and movement data through sensors and wearable devices. This allows both coaches and athletes to receive feedback promptly, enabling them to make necessary adjustments to training plans with immediate effect. Luo et al. [15]conducted a study on an intelligent sports training system based on Triboelectric Nanogenerators (TENGs), which demonstrated significant improvements in energy harvesting and motion monitoring efficiency through innovative technological applications. This study underscores the potential of integrating intelligent sports devices into training systems. However, these advanced systems also present certain drawbacks. The high costs associated with their design and implementation, particularly concerning the setup of hardware devices and data processing platforms, can be prohibitive. Additionally, the complexity of these systems often increases the difficulty of maintenance and operation, necessitating additional training for non-technical personnel to use the systems effectively. This may further complicate the widespread adoption of such systems in various sports training environments.

In summary, intelligent sports training systems, by incorporating advanced technologies such as AI, big data, and IoT, offer athletes scientific and personalized training programs that significantly enhance training effectiveness and efficiency. Nevertheless, challenges remain in areas such as data integration, real-time performance, cost management, and user-friendliness. As these systems continue to evolve, it is crucial to address these challenges to fully realize their potential in transforming sports training and physical education. Ongoing research and development efforts are essential to overcoming these obstacles, with a particular focus on improving the scalability and accessibility of intelligent training systems while ensuring they remain cost-effective and easy to use for a broad range of users.

Method

This paper proposes a method that combines ASPP and LSTM networks for analyzing and optimizing sports training data. The method aims to address the limitations of current sports training methods in real-time monitoring, data transmission, and intelligent analysis, particularly in high-intensity and complex environments. By using embedded wireless sensors to collect athletes’ motion data in real time, and combining ASPP for multi-scale feature extraction with LSTM to capture temporal patterns, this method enables precise assessment and optimization of training outcomes, providing personalized training recommendations and reducing the risk of sports injuries. See Figure 1 for the architecture of sports training system.

Figure 1.

Physical training system architecture.

ASPP Model

In this study, we employ embedded wireless sensors to capture real-time motion data from athletes, including metrics such as heart rate, acceleration, and speed. After transmission to a central processing system via a wireless network, the data undergoes preprocessing before being fed into the ASPP module for extracting multi-scale features. ASPP leverages dilated convolution filters with varying sampling rates to process data across multiple scales, enabling the capture of spatial information at different resolutions. This effectively enlarges the receptive field of the neural network, allowing it to gather richer contextual details without escalating computational load.

The core of the ASPP module is the dilated convolution, mathematically expressed as: z[i]=j=1Kx[i+sj]u[j]$$z\left[ i \right] = \mathop \sum \nolimits_{j = 1}^K x\left[ {i + s \cdot j} \right] \cdot u\left[ j \right]$$

where z[i] denotes the convolution output, x[i] is the input feature map, u[j] is the convolution filter, and s represents the dilation factor, with j marking the filter index.

ASPP achieves feature extraction across different scales by varying the dilation rate s. Suppose the ASPP module comprises M dilated convolution layers, each with dilation rates s1, s2, …, sM, then the convolution output is expressed as: z(m)[i]=j=1Kx[i+smj]u(m)[j],m=1,2,,M$${z^{\left( m \right)}}\left[ i \right] = \mathop \sum \nolimits_{j = 1}^K x\left[ {i + {s_m} \cdot j} \right] \cdot {u^{\left( m \right)}}\left[ j \right],m = 1,2, \ldots ,M$$

Here, z(m)[i] is the output of the m-th dilated layer, and u(m)[j] is the corresponding filter.

Finally, the ASPP module aggregates the outputs of the various dilated layers, typically through concatenation: zASPP[i]=concat(z(1)[i],z(2)[i],,z(M)[i])$${z_{{\text{ASPP}}}}\left[ i \right] = {\text{concat}}\left( {{z^{\left( 1 \right)}}\left[ i \right],{z^{\left( 2 \right)}}\left[ i \right], \ldots ,{z^{\left( M \right)}}\left[ i \right]} \right)$$

where concat(·) indicates concatenation along the channel dimension, yielding the ASPP module’s final output.

In these formulas, z[i] is the output feature, x[i] is the input map, u[j] represents filter weights, s is the dilation rate, and M is the number of dilated layers. By aggregating the outputs from different scales, the ASPP module enhances the network’s capability to recognize intricate motion patterns, forming a robust basis for LSTM-based temporal analysis.

The ASPP module captures contextual information at multiple scales within the input features, thus enhancing the recognition of complex motion patterns. This provides a solid foundation for the subsequent use of LSTM for timing analysis.

LSTM model

LSTM is a recurrent neural network tailored for processing sequential data, effectively capturing long-term dependencies through its distinct architecture. In this research, spatial features derived from the ASPP module are input into the LSTM network to manage temporal relationships within motion data. The LSTM, utilizing memory and forget mechanisms, accurately reflects variations in an athlete’s condition across different training stages (see Figure 2 for the LSTM structure following convolution).

The core of an LSTM consists of three gates: forget, input, and output gates. These gates collaborate to regulate information flow within the cell state. The operations are mathematically defined as follows: fn=σ(Wf[hn1,xn]+bf)$${f_n} = {\rm \sigma }\left( {{W_f} \cdot \left[ {{h_{n - 1}},{x_n}} \right] + {b_f}} \right)$$ in=σ(Wi[hn1,xn]+bi)$${i_n} = {\rm \sigma }\left( {{W_i} \cdot \left[ {{h_{n - 1}},{x_n}} \right] + {b_i}} \right)$$ Sn˜=tanh(WS[hn1,xn]+bS)$$\tilde {{S_n}} = \tanh \left( {{W_S} \cdot \left[ {{h_{n - 1}},{x_n}} \right] + {b_S}} \right)$$ Sn=fnSn1+inSn˜$${S_n} = {f_n} \cdot {S_{n - 1}} + {i_n} \cdot \tilde {{S_n}}$$ on=σ(Wo[hn1,xn]+bo)$${o_n} = {\rm \sigma }\left( {{W_o} \cdot \left[ {{h_{n - 1}},{x_n}} \right] + {b_o}} \right)$$ hn=ontanh(Sn)$${h_n} = {o_n} \cdot \tanh \left( {{S_n}} \right)$$

In these equations, fn is the output of the forget gate, in is the output of the input gate, Sn˜$$\tilde {{S_n}}$$ represents the candidate state, Sn is the current cell state, on is the output of the output gate, and hn is the hidden state at time step n. The matrices Wf, Wi, WS, Wo correspond to the weights linked with the gating units, while bf, bi, bS, bo are the associated bias terms. The function σ is typically the sigmoid, and tanh denotes the hyperbolic tangent.

Figure 2.

Multi-group LSTM network structure after convolution.

These equations explain how the LSTM controls the retention of information from the previous state Sn−1 through the forget gate fn, updates the current state Sn via the input gate in and candidate state Sn˜$$\tilde {{S_n}}$$, and generates the hidden state hn at time step n using the output gate on and the updated state Sn. These mechanisms enable the LSTM to effectively manage long-term dependencies, making it ideal for analyzing shifts in an athlete’s state across various training phases.

In these formulas, xn represents the input features, typically derived from the ASPP module. The variables hn−1 and hn are the hidden states from the previous and current time steps, respectively. Similarly, Sn−1 and Sn refer to the cell states from the previous and current steps. The parameters Wf, Wi, WS, Wo, and biases bf, bi, bS, bo are the weight matrices and bias vectors optimized during training.

Model Optimization

In the model optimization phase, the Gradient Descent (GD) algorithm is widely utilized for iterative updates of model parameters, aimed at minimizing the loss function. The integrated ASPP and LSTM model undergoes several training cycles and parameter tuning, ultimately achieving high-precision predictions for motion data. GD operates by computing the gradient of the loss function concerning model parameters and adjusting these parameters in the direction opposite to the gradient, steering the model toward optimal performance.

The fundamental formula for Gradient Descent is: ϕn+1=ϕnηϕL(ϕn)$${\phi _{n + 1}} = {\phi _n} - {\rm \eta} \cdot {\nabla _\phi }L\left( {{\phi _n}} \right)$$

where ϕn represents the model parameters at iteration n, η is the learning rate, and ∇ϕLn) denotes the gradient of the loss function L(ϕ) with respect to ϕ.

The gradient is calculated as: ϕL(ϕ)=L(ϕ)ϕ$${\nabla _\phi }L\left( \phi \right) = \frac{{\partial L\left( \phi \right)}}{{\partial \phi }}$$

where L(ϕ)ϕ$$\frac{{\partial L\left( \phi \right)}}{{\partial \phi }}$$ is the partial derivative of the loss function.

During each iteration, the parameter update formula can be further expanded as: ϕn+1=ϕnη1pj=1pϕLj(ϕn)$${\phi _{n + 1}} = {\phi _n} - {\rm \eta} \cdot \frac{1}{p}\mathop \sum \nolimits_{j = 1}^p {\nabla _\phi }{L_j}\left( {{\phi _n}} \right)$$

where p denotes the number of training samples, and ∇ϕLjn) is the gradient for the j th sample. Iteratively, GD refines the model parameters, reducing the loss function and enhancing the model’s accuracy.

In these equations, ϕn signifies the model parameters at iteration n; η is the learning rate, determining the update magnitude; ∇ϕLn) directs the update process; and p is the count of training samples. By appropriately selecting η and performing multiple iterations, the model progressively converges to optimal parameters, thus improving prediction accuracy on motion data.

Experimental
Datasets

The experiments employ two publicly available datasets: PAMAP2 [26] and MHEALTH [1].

The PAMAP2 dataset includes physiological and motion data from nine participants, collected using three IMUs and a heart rate monitor, capturing metrics like heart rate, acceleration, angular velocity, magnetic field, and temperature. This dataset is ideal for real-time monitoring and analysis in physical training, particularly when integrated with deep learning models for time-series prediction.

The MHEALTH dataset comprises physiological and motion data from ten volunteers engaged in various activities. Each volunteer is equipped with three IMUs placed on the chest, right wrist, and left ankle. The data encompass activities such as jogging, walking, and cycling, with detailed metrics including acceleration, angular velocity, and heart rate. These data are valuable for time-series prediction using LSTM and multi-scale spatial feature extraction via the ASPP module.

Each dataset is split into training, validation, and test sets in a 60%, 20%, and 20% ratio, respectively.

Evaluation Criteria

Throughout the training process, the model undergoes numerous iterations using the previously mentioned optimizer, loss function, batch size, and learning rate configurations. These iterations ensure that the model adapts and fine-tunes its performance progressively. During training, the model undergoes multiple iterations using the previously mentioned optimizer, loss function, batch size, and learning rate settings.

The Maximum F-measure Fβ is calculated as: Fβ=(1+β2)×Precision×Recallβ2×Precision+Recall$${F_\beta } = \frac{{\left( {1 + {\beta ^2}} \right) \times {\text{Precision}} \times {\text{Recall}}}}{{{\beta ^2} \times {\text{Precision}} + {\text{Recall}}}}$$

Here, the parameter β adjusts the balance between precision and recall, typically set to 0.5 to emphasize precision. This metric provides insights into the trade-off between these two important aspects of model accuracy.

The Mean Absolute Error (MAE) can be computed as: MAE=1H×Wi=1Hj=1W|P(i,j)G(i,j)|$$\text{MAE}=\frac{1}{H\times W}\sum\nolimits_{i=1}^{H}{\sum\nolimits_{j=1}^{W}{\ \left| \mathbf{P}\left( i,j \right)-\mathbf{G}\left( i,j \right) \right|}}$$

where H and W are the image height and width. P(i, j) is the predicted saliency score at position (i, j), normalized to [0,1], and G(i, j) is the ground truth saliency score, also normalized to [0,1]. while G(i, j) represents the corresponding ground truth saliency score, also normalized to the same range. MAE provides a straightforward measure of the average prediction error across the image.

The Weighted F-measure Fωβ$$F_{\rm \omega}^\beta$$ is calculated as: Fβω=(1+β2)×Precisionω×Recallωβ2×Precisionω+Recallω$$F_\beta ^\omega = \frac{{\left( {1 + {\beta ^2}} \right) \times {\text{Precisio}}{{\text{n}}^\omega } \times {\text{Recal}}{{\text{l}}^\omega }}}{{{\beta ^2} \times {\text{Precisio}}{{\text{n}}^\omega } + {\text{Recal}}{{\text{l}}^\omega }}}$$

Here, Precisionω and Recallω represent the weighted precision and recall, taking into account the importance of each pixel. The parameter β is similarly used to adjust the relative importance of precision and recall, typically set to 0.3.

The Structure Similarity Measure Sm can be computed as Sm=αSo+(1α)Sr$${S_m} = \alpha {S_o} + \left( {1 - \alpha } \right){S_r}$$

where So measures object-aware structure similarity between the predicted saliency map and the ground truth at the object level, and Sr measures region-aware similarity at the region level. The parameter α balances the importance of So and Sr, typically set to 0.5.

The Enhanced Alignment Measure Em can be calculated as: Em=1H×Wi=1Hj=1WE(P(i,j),G(i,j))$${{E}_{m}}=\frac{1}{H\times W}\sum\nolimits_{i=1}^{H}{\sum\nolimits_{j=1}^{W}}\ \ E\left( \mathbf{P}\left( i,j \right),\mathbf{G}\left( i,j \right) \right)$$

where E(P(i, j), G(i, j)) evaluates the alignment between the predicted saliency P and the ground truth saliency map G at pixel (i, j), and H and W representing the image dimensions.

Result

In Table 1, we present a comparative analysis of the performance of several deep learning models applied to the PAMAP2 and MHEALTH datasets, assessed using four key metrics: Maximum F-measure Fβ, Mean Absolute Error (MAE), Weighted F-measure Fωβ$$F_{\rm \omega}^\beta$$, and Structure Similarity Measure Sm. Specifically, the Fβ score provides an evaluation of the balance between precision and recall, while the MAE offers a measure of the average deviation between predicted values and ground truth. Additionally, the Fωβ$$F_{\rm \omega}^\beta$$ metric places emphasis on precision and recall after applying a weighting factor, and Sm focuses on evaluating structural similarity.

The experimental findings decisively show that our proposed ASPP+LSTM model outperforms all other models across both datasets, achieving superior performance on every metric considered. For instance, when tested on the PAMAP2 dataset, our model recorded an impressive Fβ score of 0.740 and an MAE of 0.178. In comparison, alternative models such as AFNet and TSPOANet yielded Fβ scores of 0.721 and 0.716, with corresponding MAE values of 0.184 and 0.185, respectively. These results underscore the effectiveness of our model, particularly its enhanced ability to capture and leverage multi-scale spatial features along with complex temporal patterns, thereby leading to highly accurate predictions of training outcomes.

Moreover, as depicted visually in Figure 3, these outcomes further illustrate the distinct advantages our ASPP+LSTM model holds across all evaluation metrics. The figure clearly demonstrates how our model consistently surpasses the other methods evaluated on the PAMAP2 and MHEALTH datasets, highlighting its remarkable capability and reliability in real-world applications.

Compares our model with other deep learning mainstream methods in terms of Fβ()$${F_\beta }\left( \downarrow \right)$$, MAE()$$MAE\left( \downarrow \right)$$, Fβω()$$F_\beta ^\omega \left( \uparrow \right)$$, and  Sm()$$\;{S_m}\left( \uparrow \right)$$ on two datasets. The best result for each column is highlighted in bold.

Method PAMAP2 MHEALTH
Fβ MAE Fβω$$F_\beta ^\omega$$ Sm Fβ MAE Fβω$$F_\beta ^\omega$$ Sm
AFNet [3] 0.721 0.184 0.526 0.636 0.815 0.114 0.612 0.708
DSS [7] 0.683 0.197 0.489 0.608 0.782 0.127 0.598 0.681
HRSOD [39] 0.692 0.193 0.505 0.617 0.795 0.122 0.605 0.690
FCSOD [40] 0.701 0.189 0.513 0.625 0.804 0.119 0.610 0.700
PA-KRN [36] 0.712 0.186 0.520 0.632 0.810 0.116 0.615 0.705
TSPOANe t[14] 0.716 0.185 0.523 0.634 0.813 0.115 0.618 0.707
Our 0.740 0.178 0.540 0.649 0.822 0.110 0.625 0.715
Figure 3.

Multiple performance metrics comparison of different models on PAMAP2 and MHEALTH datasets.

In Table 2, we compared the performance of various advanced models on the PAMAP2 and MHEALTH datasets using the Enhanced Alignment Measure Em. The Em metric evaluates the alignment between predictions and ground truth at both the global and local pixel levels, reflecting the model’s ability to capture detail and maintain overall consistency. The results show that our ASPP+LSTM model achieved the highest Em scores on both datasets, reaching 0.869 and 0.865, significantly outperforming other methods. For example, on PAMAP2, PiCA and PoolNet had Em scores of 0.754 and 0.761, while on MHEALTH, DUCRF, though close, still fell short with an Em of 0.821. These results indicate that our proposed model exhibits strong generalization capabilities across different datasets, maintaining superior alignment performance in various environments.

Our model is compared with 17 state-of-the-art methods in terms of  Em()$$\;{E_m}\left( \uparrow \right)$$ on 2 datasets.

Method PAMAP2 MHEALTH Method PAMAP2 MHEALTH
Em Em
AFNet [3] 0.632 0.471 CPD [35] 0.788 0.715
DSS [7] 0.624 0.586 BASNet [21] 0.763 0.728
HRSOD [39] 0.682 0.524 GCPANet [2] 0.722 0.762
FCSOD [40] 0.642 0.623 LDF [33] 0.749 0.725
PA-KRN [36] 0.628 0.608 ITSD [46] 0.792 0.781
TSPOANet [14] 0.692 0.611 MINet [18] 0.814 0.744
BRN [31] 0.715 0.644 GateNet [45] 0.826 0.791
PiCA [13] 0.754 0.672 DUCRF [37] 0.851 0.821
PoolNet [12] 0.761 0.701 Our 0.869 0.865

To validate the contributions of the ASPP and LSTM modules to the model’s performance, we designed an ablation study that included the following configurations:

M1: Removed both ASPP and LSTM modules, using only the base CNN model for feature extraction and prediction through fully connected layers. M2: Removed the ASPP module, using only LSTM for time-series prediction. M3: Replaced the ASPP module with CNN and performed predictions. M4: Removed the LSTM module, using only the ASPP module for spatial feature extraction and prediction through fully connected layers. M5: Complete ASPP+LSTM model.

Each model configuration was trained and evaluated, with results recorded for the test set on the metrics Fβ, MAE, Fωβ$$F_\omega ^\beta$$, Sm, and Em. The ablation study results were analyzed to clarify the contributions of each module to the final performance.

Influence of the main components of ASPP+LSTM.

Method Components PAMAP2 MHEALTH
Fβ MAE Fβω$$F_\beta ^\omega$$ Sm Em Fβ MAE Fβω$$F_\beta ^\omega$$ Sm Em
M1 CNN 0.701 0.147 0.514 0.612 0.601 0.821 0.101 0.605 0.708 0.413
M2 LSTM 0.689 0.151 0.507 0.605 0.589 0.812 0.105 0.598 0.700 0.405
M3 CNN+LSTM 0.710 0.144 0.520 0.616 0.609 0.825 0.099 0.610 0.710 0.418
M4 ASPP 0.703 0.146 0.516 0.614 0.603 0.822 0.100 0.607 0.709 0.414
M5 ASPP+LSTM 0.723 0.140 0.530 0.625 0.617 0.834 0.096 0.618 0.715 0.425

In Table 3, we assessed the contributions of the ASPP and LSTM modules to the model’s performance through an ablation study. The experiments were conducted on the PAMAP2 and MHEALTH datasets, using key metrics such as Maximum F-measure Fβ, Mean Absolute Error (MAE), Weighted F-measure Fωβ$$F_\omega ^\beta$$, Structure Similarity Measure Sm, and Enhanced Alignment Measure Em. These metrics respectively evaluate the model’s precision, error, weighted balance, structural similarity, and alignment. The results from five different model configurations show that the full ASPP+LSTM model (M5) consistently performed the best across all metrics. For instance, on the PAMAP2 dataset, the model achieved an Fβ of 0.723 and an MAE of 0.140, significantly outperforming the CNN-only M1 configuration (Fβ of 0.701 and MAE of 0.147). This highlights the critical role of the ASPP module in multi-scale spatial feature extraction and the significant enhancement of temporal prediction accuracy by the LSTM module.

Conclusions

This study aims to address the limitations of current sports training methods in real-time monitoring, data transmission, and intelligent analysis, particularly in high-intensity and complex environments. To this end, we propose a numerical simulation and optimization method for physical education training based on an embedded wireless communication network. Our approach combines ASPP with LSTM networks to effectively process and analyze multi-scale spatial features and temporal data from athletes. In our experiments, we used the PAMAP2 and MHEALTH datasets, conducting both comparative and ablation studies. The results demonstrate that our model excels across all key metrics, particularly in terms of maximum F-measure, mean absolute error, weighted F-measure, and structure similarity measure, proving the effectiveness and advantages of the proposed method.

Despite the strong performance of the ASPP+LSTM model in our experiments, some limitations remain. Firstly, the model’s computational complexity is relatively high, which can result in significant time costs for training and inference, especially when dealing with large datasets. This could impact real-time applications. Future research could explore model compression and optimization techniques, such as quantization and pruning, to reduce computational complexity. Secondly, while the model performed well on the two datasets used, its generalization to other types of sports data or different athletic activities has not yet been validated. Future work could expand the diversity of datasets to test the model’s applicability in more complex scenarios.

In conclusion, this paper presents an effective numerical simulation and optimization method for sports training by integrating ASPP and LSTM models. The key contributions of this study include: 1) the first application of the combined ASPP and LSTM approach for multi-scale spatial feature extraction and time-series analysis in sports training data, significantly improving the model’s prediction accuracy and real-time performance; 2) the use of ablation studies to clarify the specific contributions of each module to overall model performance, validating the method’s effectiveness. Not only did our method achieve excellent results in experiments, but it also provides valuable insights for the design and optimization of future intelligent sports training systems, contributing to the advancement of scientific and personalized sports training.

Język:
Angielski
Częstotliwość wydawania:
1 razy w roku
Dziedziny czasopisma:
Nauki biologiczne, Nauki biologiczne, inne, Matematyka, Matematyka stosowana, Matematyka ogólna, Fizyka, Fizyka, inne