Ground investigations play a crucial role in the process of designing and constructing safe and durable structures such as buildings, bridges, roads, and other infrastructure elements. Knowledge of the ground conditions allows for the assessment of the interaction between the foundations and the soil, enabling the designer to adopt the optimal structural solution. Knowledge of the ground conditions is also necessary during the construction phase of the object, as it enables the prediction and prevention of adverse phenomena that may occur in the soil, such as excessive settling, landslides, or liquefaction. Preventing these phenomena ensures the safety and efficiency of the work being carried out.
Soil is a complex engineering material that has been formed by a combination of various geological, environmental, and physicochemical processes [5]. As a result of these processes, the soil exhibits anisotropic, nonhomogeneous geotechnical properties, which additionally change with depths and over time. Complexity of the soil structure makes recognition of ground conditions a complicated task. Obtaining information about the soil is carried out by conducting tests in both field and laboratory. Currently, one of the most commonly performed field tests is the Cone Penetration Test (CPTU). Geotechnical experiments usually measure physical quantities that are possibly well correlated with the values of geotechnical soil parameters needed in the design process. This is also the case with the CPTU test. Here, the measured physical quantities are: resistance on the cone, the value of the pore pressure in the cone filter and the friction on the side of the sleeve located above the cone. These data depend not only on the type of soil, but also on the location of tested points at which the cone registers the data during its movement into the ground. There are numerous techniques that allow to correlate the values of these measured quantities with the values of geotechnical parameters, such as oedometric modules, cohesion, angle of internal friction, and many others needed in the design of the foundation, but they provide underfitted results or the ones that are fitted only locally. Correlations can be established through approximations made using mathematical functions and another method involving graphical and tabular interpretations. The article proposes an approximation method using artificial neural networks (ANNs), which has the advantage of incorporating all measurements recorded by the CPTU probe, as well as other additional information such as soil type. ANN functionalities are well suited to solve the problem of establishing a complex relationship of many variables, which cannot be written in a simple way. In this article, using an ANN, based on the measurement data obtained as a result of the use of CPTU, we determine the type of soil and its state characterized by the liquidity index
The ANN we adopt here is known as the Long Short-Term Memory (LSTM) neural network (for more about LSTM, see [10], [11], [19]). It is an ANN that is qualitatively different from that usually used in geotechnical applications layered networks trained using the error back propagation method (multilayer perceptrons [MLPs]). Instead of the input and output layers in MLP, associated with the input data and their image at the output of the network, in the LSTM network, an input data stream is distinguished, which, after applying one of the deep learning algorithms, is transformed into an output data stream. Each element of the output stream depends both on the value of the corresponding element of the input stream, as well as on the values of parameters in its narrow or wide neighborhood. The input sequence here is a sequence of pressure values on the measuring cone and friction on the measuring sleeve at each of the successive cone penetration depths. The output stream is the value of the liquidity index and the number assigned to the type of soil in this point of the ground. The LSTM network is usually used for automatic text translation. In our approach, a well-trained LSTM network “translates” the sequence of triples of measured values into the sequences interpreted as the value of the liquidity index and the type of soil at corresponding depths.
The paper is organized as follows: in the second section, LSTM network is shortly characterized; in the third section, we describe the process of experimental data acquisition; in the fourth section, the data preparation procedure is explained. The fifth section presents the application of the LSTM network to calculate the liquidity index and to determine the type of soil, based on the CPTU probe tests. The work ends with conclusions summarizing the presented method of interpretation of measurement data.
ANN is a computational tool that is part of a field of computer science called machine learning. ANN works in an algorithmic way on the database which contains a description of a physical phenomenon at hand. The database splits into two sets: input data and output data, related to the input data by the relationships usually not known. The result of an ANN's activity is a multivariate approximation function that approximates this unknown relation and is able to predict new output for any newly entered input. It is to be highlighted that ANN itself discovers the dependencies between the input and output data during the learning process (shaping of the internal parameters – weights and biases) in training. ANNs with the structure of an MLP have been used in geotechnics for more than 30 years. A review of various applications of MLP in geotechnics can be found in [1]. The success of this neural network is based on the fact that it is a very good approximator of a function or an operator. The best approximation of the values of this functional relation can be found by training based on examples of approximate functional dependence (see [2]). In [4], CPTU test parameters with auxiliary data were used to detect engineering parameters, for example, overconsolidation ratio (OCR),
Unfortunately, a classical ANN MPL cannot be used for interpretation of continuous series of measured data coming from CPTU or similar devices, since it cannot interpret sequences of the data with variable length, for which the order of the data is important. Moreover, for interpretation of the current measurement point, the input data far from it should be less important than the data taken from its close neighborhood.
One of the solutions that is not susceptible to the above limitations is the LSTM network, first presented in [10]. LSTM is a recurrent ANN characterized by the ability to find patterns in the input sequence of data, commonly used for analysis of long sequences of data, in particular, in automatic translation of one sequence of data into another data sequence (sequence2sequence problems). The most common application of LSTM ANN is translating of a string of words in one language into a string of words in another language. LSTM simply replaces the words in one language with the words that appeared in the given context in the training pattern – a set of examples of correct translations furnished in the training process. In the application proposed in this paper, the first string (the input string) is the sequence of pairs of data registered by CPTU: resistance pressure under the cone and friction on the sleeve on each consecutive depth. The output sequence (result of the “translation”) is the number identifying the soil type and the value of liquidity index
The activity of the LSTM network is assured by three internal subnets of simple structure of sigmoid layers. The first subnetwork has to decide what information we are going to throw away from the current cell state. This sigmoid layer is called the “forget gate layer.” The next sigmoid layer has to decide what new information we are going to store in the cell state. This sigmoid layer, called the “input gate layer,” decides which values will be updated. Finally, we need to decide what we are going to output. This output will be based on the current cell state, which is influenced by the short and long memory states of the previous elements of the sequence. The third sigmoid layer decides which parts of the cell state we are going to output. This process is corrected by comparison of the current output value with the needed target value in the process of training. The weights and biases of the sigmoid layers are changed according to this error by an assumed minimization procedure.
Architecture of LSTM type networks, significance, and selection of hyperparameters such as batch size, learning rate, type of neurons activation functions, number of neurons and layers, and number of epochs are described in [15] and [19]. The interested reader will find there some other references for works with varying levels of detail, theoretical or related to different practical applications of LSTM. Detailed analysis of the structure and actions of LSTM can also be found in many internet pages, for example, [11]. To our knowledge, this kind of ANN has not been applied yet in geotechnical problems.
The advantage of LSTM is that it takes into account the time factor. This means that the network, when performing computational operations on current data, takes into account the state from the previous step. The network thus understands the data as a sequence, where order matters.
In LSTM networks, an important advantage is that the problem of vanishing and exponential gradients has been eliminated. The problem is that during backpropagation, that is, in the network learning process, fixed weights between neurons may become practically insensitive to change or their values may start to grow exponentially. The expiration of network weight changes results in the fact that a given computational step will be forgotten and will not affect the later one (this applies to longer-input sequences). To overcome this problem, LSTM networks use a gating system called long-term memory that has been described roughly above in this section.
Static CPTU probing involves pushing a cone into the ground (Figure 1) under the influence of machine pressure at a constant speed (approximately 2 cm/s). During the test, information about the resistance on the cone
The widespread use of CPTU testing in geotechnics is due to the advantages that this method provides compared to other field tests. Thanks to the mechanized system of cone penetration and automatic data recording, CPTU probing has high accuracy of measured data and is practically immune to errors arising from the operator's work. The test is conducted
For the purposes of the presented research, the CPTU test was associated with execution of control geological drillings (control geological drillings are often used in practice, in parallel with the CPTU test). A geologist distinguishes layers of soil and recognizes the type of soil in each of them. This field investigation was completed by laboratory tests for Atterberg limits, moisture content, sieve and oedometer analysis.
The liquidity index is calculated based on laboratory testing of cohesive soil samples taken from the boreholes. It is a dimensionless parameter computed according to the well-known equation (1):
On the designated area, eight CPTU tests were carried out with an average distance of 50 m between them. The depth of sounding ranged from 5.4 to 17.5 m. At a distance of approximately 2 m from each sounding, a geotechnical borehole was drilled to determine the layers of the soil and to obtain samples for laboratory testing. For each layer, 5–13 samples were taken and their liquidity index was tested. The assumed
As shown in Figure 2e, the liquidity index graph is constant within the designated geotechnical layer. This is insufficient to build a neural network and inconsistent with engineering intuition. The liquidity index has varying values within the layer. To obtain appropriate data, several assumptions had to be made.
The proposed correlations between the liquidity index and the probing results, including the normative correlation [6], are a function of the cone resistance and the type of soil
The second assumption involves the statistical data from laboratory elaboration of CPTU results. The average coefficient of variation for determining the liquidity index for all layers was
The average coefficient of variation for cone resistance
The assumption was made that within the layer, the liquidity index changes proportionally to the coefficient of variation of a single measurement of the cone resistance
The above relationship was applied to the results of all eight field research points. Finally, the liquidity index charts were transformed, as seen in the example shown in Figure 3.
The transformation procedure was performed for all research points.
As a tool for establishing the correlation between
Division of soil types into categories.
Made ground | Mg | 1 |
Fine sand | FSa | 2 |
Medium sand | MSa | 3 |
Glacial till | siSa, clSa, sasiCl | 4 |
Settled deposits | clSi, saSi, Si, | 5 |
Settled deposits with gravel | grsaSi, grclSi | 6 |
Gravel | Gr | 7 |
The results of the eight research points were combined while maintaining the order of CPTU measurements according to the depth of the test. The designed network consists of 5297 sets of input and output data. Of these, 4500 initial sets were chosen for network training and the remaining sets were used for testing the network. The network was trained until the mean squared error of predicted values relative to the output data reached the lowest value. Mean Square Error (MSE) of the result obtained was 0.00156.
Figure 4 shows the results of the designed LSTM network. The green dashed line represents the beginning of the data used for testing the network. The red dashed line represents the introduced output data - the liquidity index, and the blue dashed line represents the results of the trained network. Small oscillations of predicted values are natural because there are often interlayers or admixtures of other soils that differ from the parameters of the determined layer in the soil.
Below, we present a preliminary comparison of the prediction obtained using LSTM ANN with the one resulting from correlation proposed by the standard document PN-B-04452:2002. The blue line on the graph represents the laboratory results. It can be seen that the prediction results are better than those proposed in the standard document PN-B-04452:2002 given by the equation
The prediction accuracy of designed LSTM is similar for the remaining profiles. It is presented in Figure 4.
The basic parameters
The designed LSTM network consists of 8060 input and output data sets. For network training, 6200 initial sets were selected, while the remaining sets were used for testing the network. The data were entered chronologically to borehole profiles. Figure 6 shows the results of the designed LSTM network. The mean squared error of predicted values relative to the output data reached MSE = 0.0104.
Network has difficulty in recognizing some layers, but it eventually assigns a similar type of soil, as, for example, in the third borehole, where gravels at a depth of 5.7 m were classified as medium sands. In some points, the network assigns a non-integer number, but rounding these values results in a well-predicted layer. Minor oscillations in the predicted values may result from naturally occurring small geological differences within the layers.
Two classical LSTM recurrent ANNs have been created to interpret the results of CPTU. The first one has been trained to establish the correlations between the recognized type of cohesive soil, measured pressure at the cone, and friction on the sleeve for consecutive depths (
We believe that the choice of the LSTM type network is appropriate for the interpretation of measurement data strings obtained as a result of probing with the CPTU probe, because during the training phase, the network takes into account the “context” of each single measurement. Therefore, the results of the interpretation show the correct division of the ground area into layers with similar properties. The use of the LSTM type neural network, qualitatively different from the MLP type ANN, is an element of novelty in the application of ANNs in geotechnics. We observe that the quality of correlation modeled in the LSTM network is good. The final results perform well when considering the database consisting of the parameters of eight CPTU tests and laboratory test results. The accuracy of the network can be improved by increasing the set of input and output data. The LSTM network trained with results of a representative number of surveys will allow for a quick interpretation of the survey results, obtained immediately after completion of the probing.
The use of a transformation equation (Equation 3) for the output data of the LSTM network (liquidity index), which is intended to attempt to reproduce the natural variability of a geotechnical parameter in the soil, resulted in an improvement in the effectiveness of the network by 20% compared to the same output data, but without their transformation (constant liquidity index in the layer).
It should be mentioned that, apart from the selection of input data, the adopted network architecture also has a significant impact on the quality of network prediction. Because this is a complicated IT issue, it was discussed only superficially in the article.
We are sure that the idea of application of LSTM type ANN is promising in solving many other engineering problems in geotechnics.