Accesso libero

Artificial Neural Network and Regression Models to Evaluate Rheological Properties of Selected Brazilian Honeys

INFORMAZIONI SU QUESTO ARTICOLO

Cita

INTRODUCTION

Honey is a fluid food and therefore knowledge of its rheological properties is important in the production processes. This enables better quality control and proper sizing of processing units, since it enables performance evaluation and selection of equipment for product handling, packaging and processing and avoids unnecessary expenditures of beekeepers and the industry (Yanniotis, Skaltsi, & Karaburnioti, 2006).

In general, honey has a Newtonian behavior, which means the viscosity (η) of the system is independent of the shear rate applied (Dobre et al., 2012). However, due to the presence of sugar crystals, proteins and other colloids, honey showed a pseudoplastic non-Newtonian behavior in studies by Ahmed et al. (2007), Chen et al. (2009) and Karasu et al. (2015). Moreover, honeys with their viscoelastic properties behave like liquids characteristic of viscous fluids (Ahmed et al., 2007; Chen et al., 2009; Dobre et al., 2012), because in the small amplitude oscillatory shear (SAOS) measurements the G″ (loss modulus) of honey has a much greater magnitude than that of G′ (storage modulus). The temperature sweep tests in SAOS measurements are highly sensitive in determining structural changes through heating or by temperature fluctuations to which the material may be subjected (Chen et al., 2009; Karasu et al., 2015). Meanwhile, the frequency sweep test shows how the viscoelastic behavior of the material changes with the application rate of the strain amplitude (or tension), where the G” (loss modulus) of honey is highly dependent on the angular frequency (Dobre et al., 2012; Kayacier, Yüksel, & Karaman, 2014).

Normally, the relationships between physico-chemical and rheological properties are considered complex nonlinear systems. Furthermore, the investigation of many of these properties is time-consuming and demands significant laboratory resources (Herrmann et al., 2013). In this context, artificial neural networks (ANN) have the potential to be applied in the estimation and prediction of these properties in foods based on simple and low-cost measures. This is possible due to their ability to learn and generalize the behavior of any complex and non-linear process (Zuluaga-Domínguez, Nieto-Veloza, & Quicazán-de-Cuenca, 2017).

ANNs are computational models inspired by the functioning of the human brain and based on the mechanisms of memorization and learning. This tool can learn the relationship between inputs and outputs based on training data. It is therefore widely used for modeling, prediction and classification since it permits the identification of relationships present in limited, incomplete, noisy, dynamic and nonlinear data sources (Mohanraj, Jayaraj, & Muraleedharan, 2012).

The combined effect of temperature, moisture content and strain rate on viscosity of honeys from Jordan (Al-Mahasneh, Rababah, & Ma’Abreh, 2013) and honeys from Iran (Ramzi et al., 2015) was evaluated using ANNs with showed good results. The ANN was less accurate than the Adaptive Neuro - Fuzzy Inference System (ANFIS) in modeling the complex viscosity (η*) of Turkish honeys dependent on the variety, temperature and angular frequency (Kayacier, Yüksel, & Karaman, 2014). The influence of temperature, frequency and water content on the viscoelastic parameters of honeys from Spain was also evaluated using ANNs and the ANFIS.

In this case, a Multilayer Perceptron (MLP) ANN presented the best predictive ability of all viscoelastic properties (Oroian, 2015). In all studies, the temperature range and number of honey samples were relatively limited considering the wide variety of honeys available and the range of temperatures that can be employed during processing. Furthermore, there are no reports in literature on the use of ANNs for estimating viscoelastic properties by means of temperature scans during heating and cooling.

Thus, the potential use of ANNs to predict the rheological properties of honeys from low-cost measures using larger temperature and water content ranges will expand the application of the obtained network. The present study therefore sought to develop different ANN models to predict viscosity from steady shear measurements, G′, G″ and η* from temperature scans during heating, G′, G″ and η* from temperature scans during cooling, where both models are a function of water content and temperature and finally G′, G″ and η* from the frequency sweeps, in function of the water content, temperature and angular frequency. Furthermore, models 1, 2 and 3 were compared to exponential models and model 4 to multiple linear regression models.

MATERIAL AND METHODS
Rheological data collection

The data were obtained by Silva et al. (2016) and Silva, Torres Filho & Resende (2018), who analyzed twenty-two samples of monofloral honey Assa-peixe (Vernonia polysphaera) (3), Cipó-uva (Serjania lethalis) (5), Eucalyptus (Eucalyptus spp.) (7), and Orange blossom (Citrus sinensis) (7) from the southeast and eighteen samples of multifloral honey from the southeast (8), south (3), northeast (3) and central-west (4) regions of Brazil.

Artificial neural network modeling

In this study, a supervised multilayer perceptron feedforward artificial neural network (MLP ANN) was used for the development of four independent neural models for the rheological parameter of honeys. Model 1 was developed based on data of η determined in steady shear rheological measurements. Models 2 and 3 were developed from data of G′, G″ and η* measured in oscillatory tests of temperature scans during heating and cooling, respectively. Model 4 was developed from data of G′, G″ and η* measured in oscillatory tests of frequency scans. For models 1, 2 and 3 the input layers were composed of the variables temperature and water content, while for model 4 the input layers were composed of the variables temperature, water content and angular frequency. The input nodes and the neurons in the output layer for each model are shown in Tab. 1 as the statistical indexes of the inputs and outputs of training and testing data in this study.

Statistical Indexes of Input and Output data in the training and test process of Multilayer Perceptron Feedforward Neural Network

ModelTraining dataTest dataTotal data
MeanSTDMinMaxMeanSTDMinMax
1(*)InputsWC (%)15.781.0214.2318.8615.781.0314.2318.86320
T (°C)30.9015.8810.0060.0032.3117.8210.0060.00
Outputsη (Pa.s)24.9236.910.37225.2727.1738.730.23177.99
2(*)InputsWC (%)15.821.0314.2318.8615.750.9914.2318.86800
T (°C)39.2821.423.6774.5739.2221.833.6174.48
OutputsG′ (Pa)1.273.560.0036.671.182.760.0020.82
G″ (Pa)299.97744.950.926174.37308.74676.121.434571.52
η* (Pa.s)47.74118.560.15982.7049.14107.610.23727.59
3(*)InputsWC (%)15.801.0314.2318.8615.730.9914.2318.86800
T (°C)35.9621.350.5571.4335.9521.710.5771.42
G′ (Pa)28.2970.770.00772.1322.8548.720.00354.15
OutputsG″(Pa)493.781169.190.838455.12530.611282.021.028398.52
η* (Pa.s)78.93186.420.131346.1484.66204.160.161336.82
4(**)WC (%)15.781.0214.2318.8615.791.0214.2318.864160
InputsT (°C)31.0816.2410.0060.0031.7616.6710.0060.00
F (Hz)2.422.950.1010.002.342.890.1010.00
G′ (Pa)2.428.290.00233.522.297.140.00151.11
OutputsG″ (Pa)407.861081.780.1816404.51405.201052.590.239751.40
η* (Pa.s)27.7440.980.28268.3226.9439.960.28266.10

WC: water content; T: temperature; η: steady shear viscosity; G′: Storage moduli; G″: loss moduli; η*: complex viscosity; F: Frequency.

Reference:

Silva et al (2016) and

Silva et al. (2018).

For the training phase 75% of the data was used and 25% was used in the test phase of the MLP (Faria et al., 2015). The data was normalized to fall within the range of the sigmoidal activation function and the hyperbolic tangent activation function (Khanlari et al., 2012; Xi et al., 2013).

The learning algorithm used was backpropagation with momentum which permits the correction of synaptic weights during training based on the difference (error) between the desired value and that calculated by the network (Abbasi-tarighat, Shahbazi, & Niknam, 2013), and allows that the convergence of weights results in an overall minimum rather than a local minimum of the error (Goktepe et al., 2008).

For analysis of the different network topologies, the activation function (hyperbolic tangent and sigmoidal) and the number of neurons in the hidden layer (3, 6, 9, 12 and 15) were varied, and the learning rate to 0.2 and momentum equal to 0.7 were fixed. After obtaining the best topology, different learning rates (0.2, 0.4, 0.6) and momentum values were tested (0.3, 0.5, 0.7, 0.9) (Llave, Hagiwara, & Sakiyama, 2012).

The study for the determination of the optimal number of iterations for network training was performed through error convergence analysis (RMSE - Equation 1) for a minimum value of the test data (Phimolsiripol, Siripatrawan, & Cleland, 2011). Definition of the optimal number of iterations used during training is an essential criterion to avoid overfitting (Linder & Po, 2003). Selection of the optimal network for each model was determined by a trial and error technique, evaluating different scenarios and configurations and opting for topologies that had lower error values (RMSE). In addition to this statistical criterion, the correlation coefficient (r) was used as a performance measure of the analyzed networks.

The neural network structures were generated using the 2.92 version of Neuroph Neural Network freeware.

Regression Models

In order to accurately evaluate the predicting technique, the combined effect of water content and temperature on the rheological parameters of the honeys, the same data subsets (training set) for the models 1, 2 and 3, shown in Tab. 1, were used to adjust the following exponential model (Oroian 2012) (Eq.1): y=aexp(bwcT)y = aex{p^{(- bw - cT)}} where y is the predicted response (η (Pa.s), η* (Pa.s), G′ (Pa) or G″ (Pa)), w is the water content (%), T is temperature (°C), and a, b, and c are empirical constants of the models.

A nonlinear regression technique that incorporates the Levenberg-Marquardt method to solve nonlinear regression was used.

The training data for model 4 was further analyzed using multiple regressions and a second order polynomial model expressed in Eq.2: y=β0+i=13βiXi+i=13βiiXi2+i=12j=23βijXiXj+βijkXiXjXk{\rm{y}} = {\beta _0} + \sum\limits_{i = 1}^3 {{\beta _{{\rm{i}}{{\rm{X}}_{\rm{i}}}}}} + \sum\limits_{i = 1}^3 {{\beta _{{\rm{ii}}}}{\rm{X}}_{\rm{i}}^2} + \sum\limits_{i = 1}^2 {\sum\limits_{j = 2}^3 {{\beta _{{\rm{ij}}}}{{\rm{X}}_{\rm{i}}}{{\rm{X}}_{\rm{j}}} + {\beta _{{\rm{ijk}}}}{{\rm{X}}_{\rm{i}}}{{\rm{X}}_{\rm{j}}}{{\rm{X}}_{\rm{k}}}}} where y is the predicted response (η* (Pa.s), G′ (Pa) or G″ (Pa)), β0 is the intercept coefficient, βi is the linear coefficient, βii is the quadratic coefficient, βij and βijk are interaction coefficients, Xi is the water content (%), Xj is the temperature (°C) and Xk is the frequency (Hz).

A one-factor analysis of variance (ANOVA) was carried out to assess this model. The significances of the coefficients were examined using the t-test.

Performance of the exponential and multiple regression models was evaluated by means of the RMSE and r statistical parameters.

This statistical analysis was performed using version 9.1 of the SAS (Statistical Analysis System) software.

RESULTS
Performance of the ANNs and regression models

The optimal number of iterations was obtained by means of the error convergence analysis (Linder & Po, 2003; Phimolsiripol, Siripatrawan, & Cleland, 2011) (Fig. 1). The activation function that presented the best results for all models was sigmoidal. This function remains finite even with x values close to ± ∞, which results in more efficient training of the network (Razmi-Rad et al., 2007).

Fig. 1

RMSE of training (solid line) and test (dotted line) sets versus number of iterations for optimum MLP ANN: a) model 1; b) model 2; c) model 3; d) model 4.

The optimal number of iterations is equal to 100 in training the network with 2-12-1 neurons in its layers, referring to model 1 (Fig. 1-a). From this point a considerable increase in the RMSE test was observed, characterizing the overfitting phenomenon. The network with 2-9-3 neurons in its layers, referring to model 2, showed the lowest test RMSE value when completing 1000 iterations (Fig. 1-b), while for the network 2-3-3, referring to model 3, 2500 iterations were required (Fig. 1-c). In both cases a slight increase in the test RMSE was observed after the convergence point. Model 4, whose architecture consisted of 3-9-3 neurons in its layers, required only ten iterations to reach the minimum test RMSE value (Fig. 1-d)

After selecting the best activation function and number of neurons in the hidden layer, all model were evaluated with regards to the influence of the learning rate and momentum on their performance. In all models the learning rate value for the best performance was equal to 0.2. This result is in concordance with that reported in literature, considering that low-learning rate values prevent large fluctuations in error and consequent instability problems (Al-Mahasneh, Rababah, & Ma’Abreh, 2013; Rai, Majumdar, & Dasgupta, 2005; Rossi et al., 2014). In relation to the term momentum, models 1 and 4 performed better for the value of 0.3, while models 2 and 3 performed better at the value of 0.7.

For model 1 (2-12-1 architecture), 500 iterations were required for training. This MLP-ANN produced good results for predicting the viscosity of honey as a function of water content and temperature. The RMSE of the training phase of this model was equal to 0.0359, while the RMSE of the test phase was equal to 0.0430. Comparing the data calculated by the network and that obtained experimentally, they showed high correlation coefficients for both training and test data, equal to 0.9760 and 0.9681, respectively.

The exponential model for assessing the combined effect of water content and temperature on viscosity showed higher accuracy than the MLP ANN of model 1 due to the higher coefficient of determination and lower RMSE for the training data, equal to 0.9981 and 0.0101, respectively (Tab. 2). With respect to the test data, both models showed the same performance (Tab. 2) and indicated a reduction in their prediction quality when unknown data was used.

RMSE and correlation coefficient (r) of models 1, 2 and 3 variables from the development and test process of a nonlinear exponential and of models 1, 2, 3 and 4 from the best ANNs models

ModelEstimated variableEmpirical constants1Exponential Model (Training data)Exponential Model (Test data)ANN (Training)ANN (Test)
ABCRMSErRMSErRMSErRMSEr
1η (Pa.s)0.7992.0366.4150.01010.99810.04330.97000.03590.97600.04300.9681
G′ (Pa)0.8053.30214.6960.03430.93560.02900.92930.03380.93980.02610.9390
2G″ (Pa)0.8812.54910.9530.02820.97250.02720.96880.02960.97040.02520.9731
η* (Pa.s)0.8812.54910.9530.02820.97250.02720.96880.02950.97050.02520.9731
G′ (Pa)0.4892.6288.5420.06590.69480.05210.70550.06750.69690.04860.6629
3G″ (Pa)0.9111.68412.5490.03130.97420.03200.97790.03080.97580.03260.9794
η* (Pa.s)0.9131.68712.5260.03140.97420.03200.97790.03090.97590.03260.9794
G′ (Pa)-------0.02600.72440.01580.7301
4G″ (Pa)-------0.01950.96040.01760.9581
η* (Pa.s)-------0.04200.96360.04060.9647

Empirical constants of the exponential model using the training data.

In relation to models 2 and 3, the error convergence point was reached after 500 and 2500 iterations, respectively. Model 2 (2-9-3 architecture) showed good results in the determination of the rheological properties G′, G″ and η* from the water content and temperature in the temperature sweep curves by means of SAOS measurements during heating. This model presented a RMSE test value equal to 0.0255, while that of training was 0.0310. When comparing each predicted variable separately in this model with the desired experimental values, the G′ property was observed to have the highest RMSE training values, equal to 0.0338, and test values equal to 0.0261 (Tab. 2), as well as the lowest correlation coefficient values, equal to 0.9398 in training and 0.9390 in testing. The properties G″ and η* showed RMSE and training correlation coefficient values equal to 0.0296 and 0.9704, and 0.0295 and 0.9705, respectively, and test values of 0.0252 and 0.9731 (Tab. 2).

Model 3 (2-3-3 architecture) also showed good results in the determination of the rheological properties G″ and η* from the water content and temperature in the temperature sweep curves by means of SAOS measurements during cooling. However, its statistical parameters were inferior to model 2, where the training RMSE is equal to 0.0464 and that of testing is equal to 0.0387. For each variable separately, the RMSE (Tab. 2) and correlation coefficient were, respectively, 0.0675 and 0.6969 in training, and 0.0486 and 0.6629 in testing for G′, were 0.0308 and 0.9758 in training and 0.0326 and 0.9794 in testing for G″, and finally were 0.0309 and 0.9759 in training and 0.0326 and 0.9794 in testing for η*.

Regarding the combined effect of temperature and water content on the viscoelastic properties (G′, G″ and η*) during temperature scanning, in both heating and cooling, the exponential model showed the same performance as models 2 and 3 with the MLP ANNs, due to the similarity of the obtained statistical parameters (Tab. 2).

Model 4 (3-9-3) presented a training RMSE value equal to 0.0306, while that of testing was 0.0270 after fifty iterations. The RMSE values (Tab. 2) and correlation coefficients in training were equal to 0.0260 and 0.7244 for G′, 0.0195 and 0.9604 for G″ and 0.0420 and 0.9636 for η*, respectively. In the test, for G′ they were equal to 0.0158 and 0.7301, for G″ were equal to 0.0176 and 0.9581 and for η* were equal to 0.0407 and 0.9647, respectively. These values of the statistical parameters show a good quality prediction of the properties G″ and η* in function of frequency, water content and temperature by means of the SAOS measurements. Second order multiple regression models were determined to estimate the rheological properties G′, G″ and η* of model 4 (Tab. 3). The prediction quality, based on the correlation coefficient and RMSE parameters, were much lower than the MLP ANN of model 4 (Tab. 2).

RMSE and correlation coefficient (r) of model 4 variables from the development and test process of a multiple-second order polynomial regression

VariableModel Coefficient1Training dataTest data
β0β1β2β3β12β13β23β123β11β22β33RMSErRMSEr
G′ (Pa)-0.01−0.040.10−0.02−0.17−0.170.26-0.050.060.02950.61890.01700.6932
G″ (Pa)0.03−0.03−0.200.38-−0.40−0.510.560.040.20-0.03620.85030.02930.8539
η* (Pa.s)0.49−0.53−1.29-0.52---0.160.81-0.06800.89640.06400.9018

Coefficients of the multiple regression model using the training data of model 4.

( - ) p-value > 0.05 obtained by ANOVA.

DISCUSSION

The ANNs and exponential nonlinear regression models developed in this study presented good qualities in predicting the viscosity from steady shear rheological measurements (model 1), as well as the properties G′, G″ and η* in temperature scans during heating (model 2) and G″ and η* in cooling temperature scans (model 3) from SAOS measurements. This was possible even when using data unknown by the network or not used in determining the model parameters. In predicting the volume of eggs using a machine vision technique, the authors considered that a mathematical model in literature showed better performance than the ANN (Soltani, Omid, & Alimardani, 2014), which corroborates with the present study. In contrast, other studies have shown that some mathematical models are less accurate than the ANN (Çelekli et al., 2012).

When using the MLP-ANN, similar correlation coefficient values were obtained in the study for predicting the viscosity of fruit juices clarified in the function of temperature and concentration (Rai, Majumdar, & Dasgupta, 2005) as well as the rheological properties: K (consistency index), n (flow behavior index) and τ0 (initial yield stress) as a function of composition (Herrmann et al., 2013). In predicting the viscosity of wild honeys from Jordan as a function of temperature, water content and strain rate, coefficients of determination equal to 0.970 and 0.962 were calculated for the training and test data, respectively, for a network with an architecture of 3-10-1 (Al-Mahasneh, Rababah, & Ma’Abreh, 2013). On the other hand, in a study performed with honey from Iran including the prediction of viscosity in function of temperature, water content and strain rate, a correlation coefficient of 0.997 was obtained using an ANN with a genetic algorithm such as the learning algorithm (Ramzi et al., 2015).

The quality of the lower prediction observed for G′ in models 2 and 3 indicates that its relationships with the input variables are very complex and nonlinear. The honeys presented liquid-like behavior, since G″ is much larger than G′ in the entire temperature range studied. This is due to a weak interaction without formation of networks between the honey molecules (Ahmed et al., 2007). Thus, Nayik, Dar, & Nanda (2015) and Silva et al. (2016) studied the effect of temperature on rheological properties of honey only in terms of G″, because it is a more informative parameter for this type of material. These temperature sweep curves are very important in the study of the effect of temperature fluctuations on the quality of honey observed by hysteresis between the heating and cooling curves (Chen et al., 2009; Karasu et al., 2015; Silva et al., 2016), thereby a proper prediction of the parameters G″ and η* in the function of water content and temperature represents an important practical application.

The property G′ showed a low correlation coefficient value despite the lower RMSE, considering that as noted by other authors the temperature and water content do not greatly influence this property (Nayik, Dar, & Nanda, 2015). When Oroian, (2015) studyied the influence of temperature, frequency and water content on the viscoelastic properties of Spanish honeys with ANN and ANFIS, the MLP ANN defined for the parameter G′ also showed the worst prediction results, with coefficients of determination equal to 0.994 in training, 0.188 in the cross-test and 0.914 in testing. An excellent prediction quality was obtained for the parameters G″ and η*, with coefficients of determination greater than 0.993, higher than those of the present study. This improved prediction is associated with greater homogeneity of the data, since the water content range used was from 16.04 to 17.82% and temperature from 5 to 40°C, more stringent than those of the present study. A better quality of the statistical parameters was also observed in the study when predicting the complex viscosity of honeys from Turkey (Kayacier, Yüksel, & Karaman, 2014), with an R2 greater than 0.985. Similarly, the database used presented great homogeneity, considering that the water content ranged from 19.1 to 19.4% and temperature range used was from 10 to 20°C. Dieulot & Skurtys (2013) noted a lack of accuracy when predicting the mechanical behavior of starch films using the complete database. However, after the authors divided this database into three more homogeneous classes via a cluster analysis, prediction of the mechanical properties of each class was more than adequate with higher R2 values and lower error values.

Regarding determination of the properties G′, G″ and η* in frequency sweeps from the SAOS measurements (model 4), the ANN model presented higher accuracy compared to the second-order polynomial multiple regression model. This demonstrated the effectiveness of the MLP ANN of model 4 when used in non-linear systems with complex relationships between input and output variables.

As in this study, several authors, have compared ANNs with multiple linear regression models and observed the superiority of ANNs for the prediction of the desired parameters due to the non-linearity and complex relationships of the systems, thus indicating their use (Ajala & Betiku, 2015; Dahmoune et al., 2015; Rahman et al., 2012).

Regarding the determined statistical parameters, with the exception of the parameter G′ of models 3 and 4, all models were properly trained due to the low RSME value. They also presented good generalization capacity, considering that they provided coherent answers with a low error from data that was not used in the training phase. All properties were determined from simple measurements over wide ranges of temperature and water content values, which represent an important practical application for processing honey and honey-based products, since that these determinations require the use of high-cost equipment and are essential in engineering calculations, quality control and shelf life of the products.

eISSN:
2299-4831
Lingua:
Inglese
Frequenza di pubblicazione:
2 volte all'anno
Argomenti della rivista:
Life Sciences, Zoology, other