The aim of this study was to determine the explanatory capacity of the Spanish Triathlon Federation’s talent identification tests in relation to performance in competition in subsequent years. We used an exploratory longitudinal study design to establish the relationship between talent identification tests completed by 247 triathletes (97 women and 150 men) aged from 14 to 19 years and the results they obtained over the years in competition. The battery of tests included freestyle swimming (100 and 1000 m) and running (400 and 1000 m). The results indicate that the explanatory capacity of these tests for split places in competition in the corresponding discipline was highest in the 1000-m swimming test, with a value of 0.34 for the adjusted coefficient of determination (R2a) (p ≤ 0.001), followed by the 1000-m running and 100-m swimming tests, where the highest R2a values were 0.26 and 0.19, respectively. No significant model was found for the 400-m running test. It was concluded that the explanatory capacity of the tests analysed for predicting performance in the discipline in competition was low. However, it was higher for the swimming and running tests of longer distance.
Key words
- talent identification
- sports performance
- triathlon performance
- endurance sports
- youth sports
- exercise tests
Triathlon is a sport that combines the disciplines of swimming, cycling and running, performed consecutively in that order and timed continuously from start to finish without stopping the clock. Between disciplines there is a period known as transition, in which triathletes change their clothing and equipment for the next segment. There are various competition formats; those used in the main world circuit (World Triathlon Series) and in the Olympic Games are sprint (750 m of swimming, 20 km of cycling, 5 km of running) and standard (1500 m, 40 km, 10 km).
As with other endurance sports such as long-distance running and cycling, success in triathlon is related to physiological characteristics of athletes, especially considering high levels in maximum values of oxygen uptake (VO2max) and peak power output in cycling (Wpeak). Similar values to other endurance athletes are also typically observed in submaximal physiological variables, in measurements of the ventilatory threshold (VT) and lactate threshold (LT), although it is difficult to make comparisons due to the multiple definitions that exist (Suriano and Bishop, 2010). Other physiological factors, less studied yet seemingly related to performance, are lactic acid buffering capacity (Baldari et al., 2007) and economy of movement (Millet et al., 2011).
Moreover, some anthropometric variables have been shown to be related to success in competition, such as a low body fat percentage (related to overall performance in competition) and long limbs (arms are related to performance in swimming and legs to performance in running and cycling) (Ackland et al., 1998; Landers et al., 2013). Mechanical factors, such as technical capacity in each discipline (Bentley et al., 2002), and tactical factors, such as competition strategy (Bentley et al., 2002; Hausswirth et al., 2010; Vleck et al., 2008), will also have influence on triathlon performance. In order to analyse these factors, it is important to consider fatigue from the previous disciplines, since behaviour will be different if analysed in isolation (Ofoghi et al., 2016).
The popularisation of triathlon in recent years has led several sports institutions to start investing significant resources in the talent identification process, in order to find young males and females with the potential to become elite athletes in adulthood (Swann et al., 2015). Some institutions, such as the Spanish Triathlon Federation (FETRI), use these programmes to provide the most talented athletes with appropriate training (including greater monitoring through national team training camps and competitions and even scholarships at high performance centres) to enable them to join the Spanish elite team, ensuring high competitiveness in international events (Federación Española de Triatlón, 2008). To achieve this goal it is critically important to evaluate the validity of the models applied and the success rates of the programmes carried out (Vaeyens et al., 2008).
However, there is a lack of scientific research on talent identification processes in sports in general (Johnston et al., 2018), and especially in triathlon. Bottoni et al. (2011) tried to identify factors to be considered in talent identification programmes, suggesting that they should be based not only on measuring physical parameters, but should also take account of other variables. Cuba-Dorado et al. (2015) analysed the talent identification tests developed by the Spanish Triathlon Federation (FETRI), which has one of the highest ITU country rankings worldwide. They found that the tests had very limited explanatory capacity for overall performance in competition in the year the tests were conducted.
This suggests that the tests carried out to choose the most promising Spanish triathletes do not predict their performance. Nevertheless, it seems necessary to check these findings in a longitudinal study, which can help avoid biases in talent identification, reducing the risk of prematurely attributing causality to the results (Johnston et al., 2018). The objective of this study, therefore, was to determine the explanatory capacity of talent identification tests carried out by the FETRI in relation to performance in competition in subsequent years in the discipline corresponding to each test.
We obtained the data for all triathletes aged from 14 to 19 years between 2009 and 2015 who underwent the talent identification tests developed by the FETRI. The study included triathletes who obtained a minimum of 8 points on the FETRI’s scale (explained in the investigation design) in at least 3 of the 4 tests, showing a good level of performance. 247 triathletes met this criterion: 150 men (60.7%) and 97 women (39.3%). Many of these triathletes subsequently participated in the Spanish Championships in their categories in the year when the tests were carried out and in the following years. In the year of the tests (Year 1), 200 of triathletes tested took part in the Spanish Championships; the next year (Year 2), the figure was 167, followed by 119 in Year 3, 71 in Year 4, 40 in Year 5, 29 in Year 6, 14 in Year 7, and 11 in Year 8, seven years after the tests were completed. The research protocol followed the principles of the Helsinki Declaration on biomedical research involving human subjects (64th World Medical Assembly, 2013). The study was approved by the Ethics Committee of the University of Vigo.
An exploratory longitudinal study design with an associative strategy was used to establish the relationship between the variables obtained in the Spanish Triathlon Federation’s tests and the split results obtained over the years in the various segments of the Spanish Triathlon Championships. We performed an independent analysis for each of the FETRI tests: S100 (100 m of freestyle swimming), S1000 (1000 m of freestyle swimming), R400 (400 m of running) and R1000 (1000 m of running). Swimming tests were conducted in a 25-m pool and running tests on a 400-m athletics track.
Predictive variables were used in each of these analyses:
Points: points obtained in each test according to the FETRI’s scale. This scale establishes a score according to the time reached in each test, taking into account age and sex of the triathlete. The minimum score is 0 points and the maximum is 10 for triathletes aged 18 or 19, 11 for those aged 16 or 17 and 12 for those aged 15.
Time: time of the test, measured in seconds.
Best Proportion of All (All): time percentage for the triathlete who achieved the best time in the test relative to all participants of the same age and sex between 2007 and 2016.
Best Proportion of 4 Results (4R): time percentage for the triathlete who achieved the best time in the test relative to all participants of the same age and sex between 2007 and 2016, taking into account only athletes who scored 8 or more points in at least three out of the four talent identification tests.
We used the place percentage obtained by the triathlete in the discipline corresponding to the test analysed, relative to the number of participants in the Spanish Triathlon Championship, as a dependent or response variable. In this percentage a low number indicates a result in the highest places, whereas 100 signifies that the athlete finished in the last place. We analysed this response variable over the following eight years, taking Year 1 (SCYear1) as the year in which the tests were carried out, Year 2 (SCYear2) as the next year, and so on. The championships in the youth, junior, under-23 and elite categories were also considered (since the last two categories include triathletes competing in the same competition, the result that was more positive for the triathlete was selected).
We collected the data for FETRI’s tests and the Spanish Championship results from 2009, 2010, 2011, 2012, 2013, 2014, and 2015, verifying that triathletes fulfilled the criterion of a minimum of 8 points in at least three tests. Once the sample had been selected, we collected the triathletes’ results in the Spanish Championships in their categories from the year of the test and the following years. The competition split place percentage (relative to the number of participants) was calculated for the discipline corresponding to each test.
In order to verify that the data were reliable (Gr) and generalisable (Ga) a variance component analysis was implemented, using the least squares and maximum likelihood estimation procedures (
In all generalisability analysis models the residual error coincided in the least-squares (VARCOMP Type I) and maximum likelihood (GLM General Linear Model) estimation procedures, as it was confirmed that the sample distribution was normal, linear and homoscedastic (Searle et al., 1992). The precision models of the S100, S1000 and R1000 tests showed high levels of Gr (between 0.99 and 1) and Ga (between 0.93 and 0.99), S1000 being the highest value that the two coefficients could take. However, R400 delivered no result (no comma) with any of the estimation procedures used. The degree of independence was calculated using the Durbin-Watson (D-W) test, in which it could be assumed that a value located between 1.5 and 2.5 showed independence. The values of all the models lay within this range, as can be observed in Tables 1, 2 and 3.
Summary of linear regression models for the 1000 m running test
All and Points | 0.270 | 0.262 | 0.520 | 18.58 | .000 | 1.94 | |
4R, Time and All | 0.128 | 0.111 | 0.358 | 21.18 | .000 | 2.18 | |
4R, Time and Points | 0.228 | 0.206 | 0.477 | 19.63 | .000 | 1.99 | |
Points | 0.109 | 0.094 | 0.330 | 20.39 | .009 | 1.85 | |
No significant model was found | |||||||
Summary of linear regression models for the 1000 m swimming test
4R and Points | 0.345 | 0.339 | 0.588 | 10.91 | .000 | 1.91 | |
4R and Points | 0.348 | 0.340 | 0.590 | 12.44 | .000 | 2.18 | |
All, 4R and Points | 0.199 | 0.179 | 0.447 | 14.52 | .000 | 1.87 | |
Time | 0.094 | 0.081 | 0.307 | 14.62 | .009 | 1.86 | |
No significant model was found | |||||||
S
4R and Time | 0.108 | 0.099 | 0.328 | 12.74 | .000 | 1.79 | |
All and Time | 0.128 | 0.117 | 0.358 | 14.39 | .000 | 2.01 | |
4R | 0.051 | 0.043 | 0.225 | 15.68 | .014 | 1.99 | |
No significant model was found | |||||||
4R and Time | 0.237 | 0.195 | 0.486 | 14.68 | .007 | 2.18 | |
No significant model was found | |||||||
The results of the significant model with the independent variables that showed the greatest explanatory capacity in each of the years analysed are presented below. Those for the R1000 test are shown in Table 1, those for the S1000 test in Table 2 and those for the S100 test in Table 3. No significant model was found for the R400 test.
In Table 1 we can see that the greatest explanatory capacity of the 1000-m running test for split places in running was obtained in the year of the tests. The value of the adjusted coefficient of determination (R2a) (
Figure 1
S

In subsequent years this explanatory capacity decreased significantly.
In Table 2 we can see that the greatest explanatory capacity of the 1000-m swimming test for split places in swimming was obtained one year after the tests. The value of R2a (
Figure 2
Scatter plot (SCYear 2 vs. Predicted SCYear 2) for the 1000 m swimming test with a linear trend, the 95% confidential interval for the linear trend

In subsequent years this explanatory capacity decreased significantly.
In Table 3 we can see that the greatest explanatory capacity of the 100-m swimming test for split places in swimming was obtained four years after the tests. The value of R2a (
Figure 3
Scatter plot (SCYear 5 vs. Predicted SCYear 5) for the 100 m swimming test with a linear trend, the 95% confidential interval for the linear trend

Unlike the previous tests, no significant model was found in the third year after this test.
The main finding of this study is that the explanatory capacity of the tests used by the FETRI in their talent identification process for split performance in triathlon competitions is low (1000 m running test: 26.2%, 1000 m swimming test: 34%, 100 m swimming test: 19.5%).
The results also indicate that the explanatory capacity is related to the length of the test (the most explanatory test was S1000, followed by R1000 and S100, whereas the explanatory capacity of R400 was zero). These results are in line with those obtained by Cuba-Dorado et al. (2015), who analysed the explanatory capacity of the same tests for the overall results in triathlon competitions in the year of the tests.
Explanatory capacity seems to be related to energy demands: the 1000-m swimming test, which has the greatest explanatory capacity, is considered a long-term effort, in which the aerobic pathway is the most used. The 1000-m running test, which has the second highest explanatory capacity, constitutes a medium-term effort, with a balanced use of the aerobic and anaerobic pathways. The other tests, i.e., 100 m swimming and 400 m running, are regarded as short-term efforts with predominance of the anaerobic over the aerobic pathway (Capelli et al., 1998; Gastin, 2001; Spencer and Gastin, 2001).
A reasonable explanation of these findings could be a relationship between predictive capacity and the predominant pathway in the test, which is related, in turn, to physiological performance factors associated with endurance sports. Thus, the S1000 test would be associated with VO2max, VT2, lactic acid buffering capacity and economy of movement, and the R1000 test would be associated with VO2max and also with the capacity to withstand concentrations of hydrogen ions in muscle and blood. On the other hand, to our knowledge the test with greater predominance of the anaerobic pathway does not seem to be related to performance factors in triathlon that have been studied (Baldari et al., 2007; Millet et al., 2011; Suriano and Bishop, 2010). However, these hypotheses still need to be tested, and this would allow us to suggest that a closer relationship with the performance factors of triathlon will help increase the explanatory capacity of talent identification tests.
In relation to the tests with predominance of the anaerobic pathway, greater explanatory capacity was observed in the S100 test compared to R400. Despite the fact that these tests are of very similar duration (62.0 ± 4.1 s for S100 and 62.5 ± 5.7 s for R400), the greater explanatory capacity of the swimming test may be related to the influence of speed over the first metres on triathlon performance (Vleck et al., 2008). Another possible explanation for the low predictive capacity of the talent identification tests in relation to performance factors in triathlon is that these tests are exclusively focused on the physiological component, contrary to the proposals of Abbot and Collins (2004), and do not take account of other factors with a higher influence on triathlon performance, such as anthropometric (Ackland et al., 1998; Landers et al., 2013), psycho-social (Ruiz-Tendero and Martin, 2012), mechanical (Bentley et al., 2002; Ofoghi et al., 2016) and tactical factors (Hausswirth et al., 2010; Ofoghi et al., 2016). Nor do they consider one of the distinctive features of triathlon compared to other long-distance cyclical sports: the influence of the preceding discipline on performance in the following segment (Bentley et al., 2007; Delextrat, 2005; Millet and Vleck, 2000).
The explanatory capacity of FETRI’s tests for predicting performance in the discipline in competition is low. It varies substantially according to the type of the test and the number of years between the test and the competition. However, it seems to be related to the performance factors involved in triathlon.
Coaches and sports managers should reconsider the suitability of the 100 m and 1000 m swimming and 400 m and 1000 m running talent identification tests for elite triathlon performance. It seems necessary to investigate a new model for the talent identification programme, with tests more appropriate to the performance factors of the sport, so as to identify and select triathletes in a more precise way. This would contribute to the better use of the resources spent on these programmes. Moreover, such programmes can substantially determine the development of female and male athletes, by providing facilities for the most outstanding ones, and conversely they can discourage those who do not stand out and lead them to abandon the sport, and it is therefore even more necessary to reconsider the relevance of the identification tests used.
Figure 1

Figure 2

Figure 3

Summary of linear regression models for the 100 m swimming test
4R and Time | 0.108 | 0.099 | 0.328 | 12.74 | .000 | 1.79 | |
All and Time | 0.128 | 0.117 | 0.358 | 14.39 | .000 | 2.01 | |
4R | 0.051 | 0.043 | 0.225 | 15.68 | .014 | 1.99 | |
No significant model was found | |||||||
4R and Time | 0.237 | 0.195 | 0.486 | 14.68 | .007 | 2.18 | |
No significant model was found | |||||||
Summary of linear regression models for the 1000 m swimming test
4R and Points | 0.345 | 0.339 | 0.588 | 10.91 | .000 | 1.91 | |
4R and Points | 0.348 | 0.340 | 0.590 | 12.44 | .000 | 2.18 | |
All, 4R and Points | 0.199 | 0.179 | 0.447 | 14.52 | .000 | 1.87 | |
Time | 0.094 | 0.081 | 0.307 | 14.62 | .009 | 1.86 | |
No significant model was found | |||||||
Summary of linear regression models for the 1000 m running test
All and Points | 0.270 | 0.262 | 0.520 | 18.58 | .000 | 1.94 | |
4R, Time and All | 0.128 | 0.111 | 0.358 | 21.18 | .000 | 2.18 | |
4R, Time and Points | 0.228 | 0.206 | 0.477 | 19.63 | .000 | 1.99 | |
Points | 0.109 | 0.094 | 0.330 | 20.39 | .009 | 1.85 | |
No significant model was found | |||||||
Inter-Limb Muscle Property Differences in Junior Tennis Players The Pre-Exhaustion Method Does Not Increase Muscle Activity in Target Muscle During Strength Training in Untrained Individuals Neuromuscular Impact of Acute Hypertrophic Resistance Loading With and Without Blood-Flow Restriction Validity and Reliability of the Smart Groin Trainer for Measuring Hip Adduction Strength The Repeated Curve Sprint Test Appears to be an Appropriate Tool for Estimating Anaerobic Fitness in Young Trained Male Futsal Players Training with a Heavy Puck Elicits a Higher Increase of Shooting Speed Than Unloaded Training in Midget Ice Hockey Players Accuracy of Predicting One-Repetition Maximum from Submaximal Velocity in the Barbell Back Squat and Bench Press The Importance of Posture And Body Composition for the Stability and Selected Motor Abilities of Professional Handball Players Influence of Tactical Behaviour on Running Performance in the Three Most Successful Soccer Teams During the Competitive Season of the Spanish First Division Post-Activation Performance Enhancement in Sprinters: Effects of Hard Versus Sand Surfaces Cognitive Factors in Elite Handball: Do Players’ Positions Determine their Cognitive Processes? Effects of Postactivation Performance Enhancement on the Vertical Jump in High-Level Volleyball Athletes On-Court Change of Direction Test: An Effective Approach to Assess COD Performance in Badminton Players Movement Patterns of Polish National Paralympic Team Wheelchair Fencers with Regard to Muscle Activity and Co-Activation Time The Creation of Goal-Scoring Opportunities at the 2019 FIFA Women’s World Cup Assessing the Sprint Force-Velocity Profile in International Football Players with Cerebral Palsy: Validity, Reliability and Sport Class’ Profiles Handcycling Training in Men with Spinal Cord Injury Increases Tolerance to High Intensity Exercise Hand Grip Strength vs. Locomotor Efficiency in Sitting Volleyball Players Evidence‐Based Recovery in Soccer – Low‐Effort Approaches for Practitioners Anaerobic Variables as Specific Determinants of Functional Classification in Wheelchair Basketball Addition of in‐Play Cooling Breaks During Intermittent Exercise while Wearing Lacrosse Uniforms in the Heat Attenuates Increases in Rectal Temperature The Effects of Preferred Music and Its Timing on Performance, Pacing, and Psychophysiological Responses During the 6‐min Test Prevalence of Dehydration and the Relationship with Fluid Intake and Self‐Assessment of Hydration Status in Czech First League Soccer Players A Study Comparing Gait and Lower Limb Muscle Activity During Aquatic Treadmill Running With Different Water Depth and Land Treadmill Running Changes in Heart Rate Variability and Post‐Exercise Blood Pressure from Manipulating Rest Intervals Between Sets of Resistance Training