DNA barcoding with the COI mitochondrial gene has been widely applied for species identification and biodiversity assessment (Avó
DNA barcoding of nematodes with the COI mitochondrial gene is increasingly applied in studies of nematode taxonomy (Derycke
A large-scale survey of nematode diversity across diverse ecosystems was conducted previously (Neher
Metadata associated with each site were obtained from the United States Department of Agriculture’s Natural Resource Conservation Service geospatial database (USDA-NRCS, 2017) and assembled into a matrix that included the annual average minimum and maximum temperature and precipitation over 40 years (1981 to 2010). These three continuous variables were categorized by grouping them in increments of 5 years. Associated land cover and elevation for each individual nematode were obtained and viewed using ArcMap software (ESRI, 2011). Associated ecoregion name and biome for each site were identified and recorded into the matrix dataset (Ricketts
Twenty-four standard morphological measurements previously recorded were used in this study of
Summary of morphological statistics by HG of
N | 3 | 13 | 9 | 14 | 12 | 16 | 12 |
R | 99 ± 2.6 | 94 ± 4.3 | 94 ± 3.9 | 101 ± 4.7 | 95 ± 3.4 | 98 ± 7.2 | 100 ± 3.9 |
Rv | 7 ± 0.6 | 7 ± 0.9 | 6 ± 1.1 | 7 ± 0.9 | 7 ± 0.8 | 7 ± 1.2 | 7 ± 0.9 |
Rex | 29 ± 2.1 | 27 ± 1.6 | 28 ± 1.9 | 27 ± 1.8 | 26 ± 1.6 | 27 ± 17 | 27 ± 1.9 |
Body annule width (µm) | 6.4 ± 0.7 | 7.0 ± 1.0 | 7.3 ± 0.7 | 6.0 ± 0.7 | 6.8 ± 0.7 | 6.1 ± 0.6 | 7 ± 0.7 |
Length (µm) | 598.2 ± 43.2 | 634.6 ± 83.2 | 621.6 ± 50.5 | 587.2 ± 63.0 | 604.8 ± 65.9 | 579.9 ± 49.3 | 642 ± 44.7 |
Stylet length (µm) | 83.7 ± 2.1 | 87.6 ± 5.7 | 89.6 ± 3.8 | 72.6 ± 3.0 | 74.9 ± 3.6 | 73.3 ± 4.9 | 71 ± 1.4 |
Stylet knob width (µm) | 13.3 ± 0.6 | 13.5 ± 0.9 | 13.3 ± 1.1 | 12.0 ± 0.7 | 12.3 ± 1.4 | 11.6 ± 1.3 | 14 ± 0.6 |
VUL | 554.2 ± 41.3 | 594.2 ± 79.4 | 581.8 ± 52.3 | 549.7 ± 59.7 | 565.3 ± 61.0 | 539.4 ± 46.5 | 597 ± 40.1 |
MBW | 52.7 ± 1.5 | 51.8 ± 4.8 | 55.8 ± 9.1 | 45.6 ± 6.7 | 45.6 ± 5.1 | 47.2 ± 5.6 | 52 ± 3.7 |
VBW | 40.3 ± 2.5 | 38.0 ± 3.5 | 40.3 ± 5.9 | 35.0 ± 4.2 | 35.3 ± 4.5 | 36.7 ± 4.5 | 40 ± 4.6 |
ESO | 145.7 ± 1.2 | 159.1 ± 13.7 | 159.6 ± 6.0 | 136.2 ± 7.0 | 141.0 ± 6.6 | 142.4 ± 6.1 | 146 ± 8.9 |
V | 92.6 ± 0.8 | 93.6 ± 1.0 | 93.5 ± 1.2 | 93.6 ± 1.2 | 93.5 ± 0.6 | 93.0 ± 0.7 | 93 ± 1.2 |
HG, haplotype group.
Forward and reverse COI sequences were edited using CodonCode Aligner software version 4.2 (CodonCode Corporation, Centerville, MA; codoncode.com/aligner/), with manual adjustment. Sequences were aligned using ClustalW in MEGA6 (Tamura
Three variations of the original dataset were created for use in our analyses. The original dataset included all
Three types of phylogenetic trees were constructed, with
Each species delimitation method utilized the original dataset, with the exception of Birky’s
Statistical parsimony analysis was conducted on individual HG using the program TCS 1.21 (Clement
Under the assumption that two HG can diverge morphologically over time, the association of genetic structure was compared with the morphological variation described above, using a discriminant function analysis conducted in JMP®, Version 7 (SAS Institute Inc., Cary, NC, 1989–2007). The discriminant function analysis uses an algorithm that classifies cases into previously determined groups and derives a model that best discriminates groups, maximizing intragroup variation relative to inter-group variation (Friedman, 1989; McClure
The association between geographical distance within and between HG and the intra-genetic distance of the respective HG was assessed using the Mantel test (Mantel, 1967) in R software version 1.1.383 (R Core Team, 2018) using the mantel() function in the R package “ecodist” 2.0.1 (Goslee and Urban, 2007). This analysis assumes that a consequence of dispersal limited by geographic distance, isolation by distance, is correlated (positively or negatively) with the organisms’ genetic distance. This relationship can be positive, genetically similar, and geographically close together, or genetically dissimilar and geographically far apart. It can also be negative and genetically similar, but geographically far apart, or genetically dissimilar but geographically close together. The detection of a correlation indicates an effect of dispersal and/or migration probabilities within the individual HG. A pairwise distance matrix of genetic similarity between individual sequences within each HG was created using the JC69 substitution model on the dataset without singletons and without nematodes collected outside the United States. The JC69 assumes equal base pair frequencies and equal mutation rates. Geographic distance between individuals was also calculated in a pairwise fashion, using latitudinal and longitudinal coordinates and calculated using the geodesic() function in the R package “geosphere” 1.5–7 (Hijmans, 2017). This function calculates the shortest distance between two points following an ellipsoid (Hijmans, 2015). To determine whether there was increasing genetic similarity with decreasing distance, the two matrices were tested for structure along a spatial gradient with 10,000 permutations, 500-bootstrap iterations, a 0.90-resampling level, and a Pearson correlation coefficient with a 0.95-confidence level. The power and false-positive rates were estimated based on the significance threshold value of α = 0.05, the Mantel coefficient, MantleR (
To assess for associations between population structure and environmental parameters, a distance-based redundancy analysis (dbRDA) was applied (Legendre and Anderson, 1999). This analysis does not require groups to be defined a priori and relies on a forward–backward selection process to identify the model of environmental parameters that best predicts the genetic variation. This method detects linear relationships based on similarities and dissimilarities, generated by constrained ordination on a distance matrix representing the response variable, to describe the relative contribution of multiple independent explanatory variables (McClure
The strongest association was assigned by the visualization within the bi-plots. Principal components are indicated by an axis, with each variable or HG indicated a priori and projected onto the axis (Jongman, 1995). The strength of a variable in association with the HG’s genetic score is conveyed by the length of the bi-plot rays and the relative distance of the ray in relation to the HG’s location on the bi-plot figure. This plot indicates the presence of a correlation between HG and the bi-plot axes. The eigenvalues for these axes also indicate the importance of the subsequent variables associated with the axes, as well as other HG, in order to explain the relationship within the data matrices (Fiscus and Neher, 2002).
A discriminant analysis of principal components was implemented using the R package “adegenet” 2.1.0 to assess the per-sample posterior group assignment probability (Jombart
HG were defined according to the strongest environmental variables, and those containing fewer than five individuals were compared using an analysis of molecular variance (AMOVA) in the R package “pegas” version 0.10 (Excoffier
A total of 132 specimens were identified as
Locality and GenBank accession information for
N3 | Timmas Farm Ecological Forest Reserve, Cass County, NE, USA | Central Tall Grasslands | N | KJ787901 |
N173 | Middle Loup River, Hooker County, NE, USA | Nebraska Sand Hills Mixed Grasslands | N | KJ787902 |
N583 | Wakulla Springs State Park, Wakulla County, FL, USA | Southeastern Conifer Forests | N | KJ787880 |
N584 | Ichetucknee Springs State Park, Columbia County, FL, USA | Southeastern Conifer Forests | N | KJ787885 |
N588 | Ichetucknee Springs State Park, Columbia County, FL, USA | Southeastern Conifer Forests | N | KJ787886 |
N607 | Schluckebier Prairie State Natural Area, Sauk County, WI, USA | Upper Midwest forest-savanna Transition Zone | N | KJ787881 |
N724 | Chimney Creek, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | KJ787906 |
N728 | Pickens County, SC, USA | Southeastern Mixed Forests | Y | KJ787873 |
N729 | Pickens County, SC, USA | Southeastern Mixed Forests | Y | KJ787874 |
N730 | Pickens County, SC, USA | Southeastern Mixed Forests | Y | KJ787875 |
N733 | Ozark National Forest, AR, USA | Central US Hardwood Forests | N | KJ787882 |
N735 | Ozark National Forest, AR, USA | Central US Hardwood Forests | N | KJ787883 |
N736 | Southeastern Fruit and Nut Research Station, Peach County, GA, USA | Southeastern Mixed Forests | Y | KJ787887 |
N746 | Crawford Bay, British Columbia, Canada | Cascade Mountains Leeward Forests | N | KJ787907 |
N747 | Crawford Bay, British Columbia, Canada | Cascade Mountains Leeward Forests | N | KJ787908 |
N944 | Timmas Farm Ecological Forest Reserve, Cass County, NE, USA | Central Tall Grasslands | N | KJ787903 |
N945 | Timmas Farm Ecological Forest Reserve, Cass County, NE, USA | Central Tall Grasslands | N | KJ787904 |
N947 | Timmas Farm Ecological Forest Reserve, Cass County, NE, USA | Central Tall Grasslands | N | KJ787905 |
N999 | Albright Grove, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | KJ787896 |
N1024 | Albright Grove, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | KJ787897 |
N1025 | Albright Grove, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | KJ787898 |
N1028 | Albright Grove, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | KJ787899 |
N1072 | Nine-Mile Prairie, Lancaster County, NE, USA | Central Tall Grasslands | N | KJ787913 |
N1073 | Nine-Mile Prairie, Lancaster County, NE, USA | Central Tall Grasslands | N | KJ787914 |
N1215 | Long Branch Stream Valley Park, Fairfax County, VA, USA | Southeastern Mixed Forests | N | KJ787900 |
N1216 | Long Branch Stream Valley Park, Fairfax County, VA, USA | Southeastern Mixed Forests | N | MN711175 |
N1217 | Spring Green Prairie Preserve, Sauk County, WI, USA | Upper Midwest forest-savanna Transition Zone | N | KJ787884 |
N1267 | George Washington Memorial Parkway, Fairfax County, VA, USA | Southeastern Mixed Forests | N | KJ787909 |
N1276 | George Washington Memorial Parkway, Fairfax County, VA, USA | Southeastern Mixed Forests | N | KJ787910 |
N1294 | Musser Fruit Research Farm, Oconee County, SC, USA | Southeastern Mixed Forests | Y | KJ787876 |
N1297 | Musser Fruit Research Farm, Oconee County, SC, USA | Southeastern Mixed Forests | Y | KJ787877 |
N1298 | Musser Fruit Research Farm, Oconee County, SC, USA | Southeastern Mixed Forests | Y | KJ787878 |
N1327 | Fresno County, CA, USA | California Central Valley | Y | KU236636 |
N1346 | George Washington Memorial Parkway, Fairfax County, VA, USA | Southeastern Mixed Forests | N | KY574831 |
N1361 | George Washington Memorial Parkway, Fairfax County, VA, USA | Southeastern Mixed Forests | N | KY574643 |
N1368 | Nine-Mile Prairie, Lancaster County, NE, USA | Central tall grasslands | N | KJ787915 |
N1375 | Nine-Mile Prairie, Lancaster County, NE, USA | Central tall grasslands | N | KJ787916 |
N1397 | George Washington Memorial Parkway, VA, USA | Southeastern Mixed Forests | N | KY574644 |
N1451 | George Washington Memorial Parkway, VA, USA | Southeastern Mixed Forests | N | KJ788063 |
N2262 | George Washington Memorial Parkway, VA, USA | Southeastern Mixed Forests | N | KY574645 |
N2269 | George Washington Memorial Parkway, VA, USA | Southeastern Mixed Forests | N | MN711176 |
N2528 | Nine-Mile Prairie, Lancaster County, NE, USA | Central tall grasslands | N | KY574650 |
N2557 | Autauga County, AL, USA | Southeastern Mixed Forests | N | KY574633 |
N2558 | Autauga County, AL, USA | Southeastern Mixed Forests | Y | MN711177 |
N2577 | Chilton County, AL, USA | Southeastern Mixed Forests | Y | KY574624 |
N2604 | Chilton County, AL, USA | Southeastern Mixed Forests | Y | MN711178 |
N2610 | Chilton County, AL, USA | Southeastern Mixed Forests | Y | MN711179 |
N2611 | Chilton County, AL, USA | Southeastern Mixed Forests | Y | MN711180 |
N2615 | Chilton County, AL, USA | Southeastern Mixed Forests | Y | KY574634 |
N2618 | Chilton County, AL, USA | Southeastern Mixed Forests | Y | KY574635 |
N2619 | Chilton County, AL, USA | Southeastern Mixed Forests | Y | MN711181 |
N2622 | Chilton County, AL, USA | Southeastern Mixed Forests | Y | KY574623 |
N2694 | Albright Grove, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | MF770909 |
N2719 | Albright Grove, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | MN711182 |
N2720 | Albright Grove, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | MN711183 |
N2727 | Purchase Knob, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | MN711184 |
N2728 | Purchase Knob, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | MN711185 |
N2842 | Double Springs Gap, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | MN711186 |
N2844 | Double Springs Gap, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | MN711187 |
N2849 | Konza Prairie Biological Station, Riley County, KS, USA | Flint Hills Tall Grasslands | N | KY574651 |
N2850 | Konza Prairie Biological Station, Riley County, KS, USA | Flint Hills Tall Grasslands | N | KY574652 |
N2851 | Konza Prairie Biological Station, Riley County, KS, USA | Flint Hills Tall Grasslands | N | MN711188 |
N2853 | Konza Prairie Biological Station, Riley County, KS, USA | Flint Hills Tall Grasslands | N | MN711189 |
N2855 | Double Springs Gap, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | MN711190 |
N2857 | Double Springs Gap, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | MN711191 |
N2858 | Double Springs Gap, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | MN711192 |
N2863 | Konza Prairie Biological Station, Riley County, KS, USA | Flint Hills Tall Grasslands | N | KY574653 |
N2864 | Konza Prairie Biological Station, Riley County, KS, USA | Flint Hills Tall Grasslands | N | MN711193 |
N2869 | Double Springs Gap, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | KY574646 |
N2873 | Double Springs Gap, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | MN711194 |
N2874 | Double Springs Gap, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | MN711195 |
N2891 | Konza Prairie Biological Station, Riley County, KS, USA | Flint Hills Tall Grasslands | N | KY574654 |
N2892 | Konza Prairie Biological Station, Riley County, KS, USA | Flint Hills Tall Grasslands | N | KY574655 |
N2893 | Konza Prairie Biological Station, Riley County, KS, USA | Flint Hills Tall Grasslands | N | MN711196 |
N2896 | Konza Prairie Biological Station, Riley County, KS, USA | Flint Hills Tall Grasslands | N | MN711197 |
N2933 | West Point, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | KY574647 |
N2945 | West Point, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | MN711198 |
N2998 | Cades Cove, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | MN711199 |
N3008 | Goshen Prong, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | KY574648 |
N3013 | Trillium Gap, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | MN711200 |
N3078 | Gifford Woods State Park, VT, USA | New England Acadian Forests | N | KY574639 |
N3084 | Hayden Prairie State Preserve, Howard County, IA, USA | Central tall grasslands | N | MN711201 |
N3112 | Hayden Prairie State Preserve, Howard County, IA, USA | Central tall grasslands | N | MN711202 |
N3124 | Hayden Prairie State Preserve, Howard County, IA, USA | Central tall grasslands | N | KY574656 |
N3153 | George Washington Memorial Parkway, VA, USA | Southeastern Mixed Forests | N | KY574649 |
N3154 | George Washington Memorial Parkway, VA, USA | Southeastern Mixed Forests | N | MN711203 |
N3208 | Ichetucknee Springs State Park, FL, USA | Southeastern Conifer forests | N | KY574625 |
N3246 | Oconaluftee, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | MN711204 |
N3253 | Oconaluftee, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | MN711205 |
N3316 | Twin Creeks, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | KY574640 |
N3320 | Twin Creeks, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | KY574832 |
N3342 | Torreya State Park, FL, USA | Southeastern Conifer forests | N | KY574636 |
N3361 | Raspberry Island, Apostle Islands National Lakeshore, WI, USA | Western Great Lakes Forest | N | KY574641 |
N3371 | Raspberry Island, Apostle Islands National Lakeshore, WI, USA | Western Great Lakes Forest | N | MN711206 |
N3372 | Raspberry Island, Apostle Islands National Lakeshore, WI, USA | Western Great Lakes Forest | N | MN711207 |
N3373 | Raspberry Island, Apostle Islands National Lakeshore, WI, USA | Western Great Lakes Forest | N | MN711208 |
N3374 | Raspberry Island, Apostle Islands National Lakeshore, WI, USA | Western Great Lakes Forest | N | MN711209 |
N3481 | Wakulla Springs State Park, FL, USA | Southeastern Conifer forests | N | MN711210 |
N3491 | West Point, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | MF770951 |
N3492 | West Point, Great Smoky Mountains National Park, USA | Appalachian Blue Ridge Forests | N | MN711211 |
N5508 | Oconaluftee, Great Smoky Mountains National Park, USA | Appalachian-Blue Ridge Forests | N | MN711216 |
N5587 | Tuskegee National Forest, Macon County, AL, USA | Southeastern Mixed Forests | N | KY574626 |
N5592 | Canyonlands-South, Big Thicket National Preserve, TX, USA | Piney Woods forests | N | KY574627 |
N5593 | Canyonlands-South, Big Thicket National Preserve, TX, USA | Piney Woods forests | N | KY574628 |
N5603 | Canyonlands-South, Big Thicket National Preserve, TX, USA | Piney Woods forests | N | MF770959 |
N5638 | Tuskegee National Forest, Macon County, AL, USA | Southeastern Mixed Forests | N | KY574629 |
N5643 | Big Sandy Creek, Big Thicket National Preserve, TX, USA | Piney Woods forests | N | KY574630 |
N5645 | Big Sandy Creek, Big Thicket National Preserve, TX, USA | Piney Woods forests | N | KY574631 |
N5712 | Ichetucknee Springs State Park, FL, USA | Southeastern Conifer forests | N | MN711217 |
N5713 | Ichetucknee Springs State Park, FL, USA | Southeastern Conifer forests | N | MN711218 |
N5714 | Ichetucknee Springs State Park, FL, USA | Southeastern Conifer forests | N | MN711219 |
N5715 | Ichetucknee Springs State Park, FL, USA | Southeastern Conifer forests | N | MN711220 |
N5726 | Black Hills National Forest, Lawrence County, SD, USA | South Central Rockies forests | N | KY574642 |
N5727 | Black Hills National Forest, Lawrence County, SD, USA | South Central Rockies forests | N | MN711221 |
N5728 | Black Hills National Forest, Lawrence County, SD, USA | South Central Rockies forests | N | MN711222 |
N5731 | Black Hills National Forest, Lawrence County, SD, USA | South Central Rockies forests | N | MN711223 |
N5813 | Cuming County, NE, USA | Central Tall Grasslands | Y | KY574637 |
N5814 | Cuming County, NE, USA | Central Tall Grasslands | Y | MN711224 |
N5815 | Cuming County, NE, USA | Central Tall Grasslands | Y | MN711225 |
N5816 | Cuming County, NE, USA | Central Tall Grasslands | Y | KY574632 |
N5943 | Tulare County, CA, USA | Central California Valley | Y | KY574638 |
P74053 | Fresno County, CA, USA | Central California Valley | Y | KJ787911 |
P194033 | Konza Prairie Biological Station, Riley County, KS, USA | Flint Hills Tall Grasslands | Y | KJ787912 |
P231026 | Southeastern Fruit and Nut Research Station, Peach County, GA, USA | Southeastern Plains | Y | KJ787888 |
P231028 | Southeastern Fruit and Nut Research Station, Peach County, GA, USA | Southeastern Plains | Y | KJ787889 |
P231030 | Southeastern Fruit and Nut Research Station, Peach County, GA, USA | Southeastern Plains | Y | KJ787879 |
P231031 | Southeastern Fruit and Nut Research Station, Peach County, GA, USA | Southeastern Plains | Y | KJ787890 |
P231032 | Southeastern Fruit and Nut Research Station, Peach County, GA, USA | Southeastern Plains | Y | KJ787891 |
P231034 | Southeastern Fruit and Nut Research Station, Peach County, GA, USA | Southeastern Plains | Y | KJ787892 |
P231035 | Southeastern Fruit and Nut Research Station, Peach County, GA, USA | Southeastern Plains | Y | KJ787893 |
P231036 | Southeastern Fruit and Nut Research Station, Peach County, GA, USA | Southeastern Plains | Y | KJ787894 |
P231037 | Southeastern Fruit and Nut Research Station, Peach County, GA, USA | Southeastern Plains | Y | KJ787895 |
NID, nematode identification number.
Figure 1 displays a maximum-likelihood tree of the unique COI haplotypes. The bootstrap and posterior probability support values for HG are labeled at defining nodes sequentially for neighbor-joining, maximum-likelihood, and Bayesian analyses. Two more ancestral nodes are labeled to illustrate the close relationship between HG 12 and 13, and group 11 as a sister group to the pair. Each of the seven HG are supported by posterior probability values of 1.0 in Bayesian analyses and bootstrap values of 99 to 100 for neighbor-joining and maximum-likelihood trees.
A condensed maximum-likelihood tree of 721-bp sequences isolated from the COI gene for each individual in the morphospecies
The seven HG supported by phylogenetic analyses served as primary species hypotheses for testing by other species delimitation approaches. HG 8, 11, 13, and 14 had significant support from each of the species delimitation methods (Fig. 1). These four groups were characterized by (i) the statistical parsimony network analysis indicating 90% confidence level resulting in a single interconnected network, (ii) distinct recursive partitioning (
Calculations of Birky’s 4× test on the morphospecies
13 | 0.029 | 0.006 | 4.833* |
12 | 0.029 | 0.031 | 0.935 |
11 | 0.060 | 0.006 | 10.000* |
14 | 0.102 | 0.009 | 11.333* |
8 | 0.079 | 0.003 | 26.333* |
10 | 0.086 | 0.006 | 14.333* |
9 | 0.089 | 0.096 | 0.927 |
Interclade divergence calculated as the observed average distance between clades corrected for multiple hits (Birky, 2013).
Interclade variation calculated as π(1–4π/3), with π equal to the relativized nucleotide diversity that is corrected for sample size (Birky, 2013).
Degree of monophyly within the group, where values greater than 4.0 indicate reciprocal monophyly.
95% confidence of reciprocal monophyly.
HG, haplotype group.
Species delimitation statistics corresponding to the neighbor-joining tree HG built from the COI gene sequence for the morphospecies
13 | 12 | 0.004 | 0.03 | 0.13 | 0.95 | 0.98 | 0.002 | 0.24 | 7.00E−14 |
12 | 13 | 0.007 | 0.03 | 0.23 | 0.92 | 0.97 | 0.005 | 1.00 | 7.00E−14 |
11 | 13 | 0.002 | 0.063 | 0.02 | 0.99 | 1.00 | 0.004 | 0.05 | 3.60E−18 |
14 | 8 | 0.008 | 0.118 | 0.07 | 0.97 | 0.99 | 0.004 | 0.05 | 1.40E−24 |
8 | 13 | <0.001 | 0.095 | 0.01 | 0.94 | 1.00 | <0.001 | 0.05 | 6.70E−15 |
10 | 8 | 0.005 | 0.098 | 0.05 | 0.98 | 1.00 | 0.004 | 0.41 | 3.62E−03 |
9 | 8 | 0.03 | 0.104 | 0.29 | 0.90 | 0.97 | 0.020 | 0.56 | 3.90E−26 |
Summary of genetic statistics for the 721-bp sequence isolated from the COI gene for each individual HG among the seven HG identified within the morphospecies
13 | 28 | 11 | 5 | 0.738 | 1.354 | 10 | 0.002 |
12 | 16 | 15 | 9 | 0.858 | 4.683 | 8 | 0.006 |
11 | 16 | 8 | 2 | 0.342 | 1.308 | 3 | 0.002 |
14 | 18 | 10 | 6 | 0.863 | 1.941 | 8 | 0.003 |
8 | 8 | 1 | 1 | 0.429 | 0.428 | 2 | <0.001 |
10 | 23 | 7 | 3 | 0.711 | 1.628 | 6 | 0.002 |
9 | 20 | 60 | 50 | 0.932 | 19.53 | 13 | 0.027 |
Pooled | 132 | 214 | 182 | 0.963 | 56.15 | 53 | 0.078 |
Number of individuals.
Polymorphic sites.
Parsimony informative sites.
Hd.
Average number of nucleotide differences.
Number of haplotypes.
Nucleotide diversity.
Hd, haplotype diversity; HG, haplotype group.
Isolation by distance metric calculated by Mantel test statistic (Mantel R), and associated
13 | 0.048 | 0.731 |
12 | 0.206 | 0.118 |
11 | 0.989 | <0.001* |
14 | –0.108 | 0.443 |
8 | 0.874 | 0.021* |
10 | 0.627 | <0.001* |
9 | 0.284 | 0.005* |
Significant Mantel R scores at
HG, haplotype group.
Nematodes included in HG 12 (
Distribution of
Two HG were predominantly comprised of specimens associated with peach orchards. HG 8 (
HG 14 was the only HG not associated with North American forests. All specimens in this group were collected from remnant tallgrass prairie sites in Iowa, Kansas, and Nebraska. Precise host relationships have not been determined for this group, but collection sites were in the proximity of native woody shrubs that commonly invade prairie habitats, such as roughleaf dogwood (
HG 9 has the greatest Hd, intragroup nucleotide diversity (Pi), and average number of nucleotide differences (k) (Table 5). It is also unique among the seven HG in that its haplotypes were collected from Florida to Texas from Gulf Coast forests. Six haplotypes were recorded in Big Thicket National Preserve in Texas, from collection sites that featured native pine trees. Two specimens recovered from native prairie sites in Wisconsin may suggest the role of the Mississippi River as a corridor for plant and animal dispersal.
Results within the dbRDA analysis identified an autocorrelation (0.964) between minimum and maximum temperatures. Minimum temperature displayed less of the variation within the global model, a model that includes all possible variables; therefore, it was omitted from further analysis. Multivariate dbRDA analysis of the pairwise genetic distances and environmental metadata identified that the global model accounted for 35.7% (
An ordination plot using dbRDA bi-plot showing the seven most influential explanatory environmental variables (arrows) overlain on the first two eigenvectors of the analysis of
Based on the dbRDA results, land cover was influential for the separation of three HG and was the strongest predictor variable. Therefore, land cover was analyzed further using the discriminant analysis of principal components. The first seven principal components were analyzed, saving four discriminant functions with 92.9% of the variation accounted for. The first discriminant axis represented 11.0% of the variation, and the second axis represented 4.2% (Fig. 4). In general, individuals were clustered loosely near herbaceous land cover, which was the most isolated variable on the bi-plot. The pairwise AMOVA indicated differentiation with significant differences between nematodes from herbaceous and woody wetlands (
Scatterplots of discriminant analysis of principal components, calculated on the category of land cover that distinguishes the sequential data the most. Points represent observed individual
Stacked bar plot of the probability of each
Population comparison defined by four categories of land use associated with the morphospecies
Deciduous forest | 25 | – | 0.010 | <0.001 | 0.011 | <0.001 |
Developed open space | 5 | 0.263 | – | 0.92 | 0.168 | 0.001 |
Woody wetlands | 10 | 0.375 | –0.114 | – | 0.027 | <0.001 |
Cultivated crops | 10 | 0.118 | 0.087 | 0.217 | – | 0.009 |
Herbaceous | 9 | 0.654 | 0.725 | 0.757 | 0.330 | – |
The lower triangle displays the statistic
Morphological analysis of each HG in the discriminant function analysis identified that stylet length, total body length, and stylet knob width are the strongest distinguishing features among the seven HG as defined a priori (Fig. 6), and stylet length to be the strongest distinguishing feature. The analysis correctly classified 61 out of 87 (70.1%) of the sequences belonging to their respective groups (misclassifying 26). Canonical discriminant function for the first dimension indicated that stylet length and total body length accounted for most of the variation between groups (85.8% variation;
Canonical plot of the seven
In this study, we sought to investigate the ecological and morphological groupings within
Support for HG 9 and 14, and for the combination of HG 11, 12, and 13, were recognized based on environmental variables. HG 14 stood out as a group found only in native prairies and associated with midwestern prairie ecoregions and herbaceous land cover type. This distribution pattern has previously been observed in other criconematid nematode species (Powers
Our study was the first to analyze the population structure of
A surprising result of this study is the association of specific HG with agroecosystems, which has the potential to influence management strategies for nematodes feeding on agroeconomic hosts. In particular, the results have implications for the process of breeding for resistance in agronomic hosts, as different genetic populations may exhibit different physiological responses. Collectively, our study provides multiple lines of evidence that the morphospecies named