Whole genome sequencing and analysis of a goose-derived Mycoplasma gallisepticum in Guangdong Province, China
Publicado en línea: 28 nov 2024
Páginas: 497 - 508
Recibido: 18 abr 2024
Aceptado: 21 nov 2024
DOI: https://doi.org/10.2478/jvetres-2024-0065
Palabras clave
© 2024 Yuanyuan Zhou et al., published by Sciendo
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.
Because the 16S ribosomal RNA (rRNA) sequences of MG and MI are very similar and differ by only two nucleotides (as indicated by the sequences logged under GenBank accession Nos M22441 and L24103) (16), primers designed based on the 16S rRNA sequence are not sufficiently specific to reliably distinguish between MG and MI in clinical samples, especially from waterfowl (16). The results of García
The swift advancement of high-throughput technologies, such as microarrays and second-generation, third-generation and single-cell sequencing, has led to the extensive application of bioinformatics analysis. This application serves to identify pivotal genes and signalling pathways in diverse diseases, contributing to precise diagnoses and personalised treatment strategies. After an extensive literature review, it is evident that comprehensive sequencing and bioinformatics analyses have been conducted on only 26 chicken-origin strains of
Partial air-sac tissues were obtained from five suspected
Clinical symptoms in diseased geese were systematically observed, and post-mortem examinations were conducted on deceased geese. Detailed records of the findings were made.
Air-sac tissues from affected geese were collected for the isolation of MG. A small amount of air-sac tissue was added to modified Frey’s liquid medium (Qingdao Haibo Biotechnology Co., Qingdao, China), vortexed and then incubated at 37°C. When the medium colour changed from red to yellow, the sample was plated. Colonies exhibiting clear Mycoplasma characteristics were selected for cultivation and sub-culturing. The purified second-generation bacterial fluid was inoculated onto modified Frey’s solid medium and incubated at 37°C in a 5% CO2 incubator for 7 d to observe colony morphology under a microscope (Nikon, Tokyo, Japan). To rule out the possibility of erroneously studying L-form bacteria, which have certain similarities to mycoplasma, the isolated pathogen was cultured and was observed conditionally on its displaying a stable and typical “fried-egg” appearance.
Twenty healthy 7-day-old Yangzhou geese, obtained from a goose hatching facility (Qingyuan Hongxing, Qingyuan, China), were used in the study. The geese were evenly distributed, with an equal representation of males and females. The criteria for selection included the absence of respiratory symptoms and negative results in a plate agglutination test and tracheal swab PCR. The geese were randomly divided into two groups of 10: one for inoculation and the other as a control group. A 0.5 mL sample of the third-generation bacterial fluid in the logarithmic growth phase was inoculated twice daily into the trachea of goslings in the inoculation group. Continuous monitoring of clinical symptoms was performed in the inoculation group and the control group. After 15 d, post-mortem examinations were conducted to observe pathological changes.
A 10 μL reaction system was prepared according to Table 1, and amplification was carried out using a PCR machine (Touch T960; JingGe Scientific Instrument, Hangzhou, China). The selection of primers was based on a thorough literature review. The intergenic spacer region primers (5'16S-IGSR-23S-3') (31), designed to differentiate between
PCR reaction system for amplification of
Component | Volume (μL) |
---|---|
TaKaRa Taq Version 2.0 plus dye | 5 |
Forward primer (10μM) | 0.3 |
Reverse primer (10μM) | 0.3 |
ddH2O | 3.4 |
Bacteria solution | 1 |
dd – double-distilled
Primers used for PCR analysis of amplicons of
Species | Primer sequences 5’→3’ | Fragment length (bp) | Annealing temperature (°C) |
---|---|---|---|
IGSR-F:GTAGGGCCGGTGATTGGAGTTA | |||
IGSR-R:CCCGTAGCATTTCGCAGGTTTG | 792 | 57 |
bp – base pairs; IGSR – intergenic spacer region; F – forward; R – reverse
The identified clinical isolates were purified again, and part of the samples were sent to the Chinese Center for Type Culture Collection (CCTCC, Wuhan, China). The isolates were named
A minimal inhibitory concentration (MIC) test was performed utilising an agar dilution method with slight modifications incorporating blank controls and quality controls (10). Modified Frey’s solid medium with an appropriate drug concentration was prepared. The MIC of eight antibiotics were determined against the goose-origin MG-GD01/22, the antibiotics spanning four classes: enrofloxacin and danofloxacin as quinolones, spectinomycin as aminoglycosides, tilmicosin, tylosin, and acetylisovaleryltylosin tartrate as macrolides, and tiamulin and valnemulin as pleuromutilins. Bacterial fluid containing the isolate was diluted to 106 colony-forming units (CFU)/mL, and 100 μL of the bacterial fluid was inoculated on the drug-containing solid plates, ensuring even distribution. The plates were then incubated at 37°C in a 5% CO2 incubator for 7 d.
A 5-mL volume of MG-GD01/22 bacterial fluid was inoculated into 45 mL of modified Frey’s liquid medium and cultured at 37°C to the logarithmic phase. After centrifugation at 12,000 rpm and 4°C for 30 min, the culture medium was discarded. The white precipitate was washed with ultrapure water, and the liquid was discarded after centrifugation. The collected precipitate was resuspended in 120 μL of ultrapure water and vortexed evenly. Extraction of total DNA from the MG-GD01/22 strain was achieved with a Mag-Bind Bacterial DNA 96 Kit (Omega Bio-tek, Norcross, GA, USA) as stated in the manufacturer’s manual. The DNA concentration was determined by micro-UV spectrophotometer (Thermo Fisher Scientific, Waltham, MA, USA). The qualified DNA was stored at −20°C and then sent to Sangon Biotech (Shanghai, China) for third-generation whole-genome sequencing. The whole genome of MG-GD01/22 was sequenced using PacBio combined with Illumina technology, and sequence splicing was performed using Canu 1.3 (23), GapFiller v. 1.11 (7) and PrlnSeS-G 1.0.0 (29). The Prokka tool (32) was utilised to predict basic gene information from the MG-GD01/22 whole-genome sequencing results. Clustered, regularly interspaced short palindromic repeats (CRISPR) were predicted using the CRISPR Recognition Tool (6), gene islands were predicted using IslandPath-DIMOB (5), and prophage prediction analysis using PhiSpy (1). Gene function annotation was performed for the isolate. The bioinformatics software used in this study is listed in Table 3.
The bioinformatics software used for the prediction and annotation of
Database | Website |
---|---|
NR | HYPERLINK |
SwissProt | HYPERLINK |
COG | HYPERLINK |
GO | HYPERLINK |
KEGG | HYPERLINK |
VFDB | HYPERLINK |
CARD | HYPERLINK |
PHI-base | HYPERLINK |
CAZy | HYPERLINK |
NR – Non-redundant Protein Database; COG – Clusters of Orthologous Groups (of proteins); GO – Gene Ontology; KEGG – Kyoto Encyclopedia of Genes and Genomes; VFDB – Virulence Factor Database; CARD – Comprehensive Antibiotic Resistance Database; PHI-base – Pathogen Host Interactions; CAZy – Carbohydrate-Active enZYmes
The 16S rRNA and
The affected geese manifested lethargy, unkempt feathers and decreased food consumption. Some animals presented symptoms including nasal discharge, head shaking, neck extension, open-mouth breathing and elevated body temperature. Auscultation revealed wet rales in the lungs.
Initial dissection revealed mucus accumulation in the trachea and bronchi, pinpoint bleeding and congestion in lung tissues. A minor fibrous exudate between the thoracic and abdominal membranes and the air sac membranes was also found. Subsequent observations indicated patchy deposits on the air sacs accompanied by a degree of air-sac thickening and turbidity. In advanced stages of the disease, the air sacs contained cheesy white or yellowish substances, while the lungs and bronchi were filled with purulent secretions. Localised congestion or parenchymal consolidation was evident in lung tissues (Fig. 1A and B).

Isolation and identification of goose-origin
The inoculation of the purified bacterial fluid on modified Frey’s solid medium yielded mycoplasma colonies exhibiting characteristic of “fried-egg” morphology under microscopic examination (Fig. 1C). None of the bacteria identified as L-form, and the viable cell count surpassed 108 CFU/mL.
A distinctive band aligned with the MG S6 standard strain manifested at around 800 bp (Fig. 1D). This and the colony morphology identification methods collectively confirmed the goose-origin nature of this clinical isolate. The strain was designated as MG-GD01/22.
Animals in the infected group exhibited clinical manifestations, including open-mouthed breathing, head shaking, nasal discharge, depression and decreased food intake over a 15-day period. Necropsies revealed the nasal cavity to be filled with yellow purulent secretions, the air sacs to be slightly thickened and the existence of point-shaped and patchy light-yellow cheesy exudates. Contrastingly, animals in the control group displayed neither evident clinical symptoms nor pathological alterations (Fig. 2).

Changes caused by
The findings suggested that MG-GD01/22 had resistance to quinolones (levofloxacin and enrofloxacin) and aminoglycosides (florfenicol) but susceptibility to macrolides (tilmicosin, tylosin and tiamulin) and pleuromutilins (thiamphenicol and valnemulin) (Table 4).
Minimum inhibitory concentrations (MICs) of antimicrobials against a
Drug name | MG S6 MIC | MG-GD01 MIC |
---|---|---|
Danofloxacin | 0.8 | 3.2 |
Enrofloxacin | 0.4 | 3.2 |
Tilmicosin | 0.2 | 0.4 |
Tylosin | 0.2 | 0.4 |
Acetylisovalery ltylosin tartrate | 0.1 | 0.3 |
Tiamulin | 0.00625 | 0.0125 |
Valnemulin | 0.00625 | 0.0125 |
Spectinomycin | 0.4 | 3.2 |
The concentration of total DNA extracted from MG-GD01/22 was measured at 147.5 ng/μL. The sample purity was evaluated by means of the OD 260 nm : OD 280 nm ratios as 1.91 and OD 260 nm : OD 230 nm ratios as 2.00, which met the prescribed standards for whole-genome sequencing. Following first-generation sequencing validation, the species origin was conclusively identified as

Basic information and bioinformatics prediction for
There were three genomic islands and three prophage structures were anticipated within MG-GD01/22. According to the CRISPR prediction, the repetitive sequence in MG-GD01/22 began at position 867,297 and concluded at position 869,120. The repetitive sequence spanned 36 bp, featuring an interspace of 30 bp between consecutive repetitive sequences. A total of 28 repetitive sequence structures were identified in this study (Fig. 3B).
The COG system was established by the National Center for Biotechnology Information (NCBI) to annotate genes in prokaryotes for the direct homologous relationships among them. It categorises genes into three primary functional divisions. The system’s evolutionary relationship clusters genes with homologous traits from diverse species into distinct groups, wherein genes within the same group exhibit like functionalities. This enables the prognostication and elucidation of the acquired gene sequences. In this study, the predicted protein sequences of genes were aligned with the COG database for annotation, and the annotated genes were subsequently classified into specific groups (Fig. 4). Successfully annotated genes were classified within the MG-GD01/22 genome. The largest gene cluster was predicted to be associated with translation, ribosomal structure and biogenesis and to comprise 174 genes involved in cellular processes and signal transduction. The next-largest category within information storage and processing genes was replication, recombination and repair, encompassing 115 genes. The most prevalent category predicted was the metabolic cluster, which included 49 genes specifically involved in energy production and conversion. Separately, another subset comprised 45 genes associated with carbohydrate transport and metabolism. Additionally, the study identified two other distinct groups of 36 genes each, one involved in nucleotide transport and metabolism, and the other in the transport and metabolism of inorganic ions. The synthesis, transport and metabolism of secondary metabolites had the fewest predicted genes with only one of each. There remained 22 genes with unknown functions (Fig. 4).

Functional analysis of the
Gene ontology serves as an internationally standardised gene function classification system, offering a dynamically updated standard vocabulary for a thorough depiction of the characteristics of genes and their products within an organism. It encompasses three principal categories: molecular function of genes, cellular location and participation in biological processes. In this study, genes were categorised based on the GO system, and the outcomes are delineated in Fig. 5. We conducted GO classification on the acquired genes and identified the predominant gene category associated with biological processes. Among these genes, 724 were linked to metabolic processes, 694 to cellular processes, 103 to localisation, 71 to response to stimulus, 51 to biological regulation processes and 17 to biological adhesion. The next most numerous category comprised molecular function genes, predominantly catalytic activity genes (682) and binding factor genes (664). There were 122 transporter protein activity genes and 82 structural molecular activity genes, while 10 genes were associated with nucleic acid binding transcription factor activity. The category with the lowest predicted gene count was that of cell components, encompassing 577 genes predicted to be involved in cell-related processes, 244 genes related to membranes and 124 genes associated with cellular organelles (Fig. 5).

Functional analysis of the

Functional analysis of the
The Pathogen Host Interactions database (PHI-base), dedicated to cataloguing pathogen–host interactions, was used in this study. Annotation efforts revealed ten genes associated with loss of pathogenicity, seven genes increased the pathogen’s virulence, seven genes acted as effector, three genes with negligible impact on pathogenicity, and two genes reduced virulence (Table 5).
Pathogen Host Interactions database analysis of the
Taxon | Gene name | Gene number |
---|---|---|
Loss of pathogenicity | 10 | |
Increased virulence | 7 | |
Effector | 7 | |
Unaffected pathogenicity | 3 | |
Reduced virulence | 2 |
The Virulence Factor Database (VFDB) collates pathogenic factors in bacteria, chlamydiae and mycoplasmas. The
Predominantly, these genes belonged to the category of adenosine triphosphate synthase–binding cassette transporters (48%). Additional predictions encompassed
The Comprehensive Antibiotic Resistance Database (CARD) is a database of bacterial antibiotic resistance genes. A total of 22 antibiotic resistance genes in MG-GD01 were identified in this study. Notably, the highest number of genes conferred resistance to quinolone antibiotics (10 genes), a lower number were associated with aminocoumarin resistance (7 genes), and a single gene was noted to confer streptomycin resistance (Table 6).
The Comprehensive Antibiotic Resistance Database analysis of the
Antibiotic type | Gene name | Gene number |
---|---|---|
Fluoroquinolone | 10 | |
Streptomycin | 1 | |
Aminocoumarin | 7 | |
Daptomycin | 3 | |
Rifamycin | 2 | |
Other antibiotics | 2 |
The Carbohydrate-Active enzymes (CAZy) database encompasses six major enzyme categories: auxiliary oxidoreductases, glycoside hydrolases, glycosyltransferases, polysaccharide lyases and carbohydrate esterases. A search of the database for genes in the MG-GD01/22 strain found nine encoding proteins related to carbohydrate enzymes, including four proteins from the glycosyltransferase family 2 (carbohydrate structure domains) and two components of the pyruvate dehydrogenase complex dihydrolipoamide dehydrogenase (E3) (glycolipase). There were annotations for one gene each in polysaccharide lyases, auxiliary oxidoreductases and glycosyltransferases, with the highest proportion being proteins containing the found-in-various-architecture domain. No genes encoding proteins related to the glycoside hydrolase superfamily were annotated.
Thirty-two MG strains and one MI strain with high similarity to the 16S rRNA gene sequence (246,487 bp–247,991 bp; 1,505 bp) of MG-GD01/22 were selected by the NCBI basic local alignment search tool (BLAST) as reference strains for nucleotide sequence homology and phylogenetic tree analysis. The current isolate MG-GD01/22 differed minimally in nucleotide sequence from MG reference strains, demonstrating homology ranging from 99.4% to 99.9%. Its closest phylogenetic relationship was identified with MG S6 at 99.9%, whereas the homology with MI was comparatively lower at 95.2% (Fig. 7).

Nucleotide sequence homology analysis of the 16S ribosomal RNA of the
Recovery sequencing confirmed that the sequencing outcomes were entirely consistent with those of the 16S rRNA and

Phylogenetic tree of 16S ribosomal RNA of the
A meticulous comparative analysis was executed on the

Phylogenetic tree of
MUMmer is a suffix tree algorithm designed to find exact maximal unique matches (MUMs) of some minimum length between two input sequences. The match lists produced by MUMmer can be used alone to generate alignment dot plots, or can be passed on to the clustering algorithms for the identification of longer non-exact regions of conservation. The outcomes unveiled a substantial genomic homology between the clinical isolate and MG S6, although the collinearity was comparatively deficient. Noteworthy modifications were observed in the distribution of numerous genes, encompassing instances of gene loss and inversion (Fig. 10).

Collinearity analysis of the genomes of the
Mycoplasma infections in poultry, notably with MG and
In 2022, research focusing on the Guangdong region reported a 42% infection rate of MG among local geese. The open-air farming conditions, complex environments and stringent growth requirements for MG complicate the pathogen’s isolation, identification and detection. Consequently, epidemiological studies on
Considering the high prevalence of CRD, susceptibility to other respiratory pathogens, reduced egg production, prolonged treatment durations and frequent recurrences, which collectively make for significant economic losses, it is imperative for the goose farming industry to prioritise this issue. The sector must foster the conditions for further experimental research to enhance prevention and control strategies for
In this study, IGSR primers were applied to differentiate MG-GD01/22 from other causative agents successfully, confirming the 99.9% homology of its 16S rRNA with that of MG S6, and establishing it as goose-origin MG. Because of its good specificity,
In this investigation, goslings were tracheally inoculated with purified MG-GD01/22. Within a 15-day period, all the birds manifested conspicuous respiratory symptoms, and post-mortem examinations exhibited substantial pathological alterations in the respiratory system, aligning with the natural MG infection observed in afflicted geese and substantiating the pathogenicity of MG-GD01/22 in geese. Numerous virulence-related genes in MG-GD01/22 were annotated by VFDB database analysis. A pivotal stage in the MG invasion process involves adhesion, succeeded by colonisation and infection. Substances such as the elongation factor heat shock protein (GroEL), adhesin (
This investigation encompassed a comprehensive exploration into the pathogenicity, drug susceptibility and whole-genome sequencing analysis of a clinical isolate, MG-GD01/22, belonging to goose-origin
Virulence Factor Database analysis hinted at the presence of various virulence-related factors in MG-GD01/22, potentially influencing invasion mechanisms and inducing host damage. The findings from the CARD annotation aligned with the antimicrobial susceptibility test. Notably, the study pinpointed the