Field samplings and pathogen isolation
Diseased apricot fruits (Prunus armeniaca ‘Bei-zhai-hong-xing’) were collected from orchards in Pinggu District, Beijing, China (40°01′N, 117°01′E). Five symptomatic fruits displaying typical black blotches were randomly sampled. Pathogen isolation followed Guarnaccia et al. (2021) with minor modifications. Symptomatic tissues (0.5–1.0 cm) from lesion margins were surface-disinfected in 70% ethanol (30 s), 1% sodium hypochlorite (30 s), rinsed in sterile distilled water, dried on sterile filter paper, and plated on Luria-Bertani agar (LBA). Plates were incubated at 25 ± 1°C for 48 h. Single colonies were re-streaked onto fresh LBA plates. After 48 h incubation, single colonies were re-isolated on LBA to obtain pure cultures. Twenty isolates were obtained; two (Hongxing and HX-A) were selected for molecular characterization (Table 1). Stock cultures were stored on LBA slants at 4°C for further study.
Test of pathogenicity
The pathogenicity was verified according to the procedures by Ogiso et al. (2002) with minor modifications. A bacterial suspension (104 CFU/mL) from pure cultures of the pathogen prepared from LB was sprayed onto five healthy apricot fruits (Prunus armeniaca “Bei-zhai-hong-xing” ) in the orchard. The fruits inoculated with LB diluted by 10×fold with sterilized distilled water served as controls. Symptoms were recorded 7 days post-inoculation.
Morphological Characterization
Morphological characterization of the pathogen strains was carried out according to the methods described by Khanal et ac., (2022), with minor modifications.The pathogen strain was preliminarily identified by using morphological characteristics. The strain was inoculated on an LBA medium and observed for culture characters after incubation at 25°C for 48h. The strain cultured for 18–24 h on LBA plates was subjected to gram staining. Cell morphology was observed and photographed using an Olympus BX51 microscope. Biochemical tests were performed by using the API 50 CH Identification Kit. All the experiments were evaluated in triplicate to obtain accurate data.
Molecular and Phylogenetic analysis
The pathogen strain was further identified through the analysis of its 16S rRNA gene sequences. Briefly, the DNA of the strain was extracted by using the TIANamp bacterial DNA extraction kit and stored at 4°C. The 16S rDNA was amplified by polymerase chain reaction (PCR) with the bacterial universal primers 27F (5′-AGAGTTTGATCCTGGCTCAG-3′) and 1492R (5′-GGTTACCTTGTTACGACTT-3′). The PCR reaction mixture contained 1 µL of DNA template, 5 µL of 10× PCR buffer, 4 µL of dNTPs (2.5 mmol/L), 3 µL of MgCl2 (25 mmol/L), 1 µLof each primer (1 mmol/L), 0.5 µL of Taq polymerase and 34.5 µL of ultrapure water. The reaction condition of 16S rDNA amplification was as follows: 94°C for 5 min; 30 cycles of 94°C for 1 min, 55°C for 30 s and 72°C for 1 min; and a final extension at 72°C for 10 min. Amplicons were electrophoresed on 1% agarose gels and sequenced (Beijing Tianyihuiyuan Biotechnology Co., Ltd.). DNA sequence homology searches were performed by using the online BLAST search engine in GenBank (http://blast.ncbi.nlm.nih.gov/Blast.cgi). Phylogenetic trees were constructed from 16S rRNA sequences using MEGA7 (neighbor-joining method).
Genome sequencing and genome annotation
Genomic DNA was extracted using Wizard® Genomic DNA Purification Kit (Promega) according to manufacture’s protocol. Purified genomic DNA was quantified by TBS-380 fluorometer (Turner BioSystems Inc., Sunnyvale, CA). High quality DNA (OD260/280 = 1.8 ~ 2.0, >1µg) was used to do further research.
For Illumina sequencing, at least 1µg genomic DNA was used for each strain in sequencing library construction. DNA was sheared (400–500 bp) using a Covaris M220 (per manufacturer’s protocol).. Illumina sequencing libraries were prepared from the sheared fragments using the NEXTflex™ Rapid DNA-Seq Kit. Briefly, 5’ prime ends were first end-repaired and phosphorylated. Next, the 3’ ends were A-tailed and ligated to sequencing adapters. The third step is to enrich the adapters-ligated products using PCR. The prepared libraries then were used for paired-end Illumina sequencing (2 × 150 bp) on an Illumina HiSeq X Ten machine. Bioinformatics analyses used Majorbio Cloud Platform (www.majorbio.com).
The original image data is transferred into sequence data via base calling, which is defined as raw data or raw reads and saved as FASTQ file. Those FASTQ files are the original data provided for users, and they include the detailed read sequences and the read quality information. A statistic of quality information was applied for quality trimming, by which the low quality data can be removed to form clean data. An assembly of the clean reads were performed using SOAPdenovo2 (Koren et al., 2017).
Glimmer (Delcher et al., 2007) was used for CDS prediction, tRNA-scan-SE (Borodovsky et al., 1993) was used for tRNA prediction and Barrnap was used for rRNA prediction. The predicted CDSs were annotated from NR, Swiss-Prot, Pfam, GO, COG and KEGG database using sequence alignment tools such as BLASTP, Diamond and HMMER. Briefly, each set of query proteins were aligned with the databases, and annotations of best-matched subjects (e-value < 10− 5) were obtained for gene annotation.
The phylogeny based on the whole genome data was determined by calculating the average nucleotide identity with closely related Curtobacterium species with the Orthologous Average Nucleotide Identity Tool (OrthoANI)(Lee et al., 2016). Genome-based taxonomy was analyzed using TYGS (https://tygs.dsmz.de; Meier-Kolthoff & Göker, 2019).