Virus name: hCoV/Mexico/CDMX/InDRE_01/2020

GISAID accession ID: EPI_ISL_412972


A novel coronavirus, recently named severe acute respiratory syndrome-related coronavirus type 2 (SARS-CoV-2), was identified in the city of Wuhan, Hubei Province, China, in December of 2019 It is betacoronavirus that affects the lower respiratory tract and manifests as pneumonia in humans [1, 2]. According to the Johns Hopkins University Coronavirus Resource Center and the Daily Technical Statement on New Coronavirus Cases Worldwide (COVID-19) issued by the Secretary of Health of Mexico, as of March 20, 2020, 234,072 confirmed cases have been reported worldwide with 9840 deaths. In Mexico, 203 cases have been confirmed, with two deaths reported [3, 4]. At the time of this writing, 768 complete genomic sequences of SARS Cov-2 have been obtained and deposited in the GISAID database, and 10 clades have been defined from phylogenetic studies carried out with these sequences (A1, A1a, A2, A2a, A3, A5, B, B1, B2 and B4) [5].

Here, we report the complete genome sequence of SARS-CoV-2 from the first imported case of COVID-19 detected in Mexico. The sample hCoV/Mexico/CDMX/InDRE_01/2020 was obtained directly from a nasopharyngeal swab from a 35-year-old male patient (InDRE 1198) residing in Mexico City who had a confirmed diagnosis of SARS-CoV-2 on February 27, 2020 and had traveled to Bergamo, Italy, between February 16 and 22, 2020.

A diagnostic test for COVID-19 was first conducted at the Instituto Nacional de Enfermedades Respiratorias, Mexico City, on February 27, 2020. Following the national guidelines of the Mexican Ministry of Health [6], a sample was sent to the National Influenza Center at the Instituto de Diagnostico y Referencia Epidemiologicos (InDRE); the confirmatory diagnosis was conducted using a real-time RT-PCR protocol to detect SARS-CoV-2 [7], and of a 221-bp fragment of the region coding for the orf1ab polyprotein was sequenced by the Sanger method and will be published elsewhere. The sample from the patient had a cycle threshold value of 19.5 for the E gene and 20.4 for the RdRP gene.

The complete virus genome from the positive case was amplified using the novel coronavirus sequencing multiplex primers [8] (https://artic.network/ncov-2019). cDNA synthesis was conducted using Super Script IV Reverse Transcriptase (Invitrogen) and the concentration of PCR the product was measured using a Qubit dsDNA High Sensitivity Kit on a Qubit 3.0 fluorometer (Thermo Fisher) and an Agilent 2100 Bioanalyzer (Agilent Technologies).

A library without barcodes was generated and sequenced using an Ion Torrent system, using an Ion Xpress™ Plus Fragment Library Kit for sequencing on an Ion 318™ v2 chip. A total of 1,445,751 reads were obtained for hCov/Mexico/CDMX/InDRE_01/2020. The complete genome sequence was assembled using CLC Genomics Workbench 20 (QIAGEN) and SPAdes [9] and mapped against the reference sequence Wuhan-Hu-1 (GenBank accession number NC_045512.2).

The consensus sequence was assembled into a single contig with a total length of 29,849 bp, corresponding to 99.8% of genome, and a deep average coverage of 3,983X. The sequence was deposited in the GISAID database [5] under accession ID EPI_ISL_412972.

A database was constructed including 178 genomic sequences available on the GISAID platform, obtained from December 31, 2019 to March 05, 2020, reported in 22 countries. A phylogenetic tree was constructed in MEGA 6 software using the neighbor-joining method and the Kimura 2-parameter evolution model with 100 bootstrap replicates [10]. The strain hCoV/Mexico/CDMX/InDRE_01/2020 was grouped in the clade A2 subclade A2a (also called lineage G) and is closely related to isolates from Finland, Italy, and Germany as well as an isolate from the first imported SARS-CoV-2 case in Brazil [11] (Fig. 1).

Fig. 1
figure 1

Neighbor-joining tree constructed in MEGA using the Kimura 2-parameter evolution model with 100 bootstrap replicates. hCoV/Mexico/CDMX/InDRE_01/2020 (indicated by an arrow) is grouped in clade A2a with sequences previously reported from Brazil, Germany, Italy, and Finland. The 14408C>T substitution is characteristic of subclade A2a

Genetic analysis showed that the hCoV/Mexico/CDMX/InDRE_01/2020 genome differs by seven nucleotide substitutions compared to the Wuhan-Hu-1 reference strain: 241C>T, 3037C>T, 14408C>T, 23403A>G, 28881G>A, 28882G>A, and 28883G>C. We confirmed the presence of the 14408C>T substitution, which is characteristic of the sequences clustered in subclade A2a.

A second database of 768 sequences deposited until March 20, 2020, was used to construct a phylogenetic tree. This analysis showed that hCoV/Mexico/CDMX/InDRE_01/2020 clusters with European lineage G sequences, mainly from Belgium, Switzerland, and England (Fig. 2). This finding correlates with the transmission events reported by GISAID [5], which suggest that this strain was first introduced into Belgium, then England, and finally, Italy, where the Mexican patient had been traveling.

Fig. 2
figure 2

Neighbor-joining tree constructed with 768 sequences available in the GISAID database on March 20, 2020. The analysis shows that the Mexican strain (indicated by an arrow) clusters with European sequences, corroborating the transmission events reported by GISAID (Belgium-Germany-Italy-Mexico) for this first imported case in our country

This is the first report of the complete genome of this virus in our country, and these data will contribute to the molecular epidemiology of the SARS-CoV-2.