Structure guided prediction of Pyrazinamide resistance mutations in pncA

Karmakar, Malancha; Rodrigues, Carlos H. M.; Horan, Kristy; Denholm, Justin T.; Ascher, David B.

doi:10.1038/s41598-020-58635-x

Download PDF

Article
Open access
Published: 05 February 2020

Structure guided prediction of Pyrazinamide resistance mutations in pncA

Scientific Reports volume 10, Article number: 1875 (2020) Cite this article

3066 Accesses
48 Citations
7 Altmetric
Metrics details

Subjects

Abstract

Pyrazinamide plays an important role in tuberculosis treatment; however, its use is complicated by side-effects and challenges with reliable drug susceptibility testing. Resistance to pyrazinamide is largely driven by mutations in pyrazinamidase (pncA), responsible for drug activation, but genetic heterogeneity has hindered development of a molecular diagnostic test. We proposed to use information on how variants were likely to affect the 3D structure of pncA to identify variants likely to lead to pyrazinamide resistance. We curated 610 pncA mutations with high confidence experimental and clinical information on pyrazinamide susceptibility. The molecular consequences of each mutation on protein stability, conformation, and interactions were computationally assessed using our comprehensive suite of graph-based signature methods, mCSM. The molecular consequences of the variants were used to train a classifier with an accuracy of 80%. Our model was tested against internationally curated clinical datasets, achieving up to 85% accuracy. Screening of 600 Victorian clinical isolates identified a set of previously unreported variants, which our model had a 71% agreement with drug susceptibility testing. Here, we have shown the 3D structure of pncA can be used to accurately identify pyrazinamide resistance mutations. SUSPECT-PZA is freely available at: http://biosig.unimelb.edu.au/suspect_pza/.

Prediction of rifampicin resistance beyond the RRDR using structure-based machine learning approaches

Article Open access 22 October 2020

Stephanie Portelli, Yoochan Myung, … David B. Ascher

In silico evaluation of WHO-endorsed molecular methods to detect drug resistant tuberculosis

Article Open access 22 October 2022

Alice Brankin, Marva Seifert, … Rebecca E. Colman

Identification and Characterization of Genetic Determinants of Isoniazid and Rifampicin Resistance in Mycobacterium tuberculosis in Southern India

Article Open access 16 July 2019

Asma Munir, Narender Kumar, … Sony Malhotra

Introduction

Tuberculosis (TB), caused by Mycobacterium tuberculosis, is the leading cause of infectious disease death worldwide. In 2017, 10 million people fell ill, and 1.6 million died, from tuberculosis¹. While a range of antibiotics are available to treat TB, treatment is prolonged, and the increasing emergence of drug-resistant bacteria is a considerable threat to global health. In 2017 alone, an estimated 558,000 people developed multi-drug-resistant tuberculosis (MDR-TB), resistant to the two first-line drugs rifampicin and isoniazid¹.

Pyrazinamide (PZA) is a first-line drug that exhibits unique sterilizing activity towards both drug-susceptible and MDR-TB². It is responsible for the killing of the persistent tubercle bacilli during the initial intensive phase of chemotherapy, allowing treatment to be shortened from 9 months to 6 months for drug susceptible cases³. PZA therapy has been linked to improved outcomes for both non-MDR and MDR-TB, and is being considered as part of the future regimens in combinations with bedaquiline, delamanid, PA-824 and moxifloxacin, which are currently in phase three trials^4,5.

Despite the highly important role of PZA in clinical outcomes, resistance has largely been underestimated, with up to 20% of non-MDR-TB patients PZA resistant⁶. Being a central drug in current and future regimens, it is important to be able to rapidly and accurately identify resistant isolates and track the emergence and spread of drug resistant strains. In vitro drug susceptibility testing (DST) is challenging, expensive and time-consuming as PZA is effective against M. tuberculosis only at acidic pH, leading to false resistance rates of up to 70%^{7,8,9,10,11,12,13}. This has led to the WHO recommending the development of molecular genetics tests.

PZA is a structural analog of nicotinamide and is a pro-drug that needs to be converted into its active form, pyrazinoic acid (POA), by the non-essential enzyme pyrazinamidase, encoded by the pncA gene^14,15. It has been postulated that the mechanism of action of PZA is through POA, which disrupts the bacterial membrane energetics and inhibits the membrane transport function which is necessary for the survival of the bacterium, at an acidic site of infection¹⁶. PZA resistance has been linked to mutations in a number of genes, including pncA, rpsA¹⁷, panD¹⁸, clpC1¹⁹, and the putative efflux pumps Rv0191, Rv3756c, Rv3008, and Rv1667c²⁰, but mutations in pncA are the major mechanism for PZA resistance (70–97%)²¹. While sequencing the pncA gene can be a more reliable method to determine resistance than DST, which is prone to missing low-level pyrazinamide resistance caused by non-synonymous mutations in pncA²², the development of a genetics based resistance screen is complicated as resistant and non-resistant mutations are found across the entire protein.

To solve the problem of a reliable DST for PZA, we previously showed that protein structural information can be used in a clinical setting to rapidly, accurately and pre-emptively predict drug resistant mutations in pncA²³. This showed that mutations that affected protein folding, flexibility, stability and activity were strongly associated with resistance. Here we have used a comprehensive combination of structure and sequence-based features to develop a predictive tool to characterize novel PncA mutations, which we tested on novel mutations from the Victorian Tuberculosis Program, CRyPTIC²⁴ and Miotto et al. dataset²⁵. This highlights the potential of using structural information to guide the genetic detection of resistance. We have implemented our model through the webserver SUSPECT-PZA (http://biosig.unimelb.edu.au/suspect_pza/), which will enable the rapid structural evaluation of the molecular and phenotypic consequences of any pncA nonsynonymous mutation to support informed clinical decisions.

Results

We used a structure-guided approach to understand the structural and functional consequences of variants in the drug target PncA, and machine learning to build an empirical tool that could identify likely resistant mutations. The workflow used to analyze the mutations and train a Random Forest algorithm is shown in Fig. 1 and it comprises three major steps: (1) data curation, which can be subdivided into mutational data set acquisition and protein structure curation; (2) feature analysis, which involves the generation and evaluation of features selected to develop the predictive model to determine novel drug resistance mutations in PncA; (3) machine learning and webserver development, which aims to train, test and validate a supervised machine learning algorithm to accurately predict the susceptibility of the variant followed by a database (SUSPECT-PZA) which has information for all possible variants of PncA.

Distribution of the mutations on the structure

We curated a dataset of 1322 nonsynonymous substitutions with high quality experimentally measured PZA susceptibility (71 susceptible mutations from GMTV²⁶, 12 resistant mutations from GMTV²⁶, 178 resistant mutations from TBdreamDB²⁷, Fig. 2A, 547 resistant and 514 susceptible mutations from experimental saturation mutagenesis²⁸). After removal of duplicate mutations, we were left with a dataset of 610 mutations, which included 305 susceptible and 305 resistant mutations. Mapping the complete set of curated 610 nsSNVs (Fig. 1) and just the clinical variants only (Fig. 2B) onto the crystal structure of PncA revealed that variants were distributed throughout the entire protein structure, complicating resistance inference from sequence analysis. We also observed that the resistance mutations were not solely localized at the drug binding site but distributed throughout the protein (Fig. 2C).

PncA is a small protein molecule which constitutes of 186 amino acids. The experimental crystal structure of the drug (PZA) bound to the enzyme (PncA) was unavailable. Therefore, PZA was ab initio docked into the experimental crystal structure of the holo-wild-type PncA protein (PDB ID: 3PL1²⁹). The docked structure revealed that PZA formed key interactions within the proteins active site, which includes the catalytic triad (Asp8, Lys96, and Cys138), substrate-binding residues (Trp68 and Phe13), and the iron center (Asp49, His51, His57, and Fe 21). Analysis of the molecular interactions with Arpeggio³⁰ highlighted a strong network of polar and π- interactions between PZA and PncA (Fig. 2D).

Structural, biophysical and evolutionary consequences of PncA mutations

Looking at the SNAP2³¹ and PROVEAN³² scores, which consider evolutionary information to predict functionally important nonsynonymous mutations, we observed that resistant mutations were always associated with deleterious scores, while susceptible mutations were scored neutral (Table S1; Fig. 3). This suggest that although mutations were spread throughout the protein, mutations associated with resistance were having a stronger effect on the structure and function of the protein.

The wild-type environment also provided information to differentiate between resistant and susceptible mutations, which included relative solvent accessibility (RSA), residue depth and secondary structure of the wild-type residue (Table S1; Fig. 3). This showed that resistant mutations tended to be found at buried residues that were less solvent exposed (average RSA of 0.18 for resistant mutations compared to 0.39 for susceptible; average residue depth of 1.09 Å for resistant mutations compared to 0.75 Å for susceptible; Table S1). These values were consistent with susceptible mutations being in regions that have milder effects on protein stability and activity than the resistance mutations.

The impact of the resistant and susceptible mutations on protein folding, stability and conformation were assessed using biophysical tools which relies on graph-based signatures to calculate the change in Gibb’s free energy, like mCSM-Stability³³, DUET³⁴ and DynaMut³⁵. The effect of the mutations on the binding affinity for PZA were assessed using mCSM-Lig³⁶. We observed that resistant mutations led to large decreases in PncA stability and conformational flexibility, while susceptible mutations were associated with milder changes (Table S1; Fig. 3). This is consistent with what we have observed previously for non-essential and drug activating proteins³⁷. While resistant mutations, however, tended to be located closer to the PZA binding site (average < 10 Å from the PZA; Fig. 3), we did not see a significant difference in the distribution of the effects of resistant and susceptible mutations on PZA binding affinity (Table S1, Fig. S2), likely due to the importance of other molecular effects leading to resistance.

Machine learning to predict PZA resistance

Building on this structural and sequence-based analysis, we tested whether the information generated from these features could be used to train a supervised machine learning algorithm capable of accurately predicting resistant mutations in PncA. We grouped our features into five distinct categories: stability, dynamics, evolutionary conservation, ligand interactions and backbone geometry (structural environment). The performance of predictive models trained on each class of feature was evaluated separately to explore the contribution of each class to the predictive model (Table S2; Fig. S2). We were able to confirm that the individual categories of features did not yield a good metric for a reliable predictive model, but in combination using 10-fold cross-validation, models trained using Random Forest algorithm yielded a more balanced and accurate performance, highlighting the synergistic effect of these features. The final model correctly classified 80.1% and 72.3% of mutations in the training and blind datasets, respectively (Fig. 4; Table 1). The comparative performance across iterative non-redundant blind datasets suggested that the model was not overfitted.

Table 1 Evaluation metrics across the train and blind test datasets.

Full size table

Analysis of our model revealed that PncA-resistant mutations were associated with large changes in protein folding and stability (mCSM-Stability scores < −0.9 Kcal/mol; p < 0.0001, Welch Two Sample t-test) and conformational flexibility (DynaMut score < 0.78 Kcal/mol; p < 0.0001, Welch Two Sample t-test) or located in close proximity to the catalytic triad and substrate-binding site (<10.8 Å; p < 0.0001, Welch Two Sample t-test). Alternatively, susceptible mutations had a relative b-factor value of ≥3.19 (p < 0.0001, Welch Two Sample t-test), residue depth of ≥0.9 (p < 0.0001, Welch Two Sample t-test), distance from PZA greater than 11.9 Å and mild effects on protein stability (SDM scores ≥ 2.68 Kcal/mol; p < 0.0001, Welch Two Sample t-test).

Validation using Clinical Datasets

We next validated our model using variants reported in the recently published CRyPTIC dataset²⁴. 355 pncA nsSNVs associated with PZA resistance were reported, of which 75 were not present in our training dataset. Our model correctly classified 79.2% of the mutations across the whole dataset (355 mutations), and 72.0% of those non-redundant in amino acid position with the training data (75 mutations). The positive predictive value was 94.7% (95% CI [92.5% to 96.2%]).

We also validated our empirical classifier using the dataset reported by Miotto et al.²⁵, which contained 98 nsSNVs graded by the confidence of their association with phenotypic drug resistance. 44 out of the 98 nsSNVs reported in the paper were not present in our training dataset. We accurately predicted the drug susceptibility of 84.8% of the polymorphism across the whole dataset (98 mutations), with an accuracy of 79.5% for those mutations not included in the training data (44 mutations). The positive predictive value was 95.4% (95% CI [92.1% to 97.3%]). We observed mutations such as Q10P (21 cases reported), W68G (16 cases reported) and I133T (17 cases reported) with 0.98 probability associated with resistant phenotype²² and categorized as high confidence for association with resistance, moderate confidence for association with resistance and minimal confidence for association with resistance respectively²⁵ were all classified as resistant by our predictive model, highlighting the sensitivity of the prediction.

Mutations reported by Miotto et al.²⁵ under the “no association with resistance” category, including I31T, L35R and T47A were predicted as resistant, and I6L as susceptible. This is consistent with the available experimental data^24,28, highlighting the advantage, accuracy and versatility of our approach. A closer look into the different biophysical scores for the resistant associated mutations revealed that they had large predicted destabilizing values for protein conformational flexibility (I31T, −2.49 Kcal/mol) and stability (I31T, −3.46 Kcal/mol) and one was located very close to the catalytic triad (T47A, <6 Å).

Our predictive model was further validated on PZA DST screening at 100 μg/ml of clinical isolates from culture collections at Stellenbosch University, South Africa (865 isolates) and the Centers for Disease Control and Prevention (CDC), Atlanta, USA (185 isolates)³⁸. They identified 49 isolates with a susceptible phenotype containing 8 nsSNVs. All nsSNVs with an MIC < 50 μg/ml were correctly classified by our model as susceptible (E37V, D110G, T114M). Whitfield and colleagues suggest that those isolates with an MIC > 50 μg/ml should be considered clinically resistant, of which our model classified three as resistant (A170V, V130A and L35R) and two as susceptible (V163A and V180I). Overall, our model had a 75% agreement with the DST results and a positive predictive value of 100%

Application within a Clinical Setting

In a prospective genomic sequencing and DST analysis of over 600 Victorian clinical TB isolates, 7 pncA variants were detected in 11 variants phenotypically resistant to PZA, none of which were present in our training dataset. Our model correctly classified five out of seven variants as resistant (71.4% accuracy). The remaining two mutations, G108V and Q10H, which were susceptible according to the DST results were predicted to confer resistance and consistent with other experimental findings^24,25,28. Both variants, had a SNV frequency of <0.5, which is known to impact upon the reliability of the DST results. This highlights the potential clinical power of our model.

Expanding our analysis, four additional pncA mutations (S104R, V128G, Y95R and E15A) were identified in Victorian clinical TB isolates lacking DST results. Both S104R and V128G were predicted as resistant by our model, consistent with previously reported DST results^{24,25,26,27,28}. The remaining two mutations, Y95R and E15A, have not been reported previously. Our model suggests both mutations to confer susceptibility to PZA.

SUSPECT-PZA webserver

We have developed a user-friendly, freely available web server SUSPECT-PZA (StrUctural Susceptibility PrEdiCTion on PZA), http://biosig.unimelb.edu.au/suspect_pza/, which is a database for all possible variants of PncA. There are two different input options (Fig. S2): the first one is the “Single Mutation” option which allows the users to input one mutation for analysis. The basic format required by the server for this input option is that the mutation must be specified as a text string containing the wild-type residue one-letter amino acid code, its corresponding position on the structure and the mutant one-letter amino acid code. The second option is the “Mutation List”, which allows the user to upload a list of mutations, in the same specified format as above but in a file for batch processing (Fig. S3). Sample submission entries are available to assist users to submit their mutations for analysis and an additional help page via the top navigation bar.

Figure 5 shows a snapshot of the output page for the “Single Mutation” option. The web server displays the prediction outcome (Resistant / Susceptible) along with details of the user input data, information on the wildtype residue environment and features used for prediction. In addition, there is an interactive 3D viewer, built using NGL³⁹, which allows analysis of non-covalent inter-residue interactions for the position specified in the input calculated using Arpeggio³⁰ for both wild-type and mutant structures. The results for the “Mutation List” option is summarized in a downloadable table. The users can access details of individual mutation as shown in Fig. S4. There is a 3D viewer at the bottom of the page in which the residues in the input list is colored according to the predicted effect (Fig. S5).

Discussion

PZA was discovered in 1948 in an in vivo screen of nicotinamide derivatives in a structure-activity relationship study⁴⁰ and used as anti-tuberculosis drug in 1952 for the first time. Till the 1970’s PZA was used as a second-line drug to treat TB, until they discovered the sterilizing activity and reduction in treatment duration in combination with isoniazid and rifampicin. There has been a lot of studies conducted since then and with the continued usage of the drug to treat TB, there has been an increased incidence of resistance associated with it. Being an important first-line drug, accurate and rapid evaluation of PZA susceptibility is crucial for successful management of patients with either susceptible or drug-resistant TB. The existing molecular phenotypic tests are considered poorly reliable, expensive, and has a long turnaround time. To account for this situation there is an urgent requirement to develop a rapid, reliable and affordable molecular PZA DST. As resistance mutations are spread all over the length of the PncA protein, it is quite challenging to develop a new method. In this study, we establish a novel computational methodology to better understand the structural and functional consequences of drug resistance mutations by exploiting the protein’s 3D structure. Using supervised machine learning algorithm, we developed an empirical tool to determine novel drug resistance in PncA followed by a database which has information on all possible variants of PncA.

The primary focus of our work is on missense non-synonymous mutations as these typically have more subtle molecular effects that can be harder to predict, than in-frame and frameshift indel mutations that have a much larger deleterious effect on PncA structure and function and are all classed as high-confidence resistant mutations. The structure-based tools implement the concept of graph-based signatures to predict the effect on single point mutations for protein stability. To assess changes in conformational flexibility, graph-based signatures were integrated with normal mode analysis to predict the impact on the protein structure. Scores for these features which were calculated as change in Gibb’s free energy (ΔΔG) provided important molecular information on resistant mutations, signifying larger effects on protein folding and dynamics and minimal effect on PZA binding affinity. Interpreting the results, we observed, resistance mutations were seen to affect protein activity and function through destabilization of the protein structure and conformation. It even helped in correlating earlier findings where resistant isolates were not associated with a loss of bacterial fitness⁴¹ due to the fact that PncA was involved in nicotinamide recycling pathway rather than in its synthesis. These structural insights have been used to guide clinical decisions for novel PZA mutations²³.

Phenotypic DST which is the current “gold standard”, which encompasses methods like Wayne and Bactec MGIT 960, suffers from poor reproducibility. Discrepancies among the results lead to considerable doubt over the clinical significance of the method. Next-generation sequencing based diagnostics can be an alternative for innovative tools to reduce false detection of PZA resistance cases and fast and accurate detection of drug resistance by molecular DST⁴². In the past couple of years researchers have used different techniques to come up with a better and consistent methodology to detect and determine resistance in PZA. Stoffels et al.⁴¹ conducted an elaborate study on 14-year complete capture of clinical isolates, where he found frequency of spontaneous acquired resistance to be 10⁻⁵ bacilli in vitro. Miotto et al. 2014 work generated the minimum dataset of mutations that should be included in any molecular test for PZA, paving the way for predicting PZA resistance using new genome-based technologies²². This was followed by Farhat et al. 2016 comprehensive web-based dataset⁴³. Though all these approaches were a step up from the existing phenotypic DST, they do not provide information on novel variants. The advantage with our database is it provides information on all possible variants for PncA. This data provides a basis for use as part of any molecular DST, needed for the valid interpretation of data generated by massive sequencing approaches.

Interestingly, comparing performance of SUSPECT-PZA across datasets used to train earlier methods, we observed that the weakest performance was across variants classified as susceptible. However, many of these mutations have been observed in clinically resistant isolates. Our biophysical analysis and SUSPECT-PZA predictions would be consistent with these mutations potentially being misclassified previously.

We also compared our empirical models output to the “revised DST” of Miotto et al.²², where they accounted for enzymatic activity and structural analysis to adjust for possible errors in phenotypic DST. There were 178 missense mutations listed, of which 162 were labelled resistant (R) and 17 were labelled susceptible (S). Our model predicted 88.9% (144/162) of the resistant mutations and 58.8% (10/17) of the susceptible mutations accurately. The positive predictive value was 95.4% (95% CI [92.1% to 97.3%]). The primary divergence from the Miotto classifications was in predicting susceptible mutations. This is likely due to discrepancies in phenotypic and molecular DST results from different laboratory setups¹⁶. For example, mutations reported as susceptible in the “revised DST” like L159V, F81S, A102V, T135S, T168I and A46V were unanimously reported as resistant in other studies^24,26,27,28. Our predictive tool also predicts them to be resistant and hence, proves to be more reliable, reproducible, free to use and a fast alternative to the existing gold standard methods.

This study highlights the power of using computational prediction of the structural consequences of variants in PncA to identify likely pyrazinamide resistance mutations, a critically important first-line drug in the treatment of tuberculosis. This approach, however, is not limited to pncA and has been developed for application to other antimicrobial agents like bedaquiline⁴⁴, a last line resort to treat multi-drug and extremely drug resistant TB. A major advantage of our tool is that it was built using a very well-balanced dataset. In case of mutations reported as both susceptible and resistant in the same or different datasets, we looked for frequency of occurrence and clinical information. We have extensively evaluated the method through both cross-validation and independent non-redundant blind tests, which provide a measure of a methods applicability and robustness. Across all test sets the method performed equally well, providing strong confidence in the approach. As with all machine learning approaches, the availability of more phenotypic and clinical data will enable the development and validation of stronger approaches. This will be an iterative approach moving forward. The other aspect to improving our predictive model is through the inclusion of new features or parameters. We have shown previously that this approach can even capture strain dependent variations in resistant patterns²³. While we did not have the data available to build into our current model, we next aim to integrate lineage specific information, which will enable more refined and personalized predictions. This comprehensive web server can be used in clinical settings as an improved diagnostic tool to help realize the power of whole genome sequencing diagnostic approaches.

Methods

Data set

A list of 610 nonsynonymous single-nucleotide mutations (nsSNVs) of pncA was obtained from the GMTV (Genome-wide Mycobacterium tuberculosis Variation) Database Project²⁶, Tuberculosis Drug Resistance Mutation Database²⁷, and saturation mutagenesis²⁸. The clinical validation datasets used in the paper were from CRyPTIC²⁴ and Miotto et al.²⁵.

Modelling the biophysical consequences of missense mutations

We have developed a comprehensive in silico mutational analysis platform that uses graph-based signatures to represent the 3D structure of a protein and quantitatively predict the molecular consequences of point mutations on protein structure, function and interactions^{30,33,34,35,36,45}. This has been used to characterize and preemptively identify likely resistance mutations in drug targets^{23,37,46,47,48,49,50,51,52,53,54}. Using these tools, we assessed the molecular consequences of each mutation on the structure of PncA and drug activation.

The experimental crystal structure of holo-wild-type PncA (PDB ID: 3PL1)²⁹ was minimized in Prime, and PZA docked into the active site using Glide (Schrödinger Suite). The effects of mutations on PncA folding and stability were assessed using SDM⁵⁵, mCSM-Stability³³ and DUET³⁴, and their effects on protein flexibility and conformational was predicted using normal mode analysis by DynaMut³⁵. The effect of the changes on the binding affinity of PZA towards PncA were predicted using mCSM-Lig^36,56. These approaches are novel machine-learning algorithms. We also included structural information of the wild-type residue, including relative solvent accessibility, residue depth, secondary structure and dihedral angles of the PncA chain φ (phi) and ψ (psi). Additionally, SNAP2³¹ and PROVEAN³² were used to provide additional evolutionary information. Moreover, the scores calculated for the various structural and sequence-based features are independent of pH and temperature.

Machine learning

Here we used the Random Forest binary classifier using the Weka toolkit⁵⁷ to train our predictive models. Random Forest is an ensemble-learning robust classification algorithm, in which multiple decision trees are included over a random subset of features and decide the output via majority voting. The model was trained using 10-fold cross-validation and performance evaluated by area under the Receiver Operating Characteristic (AUROC) curve, precision and accuracy. Further validation of the models was performed using a blind-test set of 184 mutations, which were non-redundant at the position-level with mutations in the training set. Analysis of the final model revealed a set of structural features that distinguished between susceptible and resistant pncA point mutations.

Webserver development

The server front-end was built using materialize CSS framework version 1.0.0, while the backend was built in Python via the Flask framework (version 0.12.2). It is hosted on a Linux server running Apache.

Sequencing and DST of clinical isolates

Genomic DNA was extracted according to the mechanical cell disruption and ethanol precipitation method outlined in Votintseva 2015⁵⁸ with slight modifications. Briefly, no pre-treatment was used and approximately 3 × 1 µL loops of culture were dispersed in 700 µL TE buffer (Sigma Aldrich) as the starting material. The precipitated DNA pellet was only washed once and resuspended into 50 µL EB Buffer (Qiagen) at 55 °C for 10 minutes with regular vortexing. Finally, samples were centrifuged 3 min at 13,000 rpm and 45 µL of DNA extract was transferred into a clean tube for downstream processing. Each extract was interrogated for Mycobacterium tuberculosis viability by inoculating 15 µL of DNA extract into MGIT tube (Becton Dickinson, UK) and incubated in the Bactec MGIT 960 system (Becton Dickinson, UK). Unique dual indexed libraries were prepared using the Nextera XT DNA sample preparation kit (Illumina). Libraries were sequenced on the Illumina NextSeq. 500 with 150-cycle paired end chemistry as described by the manufacturer’s protocols.

Sequences were aligned to H37Rv (NC_0009623.3) and small nucleotide variations (SNV) mutations in pncA were identified using LoFreq (http://csb5.github.io/lofreq/). SNVs with a frequency > 0.6 were used to compare the genotype of isolates to the phenotype observed using standard laboratory methods for PZA susceptibility testing.

References

WHO. Global Tuberculosis Report, Executive Summary, 2018. https://www.who.int/tb/publications/global_report/tb18_ExecSum_web_4Oct18.pdf?ua=1 (2018).
Heifets, L. & Lindholm-Levy, P. Pyrazinamide sterilizing activity in vitro against semidormant Mycobacterium tuberculosis bacterial populations. The American review of respiratory disease 145, 1223–1225, https://doi.org/10.1164/ajrccm/145.5.1223 (1992).
Article CAS PubMed Google Scholar
Tarshis, M. S. & Weed, W. A. Jr. Lack of significant in vitro sensitivity of Mycobacterium tuberculosis to pyrazinamide on three different solid media. American review of tuberculosis 67, 391–395 (1953).
CAS PubMed Google Scholar
Dawson, R. et al. Efficiency and safety of the combination of moxifloxacin, pretomanid (PA-824), and pyrazinamide during the first 8 weeks of antituberculosis treatment: a phase 2b, open-label, partly randomised trial in patients with drug-susceptible or drug-resistant pulmonary tuberculosis. Lancet (London, England) 385, 1738–1747, https://doi.org/10.1016/s0140-6736(14)62002-x (2015).
Article CAS Google Scholar
Veziris, N. et al. A once-weekly R207910-containing regimen exceeds activity of the standard daily regimen in murine tuberculosis. American journal of respiratory and critical care medicine 179, 75–79, https://doi.org/10.1164/rccm.200711-1736OC (2009).
Article CAS PubMed Google Scholar
Juma, S. P. et al. Underestimated pyrazinamide resistance may compromise outcomes of pyrazinamide containing regimens for treatment of drug susceptible and multi-drug-resistant tuberculosis in Tanzania. BMC infectious diseases 19, 129, https://doi.org/10.1186/s12879-019-3757-1 (2019).
Article PubMed PubMed Central Google Scholar
Chang, K. C., Yew, W. W. & Zhang, Y. Pyrazinamide susceptibility testing in Mycobacterium tuberculosis: a systematic review with meta-analyses. Antimicrobial agents and chemotherapy 55, 4499–4505, https://doi.org/10.1128/aac.00630-11 (2011).
Article CAS PubMed PubMed Central Google Scholar
Chedore, P., Bertucci, L., Wolfe, J., Sharma, M. & Jamieson, F. Potential for erroneous results indicating resistance when using the Bactec MGIT 960 system for testing susceptibility of Mycobacterium tuberculosis to pyrazinamide. Journal of clinical microbiology 48, 300–301, https://doi.org/10.1128/jcm.01775-09 (2010).
Article CAS PubMed Google Scholar
Hewlett, D. Jr., Horn, D. L. & Alfalla, C. Drug-resistant tuberculosis: inconsistent results of pyrazinamide susceptibility testing. Jama 273, 916–917 (1995).
Article PubMed Google Scholar
Hoffner, S. et al. Proficiency of drug susceptibility testing of Mycobacterium tuberculosis against pyrazinamide: the Swedish experience. The international journal of tuberculosis and lung disease: the official journal of the International Union against Tuberculosis and Lung Disease 17, 1486–1490, https://doi.org/10.5588/ijtld.13.0195 (2013).
Article CAS Google Scholar
Miller, M. A., Thibert, L., Desjardins, F., Siddiqi, S. H. & Dascal, A. Testing of susceptibility of Mycobacterium tuberculosis to pyrazinamide: comparison of Bactec method with pyrazinamidase assay. Journal of clinical microbiology 33, 2468–2470 (1995).
Article CAS PubMed PubMed Central Google Scholar
Pandey, S., Newton, S., Upton, A., Roberts, S. & Drinkovic, D. Characterisation of pncA mutations in clinical Mycobacterium tuberculosis isolates in New Zealand. Pathology 41, 582–584 (2009).
Article CAS PubMed Google Scholar
Simons, S. O. et al. Validation of pncA gene sequencing in combination with the mycobacterial growth indicator tube method to test susceptibility of Mycobacterium tuberculosis to pyrazinamide. Journal of clinical microbiology 50, 428–434, https://doi.org/10.1128/jcm.05435-11 (2012).
Article CAS PubMed PubMed Central Google Scholar
Scorpio, A. & Zhang, Y. Mutations in pncA, a gene encoding pyrazinamidase/nicotinamidase, cause resistance to the antituberculous drug pyrazinamide in tubercle bacillus. Nature medicine 2, 662–667 (1996).
Article CAS PubMed Google Scholar
Konno, K., Feldmann, F. M. & McDermott, W. Pyrazinamide susceptibility and amidase activity of tubercle bacilli. The American review of respiratory disease 95, 461–469, https://doi.org/10.1164/arrd.1967.95.3.461 (1967).
Article CAS PubMed Google Scholar
Zhang, Y., Wade, M. M., Scorpio, A., Zhang, H. & Sun, Z. Mode of action of pyrazinamide: disruption of Mycobacterium tuberculosis membrane transport and energetics by pyrazinoic acid. The Journal of antimicrobial chemotherapy 52, 790–795, https://doi.org/10.1093/jac/dkg446 (2003).
Article CAS PubMed Google Scholar
Shi, W. et al. Pyrazinamide inhibits trans-translation in Mycobacterium tuberculosis. Science (New York, N.Y.) 333, 1630–1632, https://doi.org/10.1126/science.1208813 (2011).
Article CAS ADS Google Scholar
Shi, W. et al. Aspartate decarboxylase (PanD) as a new target of pyrazinamide in Mycobacterium tuberculosis. Emerging microbes & infections 3, e58, https://doi.org/10.1038/emi.2014.61 (2014).
Article CAS Google Scholar
Yee, M., Gopal, P. & Dick, T. Missense Mutations in the Unfoldase ClpC1 of the Caseinolytic Protease Complex Are Associated with Pyrazinamide Resistance in Mycobacterium tuberculosis. Antimicrobial agents and chemotherapy 61, https://doi.org/10.1128/aac.02342-16 (2017).
Zhang, Y., Zhang, J., Cui, P., Zhang, Y. & Zhang, W. Identification of Novel Efflux Proteins Rv0191, Rv3756c, Rv3008, and Rv1667c Involved in Pyrazinamide Resistance in Mycobacterium tuberculosis. Antimicrobial agents and chemotherapy, 61, https://doi.org/10.1128/aac.00940-17 (2017).
Hirano, K., Takahashi, M., Kazumi, Y., Fukasawa, Y. & Abe, C. Mutation in pncA is a major mechanism of pyrazinamide resistance in Mycobacterium tuberculosis. Tubercle and lung disease: the official journal of the International Union against Tuberculosis and Lung Disease 78, 117–122 (1997).
Article CAS Google Scholar
Miotto, P. et al. Mycobacterium tuberculosis pyrazinamide resistance determinants: a multicenter study. mBio 5, e01819–01814, https://doi.org/10.1128/mBio.01819-14 (2014).
Article CAS PubMed PubMed Central Google Scholar
Karmakar, M. et al. Analysis of a Novel pncA Mutation for Susceptibility to Pyrazinamide Therapy. American journal of respiratory and critical care medicine 198, 541–544, https://doi.org/10.1164/rccm.201712-2572LE (2018).
Article PubMed PubMed Central Google Scholar
Allix-Beguec, C. et al. Prediction of Susceptibility to First-Line Tuberculosis Drugs by DNA Sequencing. The New England journal of medicine 379, 1403–1415, https://doi.org/10.1056/NEJMoa1800474 (2018).
Article CAS PubMed Google Scholar
Miotto, P. et al. A standardised method for interpreting the association between mutations and phenotypic drug resistance in Mycobacterium tuberculosis. The European respiratory journal, 50, https://doi.org/10.1183/13993003.01354-2017 (2017).
Chernyaeva, E. N. et al. Genome-wide Mycobacterium tuberculosis variation (GMTV) database: a new tool for integrating sequence variations and epidemiology. BMC genomics 15, 308, https://doi.org/10.1186/1471-2164-15-308 (2014).
Article CAS PubMed PubMed Central Google Scholar
Sandgren, A. et al. Tuberculosis drug resistance mutation database. PLoS medicine 6, e2, https://doi.org/10.1371/journal.pmed.1000002 (2009).
Article CAS PubMed Google Scholar
Yadon, A. N. et al. A comprehensive characterization of PncA polymorphisms that confer resistance to pyrazinamide. Nature communications 8, 588, https://doi.org/10.1038/s41467-017-00721-2 (2017).
Article CAS ADS PubMed PubMed Central Google Scholar
Petrella, S. et al. Crystal structure of the pyrazinamidase of Mycobacterium tuberculosis: insights into natural and acquired resistance to pyrazinamide. PloS one 6, e15785, https://doi.org/10.1371/journal.pone.0015785 (2011).
Article CAS ADS PubMed PubMed Central Google Scholar
Jubb, H. C. et al. Arpeggio: A Web Server for Calculating and Visualising Interatomic Interactions in Protein Structures. Journal of molecular biology 429, 365–371, https://doi.org/10.1016/j.jmb.2016.12.004 (2017).
Article CAS PubMed PubMed Central Google Scholar
Hecht, M., Bromberg, Y. & Rost, B. Better prediction of functional effects for sequence variants. BMC genomics 16(Suppl 8), S1, https://doi.org/10.1186/1471-2164-16-s8-s1 (2015).
Article PubMed PubMed Central Google Scholar
Choi, Y., Sims, G. E., Murphy, S., Miller, J. R. & Chan, A. P. Predicting the functional effect of amino acid substitutions and indels. PloS one 7, e46688, https://doi.org/10.1371/journal.pone.0046688 (2012).
Article CAS ADS PubMed PubMed Central Google Scholar
Pires, D. E., Ascher, D. B. & Blundell, T. L. mCSM: predicting the effects of mutations in proteins using graph-based signatures. Bioinformatics (Oxford, England) 30, 335–342, https://doi.org/10.1093/bioinformatics/btt691 (2014).
Article CAS Google Scholar
Pires, D. E., Ascher, D. B. & Blundell, T. L. DUET: a server for predicting effects of mutations on protein stability using an integrated computational approach. Nucleic acids research 42, W314–319, https://doi.org/10.1093/nar/gku411 (2014).
Article CAS PubMed PubMed Central Google Scholar
Rodrigues, C. H., Pires, D. E. & Ascher, D. B. DynaMut: predicting the impact of mutations on protein conformation, flexibility and stability. Nucleic acids research 46, W350–w355, https://doi.org/10.1093/nar/gky300 (2018).
Article CAS PubMed PubMed Central Google Scholar
Pires, D. E., Blundell, T. L. & Ascher, D. B. mCSM-lig: quantifying the effects of mutations on protein-small molecule affinity in genetic disease and emergence of drug resistance. Scientific reports 6, 29575, https://doi.org/10.1038/srep29575 (2016).
Article ADS PubMed PubMed Central Google Scholar
Portelli, S., Phelan, J. E., Ascher, D. B., Clark, T. G. & Furnham, N. Understanding molecular consequences of putative drug resistant mutations in Mycobacterium tuberculosis. Scientific reports 8, 15356, https://doi.org/10.1038/s41598-018-33370-6 (2018).
Article CAS ADS PubMed PubMed Central Google Scholar
Whitfield, M. G. et al. Mycobacterium tuberculosis pncA Polymorphisms That Do Not Confer Pyrazinamide Resistance at a Breakpoint Concentration of 100 Micrograms per Milliliter in MGIT. Journal of clinical microbiology 53, 3633–3635, https://doi.org/10.1128/jcm.01001-15 (2015).
Article CAS PubMed PubMed Central Google Scholar
Rose, A. S. et al. NGL viewer: web-based molecular graphics for large complexes. Bioinformatics (Oxford, England) 34, 3755–3758, https://doi.org/10.1093/bioinformatics/bty419 (2018).
Article CAS Google Scholar
Kushner, S. et al. Experimental chemotherapy of tuberculosis; substituted nicotinamides. The Journal of organic chemistry 13, 834–836, https://doi.org/10.1021/jo01164a008 (1948).
Article CAS PubMed Google Scholar
Stoffels, K., Mathys, V., Fauville-Dufaux, M., Wintjens, R. & Bifani, P. Systematic analysis of pyrazinamide-resistant spontaneous mutants and clinical isolates of Mycobacterium tuberculosis. Antimicrobial agents and chemotherapy 56, 5186–5193, https://doi.org/10.1128/aac.05385-11 (2012).
Article CAS PubMed PubMed Central Google Scholar
Koser, C. U. et al. Routine use of microbial whole genome sequencing in diagnostic and public health microbiology. PLoS pathogens 8, e1002824, https://doi.org/10.1371/journal.ppat.1002824 (2012).
Article CAS PubMed PubMed Central Google Scholar
Farhat, M. R. et al. Genetic Determinants of Drug Resistance in Mycobacterium tuberculosis and Their Diagnostic Value. American journal of respiratory and critical care medicine 194, 621–630, https://doi.org/10.1164/rccm.201510-2091OC (2016).
Article CAS PubMed PubMed Central Google Scholar
Karmakar, M. et al. Empirical ways to identify novel Bedaquiline resistance mutations in AtpE. PloS one 14, e0217169, https://doi.org/10.1371/journal.pone.0217169 (2019).
Article CAS PubMed PubMed Central Google Scholar
Pires, D. E., Chen, J., Blundell, T. L. & Ascher, D. B. In silico functional dissection of saturation mutagenesis: Interpreting the relationship between phenotypes and changes in protein stability, interactions and activity. Scientific reports 6, 19848, https://doi.org/10.1038/srep19848 (2016).
Article CAS ADS PubMed PubMed Central Google Scholar
Ascher, D. B. et al. Potent hepatitis C inhibitors bind directly to NS5A and reduce its affinity for. RNA. Scientific reports 4, 4765, https://doi.org/10.1038/srep04765 (2014).
Article CAS PubMed Google Scholar
Kano, F. S. et al. The Presence, Persistence and Functional Properties of Plasmodium vivax Duffy Binding Protein II Antibodies Are Influenced by HLA Class II Allelic Variants. PLoS Negl. Trop. Dis. 10, e0005177, https://doi.org/10.1371/journal.pntd.0005177 (2016).
Article PubMed PubMed Central Google Scholar
Phelan, J. et al. Mycobacterium tuberculosis whole genome sequencing and protein structure modelling provides insights into anti-tuberculosis drug resistance. BMC Med. 14, 31, https://doi.org/10.1186/s12916-016-0575-9 (2016).
Article CAS PubMed PubMed Central Google Scholar
Silvino, A. C. et al. Variation in Human Cytochrome P-450 Drug-Metabolism Genes: A Gateway to the Understanding of Plasmodium vivax Relapses. PloS one 11, e0160172, https://doi.org/10.1371/journal.pone.0160172 (2016).
Article CAS PubMed PubMed Central Google Scholar
Albanaz, A. T. S., Rodrigues, C. H. M., Pires, D. E. V. & Ascher, D. B. Combating mutations in genetic disease and drug resistance: understanding molecular mechanisms to guide drug design. Expert Opin. Drug Discov 12, 553–563, https://doi.org/10.1080/17460441.2017.1322579 (2017).
Article PubMed Google Scholar
Park, Y. et al. Essential but Not Vulnerable: Indazole Sulfonamides Targeting Inosine Monophosphate Dehydrogenase as Potential Leads against Mycobacterium tuberculosis. ACS infectious diseases 3, 18–33, https://doi.org/10.1021/acsinfecdis.6b00103 (2017).
Article CAS PubMed Google Scholar
Singh, V. et al. The Inosine Monophosphate Dehydrogenase, GuaB2, Is a Vulnerable New Bactericidal Drug Target for Tuberculosis. ACS infectious diseases 3, 5–17, https://doi.org/10.1021/acsinfecdis.6b00102 (2017).
Article CAS PubMed Google Scholar
Hawkey, J. et al. Evolution of carbapenem resistance in Acinetobacter baumannii during a prolonged infection. Microbial. Genomics 4, -, https://doi.org/10.1099/mgen.0.000165 (2018).
Article CAS Google Scholar
Holt, K. E. et al. Frequent transmission of the Mycobacterium tuberculosis Beijing lineage and positive selection for the EsxW Beijing variant in Vietnam. Nat. Genet. 50, 849–856, https://doi.org/10.1038/s41588-018-0117-9 (2018).
Article CAS PubMed PubMed Central Google Scholar
Worth, C. L., Preissner, R. & Blundell, T. L. SDM–a server for predicting effects of mutations on protein stability and malfunction. Nucleic acids research 39, W215–222, https://doi.org/10.1093/nar/gkr363 (2011).
Article CAS PubMed PubMed Central Google Scholar
Pires, D. E. & Ascher, D. B. CSM-lig: a web server for assessing and comparing protein-small molecule affinities. Nucleic acids research 44, W557–561, https://doi.org/10.1093/nar/gkw390 (2016).
Article CAS PubMed PubMed Central Google Scholar
Hall, M. et al. The WEKA data mining software: an update. SIGKDD Explor. Newsl. 11, 10–18, https://doi.org/10.1145/1656274.1656278 (2009).
Article Google Scholar
Votintseva, A. A. et al. Mycobacterial DNA extraction for whole-genome sequencing from early positive liquid (MGIT) cultures. Journal of clinical microbiology 53, 1137–1143, https://doi.org/10.1128/jcm.03073-14 (2015).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

M.K. and C.M.H.R. were funded by the Melbourne Research Scholarship. Funding for genomic sequencing was provided by the Department of Health and Human Services, Victoria. D.B.A. was funded by a Newton Fund RCUK-CONFAP Grant awarded by The Medical Research Council (MRC) and Fundação de Amparo à Pesquisa do Estado de Minas Gerais (FAPEMIG) (MR/M026302/1), the Jack Brockhoff Foundation (JBF 4186, 2016), and an Investigator Grant from the National Health and Medical Research Council (NHMRC) of Australia (GNT1174405). This work was supported in part by the Victorian Government’s OIS Program.

Author information

Authors and Affiliations

Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia
Malancha Karmakar, Carlos H. M. Rodrigues & David B. Ascher
Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia
Malancha Karmakar, Carlos H. M. Rodrigues & David B. Ascher
Victorian Tuberculosis Program, Melbourne Health and Department of Microbiology and Immunology, University of Melbourne, Melbourne, Victoria, Australia
Malancha Karmakar & Justin T. Denholm
Microbiological Diagnostic Unit Public Health Laboratory, University of Melbourne at The Peter Doherty Institute for Infection &Immunity, Melbourne, Victoria, Australia
Kristy Horan
Department of Biochemistry, University of Cambridge, Cambridge, CB2 1GA, UK
David B. Ascher

Authors

Malancha Karmakar
View author publications
You can also search for this author in PubMed Google Scholar
Carlos H. M. Rodrigues
View author publications
You can also search for this author in PubMed Google Scholar
Kristy Horan
View author publications
You can also search for this author in PubMed Google Scholar
Justin T. Denholm
View author publications
You can also search for this author in PubMed Google Scholar
David B. Ascher
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.K. performed the analysis and along with C.H.M.R. developed the analysis tool. K.H. and J.D. contributed to data collected and analysis. D.B.A. conceived, designed and supervised the project. All authors contributed to manuscript writing and editing.

Corresponding author

Correspondence to David B. Ascher.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information .

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Karmakar, M., Rodrigues, C.H.M., Horan, K. et al. Structure guided prediction of Pyrazinamide resistance mutations in pncA. Sci Rep 10, 1875 (2020). https://doi.org/10.1038/s41598-020-58635-x

Download citation

Received: 11 July 2019
Accepted: 28 November 2019
Published: 05 February 2020
DOI: https://doi.org/10.1038/s41598-020-58635-x

This article is cited by

Exploring the effects of missense mutations on protein thermodynamics through structure-based approaches: findings from the CAGI6 challenges
- Carlos H. M. Rodrigues
- Stephanie Portelli
- David B. Ascher
Human Genetics (2024)
Quantitative measurement of antibiotic resistance in Mycobacterium tuberculosis reveals genetic determinants of resistance and susceptibility in a target gene approach
- Ivan Barilar
- Simone Battaglia
- Baoli Zhu
Nature Communications (2024)
Estimating tuberculosis drug resistance amplification rates in high-burden settings
- Malancha Karmakar
- Romain Ragonnet
- Justin T. Denholm
BMC Infectious Diseases (2022)
Mutations Associated with Pyrazinamide Resistance in Mycobacterium tuberculosis: A Review and Update
- Ananthi Rajendran
- Kannan Palaniyandi
Current Microbiology (2022)
Pharmacoengineered Lipid Core–Shell Nanoarchitectonics to Influence Human Alveolar Macrophages Uptake for Drug Targeting Against Tuberculosis
- Maharshi Thalla
- Gangipangi Vijayakumar
- Subham Banerjee
Journal of Inorganic and Organometallic Polymers and Materials (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.