Sreenivas B. K. A, Imran S, Ravi L. Elucidation of Computational 3D Models of Protein Drug Targets for Colletotrichum Falcatum A Fungal Plant Pathogen Causing Red Rod of Sugarcane. Biomed Pharmacol J 2020;13(2).
Manuscript received on :15-03-2020
Manuscript accepted on :14-04-2020
Published online on: 27-04-2020
Plagiarism Check: Yes
Reviewed by: Yahya Wahba  orcid id- Biomedical and Pharmacology Journal https://publons.com/researcher/1412376/yasemin-saygideger/    
Second Review by: Yasemin Saygideger orcid id- Biomedical and Pharmacology Journal https://publons.com/researcher/1412376/yasemin-saygideger/  
Final Approval by: Dr Pallav Sengupta

How to Cite    |   Publication History
Views Views: (Visited 1,132 times, 1 visits today)   Downloads PDF Downloads: 483

Anirudh Sreenivas B. K, Sabia Imran and Lokesh Ravi*

Department of Botany, St.Joseph’s College (Autonomous), Bengaluru-27

Corresponding Author E-mail : Lokesh.ravi@sjc.ac.in

DOI : https://dx.doi.org/10.13005/bpj/1926

Abstract

Objective of this study is to develop 3D structures of the potential drug target proteins in the ascomycte plant pathogen Colletotrichum falcatum that causes ‘red rot disease’ a grave disease of sugarcane crop. This study uses online databases such as UniPort, DrugBank, PDB and PMDB to retrieve and submit biological information. Online webtools such as SwissModel, BLASTp, was used to construct homology models and find similarity, respectively. Total of 72 protein sequences were identified as potential drug targets and were retrieved form UniProt in .fasta file format.  Based on the available template model, total of 52 proteins were successfully modelled using Swiss-Model webtool. Among these 52 predicted models, 41 models were identified as significant based on Ramachandran plot analysis. The 41 predicted models were submitted to PMDB server for public access. This study has created a dataset of 3D homology models of drug target proteins that can greatly benefit in selective drug discovery and drug development against the investigated pathogen Colletotrichum falcatum. This can greatly benefit the pharmaceutical industry in developing agricultural antifungal favouring sugarcane cultivation.

Keywords

Colletotrichum Falcatum; Homology Modeling; PMDB; Ramachandran Plot; Red Rot of Sugarcane; Swiss-Model

Download this article as: 
Copy the following to cite this article:

Sreenivas B. K. A, Imran S, Ravi L. Elucidation of Computational 3D Models of Protein Drug Targets for Colletotrichum Falcatum A Fungal Plant Pathogen Causing Red Rod of Sugarcane. Biomed Pharmacol J 2020;13(2).

Copy the following to cite this URL:

Sreenivas B. K. A, Imran S, Ravi L. Elucidation of Computational 3D Models of Protein Drug Targets for Colletotrichum Falcatum A Fungal Plant Pathogen Causing Red Rod of Sugarcane. Biomed Pharmacol J 2020;13(2). Available from: https://bit.ly/2yIw070

Introduction

The Red-Rot Disease

Saccharum officinarium (sugarcane) is a monocotyledonous plant species that belongs to the family Graminae and is cultivated in most of the tropical and subtropical regions of the world [1]. It is the 2nd most valuable agro-industrial crop (cash-crop) in India next to cotton [2, 3]. It occupies around 4.2%(4.36million hectares) of the total area under cultivation and contributes 7.5% to the gross value of agricultural production of the country [1, 2]. Global contributors to sugarcane production include Brazil, India, China, Thailand, the US, and the UK. India is the 2nd largest producer of sugarcane in the world following  Brazil and is the largest consumer of sugarcane with an average consumption of 19 million MT a year[3].Sugarcane is economically very important as it stores a high concentration of sucrose in its stalk tissues, but several biotic and abiotic factors affects this sucrose yield. Approximately a hundred diseases have been reported globally, with 100 fungi, 10 bacteria,10 viruses, and 50 nematode species known to cause destructive diseases to sugarcane [3]. The most devastating disease of sugarcane is the red rot disease, caused by the fungi Colletotrichum falcatum.

Colletotrichum Falcatum

It is a facultative saprophyte which belongs to the subdivision Ascomycotina [3]. It causes huge loss of 18-31% in sugarcane production. According to the GOI study of 2017, various biotic stress factors, including red rot disease hampers the production of sugarcane in many important parts of the country [3][4]. The pathogen affects the economically valuable stalk by entering through the nodes [5]. Meteorological factors, along with soil pH and waterlogging play a major, but a compound role in aiding the infection mechanism [3, 6–9]. Susceptibility of the cultivar, along with the age of stalk tissues and time of infestation, determines the symptoms shown by the plant [7]. Infections decrease juice purity and deplete important micro and macronutrients like iron, copper, zinc, potassium, and phosphorous, but however, increase the content of calcium, and nitrogen to a large extent [10, 11]. Stalk weight is reduced by 29% furthermore, sugar retrieval capacity is reduced by up to 31%. Affected tissues cannot be commercially utilized because they lead to deterioration of the product [3, 11].

Homology Modeling

In this study, the 3D structures of proteins of Colletotrichum falcatum, are developed using homology modeling technique. These structures are necessary to design and develop drugs that could aim to stop the spread of this dreadful disease.Lack of protein structures has hindered the understanding of binding specificities of proteins and ligands, which are pre-requisites for drug design and development[12]. Well established and recognized databases and tools were employed in this study to obtain the computational structures of potential drug target proteins. These are very necessary to contain this dreadful disease and also reduce the monetary burdens incurred on the farmers due to this disease. Homology modelling was done using the pre-existing fasta sequences of the proteins and structure of the protein was predicted based on the available templates which were similar to the respective query sequence of the protein [13][14][15]. It is of utmost importance that the protein structures are available, which would otherwise hinder the understanding of binding specificities of the protein and ligand leading to low availability of drugs to stop the spread of this disease. The structures developed in this study could further be exploited to design and develop new and efficient drugs which would be a boon to the cultivators of this crop.

Materials and Methods

NCBI Database

This database was used to screen the amount of pre-existing information available on the chosen organism of interest. A simple search with the organism’s name revealed the amount of available data in different databases (https://www.ncbi.nlm.nih.gov/).

Sequence Retrieval

The UniProtknowledgebase (www.uniprot.org)-a centralised, reliable and publically accessable collection of non–reductant protein sequences [16], was used to retrieve the amino acid sequences required for this study. The organism’s name was used  as the search criteria. The best sequences were sorted based on the length of the amino acid chain and were downloaded in.fasta file format. Repetitive sequences were all omitted to prevent ambiguity, and downloaded sequences were all stored along with their respective accession IDs.

Sequence Alignment

The query sequences or amino acid sequences retrieved from UNIPROT were all compared using the BLASTp server (www.uniprot.org), an online tool that compares the query sequence with pre-existing protein sequences in the Protein Data Bank in order to obtain the percentage similarity [17][14][18]. The first BLAST was carried out to get similarity values with pre-existing sequences of the non-reductant database, followed by a second BLAST to get similarity values with the sequences present in the Protein Data Bank (www.rcsb.org). The similarity estimations thus obtained in percentages were noted down for further reference.

Structure Prediction

The 3D structure of the selected drug target proteins were predicted through homology modelling technique, using the online tool SWISS-MODEL (https://swissmodel.expasy.org/). This tool uses respective amino acid sequences of the protein as well as the templates available in the protein databank to predict the 3D structure of the protein[19][20].Sequences retrieved from UNIPROT were uploaded in .fasta file format and the predicted 3D models were retrieved in .pdb file format. The quality of the models developed was dependent on the availability and percentage similarity of the templates.

The models hence developed were analyzed using the Ramachandran plots. These are graphical plots that represent protein structures in terms of torsion angles andhence play a significant role in confirming the predicted structure’s accuracy [21]. The best models were downloaded in .pdb after the analysis was complete.

Model Analysis

Qualitative analysis of the reliability of the predicted model was performed via the Ramachandran plot provided by the SWISS-MODEL as an in-built feature. The degree angles of all the residues were expected to be within the Most-Favoured regions of the Ramachandran plot, that determines the quality of the predicted structure. The residues that are outside of this favoured region are considered to be unfavourable prediction or outliers. These outliers reduce the confidence score of the predicted model.

Model Submission

The 3D models thus predicted with good quality Ramachandran plot (confidence score) were then submitted to public database Protein Model Data Base [PMDB] (https://bioinformatics.cineca.it/PMDB/)  which is a resource that stores manually built protein models that are published in scientific literature [22]. All the models were uploaded in .pdb file formats and each entry was given a unique PMDBID for future reference.

Results and Discussions

Drug Target Selection

A bibliographical search of the organism Colletotrichum falcatum in NCBI website, showed that, there are no known protein structures of the organism, but there are about 600 protein sequences reported. Hence, this organism was ideal for construction of computational protein models. Preliminary screening of the 600 protein sequences, suggested that non-enzymatic protein components such as ribosomal subunits were predominantly reported. Hence, these ribosomal and non-enzymatic proteins were eliminated for the homology model development, resulting in a total of 125 protein sequences for further analysis. These 125 protein sequences were then analysed in the DrugBank database (www.drugbank.ca)  to confirm if they are previously reported as a possible drug targets. Among these 125 protein sequences, 72 sequences were identified as potential drug targets based on literature proof of mechanism of action of known antibiotics.

Building Homology Model

The selected 72 protein sequences were retrieved from the uniprot database (www.uniprot.org) and saved as fasta file format (.fasta) then subjected for BLASTp also known as Protein Blast (https://blast.ncbi.nlm.nih.gov/) to search within Protein Data Bank (www.rcsb.org) to identify the templates for homology modelling. Protein structures with more than 80% sequence similarity were chosen as ideal template. Only 12 sequences had a similarity of more than 80%. The other sequences had more than 1 template used for homology modelling, with less than 80% sequence similarity. All the 72 protein sequences with appropriate template models were subjected for homology model construction using the Swiss-Model web server tool (https://swissmodel.expasy.org/). Homology protein models were built for 52 sequences successfully.

Ramachandran Plot Validation

The Swiss-Model tool, has developed a maximum of 5 different models for each of the query sequences. The best of the 5 models were selected based on the Ramachandran Plot analysis, that was integrated with the server. The ramachandran plot of all developed models were examined, and the model which has majority percentage of the residues within the most preferred regions of the graph were considered as the ideal / final model. The percentage of residues within the most favoured regions was also used as the confidence percentage score. The graphical representation of the ramachandran plot of least preferred model and best preferred models with lowest and highest confidence score are shown in Figure.1(A) & Figure.1(B). The structure analysis of a constructed homology model for WRKY transcription factor 37 protein is represented graphically in Figure.2.

Figure 1: Ramachandran plot analysis of; [A]: Least preferred model with lowest confidence score of 79.3% (Coronative insensitive protein homolog-1); [B]: Most preferred model with highest confidence score of 100% (WRKY transcription factor 37) Figure 1: Ramachandran plot analysis of;

Click here to View Figure

 

Figure 2: Structural analysis of the most preferred homology model with 100% of the residues with the most favoured region (WRKY transcription factor 37) Figure 2: Structural analysis of the most preferred homology model with 100% 

Click here to View Figure

Submission to PMDB

A total of 52 homology models were developed in the Swiss-Model online tool. However, ramachandran plot analysis showed that only 41 protein models exhibited more than 90% residues within the most favoured regions. Hence, these 41 sequences were submitted in the Protein Model Data Base (https://bioinformatics.cineca.it/PMDB/) for public access and further research. A summary of all the 41 protein 3D models submitted to the PMDB database with their PMDB ID, along with drug target name, Uniprot ID and confidence score of the model are tabulated in Table.1.

Table 1: Summary of PMDB entries for Colletotrichum falcatum

Sl. No Drug Target Name UniProt ID Confident Score PMDB ID
1 Chitin synthase J7HAD5 91.89 PM0082471
2 Beta-tubulin(142) Q8J1X0 95.16 PM0082472
3 Elongation factor 2 A0A1B3B2L9 91.43 PM0082473
4 Dirigent protein A0A1B3B2N1 96.97 PM0082474
5 EPL1 protein(212) A0A1C9II66 98.25 PM0082310
6 NAC transcription factor C A0A059VE62 95.65 PM0082475
7 NAC transcription factor J A0A059VK69 91.49 PM0082476
8 NAC transcription factor M A0A059VPC0 92.68 PM0082477
9 Glycosyl hydrolase family 10 A0A1C9II18 98.68 PM0082478
10 WRKY transcription factor 37 A0A059VP87 100 PM0082479
11 Elongation factor 1 alpha A0A1B3B2J0 97.22 PM0082480
12 Translation elongation factor 1 alpha A0A1B3B2K3 94.74 PM0082481
13 BZIP transcription factor 25 A0A059VP32 99.64 PM0082482
14 BZIP transcription factor 9 A0A059VJB5 100 PM0082483
15 BZIP transcription factor 15 A0A059VP28 100 PM0082484
16 BZIP transcription factor 18 A0A059VK55 100 PM0082491
17 BZIP transcription factor 29 A0A059VE58 95.65 PM0082492
18 BZIP transcription factor 4 A0A059VE50 98 PM0082493
19 BZIP transcription factor 24 A0A059VP99 96.23 PM0082494
20 BZIP transcription factor 14 A0A059VP93 92.31 PM0082495
21 TLP transcription factor A A0A059VJE2 91.11 PM0082497
22 TLP transcription factor K A0A059VK79 98.96 PM0082498
23 TLP transcription factor M A0A059VJE8 93.62 PM0082499
24 TLP transcription factor L A0A059VE75 93.62 PM0082500
25 MYB transcription factor 83 A0A059VP36 93.27 PM0082501
26 MYB transcription factor 82 A0A059VPA7 96.2 PM0082502
27 MYB transcription factor 78 A0A059VJC5 90.54 PM0082503
28 Class VII chitinase A0A1B3B2M2 98.68 PM0082504
29 Chitinase B A0A1B3B2K5 95.89 PM0082505
30 RNA -dependent RNA polymerase A0A3Q9NNK8 92.64 PM0082309
31 Putative heat shock protein (90) A0A1B3B2M4 91.43 PM0082506
32 Respiratory burst oxidase-like protein H A0A1B3B2N7 98.44 PM0082507
33 NDR1/HIN1-like protein A0A1B3B2P4 100 PM0082508
34 Type I polyketide synthase C9WLC4 93.83 PM0082509
35 Respiratory burst oxidase-like protein B A0A1B3B2N2 95.45 PM0082510
36 Putative ethylene response sensor A0A1B3B2N0 100 PM0082511
37 Beta-1,3-glucanase D A0A1B3B2K9 97.06 PM0082512
38 EPL1-like protein A0A1C9II08 96 PM0082513
39 Salicylate hydroxylase 1 A0A1C9IIB1 95.45 PM0082514
40 Bys1 family protein A0A1C9II96 97.14 PM0082515
41 Respiratory burst oxidase-like protein C A0A1B3B2N8 97.25 PM0082516

Conclusion

The objective of this study was to prepare 3-D models of proteins present in Colletotrichum falcatum that is a major pathogen responsible for the red rot disease in sugarcane crops. The protein structures of this ascomycete were successfully developed and analyzed using the swiss-model workspace. A similar study by Divya et.al (2018) was performed on a less studied organism P.marinus an endoparasitic pathogen, and other Perkinsus spp. that are responsible for causing devastating losses in the cultivation of shellfish and mollusk species worldwide , where the authors developed and submitted 3D structures of drug target proteins of the organism based on homology modelling [23] .

These developed structures could be further exploited to either develop new antifungal drugs or to revise the pre-existing ones. This is an important aspect of computer-aided, in-silico drug designing approach which gained significant momentum in the recent years [24]. Furthermore active sites of the molecule can be found out aiding in docking studies that would help in studying the interactions between the protein and the ligand. A Study by Daisy et.al (2013) involving the in-silico drug designing approach for biotin protein ligase of  Mycobacterium tuberculosis, provides better insight on the significant role played by predicted protein structures in the field of drug design and development [25].

The structures can also prove to be useful for understanding the mechanism of infection of the disease. They can also be used to design a pathway for understanding the effect of inhibition of a few proteins that can act as good antifungal drug targets which may help design an antifungal compound that might be more effective as well as non-toxic unlike the traditional fungicides and insecticides that are proven to be harmful in the long run. This study provides a basic platform for future in-silico work on Colletotrichum falcatum specific drug design and development.

Acknowledgement

The authors thank the management of St. Joseph’s College (Autonomous), Bengaluru for supporting this research work.

Conflict of Interest

None declared. 

References

  1. Gawade DB, Pawar BH, Gawande SJ, Vasekar VC, Antagonistic Effect of Trichoderma Against Fusarium moniliformae the Causal of Sugarcane Wilt Plant Pathology Section , Division of Crop Protection ,. 12(9): 1236–1241 (2012)
  2. Jayashree J, Selvi A, Nair N V, Characterization of Resistance Gene Analog Polymorphisms in sugarcane cultivars with varying levels of red rot resistance. 1(July): 1191–1199 (2010)
  3. Sharma R, Tamta S, A Review on Red Rot : The ” Cancer ” of Sugarcane. Plant Pathol Microbiol, (2015)
  4. Varma PK, Kumar KVK, Suresh M, Kumar NR, Sekhar VC, Potentiality of Native Pseudomonas spp . in Promoting Sugarcane Seedling Growth and Red Rot ( Colletotrichum falcatum went ) Management. 7(02): 2855–2863 (2018)
  5. Viswanathan R, Ramasamy S, Viswanathan , R . and R . Samiyappan ( 2000 ). Red rot disease in sugarcane : Challenges and prospects . Madras Agricultural. (January): 549–559 (2000)
  6. Kumar A, Sharma SK, Management of red rot of sugarcane caused by colletotrichum falcatum went : A review. 7(3): 5029–5039 (2019)
  7. Patel P, Krishnamurthy R, Physiological profiling of Colletotrichum falcatum , the causal agent of Sugarcane Red rot disease Physiological profiling of Colletotrichum falcatum , the causal agent of Sugarcane Red. (October) (2017)
  8. Paswan S, Kumar M, Sattar A, Impact of Weather Factors on Development of Red Rot Disease of Sugarcane Agro-Ecosystem. 7(02): 8–12 (2018)
  9. Phytopathology I, Agnihotri VP, Current sugarcane disease scenario and management strategies.
  10. Gupta RN, Sah SB, Kumar S, Kumar A, Kishore C, Chand G, Impact of Red Rot Disease on Nutrient Status of Sugarcane. 44(7): 3533–3538 (2018)
  11. Kamat DN, Losses Due to Red Rot Pathogen in Cane Juice Quality. 7(02): 13–16 (2018)
  12. Name C, Name S, Cycle L, Pacific Pests and Pathogens – Fact Sheets Sugarcane red rot ( 221 ). (221)
  13. Cavasotto CN, Phatak SS, Homology modeling in drug discovery : current trends and applications. 14(July) (2009)
  14. Article R, Homology Modeling a Fast Tool for Drug Discovery : Current Perspectives. 1–17
  15. Scientific R, International Journal Of. 7(3) (2016)
  16. Apweiler R, Bairoch A, Wu CH, Barker WC, Boeckmann B, Ferro S, et al., UniProt : the Universal Protein knowledgebase. 32 (2004)
  17. Mcginnis S, Madden TL, BLAST : at the core of a powerful and diverse set of sequence analysis tools. 32: 20–25 (2004)
  18. Manuscript A, NIH Public Access. 7(3): 217–227 (2007)
  19. Biasini M, Bienert S, Waterhouse A, Arnold K, Studer G, Schmidt T, et al., SWISS-MODEL : modelling protein tertiary and quaternary structure using evolutionary information. 42(April): 252–258 (2014)
  20. Arnold K, Bordoli L, Schwede T, Structural bioinformatics The SWISS-MODEL workspace : a web-based environment for protein structure homology modelling. 22(2): 195–201 (2006)
  21. Kumar P, Arya A, Ramachandran Plot : A simplified approach. (December) (2018)
  22. Meo PDO De, Cozzetto D, The PMDB Protein Model Database. (January) (2006)
  23. Jindam D, Ravi L, Krishnan K, Construction of computational protein database by homology modeling for the aquatic pathogen Perkinsus marinus.
  24. Wadood A, Ahmed N, Shah L, Ahmad A, Hassan H, Shams S, In-silico drug design: An approach which revolutionarised the drug discovery process. OA Drug Des Deliv, 1(1): 3 (2013)
  25. Daisy P, Nivedha RP, Bakiya RH, In silico drug designing approach for biotin protein ligase of mycobacterium tuberculosis. Asian J Pharm Clin Res, 6(SUPPL.1): 103–107 (2013)
Share Button
(Visited 1,132 times, 1 visits today)

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.