Abstract
Indolecarboxamidotetrazole compounds are well known as potential anti allergic agents due to their mast cell stabilizing activity on human basophils. A quantitative structure activity relationship (QSAR) model has been generated using Multiple Linear regression (MLR) for the prediction of inhibition efficiency of indolecarboxamidotetrazole derivatives. Twenty-one compounds with their activities expressed as % inhibition (PI) are collected. Descriptors are generated using Chemistry Development Kit. Three models are built and the models are evaluated using multiple correlation coefficient (R) and residual standard deviation (s). Considering the quality and accuracy of the predicted models, model 1 is the best, because it predicts biological activity which is almost closed to that of experimental value. This model is externally validated. This built model can be used to calculate inhibition efficiency of natural mast cell stabilizers containing caroxamidotetrazoles as antiallergic chemical in future.
1 Introduction
Ethnopharmacology is the study of traditionally used drugs. Some approved drugs are identified by using ethanopharmacological data e.g. sodium salt of chromoglicic acid as mast cell stabilizer [1]. Mast cells are master players during type I hypersensitivity reaction in response to allergens. So, it acts as target site for the treatment of allergy. Mast cells are master players during type I hypersensitivity reaction in response to allergens. So, it acts as target site for the treatment of allergy [2]. These sensitized mast cells after exposure to allergen initiate different signaling pathways inside themselves. Finally, different mediators like histamine are released from mast cells after degranulation and acting on surrounding cells produce symptoms of allergy. Mast cell stabilizers are natural, semi-synthetic and synthetic compounds, which can prevent release of mediators from mast cells. But their mechanisms of actions remain unknown till now.
Disodium cromoglycate, a chromone complex, is first discovered as orally active antiallergic drugs for mast cell stabilizer. This most commonly prescribed antiallergic drug, acts by inhibiting the release of mediators (histamine, leukotrienes) from mast cell as a prophylactic drug [3, 4]. It probably interferes with the antigen-mediated calcium ion influx into mast cells, as it acts on Calcium-activated potassium channel subunit alpha-1 [5, 6, 7]. But this drug has other targets for binding in human like Protein S100-P [8]. It plays an important role in cellular calcium signaling, which may cause adverse effect on our body.
Mastocytosis is the adverse reaction, which is reported during administration of Cromolyn Sodium as shown in different drug databases like DrugbindingDB etc. So, the designs of new oral and natural mast cell stabilizers are urgently needed, which are cheaper and with longer half-life period.
Several natural constituents, which are obtained from different herbs, like Holy Basil (Ocimum Tenuiflorum) [9], Chamomile (Matricaria Recutita) [10], flavonoids in peppermint [11], rhizome of Ginger [12], polyphenols of apple [13], are identified as mast cell stabilizers. Different types of chemical compounds are also analyzed in earlier research works as natural mast cell stabilizers like Flavonoids [14], Coumarins [15], phenols [16], terpenoids [17], tetrazoles [18] and amino acids [19].
Unangst et al. [18] in 1989, describe the inhibition of histamine release from human basophils by a group of Indolecarboxamidotetrazole compounds. They synthesize 31 derivatives. The percent inhibition (PI) of basophil histamine release for 31 derivatives along with two well-known inhibitors e.g. nedocromil (CHEMBL 746) and Cromolyn sodium (CHEMBL 74) is calculated at screening concentration of 10 µm. Among them, 16 compounds, which have PI values less than 50, are selected as training set for our QSAR study. For this study, log (PI/100-PI) is used as biological activity.
Quantitative structure–activity relationship (QSAR) analysis is used to identify mast cell stabilizing activity of different types of compounds [20, 21, 22]. This work is important because experimental methods are costly and time consuming. In this QSAR study of Indolecarboxamidotetrazole compounds, some parameters of selected chemical properties are correlated with biological activity by using mathematical equations. Quantitative parameters for physicochemical or biological or toxicological properties of a molecule, derived by the computational or experimental methods, are known as descriptors. Depending on the types of algorithms and dimensions of descriptors, QSAR analysis can be classified into 2D-QSAR, 3D-QSAR and so on. Among them, 2D-QSAR analysis is a less time consuming energy calculation, because its descriptors are simple and two dimensional. This type of analysis can be operated with direct mathematical algorithms [23].
In various models developed using MLR technique, the statistical parameters e.g. cross validated coefficient (CV) defines the goodness of prediction, whereas the non-cross validated conventional correlation coefficient (r2) defines the goodness of fit of the QSAR model [24].
As stated earlier, Unangst et al., in 1989 [18], synthesize and analyze antiallergic potential of a series of novel indolecarboxamidotetrazoles compounds. These series of thirty-one compounds are derivatives of the following compound, where different substitutions are incorporated in R1, R2 and R3 positions. Bioactivity data for 21 inhibitors of IgE receptor α subunit for human basophils are collected [18] and subjected to descriptor determination [25]. First of all, various topological, electronic, geometrical and constitutional descriptors are calculated and among them five are finally selected by systemic search method [26]. Finally, a linear regression model is hypothesized using selected descriptors, which will be quite useful for finding structurally optimum inhibitor for mast cell stabilization (Fig. 1).
Carboxyamidotriazole binds to and inhibits non-voltage-operated calcium channels, blocking both Ca2 + influx into cells and Ca2 + release from intracellular stores, resulting in the disruption of calcium channel-mediated signal transduction.
2 Material and Method
There are several steps in QSAR model development. First a dataset of similar chemical compounds with same biological activity have been identified. Then different types of descriptors are calculated and using systemic search most suitable descriptors have been identified. With the help of these selected descriptors QSAR models are constructed.
2.1 Data
A data set of the compounds, which consists of twenty-one Indolecarboxamidotetrazole substituted compounds as inhibitors for histamine release for IgE Fc receptor, alpha-subunit from allergic donors, are obtained from the ChEMBL database and literature [27]. According to Unangst et al. [18] chemicals are marked as compound key e.g. 8 l, 3 g, 13 m etc. The chemical IDs and standard inhibition (PI) values and log (PI/100-PI) values are presented in Table 1. Among them first sixteen compounds form training set and last five compounds consist of test set for QSAR study.
2.2 Descriptor Generation and Calculation
In this study, SMILES structures, as they are obtained from ChEMBL [27], of all sixteen molecules are used as inputs to calculate 44 different types of molecular descriptors. These descriptors are categorized as topological, electronic, constitutional, and geometric classes for QSAR analysis. For our 2D QSAR study, we choose all 123 descriptors from Chemistry Development Kit (CDK v 1.03), an open source Java library for Chemoinformatics and Bioinformatics [25]. A total of 1,968 descriptors (123 descriptors for 16 compounds) are generated after the calculation using CDK for our experiment (Fig. 2).
2.3 Descriptor Selection
Variable selection method in BuildQSAR software [28] is carried out by using the systematic search method, where controlling parameters are set, as described next. Considering log (PI/100-PI) as chosen biological activity and a correlation coefficient factor R is >0.84, cross validation of results is done by using least-one-out method. The number of descriptors per model is set to be 1 as the ratio of (number of compounds) to (number of descriptors) should be >= 5. Here the total twenty-one compounds are randomly divided into training and validation sets containing first sixteen molecules in training set and last five compounds in validation set.
From the summary table of CDK descriptor (wiki.qspr.thesaurus.eu) the above-mentioned descriptors are discussed here. SPC-6 is a type of Chi Path Cluster descriptor belongs to the class of topological descriptor. This type of descriptor reflects the molecular connectivity of a compound without its geometry information. This descriptor evaluates the value of the Kier & Hall Chi path cluster indices of order 6. The second descriptor MDEC-12 represents molecular distance edge descriptors for C as another topological descriptor. Similarly, another topological descriptor is BCUTw-1 h. This is an Eigen value based descriptor noted for its utility in chemical diversity. SPC-5 is the value of the Kier & Hall Chi path cluster indices of order 5. XLogP is a constitutional descriptor which predicts of logP based on the atom-type method.
2.4 Model Development
Three models are generated using one descriptor in each model by Multiple Linear Regression method in BuildQSAR software [28]. The QSAR Model is represented as QSAR equation, with the correlation coefficients calculated for each descriptor used in the regression model. By plotting the experimental activity (Yexp) vs predicted activity (Ypred), built models are evaluated for their predictive powers to determine activity as mast cell stabilizer. Three linear models are built using three selected descriptors as shown in Table 2.
For the first model, linear regression equation is
This equation is obtained by analyzing the three topological descriptors e.g. Chi Path Cluster descriptor, molecular distance edge descriptor and Eigen value based descriptor. Here n is number of molecules under analysis, R is the correlation coefficient, r2(R) is the squared correlation coefficient, s is the standard deviation and F is the F statistical value. The cross validated squared correlation coefficient, Q2 is 0.643 and standard deviation of sum of square of difference between predicted and observed values, SPress is 0.205.
The second QSAR equation is
This equation is obtained by analyzing the three topological descriptors. The cross validated squared correlation coefficient, Q2 is 0.570 and SPress is 0.225.
The third model can be generated by using two topological and one constitutional descriptor. Here the values of Q2 and SPress are 0.564 and 0.227 respectively.
3 Evaluating the Model
As stated earlier in various models developed using Multiple Linear Regression technique, the cross validated coefficient (CV) defines the goodness of prediction for developed models, whereas the non-cross validated conventional correlation coefficient (r2) defines the goodness of fit of the QSAR model [24].
3.1 On the Basis of Goodness of Fit
Among the above three models, model 1 has produced high statistical quality equation (n = 16; R = 0.887; r 2 (R) = 0.7860; R 2-Adj. = 0.7325; s = 0.159; F = 14.692; p = 0.0003; Q 2 = 0.643; SPress = 0.205; SDEP = 0.184). It is seen that both models 2 and 3 have R values of 0.867 and 0.855 respectively, which are lower compared to the first model. Model 1 contents the lowest standard deviation value (s = 0.159) when compared to the other two models. The value of F, the calculated value of the F-ratio test, is 14.692, which is also highest among others F values for all models (Table 3). So, on the basis of R2, s and F values, model 1 can be considered as the best one among the three models.
3.2 On the Basis of Predictive Power
According to the predictive powers of models, the models can be ranked (from the best to the worst) (considering the values of Q2 in descending order), as 1, 2 and 3. When minimum value of SDEP is considered, the model 1 is the best model again on the basis of predictive power (Tables 4 and 5).
4 Graphical Analysis
The graphical analysis has been performed and the graph is shown in following Figs. 3, 4, 5 and 6. The graph has been plotted between the predicted and observed log (PI/100-PI) values (Fig. 3). The predicted activity log(PI/100-PI)pred shows linear relationship with observed activity log(PI/100-PI)obs, because fit of the data to the regression line is good. The higher the value for r2, less likely proves that the relationship is due to chance.
This QSAR investigation indicates that the descriptors, namely SPC-6, MDEC-12 and BCUT-1 h, for the set of Indolecarboxamidotetrazole compounds inhibitors studied, are found to have a great deal to positively contribute to biological activity.
The graph is plotted for observed activity versus residual (Fig. 4) and predicted activity versus residual (Fig. 5). The finalized descriptors are found to be the members of topological descriptors in model 1.
5 External Validation
All the three models are externally validated with validation set of eight compounds. On the basis of predictive power model 1 is selected by internal validation. A validation set is constructed with the other five molecules and external validation is done using descriptors of the model 1. Observed activity Vs calculated activity graph for validation set shows that all compounds of this set located symmetrically around the best fit line (Fig. 6). Thus model 1 are externally validated with five compounds with the values 0.980 and 0.9606 for the correlation coefficient R and the squared correlation coefficient r2(R) respectively.
6 Conclusion
Innumerable QSAR models have been built in last 50 years drug designing of antimycobacterial agents [30], antituberculosis agents [31], acetylcholinesterase inhibitors [31] and estrogen receptor agonist and antagonists [32]. Mast cell stabilizers can act as inhibitors on human basophil cells and thus they are potent anti allergic drugs.In earlier work, Unangst et al. in 1989 concludes that N –phenyl analogue of indolecarboxamidotetrazole inhibits histamine release from human leukocytes after stimulating with anti-IgE antibody (18), compared to the substitution N-H and N-methyl compounds. These compounds are marked as compound no. 7, 11 and 16 in our dataset respectively and their calculated activity using QSAR model correlates with their observed activity. In this QSAR model developed by using multiple linear regression (MLR) analysis, the cross-validated values of maximum Q2 and minimum SDEP correlates with the goodness of prediction, whereas the non-cross-validated conventional correlation coefficient (R2) defines the goodness of fit of the model. Based on the most predictable QSAR model 1, it can be inferred that inhibitory activity will be decreased with the following substituent, namely halogen substitution in 5th position at R1 of indole ring, OEt, OCH(Me)2 substitutions in R3 of indole ring. The model is also valid for other five carboxamidotetrazole derivatives of furan, thiophene, naphthalene and benzothiophene derivatives. So, it can be concluded that irrespective of nature of substituents the basic structure of this twenty one compounds are responsible for their action as mast cell stabilizer.
We hope that the derived models and effect of substituents can be used in searching more potential mast cell stabilizers from the natural resources prior to experimental evaluation for our future work.
References
Atanasov, A.G., Waltenberger, B., Pferschy-Wenzig, E.-M., Linder, T., Wawrosch, C., Uhrin, P., Temml, V., Wang, L., Schwaiger, S., Heiss, E.H., Rollinger, J.M., Schuster, D., Breuss, J.M., Bochkov, V., Mihovilovic, M.D., Kopp, B., Bauer, R., Dirsch, V.M., Stuppner, H.: Discovery and resupply of pharmacologically active plant-derived natural products: a review. Biotechnol. Adv. 33(8), 1582–1614 (2015). http://doi.org/10.1016/j.biotechadv.2015.08.001
Finn, D.F., Walsh, J.J.: Twenty first century mast cell stabilizers. Br. J. Pharmacol. 170(1), 23–37 (2013)
Margit, N., Fedina, L., Balazs, R., Barta, B., nee Gyarmati, G.T., Marczis, J., Szasz, A.: Antiasthmatic aerosol preparation of sodium cromoglycate. U.S. Patent US5753208, issued June 1965
Heinke, S., Szucs, G., Norris, A., Droogmans, G., Nilius, B.: Inhibition of volume-activated chloride currents in endothelial cells by chromones. Br. J. Pharmacol. 115(8), 1393–1398 (1995)
Overington, J.P., Al-Lazikani, B., Hopkins, A.L.: How many drug targets are there? Nat. Rev. Drug Discov. 5(12), 993–996 (2006)
Imming, P., Sinning, C., Meyer, A.: Drugs, their targets and the nature and number of drug targets. Nat. Rev. Drug Discov. 5(10), 821–834 (2006)
Brogden, R.N., Speight, T.M., Avery, G.S.: Sodium cromoglycate (cromolyn sodium): a review of its mode of action, pharmacology, therapeutic efficacy and use. Drugs 7(3–4), 164–282 (1974)
Arumugam, T., Ramachandran, V., Logsdon, C.D.: Effect of cromolyn on S100P interactions with RAGE and pancreatic cancer growth and invasion in mouse models. J. Natl Cancer Inst. 98(24), 1806–1818 (2006)
Sridevi, G., Gopkumar, P., Ashok, S., Shastry, C.: Pharmacological basis for antianaphylactic, antihistaminic and mast cell stabilization activity of ocimum sanctum. Internet J. Pharmacol. 7(1) (2008)
Chandrashekhar, V.M., et al.: Anti-allergic activity of German chamomile (Matricaria recutita L.) in mast cell mediated allergy model. J. Ethnopharmacol. 137(1), 336–340 (2011)
Inoue, T., et al.: Antiallergic effect of flavonoid glycosides obtained from Mentha piperita L. Biol. Pharm. Bull. 25(2), 256–259 (2002)
Chen, B.-H., et al.: Antiallergic potential on RBL-2H3 cells of some phenolic constituents of Zingiber Officinale (Ginger). J. Nat. Prod. 72(5), 950–953 (2009)
Kanda, T., et al.: Inhibitory effects of apple polyphenol on induced histamine release from RBL-2H3 cells and rat mast cells. Biosci. Biotechnol. Biochem. 62(7), 1284–1289 (1998)
Kimata, M., Shichijo, M., Miura, T., Serizawa, I., Inagaki, N., Nagai, H.: Effects of luteolin, quercetin and baicalein on immunoglobulin E-mediated mediator release from human cultured mast cells. Clin. Exp. Allergy 30, 501–508 (2000)
Ninomiya, M., Itoh, T., Ishikawa, S., Saiki, M., Narumiya, K., Yasuda, M., et al.: Phenolic constituents isolated from Fragaria ananassa Duch. inhibit antigen-stimulated degranulation through direct inhibition of spleen tyrosine kinase activation. Bioorg. Med. Chem. 18, 5932–5937 (2010)
El-Agamy, D.S.: Anti-allergic effects of nilotinib on mast cell-mediated anaphylaxis like reactions. Eur. J. Pharmacol. 680, 115–121 (2012)
Itoh, T., Oyama, M., Takimoto, N., Kato, C., Nozawa, Y., Akao, Y., et al.: Inhibitory effects of sesquiterpene lactones isolated from Eupatorium chinense L. on IgE-mediated degranulation in rat basophilic leukemia RBL-2H3 cells and passive cutaneous anaphylaxis reaction in mice. Bioorg. Med. Chem. 17, 3189–3197 (2009)
Unangst, P.C., Connor, D.T., Stabler, S.R., Weikert, R.J., Carethers, M.E., Kennedy, J.A., Thueson, D.O., Chestnut, J.C., Adolphson, R.L., Conroy, M.C.: Novel indolecarboxamidotetrazoles as potential antiallergy agents. J. Med. Chem. 32(6), 1360–1366 (1989)
Connor, D.T., Cetenko, W.A., Mullican, M.D., Sorenson, R.J., Unangst, P.C., Weikert, R.J., Adolphson, R.L., Kennedy, J.A., Thueson, D.O.: Novel benzothiophene-, benzofuran-, and naphthalene Hocarboxamidotetrazoles as potential antiallergy agents. J. Med. Chem. 35(5), 958–965 (1992)
Kim, N.H., Jeong, H.J., Kim, H.M.: Theonine is a candidate amino acid for pharmacological stabilization of mast cells. Amino Acids 42, 1609–1618 (2011)
Raj, N., Jain, S.K.: 2D QSAR study of substituted 2-Phenyl-Benzimidazole derivatives as potent anti allergic agents. Int. J. PharmTech Res. 4(4), 1350–1360 (2012)
Zhou, Y.X., Xu, L., Wu, Y.P., Liu, B.L.: A QSAR study of the antiallergic activities of substituted benzamides and their structures. Chemometr. Intell. Lab. Syst. 45(1), 95–100 (1999)
Ford, R.E., Knowles, P., Lunt, E., Marshall, S.M., Penrose, A.J., Ramsden, C.A., Summers, A.J.H., Walker, J.L., Wright, D.E.: Synthesis and quantitative structure-activity relationships of antiallergic 2-hydroxy-N-(1H-tetrazol-5-yl) benzamides and N-(2-hydroxyphenyl)-1H-tetrazole-5-carboxamides. J. Med. Chem. 29(4), 538–549 (1986)
Roy, K., Narayan Das, R.: A review on principles, theory and practices of 2D-QSAR. Curr. Drug Metab. 15(4), 346–379 (2014)
Golbraikh, A., Tropsha, A.: Predictive QSAR modeling diversity sampling of experimental data set and test set selection. J. Comput. Aided Mol. Des. 5, 231–243 (2002)
Guha, R., Van Drie, J.H.: Structure-activity landscape index: identifying and quantifying activity cliffs. J. Chem. Inf. Model. 48(3), 646–658 (2008)
Gonzalez, M.P., Teran, C., Saiz-Urra, L., Teijeira, M.: Variable selection methods in QSAR: an overview. Curr. Top. Med. Chem. 8(18), 1606–1627 (2008)
Bento, A.P., Gaulton, A., Hersey, A., Bellis, L.J., Chambers, J., Davies, M., Kruger, F.A., Light, Y., Mak, L., McGlinchey, S., Nowotka, M., Papadatos, G., Santos, R., Overington, J.P.: The ChEMBL bioactivity database: an update. Nucleic Acids Res. 42, 1083–1090 (2014)
De Oliveira, D.B., Gaudio, A.C.: BuildQSAR: a new computer program for QSAR analysis. Quant. Struct.-Activ. Relat. 19(6), 599–601 (2001)
Stavrakov, G., Valcheva, V., Philipova, I., Doytchinova, I.: Design of novel camphane-based derivatives with antimycobacterial activity. J. Mol. Graph. Model. 51, 7–12 (2014)
Arvind, A., et al.: Uridine monophosphate kinase as potential target for tuberculosis: From target to lead identification. Interdisc. Sci. Comput. Life Sci. 5(4), 296 (2013)
Lee, S., Barron, M.G.: A mechanism-based 3D-QSAR approach for classification and prediction of acetylcholinesterase inhibitory potency of organophosphate and carbamate analogs. J. Comput. Aided Mol. Des. 30(4), 347–363 (2016)
Lee, S., Barron, M.G.: Structure-based understanding of binding affinity and mode of estrogen receptor α agonists and antagonists. PLoS ONE 12(1), e0169607 (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Basu, A., Sarkar, A., Basak, P. (2017). QSAR Model for Mast Cell Stabilizing Activity of Indolecarboxamidotetrazole Compounds on Human Basophils. In: Mandal, J., Dutta, P., Mukhopadhyay, S. (eds) Computational Intelligence, Communications, and Business Analytics. CICBA 2017. Communications in Computer and Information Science, vol 776. Springer, Singapore. https://doi.org/10.1007/978-981-10-6430-2_2
Download citation
DOI: https://doi.org/10.1007/978-981-10-6430-2_2
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-6429-6
Online ISBN: 978-981-10-6430-2
eBook Packages: Computer ScienceComputer Science (R0)