Abstract
In 1967 Bhattacharya, a biologist from India, presented a method for identifying juvenile-fish subgroups from random samples (Bhattacharya 1967). By now this test, based on Gaussian curves, has become a key-method for the analysis and sustainability of this important resource in the eco-system, and is recommended by the Food and Agricultural Organization of the United Nations Guidelines (FAO 2011). As Gaussian curves are the mainstream not only with fish population research, but also with clinical data, it is peculiar that, so far, this method has not been widely applied in clinical research. When searching Pub Med we only found a few clinical-laboratory studies (Guerin et al. 1992; Watson et al. 1999; Pottel et al. 2008; Baadenhuijsen and Smit 1985), epidemiological (Metz et al. 2002; Zhang et al. 2004) and genetic studies (Miescke and Musea 1994; Evans et al. 1983), and not a single cardiovascular study. In clinical research data-files are, usually, summarized by their means and standards deviations (SDs). Standard deviations are a convenient way of estimating the spread in your data, but they are only valid if your data can be assumed to follow a clock-like Gaussian curve. Under this assumption the mean ± 1.96 × SDs covers 95% of the data. Of course, many cardiovascular data samples are not perfectly Gaussian-like. Mean and SDs are, therefore, just approximations. There may be better methods to find the best fit Gaussian curves for your data. Instead of the mean, the mode or median can be used, and instead of histograms consistent of bins, more refined Kernel histograms consistent of multiple similarly sized small Gaussian curves can be drawn (Metz et al. 2002). Also, distribution-free statistical methods like non-parametric tests can be applied to “quasi-gaussianize” the data. However, all of these methods massage the data. Bhattacharya modeling does not massage the data, but, instead, unmasks Gaussian curves, as truly present in the data, and removes outlier frequencies. In clinical research it could be used (1) for unmasking normal values of diagnostic tests, (2) for improving the p-values of data testing, and (3) for objectively searching subsets in your data. The current chapter uses as examples simulated vascular lab scores to investigate the performance of Bhattacharya modeling as compared to standards methods, and was written to acquaint the clinical research community with this novel method.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Altman DG (1995) Statistical notes: the normal distribution. BMJ 310:298
Armitage P, Berry G (1994) Statistical methods in medical research. Blackwell Scientific Publications, Oxford, pp 66–71
Baadenhuijsen H, Smit JC (1985) Indirect estimation of clinical chemical reference intervals from total hospital patient data. J Clin Chem Clin Biochem 23:829–839
Bhattacharya CG (1967) A simple solution of a distribution into Gaussian components. Biometrics 23:115–135
Evans DA, Harmer D, Downham DY, Whibley EJ, Idle JR, Ritchie J, Smith RL (1983) The genetic control of sparteine and debrisoquine metabolism in man with new methods of analysing bimodal distributions. J Med Genet 20:321–329
Feng Z, McLerran D, Grizzle J (1996) A comparison of statistical methods for clustered data analysis with Gaussian error. Stat Med 15:1793–1806
Food and Agricultural Organization of the United Nations (2011) Manuals and guides BOBP/MAG/14. Separating mixtures of normal distributions: basic programs for Bhattacharya’s method and their applications to fish analysis. Copyright@fao.org. Accessed 15 Dec 2011
Guerin MD, Sikaris KA, Martin A (1992) Pathology informatics: an expanded role for laboratory information systems. Pathology 24:523–529
Janecki JM (2008) Application of statistical features of the Gaussian distribution hidden in sets of unselected medical laboratory results. Biocyb Biomed Eng 28:71–81
Metz J, Maxwell EL, Levin MD (2002) Changes in folate concentrations following voluntary food fortification in Australia. Med J Aust 176:90–91
Miescke KJ, Musea MN (1994) On mixtures of three normal populations caused by Monogenic inheritance: application to desipramine metabolism. J Psychiatry Neurosci 19:295–300
Pottel H, Vrydags N, Mahieu B, Vandewynckele E, Croes K, Martens F (2008) Establishing age/sex related serum creatinine reference intervals from hospital laboratory data based on different statistical methods. Clin Chim Acta 396:49–55
Watson N, Sikaris KA, Morris G, Mitchell DK (1999) Confirmation of age related rise in reference intervals for fasting glucose using the Bhattacharya method and patient data. Clin Biochem Rev 20:92–98
Zhang L, Liu C, Davis CJ (2004) A mixture model-based approach to the classification of habitats using forest inventory and analysis data. Can J For Res 34:1150–1156
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer Science+Business Media B.V.
About this chapter
Cite this chapter
Cleophas, T.J., Zwinderman, A.H. (2012). Bhattacharya Modeling. In: Statistics Applied to Clinical Studies. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-2863-9_26
Download citation
DOI: https://doi.org/10.1007/978-94-007-2863-9_26
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-007-2862-2
Online ISBN: 978-94-007-2863-9
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)