Skip to main content

Efficient Test for Nonlinear Dependence of Two Continuous Variables

  • Chapter
  • First Online:
Book cover Applied Computational Genomics

Part of the book series: Translational Bioinformatics ((TRBIO,volume 13))

Abstract

A new method to test nonlinear dependence between two continuous variables (X and Y) is proposed. This is achieved by using continuous analysis of variance (CANOVA). The software is available at https://sourceforge.net/projects/canova. First, a neighborhood for each data point related to its X value was defined. Then, the variance of the Y value within the neighborhood was calculated. Last, permutations to evaluate the significance of the observed values within the neighborhood variance were conducted. To examine the strength of CANOVA compared to six other methods, extensive simulations were completed to examine the false-positive rates and statistical power. Both simulation and real datasets (kidney cancer RNA-seq data) were used. From these analyses, it was concluded that CANOVA is efficient as a method in testing nonlinear correlation and has several advantages for real data application.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Aitken AC. Statistical mathematics. Edinburgh: Oliver and Boyd; 1942.

    Google Scholar 

  • Albanese D, Filosi M, Visintainer R, Riccadonna S, Jurman G, Furlanello C. Minerva and minepy: a C engine for the MINE suite and its R, Python and MATLAB wrappers. Bioinformatics. 2013;29(3):407–8.

    Article  CAS  PubMed  Google Scholar 

  • Altman NS. An introduction to kernel and nearest-neighbor nonparametric regression. Am Stat. 1992;46(3):175–85.

    Google Scholar 

  • Burke DS, Brundage JF, Redfield RR, Damato JJ, Schable CA, Putman P, et al. Measurement of the false positive rate in a screening program for human immunodeficiency virus infections. N Engl J Med. 1988;319(15):961–4.

    Article  CAS  PubMed  Google Scholar 

  • Cleveland WS, Devlin SJ, Grosse E. Regression by local fitting–methods, properties, and computational algorithms. J Econ. 1988;37(1):87–114.

    Article  Google Scholar 

  • Cohen J. Statistical power analysis for the behavioral sciences. Hillsdale: L Erlbaum Associates; 1988.

    Google Scholar 

  • Croxton FE, Cowden DJ. Applied general statistics. New Jersey: Prentice-Hall Inc.; 1939.

    Google Scholar 

  • Devlin SJ, Gnanadesikan R, Kettenring JR. Robust estimation and outlier detection with correlation-coefficients. Biometrika. 1975;62(3):531–45.

    Article  Google Scholar 

  • Dieter MZ, Freshwater SL, Miller ML, Shertzer HG, Dalton TP, Nebert DW. Pharmacological rescue of the 14CoS/14CoS mouse: hepatocyte apoptosis is likely caused by endogenous oxidative stress. Free Radic Biol Med. 2003;35(4):351–67.

    Article  CAS  PubMed  Google Scholar 

  • Dietrich CF. Uncertainty, calibration and probability: the statistics of scientific and industrial measurement. Boca Raton: CRC Press; 1991.

    Google Scholar 

  • Galton F. Typical laws of heredity. 1877. 5.

    Google Scholar 

  • Galton F. Regression towards mediocrity in hereditary stature. J Anthropol Inst Great Brit Ireland. 1886;15:246–63.

    Article  Google Scholar 

  • Good P. Permutation tests. New York: Springer; 2000.

    Book  Google Scholar 

  • Gretton A, Bousquet O, Smola A, Schölkopf B. Measuring statistical dependence with Hilbert-Schmidt norms. In: Algorithmic learning theory. Heidelberg: Springer; 2005. p. 63–77.

    Chapter  Google Scholar 

  • Grosse L, Campeau AS, Caron S, Morin FA, Meunier K, Trottier J, et al. Enantiomer selective glucuronidation of the non-steroidal pure anti-androgen bicalutamide by human liver and kidney: role of the human UDP-glucuronosyltransferase (UGT)1A9 enzyme. Basic Clin Pharmacol Toxicol. 2013;113(2):92–102.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Ha SA, Shin SM, Namkoong H, Lee HJ, Cho GW, Hur SY, et al. Cancer-associated expression of minichromosome maintenance 3 gene in several human cancers and its involvement in tumorigenesis. Clin Cancer Res. 2004;10(24):8386–95.

    Article  CAS  PubMed  Google Scholar 

  • Heller R, Heller Y, Gorfine M. A consistent multivariate test of association based on ranks of distances. Biometrika 2012:ass070.

    Google Scholar 

  • Hmisc: Harrell Miscellaneous. http://CRAN.R-project.org/package=Hmisc.

    Google Scholar 

  • Horn PS. Introduction to robust estimation and hypothesis testing. Technometrics. 1998;40(1):77–8.

    Article  Google Scholar 

  • Huber P. Robust statistics. In: Lovric M, editor. International encyclopedia of statistical science. Berlin/Heidelberg: Springer; 2011. p. 1248–51.

    Chapter  Google Scholar 

  • Jiang J, Lin N, Guo S, Chen J, Xiong M. Methods for joint imaging and RNA-seq data analysis. arXiv preprint. 2014;arXiv:14093899.

    Google Scholar 

  • Kendall MG. A new measure of rank correlation. Biometrika. 1938;30:81–93.

    Article  Google Scholar 

  • Kinney JB, Atwal GS. Equitability, mutual information, and the maximal information coefficient. Proc Natl Acad Sci U S A. 2014;111(9):3354–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Kirikoshi H, Katoh M. Molecular cloning and characterization of human GIPC2, a novel gene homologous to human GIPC1 and Xenopus Kermit. Int J Oncol. 2002;20(3):571–6.

    PubMed  CAS  Google Scholar 

  • Kosorok MR. On Brownian distance covariance and high dimensional data. Ann Appl Stat. 2009;3(4):1266–9.

    Article  PubMed  PubMed Central  Google Scholar 

  • Li B, Reed JC, Kim HR, HJ C. Proteomic profiling of differentially expressed proteins from Bax inhibitor-1 knockout and wild type mice. Mol Cells. 2012;34(1):15–23.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Lockyer N. Nature: Macmillan Journals Limited. 1885.

    Google Scholar 

  • Murrell B, Murrell D, Murrell H. R2-equitability is satisfiable. Proc Natl Acad Sci. 2014;111(21):E2160.

    Article  CAS  PubMed  Google Scholar 

  • Myers JL, Well AD, Lorch RF Jr. Research design and statistical analysis. New York: Routledge; 2010.

    Google Scholar 

  • Natrajan R, Little SE, Reis-Filho JS, Hing L, Messahel B, Grundy PE, et al. Amplification and overexpression of CACNA1E correlates with relapse in favorable histology Wilms’ tumors. Clin Cancer Res. 2006;12(24):7284–93.

    Article  CAS  PubMed  Google Scholar 

  • Pearson K. Note on regression and inheritance in the case of two parents. Proc R Soc Lond. 1895;58(347–352):240–2.

    Article  Google Scholar 

  • Reshef DN, Reshef YA, Finucane HK, Grossman SR, McVean G, Turnbaugh PJ, et al. Detecting novel associations in large data sets. Science. 2011;334(6062):1518–24.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Reshef D, Reshef Y, Mitzenmacher M, Sabeti P. Equitability analysis of the maximal information coefficient, with comparisons. arXiv preprint. 2013;arXiv:13016314.

    Google Scholar 

  • Reshef DN, Reshef YA, Mitzenmacher M, Sabeti PC. Cleaning up the record on the maximal information coefficient and equitability. Proc Natl Acad Sci. 2014;111(33):E3362–3.

    Article  CAS  PubMed  Google Scholar 

  • Scheffe H. The analysis of variance, vol. 72. New York: Wiley; 1999.

    Google Scholar 

  • Stigler SM. Francis Galton’s account of the invention of correlation. Stat Sci. 1989;4:73–9.

    Article  Google Scholar 

  • Stroustrup B. The C++ programming language: Pearson Education India. 1995.

    Google Scholar 

  • Székely GJ, Rizzo ML. Energy statistics: a class of statistics based on distances. J Stat Plan Inference. 2013;143(8):1249–72.

    Article  Google Scholar 

  • Székely GJ, Rizzo ML, Bakirov NK. Measuring and testing dependence by correlation of distances. Ann Stat. 2007;35(6):2769–94.

    Article  Google Scholar 

  • Tanaka Y, Hirata H, Chen Z, Kikuno N, Kawamoto K, Majid S, et al. Polymorphisms of catechol-O-methyltransferase in men with renal cell cancer. Cancer Epidemiol Biomark Prev. 2007;16(1):92–7.

    Article  CAS  Google Scholar 

  • The Cancer Genome Atlas Research Network. Comprehensive molecular characterization of clear cell renal cell carcinoma. Nature. 2013;499(7456):43–9.

    Article  CAS  PubMed Central  Google Scholar 

  • Tierney L, Rossini A, Li N. Snow: a parallel computing framework for the R system. Int J Parallel Prog. 2009;37(1):78–90.

    Article  Google Scholar 

  • Wang Y, Li Y, Cao H, Xiong M, Shugart YY, Jin L. Efficient test for nonlinear dependence of two continuous variables. BMC Bioinformatics. 2015;16(1). https://doi.org/10.1186/s12859-015-0697-7.

  • Wilding GE, Mudholkar GS. Empirical approximations for Hoeffding’s test of bivariate independence using two Weibull extensions. Stat Meth. 2008;5(2):160–70.

    Google Scholar 

  • Zhang T, Niu X, Liao L, Cho EA, Yang H. The contributions of HIF-target genes to tumor growth in RCC. PLoS One. 2013;8(11):e80544.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Li Jin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Singapore Pte Ltd.

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Ritter, M., Li, Y., Wang, Y., Yao, Y., Jin, L. (2018). Efficient Test for Nonlinear Dependence of Two Continuous Variables. In: Yao, Y. (eds) Applied Computational Genomics. Translational Bioinformatics, vol 13. Springer, Singapore. https://doi.org/10.1007/978-981-13-1071-3_8

Download citation

Publish with us

Policies and ethics