Abstract
In this article, the performance enhancement of statistically significant bicluster using analysis of variance is articulated. Various statistical methods are used to analyze the gene expression level. It is found that analysis of variance is one of the efficient methods for aggregation between a pair of genes. It computes the values by comparing the mean value of each group, and results are tested using the hypothesis to calculate the p-value. Various tests are conducted to increase the performance of the gene pair. Various clustering techniques are functional to investigate the gene expression information for both homogeneous and heterogeneous. Statistical approaches are used to identify the relevant information from the subset of genes. Various testing methods were conducted to enhance the performance of correlated genes. When compared with the biclustering methods such as the paired t-test, two-sample tests, the ANOVAs One sample test produces better result.
References
Busygin, S., Prokopyev, O., Paradalos, P.: Biclustering in data mining. Comput. Oper. Res. 35(9), 2964–2687 (2000)
Ji, J., Pang, W., Zhou, C., Han, X., Wang, Z.: A fuzzy k-prototype clustering algorithm for mixed numeric and categorical data. Knowl.-Based Syst. 30, 129–135 (2012)
Rajkumar, P., Vennila, I., Nirmalakumari, K.: A novel hybrid method for gene selection in microarray based cancer classification. IJEST 5 (2013)
Hanczer, B., Nadif, M.: Using the bagging approach for biclustering of gene expression data. Neuro-Comput. 74, 568–584 (2011)
Mederia, S., Oliveira, A.: Biclustering algorithm for biological data analysis: a survey. IEEE Trans. Comput. Biol. Bioinf. 1(1), 24–45 (2004)
Dai, D., Yan, H.: Matrix decomposition for feature generation from high dimensional data. Pattern Recogn. Theor. Appl. 48, 194–205 (2007)
Reiss, D., Beliga, N., Bonneau, R.: Integrated biclustering of heterogeneous genome-wide data set for the inference of global regulatory networks. BMC Bioinf. 7 (2006)
Yang, W., Dai, D.: Finding correlated biclusters from gene expression data. IEEE Trans. Knowl. Data Eng. 23, 568–584 (2011)
Belacel, N., Wang, Q.: Cuperlovic culf, clustering methods for microarray gene expression data. OMICS 1, 507–531 (2006)
Ayadi, W., Elloumi, M., Hao, J.: BiMNine+: An efficient algorithm for discovering relevant biclusters of DNA microarray data. Knowl. Based Syst. (KDS) 35, 224–234 (2012)
Deng, Z., Choi, K., Chiung, F., Wang, S.: EEW-SC enhanced entropy-weighting subspace clustering for high dimensional gene expression data cluster analysis. Appl. Soft Comput. 41, 1041–1050 (2011)
Dueck, D., Morris, Q., Frey, B.: Multi way clustering of microarray data using probabilistic sparse matrix factorization. Bioinformatics 21, 1144–1151 (2005)
Liu, J., Yang, J., Wang, W.: Op-cluster: clustering by tendency in high dimensional space. In: IEEE International Conference on Data Mining (2003)
Han, L., Yan, H.: Hybrid method for the analysis of time series gene expression data. Knowl.-Based Syst. (KBS) 35, 14–20 (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Vengatesan, K., Mahajan, S.B., Sanjeevikumar, P., Moin, S. (2018). The Performance Enhancement of Statistically Significant Bicluster Using Analysis of Variance. In: Konkani, A., Bera, R., Paul, S. (eds) Advances in Systems, Control and Automation. Lecture Notes in Electrical Engineering, vol 442. Springer, Singapore. https://doi.org/10.1007/978-981-10-4762-6_64
Download citation
DOI: https://doi.org/10.1007/978-981-10-4762-6_64
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-4761-9
Online ISBN: 978-981-10-4762-6
eBook Packages: EngineeringEngineering (R0)