Abstract
The popularity of herbal medicines has greatly increased in worldwide countries over recent years. Herbal formula is a form of traditional medicine where herbs are combined to heal patient to heal faster and more efficiency. Herbal formulae can be divided into categories. Some formulae can be classified as more than one category. The categories are usually based on indications of herbs in formulae. To support experts for classifying a formula to one or more therapeutic categories, the normalized score multi-label k-nearest neighbors (NSML k-NN) algorithm, is proposed for multi-label herbal formulae classification. The k-NN classifiers with several term weight schemes are explored. The normalized scores are calculated. The values of k, strategies to assign categories are investigated to adjust the decision for multi-label herbal formulae. The experiment is done using a mixed data set of herbal formulae collected from the Natural List of Essential Medicine and the list of common household remedies for traditional medicine. Moreover, a set of well-known commercial products are used for evaluating the effectiveness of the proposed method. From the results, the NSML k-NN is an efficient method to classify multi-label herbal formulae.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Lovell-Smith, H.D.: In defence of ayurvedic medicine. The New Zealand Medical Journal 119, 1–3 (2006)
Aziz, Z., Peng, T.N.: Herbal medicines: prevalence and predictors of use among malaysian adults. Complementary Therapies in Medicine 44, 44–50 (2009)
Roiger, R., Geatz, M.: Data Mining: A Tutorial Based Primer. Addison-Wesley, Boston (2002)
Sebastiani, F.: Machine learning in automated text categorization. ACM Computing Surveys 34, 1–47 (2002)
Nigam, K., McCallum, A.K., Thrun, S., Mitchell, T.M.: Text classification from labeled and unlabeled documents using em. Machine Learning 39, 103–134 (2000)
Duwairi, R., Al-Zubaidi, R.: A hierarchical k-nn classifier for textual data. The International Arab Journal of Information Technology 8, 251–259 (2011)
Lertnattee, V., Theeramunkong, T.: Effect of term distributions on centroid-based text categorization. Information Sciences 158, 89–115 (2004)
Joachims, T.: Learning to Classify Text using Support Vector Machines. Kluwer Academic Publishers, Dordrecht (2002)
Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Information Processing and Management 24, 513–523 (1988)
Singhal, A., Salton, G., Buckley, C.: Length normalization in degraded text collections. Technical Report TR95-1507 (1995)
Tsoumakas, G., Katakis, I.: Multi-label classification: An overview. International Journal Data Warehousing and Mining 3, 1–13 (2007)
Read, J., Pfahringer, B., Holmes, G., Frank, E.: Classifier chains for multi-label classification. Machine Learning 85, 333–359 (2011)
Fujino, A., Isozaki, H., Suzuki, J.: Multi-label text categorization with model combination based on f1-score maximization. In: Proceeding of The 3rd International Joint Conference on Natural Language Processing, pp. 823–828 (2008)
Hua, L.: Research on multi-classification and multi-label in text categorization. In: Proceeding of International Conference on Intelligent Human-Machine Systems and Cybernetics, pp. 86–89 (2009)
Zhang, M.L., Zhou, Z.H.: ML-KNN: A lazy learning approach to multi-label learning. Pattern Recognition 40, 2038–2048 (2007)
Younes, Z., Abdallah, F., Denœux, T.: An Evidence-Theoretic k-Nearest Neighbor Rule for Multi-label Classification, pp. 297–308 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer International Publishing Switzerland
About this paper
Cite this paper
Lertnattee, V., Chomya, S., Lueviphan, C. (2013). Using a Normalized Score Multi-Label KNN to Classify Multi-label Herbal Formulae. In: Prasath, R., Kathirvalavakumar, T. (eds) Mining Intelligence and Knowledge Exploration. Lecture Notes in Computer Science(), vol 8284. Springer, Cham. https://doi.org/10.1007/978-3-319-03844-5_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-03844-5_6
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-03843-8
Online ISBN: 978-3-319-03844-5
eBook Packages: Computer ScienceComputer Science (R0)