MapReduce distributed parallel computing framework for diagnosis and treatment of knee joint Kashin-Beck disease

Dang, Chenpo; Yi, Guirong; Zhu, Zhaomin; Zhou, Peng; Shao, Hongbin; Yao, Yanbin; Zhao, Maosheng; Li, Lintao; Li, Shensong

doi:10.1007/s11227-020-03608-0

MapReduce distributed parallel computing framework for diagnosis and treatment of knee joint Kashin-Beck disease

Published: 02 February 2021

Volume 77, pages 9088–9101, (2021)
Cite this article

The Journal of Supercomputing Aims and scope Submit manuscript

Chenpo Dang¹^na1,
Guirong Yi²^na1,
Zhaomin Zhu³^na1,
Peng Zhou¹,
Hongbin Shao¹,
Yanbin Yao¹,
Maosheng Zhao^1,4,
Lintao Li⁵ &
…
Shensong Li¹

144 Accesses
2 Citations
Explore all metrics

Abstract

To improve the accuracy and computational efficiency of the MapReduce distributed parallel computing framework, thereby mining the diagnosis and treatment data of Kashin-Beck Disease (KBD) of the knee joint. Based on the shortcomings of the traditional K-means Clustering Algorithm (KCA), a simplified method for distance calculation was proposed. The Manhattan distance was used instead of Euclidean distance. Further improvement strategies were proposed to implement and compare KCA of MapReduce (MR-KCA) and Improved MR-KCA (IMR-KCA). With the same data, the sum of squared errors of MR-KCA and IMR-KCA decreased with the increase in the number of center points. Compared with MR-KCA, the quality of IMR-KCA was higher, and their difference was especially evident at 8 GB data capacity. The total execution time of both MR-KCA and IMR-KCA increased with the increase in the number of center points. Compared to MR-KCA, the total execution time of IMR-KCA was significantly reduced, especially when the data capacity was 8 GB. When the number of center points was 5000, IMR-KCA could reduce the total execution time by 50%. Through experiments, IMR-KCA was proved to better present the diagnosis and treatment data of patients with knee joint KBD. The scalability rates of MR-KCA and IMR-KCA decreased as the number of nodes increased, but the scalability rates of both algorithms could be maintained above 0.80, which had better scalability. Compared with MR-KCA, IMR-KCA had significantly higher scalability. The IMR-KCA proposed in this study had high accuracy and computing efficiency, which could be used in the visualization of KBD diagnosis and treatment.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Intelligent Diagnosis and Treatment Research of Knee Osteoarthritis Based on Big Data

RETRACTED ARTICLE: Innovative study on clustering center and distance measurement of K-means algorithm: mapreduce efficient parallel algorithm based on user data of JD mall

Article 31 March 2021

Distributed fuzzy clustering algorithm for mixed-mode data in Apache SPARK

Article Open access 21 December 2022

References

Shi XW, Zhang F, Li ZY et al (2018) Polymorphism in rs2229783 of the alpha 1(XI) collagen gene is associated with susceptibility to but not severity of Kashin-Beck disease in a Northwest Chinese Han population. Biomed Environ Sci Bes 31(4):322–326
Google Scholar
Liu H M, Wang Y F, Wu J M, et al. (2020) A comparative study of clinical effect of total knee arthroplasty in the treatment of primary osteoarthritis and osteoarthritis of Kashin-Beck disease. Int Orthop pp 1–8
Ma M, Liang X, Wang X et al (2020) The molecular mechanism study of COMP involved in the articular cartilage damage of Kashin-Beck disease. Bone Joint Res 9(9):578–586
Article Google Scholar
Li Y, Kang P, Zhou Z et al (2020) Magnetic resonance imaging at 7.0 T for evaluation of early lesions of epiphyseal plate and epiphyseal end in a rat model of KashinBeck disease. BMC musculoskelet disord 21(1):1–9
Article Google Scholar
Wu F, Xu J, Zhu Z (2018) Protective effect of tetrandrine in a rabbit model of osteoarthritis. Arch Rheumatol 33(1):80–84
Article Google Scholar
Yang L, Wang D, Li X et al (2020) Comparison of the responsiveness of the WOMAC and the 12-item WHODAS in patients with Kashin–Beck disease. BMC Musculoskelet Disord 21(1):188
Article Google Scholar
Bendechache M, Tari AK, Kechadi MT (2019) Parallel and distributed clustering framework for big spatial data mining. Int J Parallel Emergent Distrib Syst 34(6):671–689
Article Google Scholar
Shakeel PM, Baskar S, Dhulipala VRS et al (2018) Cloud based framework for diagnosis of diabetes mellitus using K-means clustering[J]. Health Inf Sci Syst 6(1):16
Article Google Scholar
Ding H, Sun C, Zeng J (2020) Fuzzy weighted clustering method for numerical attributes of communication big data based on cloud computing. Symmetry 12(4):530
Article Google Scholar
Rathee S, Kashyap A (2018) Adaptive-miner: an efficient distributed association rule mining algorithm on spark. J Big Data 5(1):6
Article Google Scholar
Sardar TH, Ansari Z (2018) An analysis of MapReduce efficiency in document clustering using parallel K-means algorithm. Future Comput Inform J 3(2):200–209
Article Google Scholar
Feng X, Gao J (2019) Gene sequence input formatting and MapReduce computing. Int J Bioautom 23(2):233
Article Google Scholar
Ding D, Han QL, Wang Z et al (2019) A survey on model-based distributed control and filtering for industrial cyber-physical systems. IEEE Trans Industr Inf 15(5):2483–2499
Article Google Scholar
Chen X, Liu Z, Kim I (2020) A parallel computing framework for solving user equilibrium problem on computer clusters. Transportmetrica A: Transport Sci 16(3):550–573
Article Google Scholar
Sardar TH, Ansari Z (2018) Partition based clustering of large datasets using MapReduce framework: an analysis of recent themes and directions. Future Comput Inform J 3(2):247–261
Article Google Scholar
Lee S, Kang S, Kim J et al (2019) Scalable distributed data cube computation for large-scale multidimensional data analysis on a Spark cluster. Cluster Comput 22(1):2063–2087
Article Google Scholar
Zhang H, Wu Y (2018) Optimization and application of clustering algorithm in community discovery. Wireless Pers Commun 102(4):2443–2454
Article Google Scholar
Xiao B, Wang Z, Liu Q et al (2018) SMK-means: an improved mini batch k-means algorithm based on mapreduce with big data. Comput Mater Continua 56(3):365–379
MathSciNet Google Scholar
Chen C, Li K, Ouyang A et al (2018) Gflink: An in-memory computing architecture on heterogeneous CPU-GPU clusters for big data[J]. IEEE Trans Parallel Distrib Syst 29(6):1275–1288
Article Google Scholar
Qiu Z, Chen R, Yan M (2020) Monitoring data analysis technology of smart grid based on cloud computing. MS&E 750(1):012221
Google Scholar

Download references

Acknowledgements

This work was supported by Fund of Gansu Health Care Research Plan (GSWSKY-2019-12).

Author information

Chenpo Dang, Guirong Yi and Zhaomin Zhu have equally contributed to this works.

Authors and Affiliations

Department of Orthopedics (Sport Medicine), The 940th Hospital of Joint Logistics Support Force of Chinese People’s Liberation Army, Lanzhou, 730050, People’s Republic of China
Chenpo Dang, Peng Zhou, Hongbin Shao, Yanbin Yao, Maosheng Zhao & Shensong Li
Department of Gastroenterology, The Second Affiliated Hospital of Lanzhou University, Lanzhou, 730050, People’s Republic of China
Guirong Yi
Department of Orthopedics, General Hospital of Tibet Military Region, Lasa, 850007, People’s Republic of China
Zhaomin Zhu
Clinical Medical Colleges, Gansu University of Chinese Medicine, Lanzhou, 730050, People’s Republic of China
Maosheng Zhao
Department of Orthopedics, School of Medicine, Jinling Hospital, Nanjing University, Nanjing, 210093, People’s Republic of China
Lintao Li

Authors

Chenpo Dang
View author publications
You can also search for this author in PubMed Google Scholar
Guirong Yi
View author publications
You can also search for this author in PubMed Google Scholar
Zhaomin Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Peng Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Hongbin Shao
View author publications
You can also search for this author in PubMed Google Scholar
Yanbin Yao
View author publications
You can also search for this author in PubMed Google Scholar
Maosheng Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Lintao Li
View author publications
You can also search for this author in PubMed Google Scholar
Shensong Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Lintao Li or Shensong Li.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Dang, C., Yi, G., Zhu, Z. et al. MapReduce distributed parallel computing framework for diagnosis and treatment of knee joint Kashin-Beck disease. J Supercomput 77, 9088–9101 (2021). https://doi.org/10.1007/s11227-020-03608-0

Download citation

Accepted: 28 December 2020
Published: 02 February 2021
Issue Date: August 2021
DOI: https://doi.org/10.1007/s11227-020-03608-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

MapReduce distributed parallel computing framework for diagnosis and treatment of knee joint Kashin-Beck disease

Abstract

Access this article

Similar content being viewed by others

Intelligent Diagnosis and Treatment Research of Knee Osteoarthritis Based on Big Data

RETRACTED ARTICLE: Innovative study on clustering center and distance measurement of K-means algorithm: mapreduce efficient parallel algorithm based on user data of JD mall

Distributed fuzzy clustering algorithm for mixed-mode data in Apache SPARK

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

MapReduce distributed parallel computing framework for diagnosis and treatment of knee joint Kashin-Beck disease

Abstract

Access this article

Similar content being viewed by others

Intelligent Diagnosis and Treatment Research of Knee Osteoarthritis Based on Big Data

RETRACTED ARTICLE: Innovative study on clustering center and distance measurement of K-means algorithm: mapreduce efficient parallel algorithm based on user data of JD mall

Distributed fuzzy clustering algorithm for mixed-mode data in Apache SPARK

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation