Background

In December 2019, a disease outbreak was noticed after a massive admission of patients with common clinical symptoms of pneumonia in the local hospitals of Wuhan City, China. Upon further investigations, the World Health Organization confirmed that the novel coronavirus, Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), was responsible for these clinical symptoms and further denominated this disease as coronavirus disease (COVID-19) [1]. Its clinical course is diverse, ranging from mild self-limited illness to life-threatening organ dysfunctions [2,3,4].

Identifying disease sub-phenotypes could improve the understanding of the pathophysiology of critical care syndromes and lead to the discovery of new treatment targets by allowing future therapeutic trials to focus on predicted responders [5]. COVID-19 cluster analysis was previously used to identify distinct sub-phenotypes based on clinical and biochemical characteristics [6,7,8,9,10], for other heterogeneous syndromes, such as acute respiratory distress syndrome, sepsis, and acute kidney injury [11]. However, the main factors in the cluster analysis and methodology differed among these studies, as did the characteristics of the sub-phenotypes. Moreover, most studies used not only baseline characteristics but also laboratory test results and radiographic patterns [7,8,9,10].

Previous reports, including ours, revealed that baseline characteristics, such as age, sex, and comorbidities, can predict meaningful outcomes of COVID-19 [12, 13]. The clinical characteristics of COVID-19 may differ depending on the population. For instance, COVID-19 is milder in Japan than in other countries [14, 15]. Population differences may be influenced by complex factors, including the number of patients, medical infrastructure, resources of medical personnel, and patient background [15]. To the best of our knowledge, no clinical studies to date have examined the phenotypes of COVID-19 patients in Japan.

Based on the above, we hypothesized that cluster analysis using baseline characteristics reportedly related to COVID-19 outcomes may allow for simple meaningful phenotyping of Japanese COVID-19 patients, and that sub-phenotypes may differ according to population differences and cluster analysis methods. The present study aimed to demonstrate the usefulness of phenotyping in predicting meaningful outcomes of Japanese COVID-19 patients and to capture the patients’ post-hospitalization course.

Methods

Study design and settings

All COVID-19 cases in this retrospective cohort study were recruited through the Japan COVID-19 Task Force [16, 17]. From April 2020 to May 2021, data from consecutive inpatients aged ≥ 18 years diagnosed with COVID-19, using SARS-CoV2 polymerase chain reaction (PCR) test results at one among the > 100 affiliated hospitals, and who agreed to cooperate in the study were registered in an electronic case record form by the study subspecialist at the affiliated research institute. Patients meeting any of the following exclusion criteria were excluded: (i) non-Japanese patients, (ii) patients with incomplete medical records, such as missing outcome information, and (iii) patients lacking any of the selected 12 variables for cluster analysis (Fig. 1). All patients provided written informed consent. This study was approved by the ethics committees of Keio University School of Medicine (20200061) and related research institutions. All aspects of the study conformed to the principles of the Declaration of Helsinki adopted by WMA General Assembly, Fortaleza, Brazil, October 2013.

Fig. 1
figure 1

Process of patient selection in this study. Data from patients with known clinical outcomes and not missing any of the 12 variables used in the cluster analysis were analyzed

Data collection

The following information was extracted from the electronic case record form: age, sex, height, weight, clinical symptoms and signs, laboratory findings on admission, comorbidities, disease severity (supplementary oxygen, intensive care unit (ICU) entry, need for invasive mechanical ventilation, and survival status), and treatment details. We defined disease severity as follows: most severe, need for support by high-flow oxygen devices, invasive mechanical ventilation, extracorporeal membrane oxygenation, or death; severe, need for support of low-flow oxygen devices; mild, symptomatic patients not requiring oxygen support; asymptomatic, asymptomatic patients without oxygen support [18]. All laboratory tests were performed according to the patients’ clinical care needs. Symptoms and signs were included not only at the time of referral and admission, but also during hospitalization. Blood tests such as biochemistry, peripheral blood analysis, and coagulation were performed within 48 h of the initial visit or admission. The collected data were reviewed by a team of respiratory clinicians. If core data were missing, the clinician who first diagnosed the disease was contacted to collect it. Missing or absent data in the patient background were noted as unknown.

Identification of COVID-19 phenotypes using cluster analysis

We selected 12 clinically relevant patient baseline characteristics reportedly associated with the severity or prognosis of COVID-19 [12, 19,20,21,22,23,24,25], namely, age, sex, obesity, smoking history, hypertension, diabetes mellitus, malignancy, chronic obstructive pulmonary disease, hyperuricemia, cardiovascular disease, chronic liver disease, and chronic kidney disease. We defined obesity as body mass index (BMI) > 25 and treated it as a nominal variable.

Statistical analysis

Data are presented as means ± standard deviation (SD). Data were compared among groups using analysis of variance (ANOVA) and χ2 tests. Hierarchical cluster analysis using the 12 variables mentioned above was performed using the Ward’s minimum-variance method [26, 27]. The results are graphically depicted by a dendrogram. Statistical significance was set at p < 0.05. All data were analyzed using the JMP 16 software (SAS Institute, Cary, NC, USA).

Results

Characteristics of the study population

Table 1 shows the baseline clinical characteristics of the participants. A total of 1322 inpatients (men, 65.1%; mean age, 58 ± 18.1 years) were enrolled in this study. The mean BMI was 24.4 ± 4.7 kg/m2, and 597 (45.2%) had a history of smoking. Based on their clinical presentation, participants were classified into the most severe (n = 63, 4.8%), severe (n = 426, 32.2%), mild (n = 777, 58.8%), and asymptomatic (n = 56, 4.2%) disease groups. The most common comorbidities were hypertension (n = 449, 34%), diabetes mellitus (n = 263, 19.9%), and hyperuricemia (n = 134, 10.1%).

Table 1 Baseline clinical characteristics of the study patients

Comparison of baseline characteristics among clusters

We performed Ward’s cluster analysis based on 12 factors reportedly associated with the severity or prognosis of COVID-19 [12, 19,20,21,22,23,24,25]. Based on visual assessment of the resulting dendrogram (Fig. 2), data could be optimally grouped into four clusters, with each cluster corresponding to a potential phenotype. Table 2 presents the baseline characteristics of each cluster. Cluster 1 (young healthy cluster: n = 266) included the youngest population and tended to have fewer comorbidities than the other clusters. Cluster 3 (middle-aged obese cluster: n = 435) included mostly middle-aged patients, had the highest percentage of men with higher BMI and numerous comorbidities, such as hypertension, diabetes mellitus, and hyperuricemia. Although patients in Cluster 2 (middle-aged cluster: n = 245) were in the same age group as those in Cluster 3, they tended to have a lower BMI and fewer comorbidities compared to those in Cluster 3. Compared to other clusters, Cluster 4 (elderly: n = 376) included the oldest patients who tended to have numerous comorbidities, such as malignancy, cardiovascular diseases, and chronic kidney disease.

Fig. 2
figure 2

Dendrogram illustrating the results of cluster analysis of 1322 COVID-19 patients using Ward’s hierarchical clustering method

Table 2 Baseline characteristics for each cluster

Comparison of clinical characteristics and laboratory findings among clusters

Table 3 shows a comparison of the subjective symptoms and physical findings among the four clusters. Sore throat, dysosmia, and dysgeusia, all reported as good prognostic factors [12, 28, 29], tended to be more frequent in Cluster 1 than in other clusters. In contrast, shortness of breath, reported as a poor prognostic factor [30], tended to be less frequent in Cluster 1 than in other clusters. Cluster 4 exhibited the lowest prevalence of sore throat, dysosmia, and dysgeusia among the four clusters, but more frequent consciousness disturbance, reportedly a poor prognostic factor [31], than other clusters. Table 4 shows a comparison of the laboratory findings among the clusters. Platelet count, reported as a poor prognostic factor [32], tended to be lower in Clusters 3 and 4, while lactate dehydrogenase (LDH), ferritin, Krebs von den Lungen-6 (KL-6), d-dimer, and C-reactive protein (CRP), also considered poor prognostic factors [33,34,35], tended to be lower in Cluster 1 and higher in Clusters 3 and 4. These results imply that Cluster 1 had COVID-19 related symptoms and laboratory findings associated with good prognosis, while Clusters 3 and 4 had poor prognosis.

Table 3 Comparison of subjective symptoms and physical findings among the four clusters
Table 4 Comparison of laboratory findings among the four clusters

Comparison of clinical outcomes between the four clusters

A comparison of the rate of supplemental oxygen needs, ICU admission, mechanical ventilation, and mortality is shown in Fig. 3. Cluster 3 exhibited a higher rate of patient receiving supplementary oxygen and/or mechanical ventilation, admitted to the ICU, and mortality compared to Clusters 1 and 2. Cluster 2 had intermediate rates of the above factors, between Clusters 1 and 3, and Cluster 1 exhibited the most favorable outcomes among all the clusters. Similar to Cluster 3, Cluster 4 also tended to have poor outcomes, coupled with a higher mortality rate. These results suggest that middle-aged obese men tend to have a similarly serious course as the elderly but with a lower risk of death. Consistent with the high rate of severe disease in Clusters 3 and 4, patients in these clusters received intensive drug treatment, including remdesivir and glucocorticoids, of current frequent use and considered to be effective in the treatment of COVID-19 [36] (Table 5).

Fig. 3
figure 3

Comparison of clinical outcomes among the four clusters. a Comparison of the rate of receiving supplementary oxygen. b Comparison of the rate of ICU admission. c Comparison of the rate of requiring mechanical ventilation. d Comparison of the mortality rate. *p < 0.05 and **p < 0.005

Table 5 Comparison of drug treatment among the four clusters

Discussion

This study was the first in Japan to perform a cluster analysis of COVID-19 patients. We identified four clinical sub-phenotypes, namely the “young healthy cluster” (Cluster 1), “middle-aged cluster” (Cluster 2), “middle-aged obese cluster” (Cluster 3), and “elderly cluster” (Cluster 4), which were associated with different outcomes in Japanese patients with COVID-19. Previous reports, including ours, have shown that comorbidities and mortality rates in Japan differed from inpatient studies in other countries [15, 17]. Thus, the identification of the meaningful sub-phenotypes of Japanese COVID-19 patients is important. Notably, our study used simple baseline characteristics as variables for cluster analysis. Several previous studies have shown that cluster analysis is useful for phenotyping and predicting COVID-19 outcomes [6,7,8,9,10]. However, most of these studies used complicated variables, combining a wide range of blood test results for clustering. Promptly indefinable is an important feature for defining COVID-19 sub-phenotypes [37]. We believe that the present simple clustering may be of great help to clinicians in predicting prognosis and performing individualized therapy.

Cluster 3 included mainly middle-aged patients with a high BMI, and a high rate of complications from lifestyle-related diseases, such as hypertension, diabetes, and hyperuricemia. Even though hyperuricemia has been previously reported to be associated with prognosis [38, 39], its rate was higher in Cluster 3 than in Cluster 4, which showed the highest mortality rate. This finding may be due to a possible association between obesity and hyperuricemia [40, 41]. Cluster 2 patients were similarly middle-aged but had lower BMI and lifestyle-related diseases. Cluster 3 revealed poorer outcomes, including need for oxygen, ICU admission, and intubation, than Cluster 2. This result is consistent with the fact that obesity has already been reported as a poor prognostic factor for COVID-19 [20], as have lifestyle-related diseases [12, 21, 22]. However, the mortality rate of Cluster 3 was lower than that of Cluster 4. Despite the high risk of severe disease, there is still lifesaving potential, suggesting that this cluster is likely to benefit from aggressive intensive care.

Cluster 1 consisted mainly of younger patients with fewer comorbidities. They showed the highest frequency of sore throat, dysosmia, and dysgeusia of all the clusters, and the outcomes were generally the most favorable. These results were consistent with previous reports showing that upper respiratory tract symptoms are related to a good prognosis. [12, 28, 29]. In addition, several biomarkers (LDH, ferritin, KL-6, d-dimer, and CRP) [33,34,35] reported as poor prognosis predictors were lower in Cluster 1 than in other clusters. A majority of young people with COVID-19 are reported to be asymptomatic or have few symptoms [42], and this cluster also tended to have fewer symptoms than other clusters, except for upper respiratory tract symptoms. It is possible that this group may have contributed to the spread of the disease.

Cluster 4 included predominantly older patients with comorbidities such as hypertension, diabetes, malignant disease, cardiovascular disease, and chronic kidney disease. They had the poorest outcomes in terms of oxygen demand, ICU admission, ventilator use, and death. These results were consistent with previous reports showing that old age and comorbidities are related with poor prognosis [12, 19, 21,22,23,24]. In addition, several poor prognostic biomarkers (LDH, ferritin, KL-6, d-dimer, and CRP) [33,34,35] were higher than those in Clusters 1 and 2. Lymphocyte count, which has been linked to severe disease and mortality, was also lowest in Cluster 4 [43]. The mechanism of this lymphocytopenia has been previously reported to be hypercytokinemia, leading to inhibition of hematopoiesis by TNF-α [44]. In fact, Cluster 4 patients with low lymphocyte count also showed a trend toward low hemoglobin level and platelet count, consistent with previous reports. Among patients in Cluster 4, 4% were admitted to the ICU and 17.6% of intubated patients died, indicating their potential as a target for future development of COVID-19 therapy.

One of the characteristics of the present study is the inclusion of a single racial group only. Many of the previous studies on cluster analysis of COVID-19 patients included multiple racial groups in their analyses [6, 7], and each cluster had different proportions of racial groups, suggesting that the clinical characteristics also reflect the racial differences. In contrast, since only Japanese patients were analyzed in this study, we focused more on basic clinical information, such as age, weight, and comorbidities, and the characteristics of the clusters can be easily grasped.

Some potential limitations of our study need to be discussed. First, the phenotyping of infectious diseases requires consideration of both the host and pathogen. SARS-CoV-2 is prone to genetic evolution, resulting in multiple variants with different characteristics compared to ancestral strains. Specifically, the transmissibility and virulence of these variants can greatly differ [45]. However, our study had no detailed data on viral load and/or strain. Second, we had no validation cohort data, necessitating additional studies. Third, we could not compare the differences in treatment response among the clusters. Five essential criteria could help define COVID-19 subtypes: (1) biologically plausible, (2) promptly identifiable, (3) nonsynonymous, (4) reproducible, and most importantly, (5) treatment responsive. To establish precision medicine against COVID-19 disease, further studies with more detailed and representative data are warranted.

Conclusions

We developed a simplified tool for clustering COVID-19 patients with diverse characteristics into sub-phenotypes. We identified four clusters that predicted in-hospital outcomes in a large nationwide series of Japanese COVID-19 patients. This simple clustering will be of great help to clinicians in predicting prognosis and performing individualized therapy. Further studies are needed to develop precision medicine for COVID-19.