Background

Sepsis is now defined as a syndrome of poor host response to an infectious illness, followed by a succession of life-threatening organ dysfunctions. Sepsis is one of the leading causes of inpatient death globally, and its prevalence has gradually grown in recent years [1, 2]. According to epidemiological data, one million individuals with sepsis are hospitalized in the United States each year, with a death rate of up to 20%. A recent study based on adult hospitalization data from seven high-income nations found a 19.4 million yearly incidence of sepsis and nearly 5.3 million patient fatalities related to sepsis.

Unfortunately, despite significant efforts to discover novel medications and study the etiology of sepsis, mortality from sepsis has not improved in the 14 years from 2002 to 2016. The bottleneck is that there are no sufficiently reliable models that reliably forecast sepsis prognosis and so give a foundation for prognostic enrichment, leaving clinical trials of novel medications with insufficient power.However, clarifying the real link between the variables in the model and the outcome is a precondition for excellent performance prediction models.Hemoglobin is the exposure variable of interest in this article.

Hemoglobin is an intravascular macromolecular material that is generally maintained in human blood circulation. Hemoglobin is also important in the defense against pathogens and in the function of white blood cells. It promotes non-specific immunity by chelating iron [3] and thereby denying bacteria of nutrition, processing carbon dioxide, maintaining hemodynamic stability, and absorbing and transporting antibiotics [4].

Different levels of hemoglobin concentrations in sepsis patients may be due to presenting conditions [4], nutritional deficiencies, inflammatory anemia [5,6,7], hemolysis [8, 9], loss of bloodletting, loss of gastrointestinal tract, transfusion expansion, fluid load resulting in hemodilution [10], decreased bone marrow response to erythropoietin [11], inhibition of erythropoietin production by pro-inflammatory cytokines [7, 12], response to drugs [13], inappropriate blood transfusion [14] and so on.

Little is known regarding the predictive usefulness of hemoglobin in ICU(surgical intensive care unit) patients with sepsis. Clinical research is now focusing on the link between anemia markers such as erythrocyte distribution width [15] and platelets [5, 13, 16, 17] and sepsis prognosis. Sepsis patients frequently have anemia, and a few studies have found that hemoglobin levels equal to or below 8 g/dL in the first 48 h after ICU admission are associated with a higher risk of death from sepsis [18, 19]. However, there is still debate about the importance of higher hemoglobin levels in contributing to a better prognosis in these patients. It is unknown which hemoglobin levels give the best prognosis for sepsis patients [9, 20, 21].

As a result, this study aims to investigate the relationship between hemoglobin and the 28-day risk of mortality from sepsis using the MIMIC-IV database for sepsis in a large US sample. The enormous sample size and sensitivity analysis of numerous databases in diverse populations will allow us to deliver more consistent and accurate results, allowing us to get a better understanding of the relationship between hemoglobin and the 28-day risk of mortality in sepsis.

Methods

Description of data sources

The Medical Information Mart for Intensive Care IV (MIMIC-IV) database was utilized to gather the data for this investigation. The MIMIC-IV database was built on the successful implementation of MIMIC-III [15]. It addresses the shortcomings of the MIMIC-III database, which was intended as a retrospective cohort study. It gathered clinical data on patients who visited Beth Israel Deaconess Medical Center(BIDMC) between 2008 and 2019. The database is free to download after completing an authorized course on their official website. The author, Lu Chen, has completed the accredited course, has access to the database, and is in charge of data extraction(Record ID: 28,572,693). Our work adheres to the RECORD (Research Reports Using Routine Collection of Observational Health Data) declaration.

Cohort information

The MIMIC-IV database, containing a total of 377,207 adult patient records, was used in this study. From this population, we extracted 35,010 individuals diagnosed with sepsis using the ICD code information in the database. Specifically, patients were screened based on the following sepsis-related ICD codes: ICD-9 codes (99,591–99,592) and ICD-10 codes (R652, R6520 and R6521).The extraction phase lasted from 2008 to 2019 saw these patients in BIDMC. We looked at the relationship between 28-day death as the outcome variable (binary variable, Y = 1, death; Y = 0, survival) and basal hemoglobin as the exposure variable (recorded as a continuous variable). The hemoglobin measurement taken on the first day of admission to the ICU is defined as basal hemoglobin.Patients with missing exposure variable information were not included in this research.

The following covariates were chosen: demographic factors: gender (male or female), age (years), ethnicity, Charlson comorbidity index, SOFA (Sequential Organ Failure Assessment) score, APACHE(Acute Physiology And Chronic Health Evaluation) score, use of mechanical ventilation, use of glucocorticoids (dexamethasone, methylprednisolone, hydrocortisone), use of vasoactive drugs (dopamine,dobutamine), use of immunoglobulins, antibiotics (capreonan, cephalosporins, penicillin antibiotics, and vancomycin), heart rate and respiratory rate within the first hour of admission, lactate, heart failure, blood transfusions, gastrointestinal bleeding (GI), chronic ostructive pulmonary disease (COPD), ischemic heart disease, obstructive sleep apnea (OSA) and body temperature within 24 h of admission. The rationale for selecting these factors was mostly based on our clinical experience and the literature [15, 22].

Statement of ethics and informed consent

For this study, the MIMIC-IV database was downloaded from the Internet. The Institutional Review Boards of Beth Israel Deaconess Medical Center in Boston, Massachusetts (2001-P-001699/14) and the Massachusetts Institute of Technology approved this study (0403000206).This study was approved by the Institutional Review Board (IRB) of the Affiliated Hospital of Guizhou Medical University and deemed exempt from the requirement for ethical approval.Through the Protection of Human Research Participants Examination, the authors have the right to use and download the database. Given that the data is public and patient identifiable information is unknown, informed patient permission was revoked.

Missing data description

The variables used in this study were missing at a rate of less than 5% (0-4.1%), therefore, multiple interpolation was not used for missing data.

Statistical analysis

Continuous variables are represented by the mean, standard deviation (normal distribution), or median (minimum, maximum) (skewed distribution). Categorical variables are expressed as percentages or frequencies. Given that this was a cohort study, we divided the exposure variables into 4 groups and examined the distribution of patient baseline information across the various subgroups. To evaluate any statistical differences between the means and proportions of the groups, one-way ANOVA (normal distribution), Kruskal-Wallis H (skewed distribution) tests, and chi-square tests (categorical variables) were performed. We also looked at the relationship between hemoglobin and 28-day mortality in septic patients using univariate and multivariate binary logistic regression models.We present unadjusted models (no covariates adjusted, Model 1), minimally-adjusted models (adjusted for demographic factors only, Model 2), and fully-adjusted models (adjusted for all covariates presented in Table 1, Model 3) with 95% confidence intervals, as well as the ratio (OR) with 95% confidence intervals.To improve the robustness of our results, we initially categorized hemoglobin levels and performed a trend test to determine its consistency as a continuous or categorical variable. Next, we performed a stratified analysis using variables related to hemoglobin levels as stratification variables to evaluate the consistency of hemoglobin with respect to risk of death within 28 days across different populations, and to test for interaction using a log-likelihood ratio test. Because hemoglobin is a continuous variable, a non-linear relationship cannot be ruled out. Given the limitations of binary logistic regression models to handle non-linear correlations, we investigated the relationship between hemoglobin and 28-day mortality in sepsis patients using a generalized additive model (GAM) and a smoothed curve fit. If there was a non-linear relationship, we used a recursive algorithm to calculate the inflection point value, and then a two-piecewise linear model to calculate OR values and 95% confidence intervals on either side of the inflection point.

All analyses were carried out using the statistical software packages R (http://www.R-project.org, The R Foundation)and EmpowerStats (http://www.empowerstats.com, X&Y Solutions, Inc, Boston, MA), with P values less than 0.05 (two-sided) deemed statistically significant.

Results

Description of the patient screening process

This research initially enrolled 377,207 patients. 342,197 individuals were eliminated because they did not match the diagnostic criteria for sepsis, and 94 patients had missing HGB information, leaving a final balance of 34,916 patients for the final data analysis. The flow chart provides further information (Fig. 1).

Fig. 1
figure 1

Patient selection process

Patient baseline characteristics

Table 1 presents baseline characteristics of the patients, stratified into four groups (Q1-Q4) based on quadratic grouping of hemoglobin levels. We observed trends in variable distribution among the subgroups after grouping. The mean age of the patients was (66.67 ± 16.01) years, and 28-day mortality (Y = 1 for death) was 16.23% (5,668/34,926). In comparison to the Q4 group, the Q1, Q2, and Q3 groups had patients who were older, had a higher proportion of Caucasians, had higher Charlson co-morbidity index, SOFA score, and APACHE III score, and were more likely to have received dopamine, immunoglobulin, methylprednisolone, hydrocortisone, carbapenems, cephalosporins, cilins, blood transfusions, and vancomycin. These groups also had a higher rate of OSA, heart failure, GI bleeding, COPD, and ischemic heart disease. Additionally, we noticed that the Q1, Q2, and Q3 groups had lower body temperatures compared to the Q4 group. Similarly, we observed a lower proportion of patients in the Q2 and Q3 groups in terms of heart rate, respiratory rate, use of dexamethasone and dobutamine, and mechanical ventilation, as compared to the Q4 group.

Table 1 Description of baseline characteristics of patients with sepsis in the MIMIC-IV database

Results of univariate and multifactorial analyses of HGB and 28-day mortality

We investigated the relationship between HGB and 28-day mortality in patients with sepsis using various covariate correction procedures, and the results are shown in Table 2. In the unadjusted model, we found that for every 1-unit increase in HGB, the risk of 28-day mortality was reduced by 9% (odds ratio: 0.91, 95% CI: 0.90 to 0.93). After controlling for demographic characteristics, the risk of death within 28 days decreased by 9% for each unit increase in HGB (odds ratio: 0.91, 95% CI: 0.90 to 0.92). However, when adjusting for all covariates listed in Table 1, we observed that each unit increase in HGB was associated with only a 1% decrease in the likelihood of death after 28 days, and this association was not statistically significant (odds ratio: 0.99, 95% CI: 0.97 to 1.01).

Table 2 The findings of univariate and multivariable analyses

In order to perform sensitivity analysis, we transformed HGB into categorical variables based on quartiles and calculated p-values for trend tests. However, we found inconsistent results when comparing HGB as a continuous or categorical variable (Table 2). The non-linear association between HGB and 28-day mortality is suggested by the unequal variance in odds ratio values across the different hemoglobin groups. Therefore, our findings suggest a possible non-linear relationship between HGB and 28-day mortality in sepsis patients.

Results of the non-linear association between HGB and 28-day mortality

To investigate the nonlinear relationship between HGB and 28-day mortality, we used smoothed curve fitting and generalized summation models. After controlling for the covariates presented in Table 1, our results (Fig. 2) showed a U-shaped relationship between HGB and 28-day mortality. We determined the inflection points using a two-stage linear model and recursive techniques, which were found to be at 104 g/L and 128 g/L. Our analyses indicated (Table 3) that when HGB levels were between 41 and 104 g/L, there was a 10% decrease in the risk of 28-day mortality (OR: 0.90; 95% CI: 0.87 to 0.94, p-value = 0.0001). However, in the range of 104–128 g/L, we did not observe a significant association between hemoglobin and 28-day mortality (OR: 1.17; 95% CI: 1.00 to 1.35, P value = 0.0586). When HGB was in the range of 128–207 g/L, there was a 7% increase in the risk of 28-day mortality for every 1 unit increase in HGB (OR: 1.07; 95% CI: 1.01 to 1.15, P value = 0.0424). These findings suggest that the relationship between HGB and 28-day mortality risk may not follow a linear trend, but instead exhibit a U-shaped pattern, with different ranges of HGB associated with varying levels of mortality risk.

Fig. 2
figure 2

The association between HGB and 28-day mortality

Table 3 The addressing of nonlinear association between hemoglobin and 28-day mortality

Stratified analyses

We conducted a stratified analysis to investigate the trend of association between hemoglobin and 28-day risk of death in different subgroups of the population, using variables such as blood transfusion, GI bleeding, COPD, OSA, heart failure, and ischemic heart disease as stratification variables (Table 4). Our results showed that we did not observe any significant association between hemoglobin and 28-day risk of death, regardless of whether patients received transfusions or had OSA, heart failure, or ischemic heart disease (P for interaction was greater than 0.05). However, we found that in patients with combined GI bleeding (OR: 0.83, 95% CI: 0.72 to 0.94) and COPD (OR: 0.90, 95% CI: 0.84 to 0.96), a higher hemoglobin level was associated with a lower risk of death compared to patients without combined GI bleeding and COPD. There was a significant difference observed in P for interaction. These findings highlight the importance of considering the presence of comorbidities such as GI bleeding and COPD when evaluating the relationship between hemoglobin and mortality risk in sepsis patients.

Table 4 Use of blood transfusion or not as a stratification variable

The adjustment strategy used in this analysis was similar to the Adjust II model, with the exception that the stratification variables were not adjusted for.

Discussion

In this multicenter study, we examined the relationship between hemoglobin and 28-day mortality in patients with sepsis. Our analysis included a large sample of critically ill cohorts consisting of 34,916 individuals from the MIMIC-IV database. We found a U-shaped association between hemoglobin and the probability of 28-day mortality in patients with sepsis.

Our results indicated that there was a negative association between hemoglobin and the 28-day risk of death in patients with sepsis in the range of 41–104 g/L, and a positive association in the range of 128–207 g/L. Moreover, we observed that in the interval of 104–128 g/L, the risk of death of patients was the lowest. It is worth noting that despite significant efforts to discover new treatments and understand the causes of sepsis, mortality from sepsis has increased over the past 14 years. Our study highlights the importance of considering the hematologic system when evaluating patients with severe sepsis, as organ dysfunction is known to be associated with increased mortality in severe sepsis [23]. While previous studies have focused on the relationship between anemia indicators such as erythrocyte distribution width [24], hematocrit [15], and platelets [5, 13, 16, 17] and the prognosis of sepsis, we explored the potential nonlinear association between hemoglobin and 28-day mortality in septic patients.

Our study results are consistent with those of previous studies, such as the cohort analysis of 235 sepsis patients conducted by De Shengqi and Mei-Lin Peng [22], which showed that patients with early hemoglobin levels less than or equal to 80 g/L had significantly lower survival rates than those with levels greater than 80 g/L. Tan SMY et al. [10] confirmed similar findings in a cohort analysis of 8,132 sepsis patients, reporting that individuals with moderate anemia (hemoglobin 70 to 100 g/L) had worse prognosis than those without moderate anemia (> 100 g/L). Cai N et al. [25] found that a significant decrease in hemoglobin concentration was an independent risk factor for necrotizing small bowel colitis in preterm infants with sepsis. However, all of these studies reported only linear correlations, rather than non-linear correlations. In addition to establishing a negative association between hemoglobin and death, our study discovered for the first time that high hemoglobin levels are also associated with a significant risk of death. We hypothesize several reasons why this trend (higher hemoglobin associated with higher risk of death) was not found in previous comparable studies: (1) the population in our study consisted of adult critically ill patients in the United States, whereas previous studies with similar results studied populations in China or other regions; (2) our study had a relatively large sample size, making statistical analysis easier; (3) we used various nonlinear relationship algorithms, which have not been used in previous studies.

It is worth mentioning that Luo et al. [15] studied the relationship between hematocrit and 30-day mortality in sepsis patients using the same MIMIC-IV dataset. However, their results were different from ours: they found that mortality decreased with increasing hematocrit, rather than showing the U-shaped curve that we observed. The reason for this difference may be related to several factors: first, we had different extraction strategies for patient data. The sample size in our study was more than 10 times larger than that of Luo et al., and we extracted data using ICD-9 and ICD-10 codes (99,591–99,592, R652, R6520, and R6521), whereas they did not provide specific ICD codes used for patient selection. Additionally, our statistical models were significantly different from theirs: we used the GAM model for nonlinear associations and the two-piecewise linear model, whereas they did not use these models. Furthermore, it is important to note that Luo et al. used Cox regression, whereas we used logistic regression. We chose logistic regression because the follow-up time was short and consistent for all participants, minimizing the effect of time on the results.

The present study has several strengths. Firstly, it has a larger sample size than previous studies. Secondly, the study adjusts for more variables and has more stable results. Thirdly, the study uses algorithms that elucidate nonlinearity to better reflect the true relationship between hemoglobin and sepsis and 28-day death. Fourthly, to further explore the association between hemoglobin and the risk of 28-day death in different populations, we used diseases closely related to hemoglobin as stratification variables and found a trend where an increase in hemoglobin was associated with a lower risk of death in patients with GI bleeding and COPD. However, it should be noted that this is only an exploratory analysis and not corrected for type 1 errors caused by multiple comparisons, so the results need to be interpreted with caution.

However, this study has certain limitations: (1) Because this is an observational study, it is inevitably subject to confounding. However, we rigorously adjusted for confounding and used sensitivity analysis to assess the consistency of the results. (2) Due to the nature of observational studies, we can only observe the relationship between the two but not determine causality;(3) Because the database used in this study is a large single-center database with limited diversity, our results are subject to uniformity and may have unavoidable bias; (4) Because we can only adjust for measurable confounders but not for difficult-to-measure confounders, future clinical studies with higher levels of evidence in larger populations are required to validate our findings; (5) Since the immune status is closely related to the prognosis of sepsis, the use of immunosuppressive or chemotherapeutic agents could be a significant confounding factor. However, the MIMIC4 database did not include any information on such variables.

Conclusions

In patients with sepsis, baseline hemoglobin was related to a U-shaped risk of 28-day death. When HGB levels were between 4.1 and 10.4 g/dL,there was a 10% decrease in the risk of 28-day mortality.However, in the range of 10.4–12.8 g/dL, we did not observe a significant association between hemoglobin and 28-day mortality.When HGB was in the range of 12.8–20.7 g/dL, there was a 7% increase in the risk of 28-day mortality for every 1 unit increase in HGB.