Six artificial intelligence paradigms for tissue characterisation and classification of non-COVID-19 pneumonia against COVID-19 pneumonia in computed tomography lungs

Saba, Luca; Agarwal, Mohit; Patrick, Anubhav; Puvvula, Anudeep; Gupta, Suneet K.; Carriero, Alessandro; Laird, John R.; Kitas, George D.; Johri, Amer M.; Balestrieri, Antonella; Falaschi, Zeno; Paschè, Alessio; Viswanathan, Vijay; El-Baz, Ayman; Alam, Iqbal; Jain, Abhinav; Naidu, Subbaram; Oberleitner, Ronald; Khanna, Narendra N.; Bit, Arindam; Fatemi, Mostafa; Alizad, Azra; Suri, Jasjit S.

doi:10.1007/s11548-021-02317-0

Six artificial intelligence paradigms for tissue characterisation and classification of non-COVID-19 pneumonia against COVID-19 pneumonia in computed tomography lungs

Original Article
Published: 03 February 2021

Volume 16, pages 423–434, (2021)
Cite this article

Download PDF

International Journal of Computer Assisted Radiology and Surgery Aims and scope Submit manuscript

Six artificial intelligence paradigms for tissue characterisation and classification of non-COVID-19 pneumonia against COVID-19 pneumonia in computed tomography lungs

Download PDF

Luca Saba¹,
Mohit Agarwal²,
Anubhav Patrick³,
Anudeep Puvvula^4,21,
Suneet K. Gupta²,
Alessandro Carriero⁵,
John R. Laird⁶,
George D. Kitas^7,8,
Amer M. Johri⁹,
Antonella Balestrieri⁵,
Zeno Falaschi⁵,
Alessio Paschè⁵,
Vijay Viswanathan¹⁰,
Ayman El-Baz¹¹,
Iqbal Alam¹²,
Abhinav Jain¹³,
Subbaram Naidu¹⁴,
Ronald Oberleitner¹⁵,
Narendra N. Khanna¹⁶,
Arindam Bit¹⁷,
Mostafa Fatemi¹⁸,
Azra Alizad¹⁹ &
…
Jasjit S. Suri^20,21

3710 Accesses
43 Citations
6 Altmetric
1 Mention
Explore all metrics

Abstract

Background

COVID-19 pandemic has currently no vaccines. Thus, the only feasible solution for prevention relies on the detection of COVID-19-positive cases through quick and accurate testing. Since artificial intelligence (AI) offers the powerful mechanism to automatically extract the tissue features and characterise the disease, we therefore hypothesise that AI-based strategies can provide quick detection and classification, especially for radiological computed tomography (CT) lung scans.

Methodology

Six models, two traditional machine learning (ML)-based (k-NN and RF), two transfer learning (TL)-based (VGG19 and InceptionV3), and the last two were our custom-designed deep learning (DL) models (CNN and iCNN), were developed for classification between COVID pneumonia (CoP) and non-COVID pneumonia (NCoP). K10 cross-validation (90% training: 10% testing) protocol on an Italian cohort of 100 CoP and 30 NCoP patients was used for performance evaluation and bispectrum analysis for CT lung characterisation.

Results

Using K10 protocol, our results showed the accuracy in the order of DL > TL > ML, ranging the six accuracies for k-NN, RF, VGG19, IV3, CNN, iCNN as 74.58 ± 2.44%, 96.84 ± 2.6, 94.84 ± 2.85%, 99.53 ± 0.75%, 99.53 ± 1.05%, and 99.69 ± 0.66%, respectively. The corresponding AUCs were 0.74, 0.94, 0.96, 0.99, 0.99, and 0.99 (p-values < 0.0001), respectively. Our Bispectrum-based characterisation system suggested CoP can be separated against NCoP using AI models. COVID risk severity stratification also showed a high correlation of 0.7270 (p < 0.0001) with clinical scores such as ground-glass opacities (GGO), further validating our AI models.

Conclusions

We prove our hypothesis by demonstrating that all the six AI models successfully classified CoP against NCoP due to the strong presence of contrasting features such as ground-glass opacities (GGO), consolidations, and pleural effusion in CoP patients. Further, our online system takes < 2 s for inference.

Diagnosis of Pediatric Pneumonia with Ensemble of Deep Convolutional Neural Networks in Chest X-Ray Images

Article 12 September 2021

Tuberculosis detection in chest radiograph using convolutional neural network architecture and explainable artificial intelligence

Article 19 April 2022

Detection and classification of lung diseases for pneumonia and Covid-19 using machine and deep learning techniques

Article 18 September 2021

Introduction

The coronavirus disease 2019 (COVID-19) is highly infectious (Ro = 3) and caused by SARS-CoV-2, the single-stranded RNA virus referred to as “severe acute respiratory syndrome coronavirus.” This disease leads to complications like pneumonia, acute respiratory distress syndrome (ARDS), damage to the heart, acute strokes, or even systemic hyper-inflammation syndrome, which, in turn, leads to multiorgan failure [1]. As of 20 August 2020, nearly 23 million people have been infected by COVID-19, and nearly 800,000 subsequent deaths have been recorded worldwide [2]. Most of the mortalities have occurred within eight countries—namely the USA, Brazil, the UK, Mexico, Italy, France, India, and Spain [2].

COVID-19 affects the lungs and causes respiratory difficulties. Common symptoms of COVID-19 include breathlessness, dry cough, fatigue, and fever [3]. Some relatively uncommon symptoms of COVID-19 include a loss of taste or smell, sore throat, and vomiting [4]. The danger posed by COVID-19, as well as its spread, is worsened by the fact that many people infected with COVID-19 are asymptomatic [3]. COVID-19 impacts the pulmonary tissues of the lungs, resulting in ARDS, [5] and a considerable percentage of the patients end up needing ventilator support [6]. Many of the initial victims of COVID-19 in China were hospitalised because they exhibited lower respiratory tract (LRT) symptoms [3,7] though these symptoms varied considerably among patients. Some patients exhibited minimal symptoms, while others suffered from hypoxia due to ARDS. For some patients, LRT transformed into ARDS within nine days [7]. It has also been discovered that patients suffering from COVID-19-induced ARDS are prone to organ failure [8,9].

Radiologists primarily use radiography, computerised tomography (CT), or ultrasounds to diagnose lung disease [10,11,12]. These methods allow symptomatic patients to be tested for COVID-19 quickly when tests like real-time transcription polymerase chain reaction (RT-PCR) are not available [13]. Researchers have demonstrated that CT is a more sensitive COVID-19 detection method than traditional techniques for symptomatic patients [14]. One recent study showed that chest radiography could not be used to detect the opaque image features of COVID-19 [15]. Lung ultrasounds can be used as an alternative to CT to detect COVID-19, although CT is still considered the gold standard for detecting pulmonary infections [16].

Apart from conventional techniques, many researchers have also employed artificial intelligence (AI)-based machine learning (ML), deep learning (DL), and transfer learning (TL) techniques to diagnose COVID-19. One group of researchers provided a novel technique to classify COVID-19 infection from lung CT images using weakly supervised DL; this method was also utilised to localise the inflammation caused by COVID-19 [17]. In other work, Xiao et al. developed a multiple instance learning module based on ResNet34 to predict the severity of COVID-19 cases using lung CT scans [18].

Meanwhile, other researchers used UNet + + architecture for segmenting COVID-19-infected lung areas using CT images [19]. They transformed their study into an online platform to provide fast COVID-19 diagnostic tools that are accessible worldwide [20]. Another group of researchers created a DL and “deep reinforcement learning” model that can automatically quantify COVID-19-related lung abnormalities such as ground-glass opacities and consolidations [21,22,]. Their proposed architecture produces two metrics that can accurately quantify the spread of COVID-19.

Several other pieces of research have proposed new methods for diagnosing COVID-19 using TL on lung CT scans. TL is used when COVID-19 data are very less, or existing deep learning models can be improved by artistically utilising it [22,23,24]. However, TL works efficiently only if the model is trained using data that are similar to the target problem [25] (i.e., COVID-19 lung CT data). Otherwise, performance gains are minimal or insignificant.

In this study, we compared six state-of-the-art AI models (two traditional ML models, two TL models, and two DL models) using K-fold cross-validation to solve the COVID-19 detection problem related to lung CT data. To the best of our knowledge, no study has benchmarked the comparative efficacy of traditional machine learning, deep learning, and transfer learning architectures on COVID-19 lung CT data. As such, doing so is one of the objectives of the present study. Another important objective is to design COVID severity using output class probability values using AI models and then clinically validate against radiologist’s greyscale feature scores. As part of the clinical validation, we demonstrate the association of AI’s correlation with ground-glass opacities (GGO) values, thus validating the hypothesis on COVID severity estimation. We also performed 2D and 3D bispectrum analyses to classify COVID pneumonia (CoP) patients using CT images. Our results show that even though TL can reduce the training time of the model, DL and ML models match or surpass TL regarding the performance benchmarks of COVID-19 classification.

The aggressiveness of the COVID-19 severity can be seen using the imaging-based tests. If the Troponin is released, we know that it is likely to cause a heart attack. Similarly, if CT images can infer to tell the COVID-19 severity due to hyper-intensity distribution in the lung CT (which cannot be known from the swap sample), more aggressive care can be given to the patient. Therefore, the main clinical advantage of CT-based imaging is the determination of aggressiveness of the care which needs to be given to the patient.

Second benefit of doing this study is the development of the AI-based tool to avoid bias by the expert radiologist or pulmonologist. Due to fatigue of the over-length stay of the physicians at the hospital, the results can vary from radiologist to radiologist, so-called inter- and intra-observer variability. Thus, using the AI-based solutions, this major weakness can also be overcome. Third, if tropin is released when COVID-19 pneumonia CT has GGO, we know that it is likely to cause a heart attack too. Lastly, if CT shows pathology that means you, we have pneumonia, it is therefore important to quantify the risk using CT.

The rest of the paper is organised as follows. Section 2 discusses the pathophysiology of COVID-19 cases that develop into ARDS. Section 3 overviews the methodology. Section 4 discusses the experimental results using the K10 protocol and bispectrum analysis. The AI models’ performance is evaluated in Sect. 5 based on the ROC curve, and multiple classification metrics. We discuss our findings in Sect. 6. Sections 7 and 8 provide conclusions and references, respectively.

Methodology

Patient demographics

The CT images of 130 patients were collected. There were 100 CoP patients (68 males and 32 females) from the 17–93 age group (mean age = 61.49 ± 16). The remaining 30 cases (nine males and 21 females) from the age group of 17–93 (mean age = 51.4 ± 2 years) were NCoP patients.

Data acquisition and baseline characteristic

The methodology of this study consists of the design and development of a CADx that has three components. These components are divided based on their functionality. The first component is the region-of-interest extraction, which envelops the CT lung region. The second component of the system consists of the automatic classification of CoP patients and non-COVID pneumonia (NCoP) patients. The final stage of the CADx system consists of a performance evaluation that implements (1) a standardised analysis (e.g., ROC), (2) DOR validation (see Fig. S8 Online Resources 1), and (3) CoP validation using a bispectrum analysis paradigm. Before we dive into these three subsystems, we present the patient demographics and data acquisition systems.

Data acquisition

CT images were collected using a Philips Ingenuity Core CT Scanner, while patients were in a deep inspiration breath-hold (DIBH) supine position. The patients were not given any oral contrast or intravenous agents. The CT scan was done at 120 kV, 225 mAs. The spiral pitch factor, gantry rotation time, and detector configurations were fixed at 1.08, 0.5 s, and 65 × 0.625, respectively. A 768 × 768 lung window and a 512 × 512 mediastinal window size, were used to reconstruct 1-mm-thick images with soft tissue kernel. The CT images were reviewed using twin 35 × 43 EIZO PACS displays with a 2048 × 1536 matrix. The final data comprised 2788 CT images for CoP patients and 990 CT images for NCoP patients. For 100 COVID-19 patients, we took 27–28 scans per patient which helped us obtain 100*27–100*28, i.e., 2758 CT scans. Similarly, for healthy patients, we took around 33 scans for each of 30 patients, resulting in 30*33 = 990 CT scans.

Baseline characteristics

The baseline characteristics of the Italian cohort’s COVID-19 data are presented in Table 1. We have utilised the “R package” to perform a t-test on the data, with the level of significance set to P < = 0.05. The table shows the essential characteristic traits of CoP patients. The baseline characteristics reflect the visual characteristics of the CT lung data (row #3 to row #6). The ground-glass opacity (GGO) is significant in differentiating between CoP and NCoP classes (P = 0.00001). Lung consolidations (CONS) also differentiates the two classes from one another (P = 0.00453). The pleural effusion (PLE) attribute is also significant in the classification of CoP and NCoP patients (P = 0.00413). The most common physiological symptom of CoP is fever, which is also be correlated with body temperature (P = 0.00313).

Table 1 Baseline characteristics of CoP and NCoP patients

Full size table

Three kinds of AI architectures for classification

We have shortlisted two representative candidates from ML algorithms—namely k nearest neighbours (k-NN) and random forest (RF). The developed framework is a modified version of our previous work [26].

For TL, we utilised VGG19 and InceptionV3 pre-trained models [27] (see Fig. S5, S6 (Online Resources 1) and changed only the model top. VGG19 is a 19-layered deep model consisting of sixteen convolution layers to extract visual features, five max pool filters to reduce the spatial size of the extracted features, and three fully connected layers for classifying the image. InceptionV3 is a 42-layered deep model consisting of 11 inception modules (each comprising of multiple convolution layers and max-pooling filters), followed by three fully connected layers and a softmax activation layer.

The initial layers of TL were made nontrainable, and only last layers were made trainable. The reason for not training the entire network in case of transfer learning is that it can save computation time because the network would already be able to extract generic features from images. The network will not have to learn extracting generic features from scratch. A neural network works by abstracting and transforming information in steps. In the initial layers, the features extracted are generic, and independent of a particular task. It is the latter layers which are much more tuned specific for a particular task. So, by freezing the initial stages, we get a network which can already extract meaningful general features. We would unfreeze the last few stages (or just the new untrained layers), which would be tuned for our paradigm. It is not recommended to unfreeze all layers if we have any new/untrained layers in our model. These untrained layers will train as if initialised by random (and not pre-trained) weights which would lose the basic idea of transfer learning.

For DL, we developed our custom architectures (CNN and iCNN), consisting of a multi-layer convolution network (see Fig. S7, Table S5 (Online Resources 1). It contains three convolution layers, each of which is followed by a max-pooling filter, and two fully connected layers. A two-class probability score is obtained by passing the output to a softmax activation function. In iCNN, we slightly changed the “ReLU” activation function in the hidden layers to σ = (max(0, x))^1.00001. Here, x is the input value, sigma is the activated output value, max is a function that gives the maximum value between zero and the input value, and the exponent 1.00001 slightly scales the output.

Several lightweight convolution neural network models have been experimented with 3, 4, 5 convolution layers for COVID disease identification, and it has been shown that these models provide very good results with 3 convolution layer model giving best accuracy. In the proposed three convolution layer model, 32, 16, and 8 hidden units are there in hidden layers 1, 2, and 3, respectively. Moreover, each convolution layer is followed by a max-pooling layer. After the last max-pooling layer, the flattened layer is present which converts the 2-D matrix to 1-D column vector which is densely connected with a layer having 128 hidden units, followed by the output layer. To provide nonlinearity in the model, the standard ReLU activation function has been modified and used in hidden layers.

Results

Accuracy of the two ML, two TL, and two DL models

We compared the K10 classification accuracy of all the six AI models for the COVID-19 data, as shown in Table S2 (Online Resources 1). Our observations demonstrate that accuracies are in the following order DL > TL > ML. Further, DL-based iCNN and CNN architectures had accuracies of 99.69 ± 0.66% and 99.53 ± 1.05%, respectively, making them the two most accurate models among the six tested models. Of the TL architectures, only VGG19 fared well against DL architectures, as it had a classification accuracy of 99.53 ± 0.75%. The other TL architecture (i.e., InceptionV3) achieved a classification accuracy of only 94.84 ± 2.85%. The two ML architectures varied considerably in terms of their performance; their RF scoring was 96.84 ± 1.28%, and their k-NN scoring was 74.58 ± 2.24%. The mean accuracy figures of all six AI models are summarised in Fig. 1.

CT lung characterisation using bispectrum analysis

We characterised CoP and NCoP CT lung tissues using bispectrum analysis based on a higher-order spectrum (HOS). Bispectrum analysis is based on the principle of coupling of components of spectral signals. If there is a sudden change in grayscale image density (as is the case for COVID-19-infected tissues), then higher bispectrum (or B) values are generated. This property of bispectrum analysis can be exploited to identify COVID-19-infected tissue quickly. This study is intended to identify NCoP and CoP patients without using AI-based techniques.

Generally, COVID-19-infected lungs are characterised by a hyper-intensity region. We separated those pixels from lung CT images and passed them into a Radon transform, which acts as a signal for HOS to generate B values. The images of CoP patients have much higher B values. The 2D and 3D bispectrum plots for CoP and NCoP patients are shown in Figs. 2 and 3.

Performance evaluation of AI models and its clinical validation

Receiver operating characteristics

The ability of all six AI models to differentiate CoP and NCoP data sets is illustrated in Fig. 4. We used the K10 protocol to compute receiver operating characteristic (ROC) curves. As expected, the simplest ML model (i.e., k-NN) performed the worst in this regard, achieving a score of just 0.744 area under the curve (AUC) (P < 0.0001). The best-performing model was the novel iCNN DL, whose AUC score was 0.993 (P < 0.0001). Other AI models based on their increasing AUC values are TL-based InceptionV3, machine learning-based RF, transfer learning-based VGG19, and our custom deep learning CNN.

A comparison of six AI models based on multiple classification metrics

We compared six AI models based on a COVID-19 data set containing 377 samples (99 NCoP patients and 278 CoP patients). We choose ten classification metrics for this comparison: sensitivity, specificity, precision, negative prediction value (NPR), false positive rate (FPR), false discovery rate (FDR), false negative rate (FNR), F1 score, Matthews correlation coefficient (MCC), and Cohen’s Kappa coefficient. Cohen Kappa and F1 score are measure of AI methods performance metrics calculated based on true positive, false positive and true negative and false negative values. F1 score [37] can be calculated using the formula:

$$ F_{1} = \frac{{{\text{TP}}}}{{{\text{TP}} + \frac{1}{2}\left( {{\text{FP}} + {\text{FN}}} \right)}} $$

(1)

We adopted Matthew’s correlation coefficient [28] for quantifying the quality of binary classification since it is typically used in machine learning. It was in 1975 that the biochemist Brian W. Matthews had introduced this measure. Given the truth table values represented as TP: true positive, FP: false positive, TN: true negative, FN: false negative, we mathematically express MCC as shown in Eq. 2.

$$ {\text{MCC}} = \frac{{{\text{TP}} \times {\text{TN}} - {\text{FP}} \times {\text{FN}}}}{{\sqrt {\left( {{\text{TP}} + {\text{FP}}} \right)\left( {{\text{TP}} + {\text{FN}}} \right)\left( {{\text{TN}} + {\text{FP}}} \right)\left( {{\text{TN}} + {\text{FN}}} \right)} }} $$

(2)

Note that MCC represents the correlation between predicted and observed binary classification. It returns a value between −1 or +1. The perfect prediction is represented when MCC is +1, and −1 represents total disagreement between prediction and observation.

The results of the study are summarised in Table 2. Both the DL models (CNN and iCNN) and one of the TL models (VGG19) performed equally well. Both ML models (RF and k-NN) and the second TL model (InceptionV3) did not perform well in comparison with the DL models.

Table 2 Comparison of the six AI models on the basis of multiple classification metrics

Full size table

COVID risk stratification

Figure 5 presents the COVID-19 risk levels of patients as predicted by our custom CNN DL model. We created the frequency distribution (Fig. 5a) by using a softmax function in the output layer of the model such that the model produced a probability score (ranging from 0 to 1) that indicates a patients’ COVID-19 risk. We divided the overall probability range into ten bins and added each CT image sample to one of the bins based on the output of the model. We considered three levels of risk: low risk (probability score of 0 to 0.3), moderate risk (0.3 to 0.7), and high risk (0.7 to 1). A cumulative distribution plot of all 3788 lung CT samples is given in Fig. 5b. This distribution was computed by summing all the CT samples for each bin by adding the previous total of samples until all the COVID-19 risk probability bins are completed.

Clinical validation of COVID risk stratification

The ground-glass opacity values (GGO) correlation with CNN model was determined for each patient. For this, the mean of all CT scan slices of patient probability score was calculated and compared with GGO values. Similarly, bispectrum mean for each patient was calculated and compared with GGO values. CONS values were also tested for their correlation with COVID severity and bispectrum values. A list of all patients’ values of GGO, CONS, severity, and bispectrum B values is given in Table S3 (Online Resources 1). The correlation between these fields among themselves is also given in Table S4 (Online Resources 1).

The association linear curve between COVID severity and GGO is shown in Fig. 6 and that between bispectrum (B) value and GGO is shown in Fig. 7. Similarly, the curve between bispectrum and COVID severity is also shown in Fig. 8.

Discussion

In this study, we tested our two custom DL models against two state-of-the-art TL models, using two popular ML models as baselines to resolve the CoP vs NCoP classification problem. We used the K10 protocol and compared these models’ accuracy. We used COVID-19 data that we collected from patients, following specific privacy laws. Our relatively simple nine-layered iCNN model was the most accurate among the investigated models, and it achieved the highest AUC score of 0.993 (P < 0.0001). Surprisingly, we found that architectures that are even more straightforward compared to iCNN model (e.g., RF) can match which are comparable to the state-of-the-art TL models (e.g., InceptionV3) in terms of accuracy and AUC score when used for COVID-19 classification. TL models’ unremarkable performance could be because these models were not trained on CT images or any other radiology data. Moreover, the high separability in training data, which is being caught by other AI models, is not noticed by TL models.

The COVID risk stratification for each patient was validated by showing a strong correlation with ground-glass opacity values of the patient’s CT scans. Similarly, bispectrum was also validated against GGO values. The clinical tests also show the AI models which are having similar classification capabilities and which are significantly differing in accuracy values. This is more clear than visual inspection of accuracy and standard deviation values of each AI-model.

Benchmarking

Table 3 presents benchmarking data to compare the six AI models examined in our research with those considered in existing work on COVID classification. We have shortlisted four criteria for benchmarking: (1) the COVID-19 dataset used, (2) the AI model used by the researchers, (3) the accuracy of their proposed models, and (4) any other performance measures used by the authors. Rows R1 to R5 present the research done by other researchers, and row R6 represents our research. It can be observed that the performance of our custom iCNN model is on par with models proposed by other researchers.

Table 3 Benchmarking of six AI models with the existing work on COVID-19 classification

Full size table

3D validation

The lung CT data of our Italian cohort was processed so that we could evaluate the degradation and fibrosis of lung parenchyma of CoP vs NCoP patients (Fig. 9). We used the image segmentation tool to process data in DICOM format. Using profile lining, we applied segmentation based on the Hounsfield value (grey value) of the pixels belonging to the lung section [33]. A stacking process [34] was then applied to obtain a union, forming a 3D volume of the segmented region of interest [35]. This process was followed by region growing to develop the region of interest (in this case, the lung). The 3D volume was computed for the grown region to evaluate the volume and spatial distribution of lung parenchyma. We computed the spatial distribution of parenchyma associated with the rear end of the lung because the influence of spike proteins of COVID-19 is more significant in the deeper volume of the lung parenchyma [36].

Interpretation

DL models, particularly the CNN model that we used, are very good at recognising the spatial features of images without human intervention, which supports our hypothesis. Both of our custom models ran well likely because of the visual features of COVID-19 in the lung CT images (e.g., ground-glass opacities, consolidations, and pleural effusions). These features are very distinct for CoP when compared to NCoP. This notion is supported by the data representing the baselines characteristics of patients. If traditional ML classifiers are to work efficiently, their features need to be handcrafted, and their performance depends on the ingenuity of the model’s designer. TL models work better than DL models when there are relatively little data and training time. However, they must be pre-trained using similar dataset for which they are expected to be used. This limits the application of TL models in medical imaging unless such a model has been pre-trained on similar data.

Strengths, weakness, and extensions

Strengths: The architectures that we designed and developed in this work are relatively simple and easy to use in research and clinical settings. Even without augmentation, we demonstrated that their classification accuracies are high enough to be considered within the clinical range according to recent publications. Although the pilot trials were successful, the data sets that we used could be more balanced and could be multi-ethnic.

Weakness: Due to lack of non-COVID pneumonia data sets, the current models could not be tried. We intend to extend this to multiclass paradigms in future research [37]. Due to the limitation on the data sets regarding the “censorship” and “survival”, it was not possible to compute the survival analysis such as hazard curves and survival curves. However, in future, we will be collecting this information even though vaccines distributions have started.

Extensions: Even though the pilot study showed powerful results, one can design more robust automated segmentation step using stochastic segmentation strategies [38,39,40]. Extensive ML features can be computed under ML framework in future [41,42]. More validations using multimodality spatial images can be conducted such as PET and CT based on registration methods [43,44]. Superior lung CAD models can be designed to improve scientific validation [12,45]. Since AI has fast developed and more transfer learning approaches have been developed, one can try extending the TL models using the pre-trained weights [37]. While six AI models were tried on a single set of data, multi-centre study could be conducted using the same models to avoid any bias. Thus, the current study can be a launching pad for multi-centre, multimodality, multi-ethnic, and multi-regional analysis.

Conclusion

We presented six types of AI-based models for CoP vs NCoP classification via CT lung scans taken from an Italian cohort. The proposed CNN-based AI-model outperformed the TL and ML systems that were investigated. Further, we showed that when using higher-order spectra, bispectrum could differentiate CoP patients from NCoP patients, thus further validating our hypothesis. As part of clinical validation, a novel COVID risk factor calculation was introduced using CNN output probability values and validated against GGO values of all patients.

Our AI system was implemented on a multi-GPU system such that the online system was a few seconds per scan. The system can be extended to multiclass data sets where data can also be taken from community pneumonia or interstitial viral pneumonia. The system was validated against the well-accepted existing data sets (e.g., a biometric data set and a DL animal data set).

References

Who emro—about covid-19—covid-19—health topics (2020) [Online]. Available: http://www.emro.who.int/health-topics/corona-virus/about- covid-19.html
Coronavirus update (live) (2020) [Online]. Available: https://www.worldometers.info/coronavirus/
Wang HY, Li XL, Yan ZR, Sun XP, Han J, Zhang BW (2020) Potential neurological symptoms of COVID-19. Therap Adv Neurol Disorders 13:1756286420917830
CAS Google Scholar
Cascella M, Rajnik M, Cuomo A, Dulebohn SC, Di Napoli R (2020) Features, evaluation and treatment coronavirus (COVID-19). InStatpearls [internet] 2020, StatPearls Publishing: Treasure Island (FL)
Pan C, Chen L, Lu C, Zhang W, Xia JA, Sklar MC, Du B, Brochard L, Qiu H (2020) Lung recruitability in COVID-19–associated acute respiratory distress syndrome: a single-center observational study. Am J Respir Crit Care Med 201(10):1294–1297
Article CAS Google Scholar
Meng L, Qiu H, Wan L, Ai Y, Xue Z, Guo Q, Deshpande R, Zhang L, Meng J, Tong C, Liu H (2020) Intubation and ventilation amid the COVID-19 outbreak: Wuhan’s experience. Anesthesiology 132(6):1317–1332
Article CAS Google Scholar
Huang C, Wang Y, Li X, Ren L, Zhao J, Hu Y, Zhang L, Fan G, Xu J, Gu X, Cheng Z (2020) Clinical features of patients infected with 2019 novel coronavirus in Wuhan. China Lancet 395(10223):497–506
Article CAS Google Scholar
Devaux CA, Rolain JM, Raoult D (2020) ACE2 receptor polymorphism: susceptibility to SARS-CoV-2, hypertension, multi-organ failure, and COVID-19 disease outcome. J Microbiol Immunol Infect
Zaim S, Chong JH, Sankaranarayanan V, Harky A (2020) COVID-19 and multi-organ response. Curr Probl Cardiol 28:100618
Article Google Scholar
Saba L, Suri JS (2013) Multi-Detector CT imaging: principles, head, neck, and vascular systems. CRC Press, Boca Raton
Book Google Scholar
El-Baz A, Suri J (2019) Lung Imaging and CADx. CRC Press, Boco Raton
Book Google Scholar
El-Baz A, Suri JS (2011) Lung imaging and computer aided diagnosis. CRC Press, Boco Raton
Google Scholar
Use of chest imaging in covid-19 (2020) [Online]. Available: https://www.who.int/publications-detail-redirect/use-of-chest-imaging-in-covid-19
Hatamabadi H, Shojaee M, Bagheri M, Raoufi M (2020) Lung ultrasound findings compared to chest CT scan in patients with COVID-19 associated pneumonia: a pilot study. Adv J Emerg Med
Revel MP, Parkar AP, Prosch H, Silva M, Sverzellati N, Gleeson F, Brady A (2020) COVID-19 patients and the Radiology department–advice from the European Society of Radiology (ESR) and the European Society of Thoracic Imaging (ESTI). Eur Radiol 30(9):4903–4909
Yan C, Hui R, Lijuan Z, Zhou Y (2020) Lung ultrasound vs. chest X-ray in children with suspected pneumonia confirmed by chest computed tomography: a retrospective cohort study. Exp Therap Med 19(2):1363–1369
Google Scholar
Hu S, Gao Y, Niu Z, Jiang Y, Li L, Xiao X, Wang M, Fang EF, Menpes-Smith W, Xia J, Ye H (2020) Weakly supervised deep learning for covid-19 infection detection and classification from ct images. IEEE Access 29(8):118869–118883
Article Google Scholar
Li Z, Zhong Z, Li Y, Zhang T, Gao L, Jin D, Sun Y, Ye X, Yu L, Hu Z, Xiao J (2020) From community acquired pneumonia to COVID-19: a deep learning based method for quantitative analysis of COVID-19 on thick-section CT scans. Eur Radiol 30(12):6828–6837
Article CAS Google Scholar
Chen J, Wu L, Zhang J, Zhang L, Gong D, Zhao Y, Chen Q, Huang S, Yang M, Yang X, Hu S (2020) Deep learning-based model for detecting 2019 novel coronavirus pneumonia on high-resolution computed tomography. Sci Rep 10(1):1–1
Article Google Scholar
Ctangel (2020) [Online]. Available: http://121.40.75.149/znyx-ncov/index
Chaganti S, Grenier P, Balachandran A, Chabin G, Cohen S, Flohr T, Georgescu B, Grbic S, Liu S, Mellot F, Murray N (2020) Automated quantification of CT patterns associated with COVID-19 from chest CT. Radiol Artificial Intell 2(4):200048
Article Google Scholar
Ahuja S, Panigrahi BK, Dey N, Rajinikanth V, Gandhi TK (2020) Deep transfer learning-based automated detection of COVID-19 from lung CT scan slices. Springer, Applied Intelligence
Google Scholar
Das NN, Kumar N, Kaur M, Kumar V, Singh D. Automated deep transfer learning-based approach for detection of COVID-19 infection in chest X-rays. Irbm. 2020 Jul 3.
Jaiswal A, Gianchandani N, Singh D, Kumar V, Kaur M (2020) Classification of the COVID-19 infected patients using DenseNet201 based deep transfer learning. J Biomol Struct Dyn 2:1–8
Article Google Scholar
Caldwell M, Griffin LD (2019) Limits on transfer learning from photographic image data to X-ray threat detection. J X-ray Sci Technol 27(6):1007–1020
Article Google Scholar
Kuppili V, Biswas M, Sreekumar A, Suri HS, Saba L, Edla DR, Marinhoe RT, Sanches JM, Suri JS (2017) Extreme learning machine framework for risk stratification of fatty liver disease using ultrasound tissue characterization. J Med Syst 41(10):152
Article Google Scholar
Team k. keras documentation: Keras applications (2020) [Online]. Available: https://keras.io/api/applications/
Singh BK, Verma K, Thoke AS, Suri JS (2017) Risk stratification of 2D ultrasound-based breast lesions using hybrid feature selection in machine learning paradigm. Measurement 1(105):146–157
Article Google Scholar
Polsinelli M, Cinque L, Placidi G (2020) A light CNN for detecting COVID-19 from CT scans of the chest. Pattern Recogn Lett 140:95–100
Article Google Scholar
Hasan AM, AL-Jawad MM, Jalab HA, Shaiba H, Ibrahim RW, AL-Shamasneh AA (2020) Classification of Covid-19 coronavirus, pneumonia and healthy lungs in CT scans using Q-deformed entropy and deep learning features. Entropy 22(5):517
Article CAS Google Scholar
Loey M, Manogaran G, Khalifa NE (2020) A deep transfer learning model with classical data augmentation and cgan to detect covid-19 from chest ct radiography digital images. Neural Comput Appl 26:1–3
Google Scholar
Apostolopoulos ID, Mpesiana TA (2020) Covid-19: automatic detection from x-ray images utilizing transfer learning with convolutional neural networks. Phys Eng Sci Med 3:1
Google Scholar
Wang Y, Wang H, Shen K, Chang J, Cui J (2020) Brain CT image segmentation based on 3D slicer. J Complex Health Sci 3(1):34–42
Article Google Scholar
Chattopadhyay H, Bit A, Ghagare D, Rizvanov A (2017) Assessment of influences of stenoses in right carotid artery on left carotid artery using wall stress marker. Biomed Res Int 2017:2935195
PubMed PubMed Central Google Scholar
AlZu’bi S, Shehab M, Al-Ayyoub M, Jararweh Y, Gupta B (2020) Parallel implementation for 3d medical volume fuzzy segmentation. Pattern Recognit Lett 130:312–318
Article Google Scholar
Salehi S, Reddy S, Gholamrezanezhad A (2020) Long-term pulmonary consequences of coronavirus disease 2019 (COVID-19): what we know and what to expect. J Thorac Imag 35(4):W87–W89
Article Google Scholar
Tandel GS, Balestrieri A, Jujaray T, Khanna NN, Saba L, Suri JS (2020) Multiclass magnetic resonance imaging brain tumor classification using artificial intelligence paradigm. Comput Biol Med 30:103804
Article Google Scholar
El-Baz A, Jiang X, Suri JS (2016) Biomedical image segmentation: advances and trends. CRC Press, Boco Raton
Book Google Scholar
El-Baz A, Gimel’farb G, Suri JS (2015) Stochastic modeling for medical image analysis. CRC Press, Boca Raton
Book Google Scholar
Suri JS, Wilson DL, Laxminarayan S (2005) Handbook of Biomedical Image Analysis: Segmentation models part B. Kluwer Academic/Plenum Publishers, Amsterdam
Book Google Scholar
Wu DH, Chen Z, North JC, Biswas M, Vo J, Suri JS (2020) Machine learning paradigm for dynamic contrast-enhanced MRI evaluation of expanding bladder. Front Biosci (Landmark Edition) 1(25):1746–1764
Article Google Scholar
Roberts M, Driggs D, Thorpe M, Gilbey J, Yeung M, Ursprung S, Aviles-Rivero AI, Etmann C, McCague C, Beer L, Weir-McCall JR (2020) Machine learning for COVID-19 detection and prognostication using chest radiographs and CT scans: a systematic methodological review
Suri JS, Wilson D, Laxminarayan S (2005) Handbook of biomedical image analysis. Springer, Berlin
Book Google Scholar
Narayanan R, Kurhanewicz J, Shinohara K, Crawford ED, Simoneau A, Suri JS (2009) MRI-ultrasound registration for targeted prostate biopsy. In: 2009 IEEE international symposium on biomedical imaging: from nano to macro. IEEE, pp 991–994
Acharya R, Ng YE, Suri JS (2008) Image modeling of the human eye. Artech House, New York
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Radiology, Azienda Ospedaliero Universitaria Di Cagliari, Monserrato (Cagliari), Italy
Luca Saba
CSE Department, Bennett University, Greater Noida, UP, India
Mohit Agarwal & Suneet K. Gupta
CSE Department, KIET Group of Institutions, Delhi, NCR, India
Anubhav Patrick
Annu’s Hospitals for Skin and Diabetes, Nellore, AP, India
Anudeep Puvvula
Department of Radiology, A.O.U. Maggiore D.C. University of Eastern Piedmont, Novara, Italy
Alessandro Carriero, Antonella Balestrieri, Zeno Falaschi & Alessio Paschè
Heart and Vascular Institute, Adventist Health St. Helena, St Helena, CA, USA
John R. Laird
Academic Affairs, Dudley Group NHS Foundation Trust, Dudley, UK
George D. Kitas
Arthritis Research UK Epidemiology Unit, Manchester University, Manchester, UK
George D. Kitas
Department of Medicine, Division of Cardiology, Queen’s University, Kingston, ON, Canada
Amer M. Johri
MV Hospital for Diabetes and Professor M Viswanathan Diabetes Research Centre, Chennai, India
Vijay Viswanathan
Biomedical Engineering Department, Louisville, KY, USA
Ayman El-Baz
Department of Physiology, HIMSR, Jamia Hamdard, New Delhi, India
Iqbal Alam
Department of Radiology, HIMSR, Jamia Hamdard, New Delhi, India
Abhinav Jain
Electrical Engineering Department, University of Minnesota, Duluth, MN, USA
Subbaram Naidu
Behavior Imaging, Boise, ID, USA
Ronald Oberleitner
Department of Cardiology, Indraprastha APOLLO Hospitals, New Delhi, India
Narendra N. Khanna
Department of Biomedical Engineering, National Institute of Technology Raipur, Raipur, India
Arindam Bit
Department of Physiology and Biomedical Engineering, Mayo Clinic College of Medicine and Science, Rochester, MN, USA
Mostafa Fatemi
Department of Radiology, Mayo Clinic College of Medicine and Science, Rochester, MN, USA
Azra Alizad
Stroke Monitoring and Diagnostic Division, AtheroPoint™, Roseville, CA, USA
Jasjit S. Suri
Advanced Knowledge Engineering Centre, Global Biomedical Technologies, Inc., Roseville, CA, USA
Anudeep Puvvula & Jasjit S. Suri

Authors

Luca Saba
View author publications
You can also search for this author in PubMed Google Scholar
Mohit Agarwal
View author publications
You can also search for this author in PubMed Google Scholar
Anubhav Patrick
View author publications
You can also search for this author in PubMed Google Scholar
Anudeep Puvvula
View author publications
You can also search for this author in PubMed Google Scholar
Suneet K. Gupta
View author publications
You can also search for this author in PubMed Google Scholar
Alessandro Carriero
View author publications
You can also search for this author in PubMed Google Scholar
John R. Laird
View author publications
You can also search for this author in PubMed Google Scholar
George D. Kitas
View author publications
You can also search for this author in PubMed Google Scholar
Amer M. Johri
View author publications
You can also search for this author in PubMed Google Scholar
Antonella Balestrieri
View author publications
You can also search for this author in PubMed Google Scholar
Zeno Falaschi
View author publications
You can also search for this author in PubMed Google Scholar
Alessio Paschè
View author publications
You can also search for this author in PubMed Google Scholar
Vijay Viswanathan
View author publications
You can also search for this author in PubMed Google Scholar
Ayman El-Baz
View author publications
You can also search for this author in PubMed Google Scholar
Iqbal Alam
View author publications
You can also search for this author in PubMed Google Scholar
Abhinav Jain
View author publications
You can also search for this author in PubMed Google Scholar
Subbaram Naidu
View author publications
You can also search for this author in PubMed Google Scholar
Ronald Oberleitner
View author publications
You can also search for this author in PubMed Google Scholar
Narendra N. Khanna
View author publications
You can also search for this author in PubMed Google Scholar
Arindam Bit
View author publications
You can also search for this author in PubMed Google Scholar
Mostafa Fatemi
View author publications
You can also search for this author in PubMed Google Scholar
Azra Alizad
View author publications
You can also search for this author in PubMed Google Scholar
Jasjit S. Suri
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jasjit S. Suri.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical Approval

The COVID dataset was created and anonymised during March and April 2020 with due approval from the Institutional Ethics Committee, Azienda Ospedaliero Universitaria (AOU), “Maggiore d.c.” University of Eastern Piedmont, Novara, ITALY. We followed the ethical standards of the institution and/or national research committee for all the procedures related to human participation. Later amendments or comparable ethical standards were according to the 1964 Helsinki declaration. All participants gave informed consent to be included in the study.

Informed Consent

Informed consent was obtained from all individual participants included in the study.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary material 1 (DOCX 3,996 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Saba, L., Agarwal, M., Patrick, A. et al. Six artificial intelligence paradigms for tissue characterisation and classification of non-COVID-19 pneumonia against COVID-19 pneumonia in computed tomography lungs. Int J CARS 16, 423–434 (2021). https://doi.org/10.1007/s11548-021-02317-0

Download citation

Received: 29 August 2020
Accepted: 15 January 2021
Published: 03 February 2021
Issue Date: March 2021
DOI: https://doi.org/10.1007/s11548-021-02317-0

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Six artificial intelligence paradigms for tissue characterisation and classification of non-COVID-19 pneumonia against COVID-19 pneumonia in computed tomography lungs

Abstract

Background

Methodology

Results

Conclusions

Similar content being viewed by others

Diagnosis of Pediatric Pneumonia with Ensemble of Deep Convolutional Neural Networks in Chest X-Ray Images

Tuberculosis detection in chest radiograph using convolutional neural network architecture and explainable artificial intelligence

Detection and classification of lung diseases for pneumonia and Covid-19 using machine and deep learning techniques

Introduction

Methodology

Patient demographics

Data acquisition and baseline characteristic

Data acquisition

Baseline characteristics

Three kinds of AI architectures for classification

Results

Accuracy of the two ML, two TL, and two DL models

CT lung characterisation using bispectrum analysis

Performance evaluation of AI models and its clinical validation

Receiver operating characteristics

A comparison of six AI models based on multiple classification metrics

COVID risk stratification

Clinical validation of COVID risk stratification

Discussion

Benchmarking

3D validation

Interpretation

Strengths, weakness, and extensions

Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical Approval

Informed Consent

Additional information

Publisher's Note

Supplementary information

Supplementary material 1 (DOCX 3,996 kb)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation