Deep learning for COVID-19 detection based on CT images

Zhao, Wentao; Jiang, Wei; Qiu, Xinguo

doi:10.1038/s41598-021-93832-2

Download PDF

Article
Open access
Published: 12 July 2021

Deep learning for COVID-19 detection based on CT images

Scientific Reports volume 11, Article number: 14353 (2021) Cite this article

25k Accesses
80 Citations
4 Altmetric
Metrics details

Subjects

Abstract

COVID-19 has tremendously impacted patients and medical systems globally. Computed tomography images can effectively complement the reverse transcription-polymerase chain reaction testing. This study adopted a convolutional neural network for COVID-19 testing. We examined the performance of different pre-trained models on CT testing and identified that larger, out-of-field datasets boost the testing power of the models. This suggests that a priori knowledge of the models from out-of-field training is also applicable to CT images. The proposed transfer learning approach proves to be more successful than the current approaches described in literature. We believe that our approach has achieved the state-of-the-art performance in identification thus far. Based on experiments with randomly sampled training datasets, the results reveal a satisfactory performance by our model. We investigated the relevant visual characteristics of the CT images used by the model; these may assist clinical doctors in manual screening.

A novel CT image de-noising and fusion based deep learning network to screen for disease (COVID-19)

Article Open access 23 April 2023

A comprehensive study on classification of COVID-19 on computed tomography with pretrained convolutional neural networks

Article Open access 09 October 2020

Fast automated detection of COVID-19 from medical images using convolutional neural networks

Article Open access 04 January 2021

Introduction

The year 2019 witnessed the outbreak of a viral pneumonia, originating from an unknown source in Wuhan, China. The virus was soon termed severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) by the World Health Organization^1,2,3, and the resulting pneumonia is called Coronavirus disease 2019 (COVID-19)^2,3,4,5,6. To date, more than 120 million cases have been confirmed worldwide, and the number is still on the rise. Currently, the reverse transcription-polymerase chain reaction (RT-PCR) which relies on nasopharyngeal swabs to examine the existence of the ribonucleic acid (RNA) of SARS-CoV-2⁷ is still a popular approach to test for the disease. Despite the high level of specificity (Sp) of testing with RT-PCR, the sensitivity (Sn) of the method could be relatively low^8,9, and there is significant variability in efficacy depending on different sampling methods and the time of occurrence of symptoms^9,10. Apart from the confirmation by pathogenic labs, other useful methods for COVID-19 diagnosis include examination of clinical characteristics and the use of computed tomography (CT) imaging^11,12. Owing to its high sensitivity, CT imaging has been proposed as an essential substitute tool for COVID-19 screening, which is especially effective as a complementary method to RT-PCR, and by way of CT imaging was able to perform rapid prediction compared to RT-PCR. In particular, researches has shown that in normal CT scans conducted for non-COVID-19 reasons, such as for examination before elective operations and nerve system examinations, CT is considerably useful in testing for COVID-19 infection^13,14. In other cases where CT imaging is adopted, for example, when patients suffer from worsening respiratory complications or similar factors, and are tested negative using RT-PCR, the clinical evidence may show patterns similar to being positive for COVID-19. Early research has suggested that CT images contain a great number of potential indicators of infection^8,10, but that the infection could also be unrelated to COVID-19. This implies some challenges for radiologists in specifically identifying COVID-19 infections using CT images^15,16. In addition, visual analysis of CT images is also time-consuming, especially in large-scale studies or with a huge number of patients. A common problem in the analysis of medical images has been that most of the CT images used for diagnostic purposes are not openly accessible owing to privacy concerns, which means that the results from neural network training on any particular one dataset cannot be replicated or applied in other hospitals. The absence of open-source datasets on COVID-19 CT images thus presents a tremendous obstacle for the development of more advanced artificial intelligence technologies for better detection of CT on COVID-19 testing¹⁷. With the urgent need for solutions to cope with the COVID-19 pandemic and based on the recent efforts among researchers to promote open-source data and open access^18,19, we discussed how transfer learning can improve the performance of convolutional neural networks on COVID-19 testing using CT images, and found that pre-trained models trained on larger out-of-domain datasets have better performance in COVID-19 detection. Comparing the model architecture which was discovered automatically via a machine-driven design exploration process using generative synthesis, our model performs better in each evaluation metric. We aimed to make the following contributions:

We used various training steps, resolutions with and without mixup to test the impact of these hyperparameters on the results and discovered that a higher resolution and an appropriate number of training steps are effective in raising the model performance. As the model itself already yields excellent results, provided the data are sufficient, there is little impact of implementing mixup on the results.
With five different strategies for parameter initialization in the models, we studied the impact of initialized parameters on the model performance. Our results demonstrate that different pre-training parameters influence the final performance of fine-tuned models. By utilizing a larger out-of-field dataset for pre-training, the model can be more effectively generalized.
By comparing our results with those from previous studies, we demonstrate that our models based on transfer learning are better than those based on structural design and that our models achieve state-of-the-art performance. Furthermore, we evaluated the performance of our model in a case in which there was a small quantity of downstream data and found that it still showed excellent performance in identifying COVID-19.
With visualization, we investigated the mechanism behind the model for COVID-19 testing to better aid clinical decision-making.

Related work

COVID-19 research

Currently, research on COVID-19 is being effectively carried out in various areas. Reference²⁰ review the various types of scalable telehealth services used to support patients infected by COVID-19 and other diseases. Reference²¹ discuss the different wearable monitoring devices and respiratory support systems which are frequently used to assist coronavirus affected people. Reference²² present an overview of the existing technologies, which are frequently used to support the infected patients for respiration. They outline a comparative analysis among the developed devices necessary challenges and possible future directions for the proper selection of affordable technologies. Reference²³ propose a system that restricts the spread of COVID-19 by detecting people not wearing any facial mask in a smart city network.

In the face of the potential for using CT images as a complementary screening method for COVID-19, alongside the challenges of interpreting CT for COVID-19 screening, extensive studies have been conducted on how to detect COVID-19 using CT images. Deep learning is now widely used in all aspects of COVID-19 research aimed at controlling the ongoing outbreak^{24,25,26,27,28}, reference²⁹ give an overview of the recently developed systems based on deep learning techniques using different medical imaging modalities such as CT and X-ray. Reference¹⁷ established a database of hundreds of CT scans of COVID-19 positive cases and developed a deep learning approach with high sample efficiency based on self-supervision³⁰ and transfer learning³¹. In addition, researchers have developed an artificial intelligence system capable of diagnosing COVID-19 and separating the disease from the other common pneumonia as well as the normal cases³². Furthermore, reference³³ created a library containing CT images of 1,521 pneumonia patients (including those with COVID-19), 130 clinical symptoms (a series of symptoms including biochemical and cellular analysis of blood and urine), as well as the clinical symptoms of SARS-CoV-2, and made predictions on whether each patient experienced negative, mild, and severe cases. With machine-driven design exploration, reference³⁴ proposed a deep convolutional neural network structure, COVIDNet-CT, based on CT images. Similarly, leveraging 104,009 CT images from 1,489 patients collected from the China National Center for Bioinformation (CNCB) (China)³² combined with data cleaning and preparing in a suitable format for benchmarking, a COVIDx-CT dataset was built, along with explainability-driven performance validation and analysis using the GSInquire technology³⁵. Building upon the above progress, researchers proposed the COVIDx CT-2 datasets, which increases the number and diversity of patients³⁶.

Transfer learning

Transfer learning is the cornerstone of computer vision. Various categorization tasks related to images³⁷ can achieve greater performance with datasets of a limited size with transfer learning than using any other method. Previous work has shown that effective performance can be achieved through pre-trained models fine-tuned on specific tasks^38,39.

Methods

Datasets

With the global spread of the COVID-19 pandemic, accessibility of first-hand CT images and clinical data is critical for guiding clinical decisions, providing information which can deepen our understanding of the patterns of infection by the virus, and offering systematic models for early diagnosis and timely medical interventions. A key approach is to establish a comprehensive database with open access to CT images and associated clinical symptoms to facilitate the global fight against COVID-19. As mentioned in Related work section, several datasets have been built and are open for researchers, doctors, and data scientists for COVID-19-related research. Currently, although the COVIDx-CT dataset is evidently larger than many other CT datasets used in the literature on COVID-19 testing, a potential limitation of using COVIDx-CT for deep neural network learning lies in the limited patient demographic diversity. Specifically, as COVIDx-CT is collected from the CNCB, only information from the different provinces in China is available, meaning the symptoms of COVID-19 in the CT images may not be appropriately generalizable to cases beyond China. Increasing the number and diversity of patients would make deep neural networks more varied and comprehensive, so that they can be more generalizable and applicable in different clinical environments around the world. By carefully processing and organizing the CT images of patients based on various CT devices, solutions, and validation abilities, previous researchers³⁶ established the COVIDx CT-2A and COVIDx CT-2B datasets. COVIDx CT-2A involves 194,922 images from 3,745 patients aged between 0 and 93, with a median age of 51. Each CT scan per patient has many CT slides. We use the CT slides as the input images to detect COVID-19, making the COVID-19 detection problem an image classification problem. The CT images are provided as $512 \times 512$ pixels. The sources of input for the images in COVDx CT-2A are as follows:

China National Center for Bioinformation (CNCB) (China)³²
National Institutes of Health Intramural Targeted Anti-COVID-19 (ITAC) Program (countries unknown)⁴⁰
Negin Radiology Medical Center (Iran)⁴¹
Liyuan Hospital and Wuhan Union Hospital of Tongji Medical College of Huazhong University of Science and Technology (China)³³
COVID-19 CT lung and infection segmentation project, annotated and verified by Nanjing Drum Tower Hospital (China)⁴²
Lung Image Database Consortium (LIDC) and Image Database Resource Initiative (IDRI) (countries unknown)⁴³
Open access online collaborative radiology resource (Radiopaedia) (countries unknown)⁴⁴

Building upon COVIDx CT-2A, COVIDx CT-2B augmented the dataset with weak validation (MosMed) from the Research and Practical Clinical Center of Diagnostics and Telemedicine Technologies, Department of Health Care of Moscow (Russian Federation)⁴⁵. The purpose of establishing this validation set is to investigate, for instance, whether adding weak validation (i.e., findings without using RT-PCR and radiological tests) training data would boost the performance of the model. This validation can further increase the breadth and diversity of the dataset. In view of the comparison with previous working models and the openness of data, in the present study we employed COVIDx CT-2A for COVID-19 testing. Figure 1 illustrates the relevant examples in the COVIDx CT-2A dataset, including 3 types of CT scans: novel coronavirus pneumonia (NCP) infected by SARS-CoV-2, common pneumonia (CP), and normal controls. We applied some modifications to images from the database to facilitate our models. Specifically, as the potential contrast in the background of the images may result in biases in the models, we removed the background with an automatic cropping algorithm to standardize the field to the body area (as shown by the red frames in Fig. 1). By means of comparison across various types, we identified the ground glass opacity (GGO), lung consolidation⁴⁶, and even the presence of white pneumonia in the groups of CP and NCP. However, owing to the considerably subtle visual differences in the images between those infected with common pneumonia and those infected with SARS-CoV-2, there might be tremendous variations in the ability to distinguish between the diseases, even for radiologists. Figure 2 presents the distribution of the different types of infections and images in training, test, and validation sets.

Model selection

With the design exploration mode forming with machine-driven generation, previous researchers³⁴ have designed the deep convolutional neural network COVID-Net-CT for COVID-19 testing based on CT images. The subsequent COVID-Net CT-2³⁶ was then designed using this architecture as its basis. In our experiment, we adopted the ResNet-v2, which is a modified version from ResNet⁴⁷. Next, we substituted group normalization⁴⁸ for batch normalization⁴⁹ and conducted a weight standardization⁵⁰ for all convolutional layers. To investigate how transfer learning utilizes external data in COVID-19 testing based on CT images, we incorporated the pre-training data from CIFAR-10⁵¹, ILSVRC-2012⁵², and ImageNet-21k⁵³ as the parameters for initialization to train the models.

Hyperparameter settings for training

The general flowchart of the COVID-19 diagnosis system based on deep learning is illustrated in Fig. 3. The total system contains two sections. In the training section, the training data are used to update the model parameters, and the performance of developed model is appraised by test data. In the test section, the model can be used to extract the feature, and finally identify the class labels based on the feature. Lastly, the developed model is assessed by some evaluation metrics like accuracy, sensitivity, specificity, and so on.

The pseudocode for fine-tuning the Convolutional Neural Network (CNN) and obtaining the accuracy can be seen in Algorithm 1. For each iteration, we randomly selected b CT images to calculate the gradient and updated the network parameters. Unlike the previous standard training process, we did not constrain the epoch of iteration, but constrained the training steps instead. Regarding the choice of hyperparameters, we used the stochastic gradient descent (SGD) and set the learning rate at 0.003, the momentum at 0.9, and the batch size at 64. RGB reordering was applied, and the final input to the proposed model was provided as $512 \times 512 \times 3$ image. Concerning data augmentation, for the training set we first tailored the images according to the annotated cropping frame, and then adjusted them to $512 \times 512$ pixels, randomly segmented them to $480 \times 480$ pixels, followed by random horizontal flips and normalization. For the test set, we simply adjusted the images that were cropped according to the annotation, and then resized them to $480 \times 480$. We used 10,000 training steps in our experiments. To fine-tune the model, we first conducted a warmup⁵⁴ for the learning rate, and then reduced the learning rate three times at a rate of 10x during the entire training. The details are provided in Parameter sensitivity section. Finally, we used mixup (Eq. (1)) for data augmentation.

$$\begin{aligned} \left\{ \begin{array}{ll}{\tilde{x}}=\lambda x_{i}+(1-\lambda ) x_{j}, \\ {\tilde{y}}=\lambda y_{i}+(1-\lambda ) y_{j}, \end{array}\right. \end{aligned}$$

(1)

Here, $x_{i}$ and $x_{j}$ are the initial input vectors, while $y_{i}$ and $y_{j}$ are the labels. Through mixup, we obtained new vectors and labels. As the calculation of loss using cross entropy is a convex optimization problem, the convex optimization problem has good convergence properties when solved by gradient descent, we used cross entropy as the loss function (Eq. 2).

$$\begin{aligned} \begin{aligned} {\text {loss}}(x, \text{ class } )&=-\log \left( \frac{\exp (x[\text {class}])}{\sum _{j} \exp (x[j])}\right) \\&=-x[\text {class}]+\log \left( \sum _{j} \exp (x[j])\right) \end{aligned} \end{aligned}$$

(2)

where $x \in {\mathbb {R}}^{N \times C}$ is the output of the model, $\text{ class } \in {\mathbb {R}}^{N}$ is the label of the CT imaging and $0 \le {\text {class}}[i] \le C-1$.

Results

In this section, we investigate the model performance in testing for COVID-19. Specifically, we endeavor to address the following questions:

How are different hyperparameters, including various resolutions, training steps, and mixup, used to affect the model performance?
How do different weight initializations trained from the different datasets affect result?
Can we obtain a satisfactory result with the proposed model with limited CT images?
How can we understand the decisions made by the deep convolutional neural network to assist in clinical decision-making?

Test performance

We utilized the training setting described in Hyperparameter settings for training section to train the models. The results are summarized in Table 1 and are compared with those from the current most advanced methods. Random, Bit-S and Bit-M are the models adopted in our laboratory, and refer to the random initialization, and methods of pre-training on ILSVRC-2012 and ImageNet-21k, which will be introduced in Impact of parameter initialization section. We compared our model with the most advanced COVID-Net CT-2 L. Table 1 reveals that our Bit-S and Bit-M models which rely on transfer learning saw an increase in accuracy of 0.71% and 1.12% over COVID-Net CT-2 L model, respectively. In addition, the accuracy of our model of random initialization was 3.60% higher than that of COVID-Net CT-1, suggesting that in comparison with models using structure space search, our model with random parameter initialization also has excellent performance. Figure 4 shows the distribution of the CT images representations after dimensionality reduction, which highlights the proper differentiation of the different categories. In the confusion matrix⁵⁵ in Fig. 5, we demonstrate that even though radiologists may sometimes fail to distinguish between CP and NCP, our model provides accurate classifications. For a better quantitative analysis of the models, four indicators were introduced, namely sensitivity (Sn), specificity (Sp), positive predictive value (PPV), and negative predictive value (NPV), as summarized in Table 2. We discovered that the BiT-M model based on transfer learning achieved the state-of-the-art performance with respect to sensitivity for COVID-19 (98.7%), positive predictive value (98.5%), specificity (99.5%), and negative predictive value (99.6%). Our proposed technique outperforms previous works because we pre-train the model on a larger out-of-domain dataset which enables the model to learn more generalize knowledge. From a clinical perspective, high sensitivity ensures that there are few false negatives that lead to missed diagnoses in patients with COVID-19 infection, and high PPV ensures few false positives which add an unnecessary burden on the health care system. High specificity and NPV achieved by our Bit-M model ensure that COVID-19 negative predictions are indeed true negatives in the vast majority of cases, the prediction results are real and reliable for COVID-19 negative patients. The problem of treating false positives and false negatives is equivalent, specifically, we cannot afford to diagnose a COVID-19 positive patient as negative, as in this case, the patient may go back into community, believing to be free of COVID-19, which leads to community transmission of the disease⁵⁶. When we diagnose too many COVID-19 negative as positive, it increases the burden on the healthcare system and causes public panic. Psychological stress my result if a negative person is diagnosed as positive.

Table 1 Accuracy of the COVIDx CT-2A benchmark datasets.

Full size table

Table 2 Sensitivity, PPV, Specificity, and NVP of the test data in COVIDx CT-2A benchmark datasets.

Full size table

Hyperparameters sensitivity

In this section, we explore the effect of various hyperparameters on model performance, specifically: training steps, image resolutions, and whether to use mixup. We used four combinations of overall training steps and input resolutions. For the resolutions of CT images, we adopted the settings (160, 128), (256, 224), (448, 384), and (512, 480), where the first value in each doublet indicates the scale of adjustment during training, while the second value indicates the size of random cropping during training and testing. Regarding the length of the training project, we used [100, 200, 300, 400, 500], [500, 1,500, 3,000, 4,500, 5,000], [500, 3,000, 6,000, 9,000, 10,000], and [500, 6,000, 12,000, 18,000, 20,000]. The first parameter refers to the number of steps in the warmup step, the last parameter is the end step, and the rest are the step nodes with a learning rate decaying by 10 times. Figure 6 displays the test accuracy for different resolutions and training steps with and without mixup. The results emphasize that a higher resolution can increase accuracy in identification, which means that clearer CT images contain more diagnostic clinical information. A larger training step can also improve accuracy, but the effect is less significant when it exceeds 10,000. The results suggest that for resolutions of (512, 480) and a training step of 10,000 between Fig. 6a and b, the accuracy rates are exactly the opposite (The hyperparameter settings for the experiments are the same). This phenomenon is a result of the random sampling. It indicates that the performance of the model is not enhanced by the mixup due to the data being already rich enough.

Impact of parameter initialization

To evaluate the impact of parameter initialization on the task performance, we used the pre-trained ResNet50x1 models to investigate how upstream pre-training can affect the fine-tuning performance. Random means the parameters were randomly initialized in the models. BIT-M was pre-trained on the complete ImageNet-21k dataset, a public dataset with 14,200,000 images and 21,000 categories. The images could contain multiple labels. BIT-S was pre-trained on the ILSVRC-2012 variant from ImageNet, which include 1,280,000 images and 1,000 categories. BIT-M-S was first pre-trained on the ImageNet-21k dataset and then fine-tuned on ILSVRC-2012. BIT-M-C first went through pre-training using the ImageNet-21k dataset and was then fine-tuned on CIFAR-10 which contains 60,000 images ($32 \times 32$ pixels) across 10 categories. The weight initialization was pre-trained on out-of-domain data from a previous study⁵⁸. For a fair comparison, we set the training step as 10,000 and used mixup, while the other settings were the same as those in Hyperparameter settings for training section. The impact of weighting initialization is illustrated in Table 3. We repeated the experiment and the results were slightly different from Table 1 because of random sampling and random initialization of model parameters. We realized that the parameter pre-trained on ImageNet-21k exhibited better performance in generalization compared to that pre-trained on ILSVRC-2012. Meanwhile, this performance would not be affected even by the fine-tuning on out-of-field datasets. Afterwards, we calculated the test performance for every 100 steps, presented in Fig. 7. The models pre-trained on ImageNet-21k (BIT-M, BIT-M-S, and BIT-M-C) exhibited better performance in the evaluation with the test set at later stages than did the ILSVRC-2012 initialized weighting (BIT-S). This result highlights that training with the larger dataset results in greater generalizability.

Table 3 Categorization accuracy of test and validation sets with different weight initialization.

Full size table

Influence of the size of labeled training data on model performance

To evaluate how the models perform on the small downstream datasets akin to those which would be used in real-world situations, a certain number of images from each category were randomly selected for a performance test. For each category, we randomly chose 50, 100, 500, and 1,000 samples for training and tested the trained model to see the identification rate with the test set. The results of these tests were presented in Fig. 8. The histogram on the right showed the outcomes of the Imagenet21k pre-trained model using the entire training set, CT-2L, CT-2S, and CT-1. When conducting these tests, we noticed that BIT-M achieved a higher test accuracy with a limited number of labeled images. When 100 images were selected from each category, the accuracy (94.8%) already exceeded that of the experimental result using CT-1 (94.5%). When 1,000 images were selected, the accuracy (98.0%) was as good as that of CT-2S (97.9%). This lends support to the immense potential of our transfer learning models, which can still function well using limited dataset. This suggests that the priori knowledge learned through pre-training on large, out-of-field datasets can still ensure an excellent performance in the case of limited training data.

Qualitative analysis of Covid-19 testing of the model

Although performance indicators are useful for model evaluation, they fail to explain the decision-making behavior of the network. In this regard, we employed the Grad-CAM⁵⁹ visualization technique to explore the areas of concern for the models in COVID-19 testing, to better understand which characteristics of CT images are key for diagnostic accuracy, and thus aid clinical decision-making. As demonstrated in Fig. 9, we first cropped the images using the detection frame (introduced in Hyperparameter settings for training Section), enlarged them to $480 \times 480$ pixels, and used Grad-CAM for visual explanation. All the predictions of the model using CT images in Fig. 9 are the same as the actual detection results. In most cases, the performance of the model is the same as would be expected for typical human visual cognition. This is particularly true for CP, as the model successfully focus on the disease areas, and display the affected regions of lungs. The radiologist further can apply color visualization approach using Grad-CAM for making efficient and confident decision⁶⁰. For the norm case, the model focuses more on the lower region. Although NCP due to SARS-CoV-2 could be detected using the first and third CT images (third row in Fig. 9), the model was more interested in the texture at the periphery. Such a visual heuristic different from human visual perception merits further exploration, to gain better knowledge on how the model detect for COVID-19 and which features they consider most diagnostic. The discovery of these features would contribute to explaining the power of the model in COVID-19 testing, as well as assisting clinical doctors in discovering new visual indicators for COVID-19 infections for use in manual screening based on CT images.

Discussion

Our study applied transfer learning on COVID-19 testing using CT images and discussed the impacts of various initialization parameters on the results, demonstrating that our model which were pre-trained on ImageNet21k has strong generalizability in terms of CT images. The proposed model provides an accuracy of 99.2% while detecting the COVID-19 cases. Compared to the neural architecture search model, our model shows the state-of-the-art performance, across all metrics we have described. These ensure that COVID-19-negative patients are correctly diagnosed as negative in the vast majority of cases, reduce probability of diagnosing COVID-19-negative cases as positive and reduce the burden on the health care system. Additionally, we examined the performance of the model with limited data and found that the model still perform satisfactorily. This shows that our model is still applicable with a limited data, which is characteristic of the real situation, where large and diverse datasets may not be readily available. Finally, we explored the relevant mechanism of COVID-19 testing using Grad-CAM visualization technique to make the proposed deep learning model more interpretable and explainable. The model performs performance validation through interpretability driven in a manner consistent with the radiologist’s interpretation for the CP. The investigation of normal and NCP CT images helps to explore new visual indicators to assist clinical doctors in further manual screening. The experiments demonstrate that our models are effective in COVID-19 testing. In future, we will pay attention to the evaluation of the severity of COVID-19 and attempt to discover more valuable information from CT images to combat the pandemic. We will further conduct explanatory analyses on the models, which will shed light on the detection mechanism of COVID-19, to identify key characteristics in the CT images and to facilitate the screening by clinical doctors. Although the system has good performance on public datasets, the work is still at theoretical research stage, and the models has not been validated in actual clinical routine. Therefore, we will test our system in the clinical routine and communicate with physicians to understand how they use it and their opinions about the models. Thus, we can further improve the models in our future work.

Data availability

The datasets analysed during the current study are available in the COVIDNet-CT repository, https://github.com/haydengunraj/COVIDNet-CT.

References

Gorbalenya, A. E. et al. Coronaviridae study group of the international committee on taxonomy of viruses. the species severe acute respiratory syndrome-related coronavirus: classifying 2019-nCoV and naming it SARS-CoV-2. Nat. Microbiol. 5, 536–544 (2020).
Article Google Scholar
Wu, F. et al. A new coronavirus associated with human respiratory disease in China. Nature 579, 265–269 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhu, N. et al. A novel coronavirus from patients with pneumonia in China, 2019. New England J. Med. (2020).
Wu, J. et al. Chest CT findings in patients with coronavirus disease 2019 and its relationship with clinical features. Invest. Radiol. 55, 257 (2020).
Article CAS PubMed PubMed Central Google Scholar
Zhou, P. et al. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature 579, 270–273 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Zu, Z. Y. et al. Coronavirus disease 2019 (COVID-19): A perspective from China. Radiology 296, E15–E25 (2020).
Article PubMed Google Scholar
Wang, W. et al. Detection of SARS-CoV-2 in different types of clinical specimens. JAMA 323, 1843–1844 (2020).
CAS PubMed PubMed Central Google Scholar
Fang, Y. et al. Sensitivity of chest CT for COVID-19: Comparison to RT-PCR. Radiology 296, E115–E117 (2020).
Article PubMed Google Scholar
Li, Y. et al. Stability issues of RT-PCR testing of SARS-CoV-2 for hospitalized patients clinically diagnosed with COVID-19. J. Med. Virol. 92, 903–908 (2020).
Article CAS PubMed Google Scholar
Ai, T. et al. Correlation of chest CT and RT-PCR testing for coronavirus disease 2019 (COVID-19) in China: a report of 1014 cases. Radiology 296, E32–E40 (2020).
Article PubMed Google Scholar
Shi, H. et al. Radiological findings from 81 patients with COVID-19 pneumonia in Wuhan, China: a descriptive study. Lancet Infect. Dis. 20, 425–434 (2020).
Article CAS PubMed PubMed Central Google Scholar
Rodriguez-Morales, A. J. et al. Clinical, laboratory and imaging features of COVID-19: a systematic review and meta-analysis. Travel Med. Infect. Dis. 34, 101623 (2020).
Article PubMed PubMed Central Google Scholar
Tian, S. et al. Pulmonary pathology of early-phase 2019 novel coronavirus (COVID-19) pneumonia in two patients with lung cancer. J. Thoracic Oncol. 15, 700–704 (2020).
Article CAS Google Scholar
Shatri, J. et al. The role of chest computed tomography in asymptomatic patients of positive coronavirus disease 2019: a case and literature review. J. Clin. Imag. Sci. 10, (2020).
Bai, H. X. et al. Performance of radiologists in differentiating COVID-19 from non-COVID-19 viral pneumonia at chest CT. Radiology 296, E46–E54 (2020).
Article PubMed Google Scholar
Mei, X. et al. Artificial intelligence–enabled rapid diagnosis of patients with COVID-19. Nat. Med. 26, 1224–1228 (2020).
Article CAS PubMed PubMed Central Google Scholar
He, X. et al. Sample-Efficient Deep Learning for COVID-19 Diagnosis Based on CT Scans. Preprint, Health Informatics (2020). https://doi.org/10.1101/2020.04.13.20063941.
Wang, L., Lin, Z. Q. & Wong, A. Covid-net: a tailored deep convolutional neural network design for detection of covid-19 cases from chest x-ray images. Sci. Rep. 10, 1–12 (2020).
Google Scholar
Wong, A. et al. COVIDNet-S: Towards computer-aided severity assessment via training and validation of deep neural networks for geographic extent and opacity extent scoring of chest X-rays for SARS-CoV-2 lung disease severity. arXiv e-prints arXiv:2005 (2020).
Ullah, S. M. A. et al. Scalable telehealth services to combat novel coronavirus (COVID-19) pandemic. SN Comput. Sci. 2, 18. https://doi.org/10.1007/s42979-020-00401-x (2021).
Article PubMed PubMed Central Google Scholar
Islam, M. M. et al. Wearable technology to assist the patients infected with novel coronavirus (COVID-19). SN Comput. Sci. 1, 320. https://doi.org/10.1007/s42979-020-00335-4 (2020).
Article PubMed PubMed Central Google Scholar
Islam, M. M., Ullah, S. M. A., Mahmud, S. & Raju, S. M. T. U. Breathing aid devices to support novel coronavirus (COVID-19) infected patients. SN Comput. Sci. 1, 274. https://doi.org/10.1007/s42979-020-00300-1 (2020).
Article PubMed PubMed Central Google Scholar
Rahman, M. M., Manik, M. M. H., Islam, M. M., Mahmud, S. & Kim, J.-H. An automated system to limit COVID-19 using facial mask detection in smart city network. In 2020 IEEE International IOT, Electronics and Mechatronics Conference (IEMTRONICS), 1–5. https://doi.org/10.1109/IEMTRONICS51293.2020.9216386 (2020).
Asraf, A., Islam, M. Z., Haque, M. R. & Islam, M. M. Deep learning applications to combat novel coronavirus (COVID-19) pandemic. SN Comput. Sci. 1, 363. https://doi.org/10.1007/s42979-020-00383-w (2020).
Article PubMed PubMed Central Google Scholar
Islam, M. Z., Islam, M. M. & Asraf, A. A combined deep CNN-LSTM network for the detection of novel coronavirus (COVID-19) using X-ray images. Inf. Med. Unlocked 20, 100412. https://doi.org/10.1016/j.imu.2020.100412 (2020).
Article Google Scholar
Saha, P., Sadi, M. S. & Islam, M. M. EMCNet: Automated COVID-19 diagnosis from X-ray images using convolutional neural network and ensemble of machine learning classifiers. Inf. Med. Unlocked 22, 100505. https://doi.org/10.1016/j.imu.2020.100505 (2021).
Article Google Scholar
Islam, M. M., Islam, M. Z., Asraf, A. & Ding, W. Diagnosis of COVID-19 from X-rays using combined CNN-RNN Architecture with transfer learning. medRxiv 2020.08.24.20181339. https://doi.org/10.1101/2020.08.24.20181339 (2020).
Muhammad, L. J., Islam, M. M., Usman, S. S. & Ayon, S. I. Predictive data mining models for novel coronavirus (COVID-19) infected patients’ recovery. SN Comput. Sci. 1, 206. https://doi.org/10.1007/s42979-020-00216-w (2020).
Article CAS PubMed PubMed Central Google Scholar
Islam, M. M., Karray, F., Alhajj, R. & Zeng, J. A review on deep learning techniques for the diagnosis of novel coronavirus (COVID-19). IEEE Access 9, 30551–30572. https://doi.org/10.1109/ACCESS.2021.3058537 (2021).
Article PubMed Google Scholar
Wu, Z., Xiong, Y., Yu, S. X. & Lin, D. Unsupervised feature learning via non-parametric instance discrimination. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 3733–3742, (2018).
Pan, S. J. & Yang, Q. A survey on transfer learning. IEEE Trans. knowl. Data Eng. 22, 1345–1359 (2009).
Article Google Scholar
Zhang, K. et al. Clinically applicable AI system for accurate diagnosis, quantitative measurements, and prognosis of COVID-19 pneumonia using computed tomography. Cell 181, 1423–1433 (2020).
Article CAS PubMed PubMed Central Google Scholar
Ning, W. et al. Open resource of clinical data from patients with pneumonia for the prediction of COVID-19 outcomes via deep learning. Nat. Biomed. Eng. 4, 1197–1207. https://doi.org/10.1038/s41551-020-00633-5 (2020).
Article CAS PubMed PubMed Central Google Scholar
Gunraj, H., Wang, L. & Wong, A. COVIDNet-CT: A Tailored Deep Convolutional Neural Network Design for Detection of COVID-19 Cases From Chest CT Images. Front Med (Lausanne) 7. https://doi.org/10.3389/fmed.2020.608525 (2020).
Lin, Z. Q. et al. Do explanations reflect decisions? a machine-centric strategy to quantify the performance of explainability algorithms. arXiv preprint arXiv:1910.07387 (2019).
Gunraj, H., Sabri, A., Koff, D. & Wong, A. COVID-Net CT-2: Enhanced deep neural networks for detection of COVID-19 from Chest CT images through bigger, more diverse learning. arXiv:2101.07433 [cs, eess] (2021).
Tajbakhsh, N. et al. Convolutional neural networks for medical image analysis: full training or fine tuning?. IEEE Trans. Med. Imag. 35, 1299–1312 (2016).
Article Google Scholar
Zhao, W., Zhou, D., Qiu, X. & Jiang, W. How to represent paintings: A painting classification using artistic comments. Sensors 21. https://doi.org/10.3390/s21061940 (2021).
Zhao, W., Zhou, D., Qiu, X. & Jiang, W. Compare the performance of the models in art classification. PLOS ONE 16, 1–16. https://doi.org/10.1371/journal.pone.0248414 (2021).
Article CAS Google Scholar
Harmon, S. A. et al. Artificial intelligence for the detection of COVID-19 pneumonia on chest CT using multinational datasets. Nat. Commun. 11, 1–7 (2020).
Article Google Scholar
Rahimzadeh, M., Attar, A. & Sakhaei, S. M. A Fully Automated Deep Learning-based Network For Detecting COVID-19 from a New And Large Lung CT Scan Dataset. medRxiv 2020.06.08.20121541. https://doi.org/10.1101/2020.06.08.20121541 (2020).
Ma, J. et al. Towards efficient covid-19 ct annotation: A benchmark for lung and infection segmentation. arXiv preprint arXiv:2004.12537 (2020).
Armato, S. G. et al. The lung image database consortium (LIDC) and image database resource initiative (IDRI): a completed reference database of lung nodules on CT scans. Med. Phys. 38, 915–931. https://doi.org/10.1118/1.3528204 (2011).
Article PubMed PubMed Central Google Scholar
COVID-19 | Radiology Reference Article | Radiopaedia.org. https://radiopaedia.org/articles/covid-19-4.
Morozov, S. et al. MosMedData: Chest CT Scans with COVID-19 Related Findings Dataset. Preprint, Radiology and Imaging (2020). https://doi.org/10.1101/2020.05.20.20100362.
Liu, K.-C. et al. CT manifestations of coronavirus disease-2019: A retrospective analysis of 73 cases by disease severity. Europ. J. Radiol. 126, 108941. https://doi.org/10.1016/j.ejrad.2020.108941 (2020).
Article Google Scholar
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 770–778, (2016).
Wu, Y. & He, K. Group normalization. Proceedings of the European Conference on Computer Vision (ECCV) 3–19, (2018).
Ioffe, S. & Szegedy, C. Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv preprint arXiv:1502.03167 (2015).
Qiao, S., Wang, H., Liu, C., Shen, W. & Yuille, A. Weight standardization. arXiv preprint arXiv:1903.10520 (2019).
Krizhevsky, A. Learning multiple layers of features from tiny images. University of Toronto (2012).
Deng, J. et al. Imagenet: A large-scale hierarchical image database. In 2009 IEEE Conference on Computer Vision and Pattern Recognition, 248–255 (Ieee, 2009).
Russakovsky, O. et al. Imagenet large scale visual recognition challenge. Int. J. Comput. Vis. 115, 211–252 (2015).
Article MathSciNet Google Scholar
Goyal, P. et al. Accurate, large minibatch sgd: Training imagenet in 1 hour. arXiv preprint arXiv:1706.02677 (2017).
Haghighi, S., Jasemi, M., Hessabi, S. & Zolanvari, A. PyCM: Multiclass confusion matrix library in Python. J. Open Sour. Softw. 3, 729. https://doi.org/10.21105/joss.00729 (2018).
Panwar, H., Gupta, P. K., Siddiqui, M. K., Morales-Menendez, R. & Singh, V. Application of deep learning for fast detection of COVID-19 in X-Rays using nCOVnet. Chaos Solitons Fractals 138, 109944. https://doi.org/10.1016/j.chaos.2020.109944 (2020).
Article MathSciNet PubMed PubMed Central Google Scholar
Van der Maaten, L. & Hinton, G. Visualizing data using t-SNE. J. Mach. Learn. Res. 9, (2008).
Kolesnikov, A. et al. Big Transfer (BiT): General Visual Representation Learning. arXiv:1912.11370 [cs] (2020).
Selvaraju, R. R. et al. Grad-cam: Visual explanations from deep networks via gradient-based localization. Proceedings of the IEEE International Conference on Computer Vision 618–626, (2017).
Panwar, H. et al. A deep learning and grad-CAM based color visualization approach for fast detection of COVID-19 cases using chest X-ray and CT-Scan images. Chaos Solitons Fractals 140, 110190. https://doi.org/10.1016/j.chaos.2020.110190 (2020).
Article MathSciNet PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported in part by Key Laboratory of E&M (Zhejiang University of Technology), Ministry of Education & Zhejiang Province (Grant No. EM 2016070101).

Author information

Authors and Affiliations

College of Mechanical Engineering, Zhejiang University of Technology, Hangzhou, 310023, China
Wentao Zhao, Wei Jiang & Xinguo Qiu
School of Intelligent Transportation, Zhejiang Institute of Mechanical & Electrical Engineering, Hangzhou, 310053, China
Wentao Zhao

Authors

Wentao Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Wei Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Xinguo Qiu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

W.Z. conceived the experiment, W.Z. and X.Q. conducted the experiment, W.Z. and W.J. analyzed the results. All authors reviewed the manuscript.

Corresponding author

Correspondence to Xinguo Qiu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhao, W., Jiang, W. & Qiu, X. Deep learning for COVID-19 detection based on CT images. Sci Rep 11, 14353 (2021). https://doi.org/10.1038/s41598-021-93832-2

Download citation

Received: 28 March 2021
Accepted: 18 June 2021
Published: 12 July 2021
DOI: https://doi.org/10.1038/s41598-021-93832-2

This article is cited by

Sensecor: A framework for COVID-19 variants severity classification and symptoms detection
- T. K. Balaji
- Annushree Bablani
- Hemant Misra
Evolving Systems (2024)
Machine Learning in Healthcare Analytics: A State-of-the-Art Review
- Surajit Das
- Samaleswari P. Nayak
- Sarat Chandra Nayak
Archives of Computational Methods in Engineering (2024)
COVID-19 classification in X-ray/CT images using pretrained deep learning schemes
- Narenthira Kumar Appavu
- Nelson Kennedy Babu C
- Seifedine Kadry
Multimedia Tools and Applications (2024)
Small size CNN-Based COVID-19 Disease Prediction System using CT scan images on PaaS cloud
- Madhusudan G. Lanjewar
- Kamini G. Panchbhai
- Panem Charanarur
Multimedia Tools and Applications (2024)
Secured COVID-19 CT image classification based on human-centric IoT and vision transformer
- Dandan Xue
- Jiechun Huang
- Jun Zhang
Journal of Ambient Intelligence and Humanized Computing (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Related work

COVID-19 research

Transfer learning

Methods

Datasets

Model selection

Hyperparameter settings for training

Results

Test performance

Hyperparameters sensitivity

Impact of parameter initialization

Influence of the size of labeled training data on model performance

Qualitative analysis of Covid-19 testing of the model

Discussion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links