A Transformer-Based Framework for Automatic COVID19 Diagnosis in Chest CTs

Zhang, Lei and Wen, Yan (2021) A Transformer-Based Framework for Automatic COVID19 Diagnosis in Chest CTs. In: The IEEE/CVF International Conference on Computer Vision (ICCV).

Full content URL: https://doi.org/10.1109/ICCVW54120.2021.00063

This is the latest version of this item.

A Transformer-Based Framework for Automatic COVID19 Diagnosis in Chest CTs
Authors' Accepted Manuscript
Zhang_A_Transformer-Based_Framework_for_Automatic_COVID19_Diagnosis_in_Chest_CTs_ICCVW_2021_paper.pdf - Whole Document

Item Type:Conference or Workshop contribution (Presentation)
Item Status:Live Archive


Automated diagnosis of covid19 in chest CTs is becoming a clinically important technique to support precision and efficient diagnosis and treatment planning. A few efforts have been made to automatically diagnose the COVID-19 in CTs using CNNs, and the task still remains a challenge. In this paper, we present a transformer-based framework for COVID19 classification. We attempt to expand the adaption of vision transformer as a robust feature learner to the 3D CTs to diagnose the COVID-19. The framework consists of two main stages: lung segmentation using UNet followed by the classification, in which the features extracted from each CT slice using Swin transformer in a CT scan are aggregated into 3D volume level feature. We also investigated the performance of using the robust CNNs (BiT and EfficientNetV2) as backbones in the framework. The dataset from the ICCV workshop: MIA-COV19D, is used in our experiments. The evaluation results show that the method with the backbone of Swin transformer gain the best F1 score of 0.935 on the validation dataset, while the CNN based backbone of EfficientNetV2 has the competitive classification performance with the best precision of 93.7%. The final prediction model with Swin transformer achieves the F1 score of 0.84 on the test dataset, which doesn't require an additional post-processing stage.

Keywords:COVID19 Diagnosis, Visual Transformer, Chest CTs
Subjects:G Mathematical and Computer Sciences > G700 Artificial Intelligence
G Mathematical and Computer Sciences > G760 Machine Learning
G Mathematical and Computer Sciences > G740 Computer Vision
Divisions:College of Science > School of Computer Science
ID Code:48044
Deposited On:10 Feb 2022 11:08

Available Versions of this Item

Repository Staff Only: item control page