Skip to main content

Deep Learning at Chest Radiography: Automated Classification of Pulmonary Tuberculosis by Using Convolutional Neural Networks

In this paper, the authors explore the use of Deep Convolutional Neural Networls (DCNN) in classifying Tuberculosis (TB) in chest radiographs. One of the advantages of deep learning is its ability to excel with high dimensional datasets, such as images, which can be represented at multiple levels. 

Dataset

Four deidentified HIPAA-compliant datasets were used in this study that were exempted from review by the institutional review board, which consisted of 1007 posteroanterior chest radiographs.

DCNN Models and Training

AlexNet and GoogLeNet models, including pre-trained (on ImageNet from Caffe Model Zoo) and untrained models were used in the study. It was found that the AUCs of the pretrained networks were greater. The following solver parameters were used for training: 120 epochs; base learning rate for untrained models and for pretrained models, 0.01 and 0.001, respectively with stochastic gradient descent.  Both of the DCNNs in this studied used dropout or model regularization strategies to help overcome overfitting.

Data Augmentation

The following data augmentation techniques further increased the performance

  1. Random cropping of 227x227
  2. Mean subtraction and mirror images
  3. Rotation of 90, 180 and 270.
  4. Contrast Limited Adaptive Histogram Equalization processing

Ensembling


The ensembling technique was used to increase the AUC even further. Ensembles were performed by taking different weighted averages of the probability scores generated by the classifiers 
The best performing ensemble model had an AUC of 0.99. Refer below the table borrowed from the paper for complete results

The sensitivity of pre-trained AlexNet was 92.0% and the specificity was 94.7%. The sensitivity of pre-trained GoogLeNet was 92.0% and the specificity was 98.7%. The sensitivity of the ensemble was 97.3% and the specificity was 94.7%. 

Radiologist-augmented approach

This is were the paper takes turn to beyond the realms of deep learning, were they use a certain human to classify the images were the models fail.

For cases where the AlexNet and GoogLeNet classifiers had disagreement, an independent board-certified cardiothoracic radiologist (B.S., with 18 years of experience) blindly interpreted the images as either having manifestations of TB or as normal. This resulted in a sensitivity of 97.3% and a specificity of 100%.

Comments

Popular Posts

Deeper and Wider Siamese Networks for Real-Time Visual Tracking

 In this paper , the authors investigate how to increase the robustness and accuracy of existing Siamese trackers used for visual object tracking. Visual object tracking Visual object tracking is one of the fundamental problems in computer vision. It aims to estimate the position of an arbitrary target in a video sequence, given only its location in the initial frame. It has numerous applications in surveillance, robotics, and human-computer interaction. Siamese Networks and their usage in Trackers Siamese networks are a class of neural networks that fundamentally learns to generate comparable feature vectors from their twin inputs. By learning to compute these comparable feature vectors, it learns differentiable characteristics for each type of image class. With these output vectors, it is possible to compare the two inputs and say if they belong to the same image class or not. For example, this is used in one-shot learning for facial recognition. Here the siamese network learns t...

ES3Net: Accurate and Efficient Edge-based Self-Supervised Stereo Matching Network

Efficient and accurate depth estimation plays an indispensable role in many real-world applications, such as autonomous vehicles, 3D reconstruction, and drone navigation. Despite the precision of stereo matching, its computational intensity can pose significant challenges for edge deployment. Moreover, the struggle of acquiring ground-truth depths for training stereo-matching networks further amplifies these challenges. Enter ES3Net, the Edge-based Self-Supervised Stereo matching Network, a solution designed to mitigate these obstacles. The Challenges of Depth Estimation When it comes to applications like autonomous driving or drone navigation, the importance of accurate depth estimation is hard to overstate. It provides a foundational understanding of the 3D world, allowing for intelligent decision-making and navigation. Traditionally, stereo matching has provided greater accuracy than monocular depth estimation due to the availability of a reference image. However, it also bri...