The performance of VCS(volume, conductivity, light scatter) parameters in distinguishing latent tuberculosis and active tuberculosis by using machine learning algorithm

Dec 16, 2023BMC infectious diseases

Using machine learning to tell hidden from active tuberculosis by analyzing blood cell size, electrical properties, and light scattering

AI simplified

Abstract

A total of 220 subjects were analyzed to differentiate between active and latent tuberculosis infections using machine learning algorithms.

  • Logistic regression and random forest classifiers achieved perfect predictive performance with an area under the receiver operating characteristic curve () of 1.
  • Support vector machine and k-nearest neighbor classifiers demonstrated strong performance, with AUROC values of 0.967 and 0.943, respectively, in the training set.
  • In the testing set, logistic regression maintained the highest performance with an AUROC of 0.977.
  • The machine learning algorithm classifier utilizes blood routine indicators and leukocyte volume, conductivity, and light scatter parameters to classify tuberculosis infection types.
  • These findings suggest that machine learning could effectively assist in distinguishing active from latent tuberculosis infections.

AI simplified

Key numbers

1
Training Set
Logistic regression and random forest performance metrics.
0.977
Testing Set
Logistic regression performance in the testing phase.
220
Total Participants
Composition of active and latent tuberculosis patients.

Full Text

What this is

  • This research evaluates the effectiveness of volume, conductivity, and light scatter (VCS) parameters in distinguishing between latent and active tuberculosis (TB) using machine learning algorithms.
  • A total of 220 subjects, including patients with active TB and latent TB, were analyzed.
  • The study aims to improve diagnostic accuracy for TB, which is crucial for effective treatment and management.

Essence

  • Machine learning classifiers using can accurately differentiate between active and latent tuberculosis infections, with logistic regression and random forest showing the highest performance.

Key takeaways

  • Logistic regression and random forest achieved perfect performance in the training set with an area under the receiver operating characteristic curve () of 1.
  • In the testing set, logistic regression maintained strong performance with an of 0.977, indicating high diagnostic accuracy for identifying active TB.
  • The study underscores the potential of machine learning in enhancing TB diagnostics, which is critical for timely treatment and management.

Caveats

  • The study's retrospective design may introduce biases, and differences in age and gender between active and latent TB groups need further investigation.
  • Sample size limitations could affect the generalizability of the findings and the robustness of the machine learning models.

Definitions

  • VCS parameters: Volume, conductivity, and light scatter measurements of blood cells used to assess immune responses.
  • AUROC: Area under the receiver operating characteristic curve; a performance measurement for classification models.

AI simplified

what lands in your inbox each week:

  • 📚7 fresh studies
  • 📝plain-language summaries
  • direct links to original studies
  • 🏅top journal indicators
  • 📅weekly delivery
  • 🧘‍♂️always free