Machine Learning Electronic Health Record Identification of Patients with Rheumatoid Arthritis: Algorithm Pipeline Development and Validation Study

Nov 30, 2020JMIR medical informatics

Using Machine Learning to Identify Rheumatoid Arthritis Patients from Electronic Health Records

AI simplified

Abstract

The support vector machine classifier identified 2873 patients with rheumatoid arthritis in less than 7 seconds from a dataset of 23,300 patients.

  • Machine learning methods outperformed traditional word-matching algorithms in identifying patients with rheumatoid arthritis.
  • The best performing support vector machine achieved an area under the receiver operating characteristic curve (AUROC) of 0.98 and a positive predictive value of 0.94.
  • In the Erlangen dataset, gradient boosting demonstrated an AUROC of 0.94 and a positive predictive value of 0.97.
  • The developed workflow is language and center independent, potentially applicable to various diagnoses.
  • This approach allows for the extraction of patient records with high precision, facilitating large-scale observational studies.

AI simplified

Full Text

We can’t show the full text here under this license. Use the link below to read it at the source.

what lands in your inbox each week:

  • 📚7 fresh studies
  • 📝plain-language summaries
  • ✅direct links to original studies
  • 🏅top journal indicators
  • 📅weekly delivery
  • đŸ§˜â€â™‚ïžalways free