by Sauer, Christopher M.; Dam, Tariq A.; Celi, Leo A.;
Faltys, Martin; de la Hoz, Miguel A. A.; Adhikari, Lasith; Ziesemer, Kirsten
A.; Girbes, Armand; Thoral, Patrick J.; Elbers, Paul
Critical Care Medicine: June 2022 -
Volume 50 - Issue 6 - p e581-e588
OBJECTIVE:
As data science and artificial intelligence continue to
rapidly gain traction, the publication of freely available ICU datasets has
become invaluable to propel data-driven clinical research. In this guide for
clinicians and researchers, we aim to: 1) systematically search and identify
all publicly available adult clinical ICU datasets, 2) compare their
characteristics, data quality, and richness and critically appraise their
strengths and weaknesses, and 3) provide researchers with suggestions, which
datasets are appropriate for answering their clinical question.
DATA SOURCES:
A systematic search was performed in Pubmed, ArXiv, MedRxiv,
and BioRxiv.
STUDY SELECTION:
We selected all studies that reported on publicly available
adult patient-level intensive care datasets.
DATA EXTRACTION:
A total of four publicly available, adult, critical care,
patient-level databases were included (Amsterdam University Medical Center data
base [AmsterdamUMCdb], eICU Collaborative Research Database eICU CRD], High
time-resolution intensive care unit dataset [HiRID], and Medical Information
Mart for Intensive Care-IV). Databases were compared using a priori defined
categories, including demographics, patient characteristics, and data richness.
The study protocol and search strategy were prospectively registered.
DATA SYNTHESIS:
Four ICU databases fulfilled all criteria for inclusion and
were queried using SQL (PostgreSQL version 12; PostgreSQL Global Development
Group) and analyzed using R (R Foundation for Statistical Computing, Vienna,
Austria). The number of unique patient admissions varied between 23,106
(AmsterdamUMCdb) and 200,859 (eICU-CRD). Frequency of laboratory values and
vital signs was highest in HiRID, for example, 5.2 (±3.4) lactate values per
day and 29.7 (±10.2) systolic blood pressure values per hour. Treatment
intensity varied with vasopressor and ventilatory support in 69.0% and 83.0% of
patients in AmsterdamUMCdb versus 12.0% and 21.0% in eICU-CRD, respectively.
ICU mortality ranged from 5.5% in eICU-CRD to 9.9% in AmsterdamUMCdb.
CONCLUSIONS:
We identified four publicly available adult clinical ICU
datasets. Sample size, severity of illness, treatment intensity, and frequency
of reported parameters differ markedly between the databases. This should guide
clinicians and researchers which databases to best answer their clinical
questions.
No comments:
Post a Comment