The repository consists of a number of Structured Query Language (SQL) scripts which build the MIMIC-IV database in a number of systems and extract useful concepts from the raw data. Subfolders include:
- buildmimic - Scripts to build MIMIC-IV in various relational database management system (RDMS), in particular postgres is a popular open source option
- concepts - Useful views/summaries of the data in MIMIC-IV, e.g. demographics, organ failure scores, severity of illness scores, durations of treatment, easier to analyze views, etc. The paper above describes these in detail, and a README in the subfolder lists concepts generated.
The MIMIC-IV concepts are written in an SQL syntax compatible with BigQuery. These scripts have been converted to PostgreSQL by a script. To generate the concepts in PostgreSQL, see the MIMIC-IV postgresql concepts subfolder.
Tables in the BigQuery physionet-data.mimic_derived
dataset are generated using the concepts made available in this folder. These tables are generated using the code in the latest release on GitHub.