Responsible use of MIMIC data with online services like GPT

April 18, 2023

We have received inquiries regarding the use of credentialed data (MIMIC-III, MIMIC-IV, MIMIC-CXR) with online services such as GPT. The PhysioNet Credentialed Data Use Agreement explicitly prohibits sharing access to the data with third parties, including sending it through APIs provided by companies like OpenAI, or using it in online platforms like ChatGPT.

If you are interested in using the GPT family of models, we suggest using one of the following services:

  • Azure OpenAI service. You'll need to opt out of human review of the data, as (1) you are processing sensitive data where the likelihood of harmful outputs and/or misuse is low, and (2) you do not have the right to permit Microsoft to process the data for abuse detection due to the data use agreement you have signed. The form for opting out of the review process is available here: https://aka.ms/oai/additionalusecase
  • Amazon Bedrock. Bedrock provides options for fine-tuning foundation models using private labeled data. After creating a copy of a base foundation model for exclusive use, data is not shared back to the base model for training.

If you have any questions about this policy, feel free to reach out: https://physionet.org/about/#contact_us