Use a variety of metrics to compare and validate the quality of generated synthetic data. Derive synthetic data from the harmonized electronic health record (EHR) data. Access to synthetic data requires creation of an N3C Data Enclave account, a signed Data Use Agreement (DUA), and submission of a data use request (DUR). The dataset is open to the broad research community, including domestic and international investigators and citizen scientists.
View workstream details in the Github Repository.
Connect with Us:
- Onboard to N3C using the link below.
- In there you will provide your email address. We will add that email address to the CD2H workspace.
- Go to our workstream Slack channel directly using the link provided below.
- Login with your Slack credentials.
The N3C platform produces synthetic data from the limited dataset (LDS) that a site submits. Comparisons between source limited data and ensuing synthetic data are an essential component of the data quality assurance, verification, and validation processes used by N3C. Therefore, sites are required to submit an LDS to N3C in order to create a synthetic dataset. (See the NCATS webpage for more details on the levels of data access.)