Controlled Access


Data in the GDC is considered either open or controlled access. Access to controlled-access (i.e. protected) data in the GDC is granted on a per project basis via the database of Genotypes and Phenotypes (dbGaP).


While much of the data in the GDC is open access, many file types are controlled access. This primarily includes raw sequencing data such as BAM or FASTQ files as well as VCF files and protected MAF files. The informed consent under which the data or samples were collected is the basis for the submitting institution to determine whether the data should be available through unrestricted or controlled access. To gain access to these files a user must apply for access via dbGaP to individual projects1. Each project has a Data Access Committee (DAC) that will approve or disapprove data access requests. Before gaining access through dbGaP users also need to obtain an eRA Commons ID for authentication purposes2.

Genomic data access is governed by the NIH's Genomic Data Sharing Policy.


  1. GDC Granting Access to Controlled Data

Categories: General