The GDC Legacy Archive hosts unharmonized legacy data from repositories that predate the GDC (e.g. CGHub). Legacy data is not actively maintained, processed, or harmonized by the GDC. Legacy users are encouraged to migrate to harmonized datasets.
The GDC Legacy Archive can be accessed from the GDC Data Portal front page as well as from the "GDC Apps" menu.
The GDC Legacy Archive contains a limited set of features of the GDC Data Portal:
- Facet search: Ability to look for legacy files or legacy annotations based on case, file and annotation facets.
- File and Annotation tables: List of all the legacy files and list of all the legacy annotations.
- File and Annotation detail pages: Information page for each legacy file and annotation.
- Cart: The GDC Legacy Archive and the GDC Data Portal are separate systems with separate download carts.
The file page of the GDC Legacy Archive is similar to the file page of the GDC Data Portal. It does not include the Workflow, Reference Genome, and Read Groups sections as these are only applicable to harmonized data available in the GDC Data Portal. The Legacy Archive includes additional archive information as described below.
If a file was originally produced as part of an archive containing other files, the archive information (Archive ID and number of files in the archive) is displayed in the file properties and, if selected, the user will see a list of files containing all other files in that archive.
If a file has any associated MAGE-TAB or SRA XML metadata files, these files will be listed at the bottom of the page. These files will can be downloaded directly from here. Alternatively, metadata files can be downloaded from the file cart.
The file cart in the GDC Legacy Archive is analogous to the file cart of the GDC Data Portal. It provides an additional button to download any SRA-XML and MAGE-TAB metadata files associated with the files in the cart.