Skip to content

Tutorial Videos

Overview

The GDC Data Portal 2.0 centers around the idea of building cohorts, or groups of cases, before analyzing or downloading data.

In this GDC 2.0 Video Tutorial, learn how to:

  • Build a cohort
  • Analyze a cohort using GDC analysis tools
  • Download data associated with a cohort
  • View projects and available data in the GDC, and filter to create custom cohorts

GDC 2.0 Videos are available in the NCI GDC YouTube Playlist. For additional details, please see the GDC 2.0 User's Guide.

GDC 2.0 Workflow Flowchart

Build Cohort

Cohorts are created using the Cohort Builder, which allows users to specify cases with custom filters for things like disease characteristics, patient demographics, data type, and more.

In this video, learn how to use the Cohort Builder to build a cohort of patients with lung cancer, that are over the age of 50, and have harmonized RNA-Seq data.

Analyze Cohort

Analysis Center

Once a cohort is created, the cohort can be analyzed using an analysis tool in the Analysis Center.

In this video, learn how to analyze a cohort in the Analysis Center. Instructional videos are available for each analysis tool.

Analysis Tools

A cohort can be analyzed by selecting an analysis tool in the Analysis Center. The GDC provides analysis tools for performing both gene-level variant analysis and clinical data analysis. The table below identifies and describes each tool, and provides video instruction.

Tool DescriptionVideo
Clinical Data Analysis Use clinical variables to perform basic statistical analysis of your cohort
Cohort Comparison Display the survival analysis of your cohorts and compare characteristics such as gender, vital status and age at diagnosis
Cohort Level MAF Download an aggregated set of variants in MAF format based on the cases in your active cohort.
Gene Expression Clustering Display gene expression visualization
Mutation Frequency Visualize most frequently mutated genes and somatic mutations
OncoMatrix Visualize the top most mutated cases and genes affected by high impact mutations in your cohort
ProteinPaint Visualize mutations in protein-coding genes by consequence type and protein domain
Sequence Reads Visualize sequencing reads for a given gene, position, SNP, or variant
Set Operations Display a Venn diagram and compare/contrast your cohorts or sets of the same type
BAM Slicing Slice a specific region, gene, or chromosome from a GDC-harmonized BAM file.

Download Cohort Data

Genomic, clinical, and biospecimen data associated with a cohort can be downloaded using the Repository.

In this video, learn how to download data associated with a cohort in the Repository.

View Projects

GDC data is organized by projects within a larger cancer research program. Projects are generally composed of a particular type of cancer, such as the low-grade glioma (LGG) and sarcoma (SARC) projects within the TCGA program.
In this video, learn how to use The GDC Data Portal to access project-level information via the Projects tool and Project Summary Page and navigate and filter data to create custom cohorts.