Aggregated Somatic Mutation


Aggregated Somatic Mutation is a file type that reports somatic mutations for all of the cases in a project and includes information associated with each mutation.


Aggregated Somatic Mutation files are generated by aggregating all of the case-level annotated VCFs (each from a single case) associated with a project and variant-calling pipeline1. The GDC used a modified version of the vcf2maf script to generate aggregated somatic mutation files2,3.

Data Formats

Aggregated Somatic Mutation files are available in MAF format. MAF files are tab-delimited and associate each mutation with biologically relevant data. See the GDC MAF File Format documentation for a detailed description of this file type4. Aggregated Somatic Mutation files can be downloaded from the GDC Data Portal5.


  1. GDC Data Dictionary - Aggregated Somatic Mutation
  2. GDC DNA-Seq Documentation
  3. vcf2maf GitHub
  4. GDC MAF File Format
  5. GDC Data Portal
  • N/A

Categories: Data Type