Protein Expression - RPPA

Introduction

Reverse Phase Protein Array (RPPA) is a high-throughput antibody-based technique with a procedure similar to that of Western blots. In the procedure carried by MD Anderson Cancer Center, hundreds to thousands of different cell lysates are immobilized on a nitrocellulose-coated slide as many individual spots, followed by incubations with one protein-specific antibody, and detection. A group (often several hundreds) of antibodies form a set, which are used for each assay. Occasionally, antibodies may be added to or removed from the set depending on feasibility/functionality, which forms a new set.

To quantify protein expression, a "standard curve" is constructed from spots on each slide (one slide probed for one antibody). These spots include serial dilutions of each sample plus QC spots of standard lysates at different concentrations.

The technique is capable of the following types of analyses:

  • Patient tumor classification
  • DNA, RNA, and Protein correlation
  • Prognosis
  • Response prediction for targeted therapies
  • Pharmacodynamics and biologically relevant dose
  • Determination of appropriate handling procedures for clinical samples (based on antigen stability analysis)

RPPA Data in the GDC

The antigens used for RPPA are available at ref [5], with the following information available:

  • AGID: The antigen unique ID
  • peptide_target: The unique ID for the target site that the antigen binds to
  • gene_symbol: The unique gene name abbreviation that codes the peptide
  • antibody_origin: The species that the antibody originated from
  • source: The antibody vendor company
  • catalog_number: Antibody vendor's catalog number
  • validation_status: Indicating how trustworthy those antibodies are, based on QC tests of antibody quality by the MD Anderson.

The GDC protein expression quantification data set is available in TSV format and contains AGID, catalog_number, and peptide_target from the reference file, plus the following fields:

  • lab_id: The unique antibody ID
  • set_id: The ID for a set, ie list of antibodies (eg refs [3] & [4]).
  • protein_expression: Relative levels of protein expression - interpolation of each dilution curve to the "standard curve" (supercurve) of the slide (antibody).

References

[1]. https://bioinformatics.mdanderson.org/public-software/tcpa/

[2]. https://www.mdanderson.org/research/research-resources/core-facilities/functional-proteomics-rppa-core/rppa-process.html

[3]. https://www.mdanderson.org/content/dam/mdanderson/documents/core-facilities/Functional%20Proteomics%20RPPA%20Core%20Facility/RPPA_Expanded_Ab_List_Updated.xlsx

[4]. https://www.mdanderson.org/content/dam/mdanderson/documents/core-facilities/Functional%20Proteomics%20RPPA%20Core%20Facility/RPPA_Standard_Ab_List_Updated.xlsx

[5]. https://gdc.cancer.gov/about-data/gdc-data-processing/gdc-reference-files