ICGC data

The International Cancer Genome Consortium (ICGC) coordinates a global network of research groups that aims to generate and publicly release comprehensive catalogues of genomic, transcriptomic, and epigenomic information across 50 different cancer types and/or subtypes of clinical and societal importance. The ICGC also supports the standardization of clinical information reporting and the dissemination of analytical tools to promote the integration of other datasets with data generated by ICGC member organizations.  

ICGC data is available through several distributed repositories. Through the CGC, authorized users can access all data hosted in ICGC's AWS-Virginia repository, which includes whole genome sequencing and RNA sequencing data generated as part of the PanCancer Analysis of Whole Genomes (PCAWG) Study as well as other files which are analyzed using a common set of alignment and variant calling workflows. This data, which is derived from 12 tumor sites represented in more than 2100 (1300 for PCAWG) tissue donors, aims to document the full range of somatic mutations present in the studied tumors, including single-nucleotide variants, insertions, deletions, copy number changes, translocations, and other chromosomal rearrangements at high resolution.

Note that all ICGC data is Controlled Data and that while you can analyze, view, and  add ICGC files to your projects on the CGC, you will not be able to download the file or use the RAW text file viewer.

Learn more about the metadata associated with ICGC data on the CGC.

Data type

Number of donors

Number of files

Size

Data format

Data access tier

Aligned reads

1944

7868

513.62 TB

BAM

Controlled Data

Copy number somatic mutation

1338

2862

71.96

VCF

Controlled Data

Simple germline variation

1338

4293

262.19 GB

VCF

Controlled Data

Simple somatic mutation

1338

12879

98.38 GB

VCF

Controlled Data

Structural germline variation

1338

2862

4.52 GB

VCF

Controlled Data

Structural somatic mutation

1338

7137

504.24 MB

VCF

Controlled Data

Unaligned reads

601

1290

134.58 TB

BAM, FASTQ

Controlled Data

The latest data in AWS Virginia is available as of Apr 12 2021 and it includes 2410 donors, 39191 files that are 648.57 TB in size.


Did this page help you?