ICGC data
The International Cancer Genome Consortium (ICGC) coordinates a global network of research groups that aims to generate and publicly release comprehensive catalogues of genomic, transcriptomic, and epigenomic information across 50 different cancer types and/or subtypes of clinical and societal importance. The ICGC also supports the standardization of clinical information reporting and the dissemination of analytical tools to promote the integration of other datasets with data generated by ICGC member organizations.
ICGC data is available through several distributed repositories. Through the CGC, authorized users can access all data hosted in ICGC's AWS-Virginia repository, which includes whole genome sequencing and RNA sequencing data generated as part of the PanCancer Analysis of Whole Genomes (PCAWG) Study as well as other files which are analyzed using a common set of alignment and variant calling workflows. This data, which is derived from 12 tumor sites represented in more than 2100 (1300 for PCAWG) tissue donors, aims to document the full range of somatic mutations present in the studied tumors, including single-nucleotide variants, insertions, deletions, copy number changes, translocations, and other chromosomal rearrangements at high resolution.
Note that all ICGC data is Controlled Data and that while you can analyze, view, and add ICGC files to your projects on the CGC, you will not be able to download the file or use the RAW text file viewer.
Learn more about the metadata associated with ICGC data on the CGC.
Data type | Number of donors | Number of files | Size | Data format | Data access tier |
---|---|---|---|---|---|
Aligned reads | 1944 | 7868 | 513.62 TB | BAM | Controlled Data |
Copy number somatic mutation | 1338 | 2862 | 71.96 | VCF | Controlled Data |
Simple germline variation | 1338 | 4293 | 262.19 GB | VCF | Controlled Data |
Simple somatic mutation | 1338 | 12879 | 98.38 GB | VCF | Controlled Data |
Structural germline variation | 1338 | 2862 | 4.52 GB | VCF | Controlled Data |
Structural somatic mutation | 1338 | 7137 | 504.24 MB | VCF | Controlled Data |
Unaligned reads | 601 | 1290 | 134.58 TB | BAM, FASTQ | Controlled Data |
The latest data in AWS Virginia is available as of Apr 12 2021 and it includes 2410 donors, 39191 files that are 648.57 TB in size.
Updated less than a minute ago