{"metadata":{"image":[],"title":"","description":""},"api":{"url":"","auth":"required","results":{"codes":[]},"settings":"","params":[]},"next":{"description":"","pages":[]},"title":"ICGC data","type":"basic","slug":"icgc-data","excerpt":"","body":"The [International Cancer Genome Consortium (ICGC)](https://icgc.org/) coordinates a global network of research groups that aims to generate and publicly release comprehensive catalogues of genomic, transcriptomic, and epigenomic information across 50 different cancer types and/or subtypes of clinical and societal importance. The ICGC also supports the standardization of clinical information reporting and the dissemination of analytical tools to promote the integration of other datasets with data generated by ICGC member organizations.  \n\nICGC data is available through several distributed repositories. Through the CGC, authorized users can access all data hosted in ICGC's [AWS-Virginia repository](https://dcc.icgc.org/repositories?filters=%7B%22file%22:%7B%22repoName%22:%7B%22is%22:%5B%22AWS%20-%20Virginia%22%5D%7D,%22study%22:%7B%22is%22:%5B%22PCAWG%22%5D%7D%7D%7D&files=%7B%22from%22:1,%22size%22:25%7D), which includes whole genome sequencing and RNA sequencing data generated as part of the [PanCancer Analysis of Whole Genomes (PCAWG) Study](https://dcc.icgc.org/pcawg) as well as other files which are analyzed using a common set of alignment and variant calling workflows. This data, which is derived from 12 tumor sites represented in more than 2100 (1300 for PCAWG) tissue donors, aims to document the full range of somatic mutations present in the studied tumors, including single-nucleotide variants, insertions, deletions, copy number changes, translocations, and other chromosomal rearrangements at high resolution.\n\nNote that all ICGC data is [Controlled Data](doc:dbgap-controlled-data-access) and that while you can analyze, view, and  add ICGC files to your projects on the CGC, you will not be able to [download the file or use the RAW text file viewer](http://docs.icgc.org/cloud/guide/#aws).\n\nLearn more about the [metadata](doc:icgc-metadata) associated with ICGC data on the CGC.\n\n\n[block:parameters]\n{\n  \"data\": {\n    \"h-0\": \"Data type\",\n    \"h-1\": \"Number of donors\",\n    \"h-2\": \"Number of files\",\n    \"h-3\": \"Size\",\n    \"0-0\": \"Aligned reads\",\n    \"0-1\": \"1761\",\n    \"0-2\": \"7347\",\n    \"0-3\": \"475.71 TB\",\n    \"1-0\": \"Copy number somatic mutation\",\n    \"1-1\": \"1338\",\n    \"1-2\": \"2862\",\n    \"1-3\": \"71.96\",\n    \"2-0\": \"Simple germline variation\",\n    \"2-1\": \"1338\",\n    \"2-2\": \"4293\",\n    \"2-3\": \"262.19 GB\",\n    \"3-0\": \"Simple somatic mutation\",\n    \"3-1\": \"1338\",\n    \"3-2\": \"12879\",\n    \"3-3\": \"98.38 GB\",\n    \"4-0\": \"Structural germline variation\",\n    \"5-0\": \"Structural somatic mutation\",\n    \"4-1\": \"1338\",\n    \"5-1\": \"1338\",\n    \"4-2\": \"2862\",\n    \"5-2\": \"7137\",\n    \"h-4\": \"Data format\",\n    \"h-5\": \"Data access tier\",\n    \"0-4\": \"BAM\",\n    \"0-5\": \"Controlled Data\",\n    \"1-4\": \"VCF\",\n    \"1-5\": \"Controlled Data\",\n    \"2-4\": \"VCF\",\n    \"2-5\": \"Controlled Data\",\n    \"3-4\": \"VCF\",\n    \"3-5\": \"Controlled Data\",\n    \"4-3\": \"4.52 GB\",\n    \"4-4\": \"VCF\",\n    \"4-5\": \"Controlled Data\",\n    \"5-3\": \"504.24 MB\",\n    \"5-4\": \"VCF\",\n    \"5-5\": \"Controlled Data\",\n    \"6-0\": \"Unaligned reads\",\n    \"6-1\": \"492\",\n    \"6-2\": \"1059\",\n    \"6-3\": \"113.44 TB\",\n    \"6-4\": \"BAM, FASTQ\",\n    \"6-5\": \"Controlled Data\"\n  },\n  \"cols\": 6,\n  \"rows\": 7\n}\n[/block]\nThe latest data in AWS Virginia is available as of Jan 24 2020 and it includes 2119 donors, 38439 files that are 589.52 TB in size.","updates":["5a6f83c3cc2cbf0049b46fac","5a92dae220cacd00127d563d"],"order":10,"isReference":false,"hidden":false,"sync_unique":"","link_url":"","link_external":false,"_id":"5a43d485a66e24002a27b5ff","project":"55faf11ba62ba1170021a9a7","version":{"version":"1.0","version_clean":"1.0.0","codename":"","is_stable":true,"is_beta":true,"is_hidden":false,"is_deprecated":false,"categories":["55faf11ca62ba1170021a9ab","55faf8f4d0e22017005b8272","55faf91aa62ba1170021a9b5","55faf929a8a7770d00c2c0bd","55faf932a8a7770d00c2c0bf","55faf94b17b9d00d00969f47","55faf958d0e22017005b8274","55faf95fa8a7770d00c2c0c0","55faf96917b9d00d00969f48","55faf970a8a7770d00c2c0c1","55faf98c825d5f19001fa3a6","55faf99aa62ba1170021a9b8","55faf99fa62ba1170021a9b9","55faf9aa17b9d00d00969f49","55faf9b6a8a7770d00c2c0c3","55faf9bda62ba1170021a9ba","5604570090ee490d00440551","5637e8b2fbe1c50d008cb078","5649bb624fa1460d00780add","5671974d1b6b730d008b4823","5671979d60c8e70d006c9760","568e8eef70ca1f0d0035808e","56d0a2081ecc471500f1795e","56d4a0adde40c70b00823ea3","56d96b03dd90610b00270849","56fbb83d8f21c817002af880","573c811bee2b3b2200422be1","576bc92afb62dd20001cda85","5771811e27a5c20e00030dcd","5785191af3a10c0e009b75b0","57bdf84d5d48411900cd8dc0","57ff5c5dc135231700aed806","5804caf792398f0f00e77521","58458b4fba4f1c0f009692bb","586d3c287c6b5b2300c05055","58ef66d88646742f009a0216","58f5d52d7891630f00fe4e77","59a555bccdbd85001bfb1442","5a2a81f688574d001e9934f5","5b080c8d7833b20003ddbb6f","5c222bed4bc358002f21459a","5c22412594a2a5005cc9e919","5c41ae1c33592700190a291e","5c8a525e2ba7b2003f9b153c","5cbf14d58c79c700ef2b502e","5db6f03a6e187c006f667fa4","5f894c7d3b0894006477ca01"],"_id":"55faf11ba62ba1170021a9aa","releaseDate":"2015-09-17T16:58:03.490Z","createdAt":"2015-09-17T16:58:03.490Z","project":"55faf11ba62ba1170021a9a7","__v":47},"category":{"sync":{"isSync":false,"url":""},"pages":[],"title":"DATASETS HUB","slug":"datasets-hub","order":6,"from_sync":false,"reference":false,"_id":"58458b4fba4f1c0f009692bb","createdAt":"2016-12-05T15:44:15.650Z","project":"55faf11ba62ba1170021a9a7","version":"55faf11ba62ba1170021a9aa","__v":0},"user":"5613e4f8fdd08f2b00437620","createdAt":"2017-12-27T17:12:37.445Z","githubsync":"","__v":2,"parentDoc":null}
The [International Cancer Genome Consortium (ICGC)](https://icgc.org/) coordinates a global network of research groups that aims to generate and publicly release comprehensive catalogues of genomic, transcriptomic, and epigenomic information across 50 different cancer types and/or subtypes of clinical and societal importance. The ICGC also supports the standardization of clinical information reporting and the dissemination of analytical tools to promote the integration of other datasets with data generated by ICGC member organizations.   ICGC data is available through several distributed repositories. Through the CGC, authorized users can access all data hosted in ICGC's [AWS-Virginia repository](https://dcc.icgc.org/repositories?filters=%7B%22file%22:%7B%22repoName%22:%7B%22is%22:%5B%22AWS%20-%20Virginia%22%5D%7D,%22study%22:%7B%22is%22:%5B%22PCAWG%22%5D%7D%7D%7D&files=%7B%22from%22:1,%22size%22:25%7D), which includes whole genome sequencing and RNA sequencing data generated as part of the [PanCancer Analysis of Whole Genomes (PCAWG) Study](https://dcc.icgc.org/pcawg) as well as other files which are analyzed using a common set of alignment and variant calling workflows. This data, which is derived from 12 tumor sites represented in more than 2100 (1300 for PCAWG) tissue donors, aims to document the full range of somatic mutations present in the studied tumors, including single-nucleotide variants, insertions, deletions, copy number changes, translocations, and other chromosomal rearrangements at high resolution. Note that all ICGC data is [Controlled Data](doc:dbgap-controlled-data-access) and that while you can analyze, view, and  add ICGC files to your projects on the CGC, you will not be able to [download the file or use the RAW text file viewer](http://docs.icgc.org/cloud/guide/#aws). Learn more about the [metadata](doc:icgc-metadata) associated with ICGC data on the CGC. [block:parameters] { "data": { "h-0": "Data type", "h-1": "Number of donors", "h-2": "Number of files", "h-3": "Size", "0-0": "Aligned reads", "0-1": "1761", "0-2": "7347", "0-3": "475.71 TB", "1-0": "Copy number somatic mutation", "1-1": "1338", "1-2": "2862", "1-3": "71.96", "2-0": "Simple germline variation", "2-1": "1338", "2-2": "4293", "2-3": "262.19 GB", "3-0": "Simple somatic mutation", "3-1": "1338", "3-2": "12879", "3-3": "98.38 GB", "4-0": "Structural germline variation", "5-0": "Structural somatic mutation", "4-1": "1338", "5-1": "1338", "4-2": "2862", "5-2": "7137", "h-4": "Data format", "h-5": "Data access tier", "0-4": "BAM", "0-5": "Controlled Data", "1-4": "VCF", "1-5": "Controlled Data", "2-4": "VCF", "2-5": "Controlled Data", "3-4": "VCF", "3-5": "Controlled Data", "4-3": "4.52 GB", "4-4": "VCF", "4-5": "Controlled Data", "5-3": "504.24 MB", "5-4": "VCF", "5-5": "Controlled Data", "6-0": "Unaligned reads", "6-1": "492", "6-2": "1059", "6-3": "113.44 TB", "6-4": "BAM, FASTQ", "6-5": "Controlled Data" }, "cols": 6, "rows": 7 } [/block] The latest data in AWS Virginia is available as of Jan 24 2020 and it includes 2119 donors, 38439 files that are 589.52 TB in size.