{"_id":"59025b784c3b710f0082157f","parentDoc":null,"user":"5613e4f8fdd08f2b00437620","version":{"_id":"55faf11ba62ba1170021a9aa","project":"55faf11ba62ba1170021a9a7","__v":38,"createdAt":"2015-09-17T16:58:03.490Z","releaseDate":"2015-09-17T16:58:03.490Z","categories":["55faf11ca62ba1170021a9ab","55faf8f4d0e22017005b8272","55faf91aa62ba1170021a9b5","55faf929a8a7770d00c2c0bd","55faf932a8a7770d00c2c0bf","55faf94b17b9d00d00969f47","55faf958d0e22017005b8274","55faf95fa8a7770d00c2c0c0","55faf96917b9d00d00969f48","55faf970a8a7770d00c2c0c1","55faf98c825d5f19001fa3a6","55faf99aa62ba1170021a9b8","55faf99fa62ba1170021a9b9","55faf9aa17b9d00d00969f49","55faf9b6a8a7770d00c2c0c3","55faf9bda62ba1170021a9ba","5604570090ee490d00440551","5637e8b2fbe1c50d008cb078","5649bb624fa1460d00780add","5671974d1b6b730d008b4823","5671979d60c8e70d006c9760","568e8eef70ca1f0d0035808e","56d0a2081ecc471500f1795e","56d4a0adde40c70b00823ea3","56d96b03dd90610b00270849","56fbb83d8f21c817002af880","573c811bee2b3b2200422be1","576bc92afb62dd20001cda85","5771811e27a5c20e00030dcd","5785191af3a10c0e009b75b0","57bdf84d5d48411900cd8dc0","57ff5c5dc135231700aed806","5804caf792398f0f00e77521","58458b4fba4f1c0f009692bb","586d3c287c6b5b2300c05055","58ef66d88646742f009a0216","58f5d52d7891630f00fe4e77","59a555bccdbd85001bfb1442"],"is_deprecated":false,"is_hidden":false,"is_beta":true,"is_stable":true,"codename":"","version_clean":"1.0.0","version":"1.0"},"__v":0,"project":"55faf11ba62ba1170021a9a7","category":{"_id":"58458b4fba4f1c0f009692bb","project":"55faf11ba62ba1170021a9a7","version":"55faf11ba62ba1170021a9aa","__v":0,"sync":{"url":"","isSync":false},"reference":false,"createdAt":"2016-12-05T15:44:15.650Z","from_sync":false,"order":6,"slug":"datasets-hub","title":"DATASETS HUB"},"updates":[],"next":{"pages":[],"description":""},"createdAt":"2017-04-27T20:58:32.965Z","link_external":false,"link_url":"","githubsync":"","sync_unique":"","hidden":true,"api":{"results":{"codes":[]},"settings":"","auth":"required","params":[],"url":""},"isReference":false,"order":7,"body":"##Overview\n\n[The Cancer Imaging Archive (TCIA)](http://www.cancerimagingarchive.net/) contains radiological imaging data from [The Cancer Genome Atlas (TCGA)](http://cancergenome.nih.gov/) and is part of an effort to build a research community focused on connecting cancer phenotypes to genotypes by providing clinical images matched to subjects. TCIA includes radiological images which represent 21 types of cancer detailed in TCGA. All images are accessible for public use. These images are de-identified to ensure that images are free of protected health information (PHI), and are stored in a standard [DICOM](https://en.wikipedia.org/wiki/DICOM) format.\n\n<div align=\"right\"><a href=\"#top\">top</a></div>\n\n##Distribution of the data\n\nSee below for an overview of the number of subjects and the image modalities (such as MRI or CT) of the data, grouped by different cancer types (“Collections”)  in the TCIA public project. See a [full list of cancer type abbreviations](https://gdc.cancer.gov/resources-tcga-users/tcga-code-tables/tcga-study-abbreviations) and a [full list of DICOM image modality abbreviations](https://wiki.cancerimagingarchive.net/display/Public/DICOM+Modality+Abbreviations).\n[block:parameters]\n{\n  \"data\": {\n    \"h-0\": \"Collection\",\n    \"h-1\": \"Subjects\",\n    \"h-2\": \"Modalities\",\n    \"0-0\": \"TCGA-KIRC\",\n    \"0-1\": \"267\",\n    \"0-2\": \"CT, MR, CR\",\n    \"1-0\": \"TCGA-GBM\",\n    \"1-1\": \"262\",\n    \"1-2\": \"MR, CT, DX\",\n    \"2-0\": \"TCGA-LGG\",\n    \"2-1\": \"199\",\n    \"2-2\": \"MR, CT\",\n    \"3-0\": \"TCGA-HNSC\",\n    \"3-1\": \"192\",\n    \"3-2\": \"CT, MR, PT, RTSTRUCT, RTPLAN, RTDOSE\",\n    \"4-0\": \"TCGA-OV\",\n    \"4-1\": \"143\",\n    \"4-2\": \"CT, MR\",\n    \"5-0\": \"TCGA-BRCA\",\n    \"5-1\": \"139\",\n    \"5-2\": \"MR, MG\",\n    \"6-1\": \"97\",\n    \"6-0\": \"TCGA-BLCA\",\n    \"6-2\": \"CT, CR, MR, PT\",\n    \"7-0\": \"TCGA-LIHC\",\n    \"7-1\": \"97\",\n    \"7-2\": \"MR, CT, PT\",\n    \"8-0\": \"TCGA-LUAD\",\n    \"8-1\": \"69\",\n    \"8-2\": \"CT, PT, NM\",\n    \"9-0\": \"TCGA-UCEC\",\n    \"9-1\": \"58\",\n    \"9-2\": \"CT, CR, MR, PT\",\n    \"10-0\": \"TCGA-CESC\",\n    \"10-1\": \"54\",\n    \"10-2\": \"MR\",\n    \"11-0\": \"TCGA-STAD\",\n    \"11-1\": \"46\",\n    \"12-0\": \"TCGA-LUSC\",\n    \"11-2\": \"CT\",\n    \"12-1\": \"37\",\n    \"12-2\": \"CT, NM, PT\",\n    \"13-0\": \"TCGA-KIRP\",\n    \"13-1\": \"33\",\n    \"13-2\": \"CT, MR, PT\",\n    \"14-0\": \"TCGA-COAD\",\n    \"14-1\": \"25\",\n    \"14-2\": \"CT\",\n    \"15-0\": \"TCGA-ESCA\",\n    \"15-1\": \"16\",\n    \"15-2\": \"CT\",\n    \"16-0\": \"TCGA-KICH\",\n    \"16-1\": \"15\",\n    \"16-2\": \"CT, MR\",\n    \"17-0\": \"TCGA-PRAD\",\n    \"17-1\": \"14\",\n    \"17-2\": \"CT, PT, MR\",\n    \"18-0\": \"TCGA-THCA\",\n    \"18-1\": \"6\",\n    \"18-2\": \"CT, PT\",\n    \"19-0\": \"TCGA-SARC\",\n    \"19-1\": \"5\",\n    \"19-2\": \"CT, MR\",\n    \"20-0\": \"TCGA-READ\",\n    \"20-1\": \"3\",\n    \"20-2\": \"CT, MR\"\n  },\n  \"cols\": 3,\n  \"rows\": 21\n}\n[/block]\n<div align=\"right\"><a href=\"#top\">top</a></div>\n\n##TCIA Metadata\n\nEach TCIA file on the CGC contains a set of images acquired during the same scanning mode in a compressed file format. The following metadata are also set for each file when available:\n[block:parameters]\n{\n  \"data\": {\n    \"0-0\": \"Case UUID\",\n    \"h-0\": \"Property\",\n    \"h-1\": \"Description\",\n    \"0-1\": \"A Universally Unique Identifier (UUID) for the sample or files of a case.\",\n    \"1-0\": \"Case ID\",\n    \"1-1\": \"A human-readable identifier, such as a number or a string that may contain metadata information. This identifier is often referred as submitter ID.\",\n    \"2-0\": \"Ethnicity\",\n    \"2-1\": \"A socially defined category of people based on common ancestral, cultural, biological, and social factors. See NCI Thesaurus Code: C29933.\",\n    \"3-0\": \"Gender\",\n    \"3-1\": \"The collection of behaviors and attitudes that distinguish people on the basis of the societal roles expected for the two sexes. See NCI Thesaurus Code: C17357.\",\n    \"4-0\": \"Race\",\n    \"4-1\": \"A classification of humans characterized by certain heritable traits, common history, nationality, or geographic distribution. See NCI Thesaurus Code: C17049.\",\n    \"5-0\": \"Investigation\",\n    \"5-1\": \"A value denoting the project or study that generated the data. See NCI Thesaurus Code: C41198.\",\n    \"6-0\": \"Age at diagnosis\",\n    \"6-1\": \"The age in years of the case at the initial pathological diagnosis of disease or cancer. See NCI Thesaurus Code: C15220.\",\n    \"7-0\": \"Primary site\",\n    \"8-0\": \"Disease type\",\n    \"7-1\": \"The anatomical site where the primary tumor is located in the organism. See NCI Thesaurus Code: C43761.\",\n    \"8-1\": \"The type of the disease or condition studied. See NCI Thesaurus Code: C2991.\",\n    \"9-0\": \"Vital status\",\n    \"9-1\": \"The state of being living or deceased for cases that are part of the investigation. See NCI Thesaurus Code: C25717.\",\n    \"10-0\": \"Days to death\",\n    \"10-1\": \"The number of days from the date of the initial pathological diagnosis to the date of death for the case in the investigation.\",\n    \"11-0\": \"Series date\",\n    \"11-1\": \"Date the Series was acquired.\",\n    \"12-0\": \"Manufacturer\",\n    \"12-1\": \"Manufacturer's name of the equipment that produced the composite instances.\",\n    \"13-0\": \"Body part examined\",\n    \"13-1\": \"Text description of the part of the body examined.\",\n    \"14-0\": \"Modality\",\n    \"14-1\": \"Type of equipment that originally acquired the data.\",\n    \"15-0\": \"Protocol name\",\n    \"15-1\": \"User-defined description of the conditions under which the Series was performed.\",\n    \"16-0\": \"Manufacturer model name\",\n    \"16-1\": \"Manufacturer's model name of the equipment that produced the composite instances.\",\n    \"17-0\": \"Series description\",\n    \"17-1\": \"User provided description of the Series.\",\n    \"18-0\": \"Software versions\",\n    \"18-1\": \"Manufacturer's designation of software version of the equipment that produced the composite instances.\",\n    \"19-0\": \"Image count\",\n    \"19-1\": \"Number of images in this series.\"\n  },\n  \"cols\": 2,\n  \"rows\": 20\n}\n[/block]\n<div align=\"right\"><a href=\"#top\">top</a></div>\n\n##Access TCIA data\n\nAccess a repository of TCIA files via the [TCIA public project](doc:the-cancer-imaging-archive-tcia-project) .\n\n**Note that you cannot currently query the TCIA dataset via the Data Browser.**\n\n<div align=\"right\"><a href=\"#top\">top</a></div>","excerpt":"","slug":"tcia-data","type":"basic","title":"TCIA data"}
##Overview [The Cancer Imaging Archive (TCIA)](http://www.cancerimagingarchive.net/) contains radiological imaging data from [The Cancer Genome Atlas (TCGA)](http://cancergenome.nih.gov/) and is part of an effort to build a research community focused on connecting cancer phenotypes to genotypes by providing clinical images matched to subjects. TCIA includes radiological images which represent 21 types of cancer detailed in TCGA. All images are accessible for public use. These images are de-identified to ensure that images are free of protected health information (PHI), and are stored in a standard [DICOM](https://en.wikipedia.org/wiki/DICOM) format. <div align="right"><a href="#top">top</a></div> ##Distribution of the data See below for an overview of the number of subjects and the image modalities (such as MRI or CT) of the data, grouped by different cancer types (“Collections”) in the TCIA public project. See a [full list of cancer type abbreviations](https://gdc.cancer.gov/resources-tcga-users/tcga-code-tables/tcga-study-abbreviations) and a [full list of DICOM image modality abbreviations](https://wiki.cancerimagingarchive.net/display/Public/DICOM+Modality+Abbreviations). [block:parameters] { "data": { "h-0": "Collection", "h-1": "Subjects", "h-2": "Modalities", "0-0": "TCGA-KIRC", "0-1": "267", "0-2": "CT, MR, CR", "1-0": "TCGA-GBM", "1-1": "262", "1-2": "MR, CT, DX", "2-0": "TCGA-LGG", "2-1": "199", "2-2": "MR, CT", "3-0": "TCGA-HNSC", "3-1": "192", "3-2": "CT, MR, PT, RTSTRUCT, RTPLAN, RTDOSE", "4-0": "TCGA-OV", "4-1": "143", "4-2": "CT, MR", "5-0": "TCGA-BRCA", "5-1": "139", "5-2": "MR, MG", "6-1": "97", "6-0": "TCGA-BLCA", "6-2": "CT, CR, MR, PT", "7-0": "TCGA-LIHC", "7-1": "97", "7-2": "MR, CT, PT", "8-0": "TCGA-LUAD", "8-1": "69", "8-2": "CT, PT, NM", "9-0": "TCGA-UCEC", "9-1": "58", "9-2": "CT, CR, MR, PT", "10-0": "TCGA-CESC", "10-1": "54", "10-2": "MR", "11-0": "TCGA-STAD", "11-1": "46", "12-0": "TCGA-LUSC", "11-2": "CT", "12-1": "37", "12-2": "CT, NM, PT", "13-0": "TCGA-KIRP", "13-1": "33", "13-2": "CT, MR, PT", "14-0": "TCGA-COAD", "14-1": "25", "14-2": "CT", "15-0": "TCGA-ESCA", "15-1": "16", "15-2": "CT", "16-0": "TCGA-KICH", "16-1": "15", "16-2": "CT, MR", "17-0": "TCGA-PRAD", "17-1": "14", "17-2": "CT, PT, MR", "18-0": "TCGA-THCA", "18-1": "6", "18-2": "CT, PT", "19-0": "TCGA-SARC", "19-1": "5", "19-2": "CT, MR", "20-0": "TCGA-READ", "20-1": "3", "20-2": "CT, MR" }, "cols": 3, "rows": 21 } [/block] <div align="right"><a href="#top">top</a></div> ##TCIA Metadata Each TCIA file on the CGC contains a set of images acquired during the same scanning mode in a compressed file format. The following metadata are also set for each file when available: [block:parameters] { "data": { "0-0": "Case UUID", "h-0": "Property", "h-1": "Description", "0-1": "A Universally Unique Identifier (UUID) for the sample or files of a case.", "1-0": "Case ID", "1-1": "A human-readable identifier, such as a number or a string that may contain metadata information. This identifier is often referred as submitter ID.", "2-0": "Ethnicity", "2-1": "A socially defined category of people based on common ancestral, cultural, biological, and social factors. See NCI Thesaurus Code: C29933.", "3-0": "Gender", "3-1": "The collection of behaviors and attitudes that distinguish people on the basis of the societal roles expected for the two sexes. See NCI Thesaurus Code: C17357.", "4-0": "Race", "4-1": "A classification of humans characterized by certain heritable traits, common history, nationality, or geographic distribution. See NCI Thesaurus Code: C17049.", "5-0": "Investigation", "5-1": "A value denoting the project or study that generated the data. See NCI Thesaurus Code: C41198.", "6-0": "Age at diagnosis", "6-1": "The age in years of the case at the initial pathological diagnosis of disease or cancer. See NCI Thesaurus Code: C15220.", "7-0": "Primary site", "8-0": "Disease type", "7-1": "The anatomical site where the primary tumor is located in the organism. See NCI Thesaurus Code: C43761.", "8-1": "The type of the disease or condition studied. See NCI Thesaurus Code: C2991.", "9-0": "Vital status", "9-1": "The state of being living or deceased for cases that are part of the investigation. See NCI Thesaurus Code: C25717.", "10-0": "Days to death", "10-1": "The number of days from the date of the initial pathological diagnosis to the date of death for the case in the investigation.", "11-0": "Series date", "11-1": "Date the Series was acquired.", "12-0": "Manufacturer", "12-1": "Manufacturer's name of the equipment that produced the composite instances.", "13-0": "Body part examined", "13-1": "Text description of the part of the body examined.", "14-0": "Modality", "14-1": "Type of equipment that originally acquired the data.", "15-0": "Protocol name", "15-1": "User-defined description of the conditions under which the Series was performed.", "16-0": "Manufacturer model name", "16-1": "Manufacturer's model name of the equipment that produced the composite instances.", "17-0": "Series description", "17-1": "User provided description of the Series.", "18-0": "Software versions", "18-1": "Manufacturer's designation of software version of the equipment that produced the composite instances.", "19-0": "Image count", "19-1": "Number of images in this series." }, "cols": 2, "rows": 20 } [/block] <div align="right"><a href="#top">top</a></div> ##Access TCIA data Access a repository of TCIA files via the [TCIA public project](doc:the-cancer-imaging-archive-tcia-project) . **Note that you cannot currently query the TCIA dataset via the Data Browser.** <div align="right"><a href="#top">top</a></div>