{"__v":0,"_id":"58458e329f6fbb1b0043079e","category":{"project":"55faf11ba62ba1170021a9a7","version":"55faf11ba62ba1170021a9aa","_id":"58458b4fba4f1c0f009692bb","__v":0,"sync":{"url":"","isSync":false},"reference":false,"createdAt":"2016-12-05T15:44:15.650Z","from_sync":false,"order":6,"slug":"datasets-hub","title":"DATASETS HUB"},"parentDoc":null,"project":"55faf11ba62ba1170021a9a7","user":"5613e4f8fdd08f2b00437620","version":{"__v":35,"_id":"55faf11ba62ba1170021a9aa","project":"55faf11ba62ba1170021a9a7","createdAt":"2015-09-17T16:58:03.490Z","releaseDate":"2015-09-17T16:58:03.490Z","categories":["55faf11ca62ba1170021a9ab","55faf8f4d0e22017005b8272","55faf91aa62ba1170021a9b5","55faf929a8a7770d00c2c0bd","55faf932a8a7770d00c2c0bf","55faf94b17b9d00d00969f47","55faf958d0e22017005b8274","55faf95fa8a7770d00c2c0c0","55faf96917b9d00d00969f48","55faf970a8a7770d00c2c0c1","55faf98c825d5f19001fa3a6","55faf99aa62ba1170021a9b8","55faf99fa62ba1170021a9b9","55faf9aa17b9d00d00969f49","55faf9b6a8a7770d00c2c0c3","55faf9bda62ba1170021a9ba","5604570090ee490d00440551","5637e8b2fbe1c50d008cb078","5649bb624fa1460d00780add","5671974d1b6b730d008b4823","5671979d60c8e70d006c9760","568e8eef70ca1f0d0035808e","56d0a2081ecc471500f1795e","56d4a0adde40c70b00823ea3","56d96b03dd90610b00270849","56fbb83d8f21c817002af880","573c811bee2b3b2200422be1","576bc92afb62dd20001cda85","5771811e27a5c20e00030dcd","5785191af3a10c0e009b75b0","57bdf84d5d48411900cd8dc0","57ff5c5dc135231700aed806","5804caf792398f0f00e77521","58458b4fba4f1c0f009692bb","586d3c287c6b5b2300c05055"],"is_deprecated":false,"is_hidden":false,"is_beta":true,"is_stable":true,"codename":"","version_clean":"1.0.0","version":"1.0"},"updates":[],"next":{"pages":[],"description":""},"createdAt":"2016-12-05T15:56:34.325Z","link_external":false,"link_url":"","githubsync":"","sync_unique":"","hidden":false,"api":{"results":{"codes":[]},"settings":"","auth":"required","params":[],"url":""},"isReference":false,"order":7,"body":"[block:callout]\n{\n  \"type\": \"warning\",\n  \"title\": \"On this page:\",\n  \"body\": \"* [Overview](#section-overview)\\n* [CCLE Cell line](#section-ccle-cell-line)\\n* [Aliquot](#section-aliquot)\\n* [File](#section-file)\"\n}\n[/block]\n##Overview\n\nMetadata is data that describes other data. On this page, we've detailed CCLE metadata that are available for viewing and filtering Cancer Cell Line Encyclopedia (CCLE) data in the Data Browser, Datasets API, and the SPARQL endpoint on the CGC. The CCLE contains Open Access sequencing data in the form of reads aligned to the hg19 reference genome for nearly 1000 cancer cell line samples, as available from cgHub on May 11, 2016.\n\nCCLE metadata on the CGC consist of **entities** and their **properties**.\n\n**Entities** are particular resources with UUIDs, such as files, cases, samples, and cell lines.\n\n**Properties** can either describe an entity or relate that entity to another entity. For instance, properties include an entity's vital status, gender, data format, or experimental strategy.\n\nEntities for CCLE include:\n  * **CCLE Cell line**, which represents data generated for each cell line. Dependent elements include biospecimen data such as **Sample** and clinical data such as **Investigation**.\n  * **Aliquot**\n  * **File**\n\nBelow, each of these three entities is followed by a table of their related properties.\n\n<div align=\"right\"><a href=\"#top\">top</a></div>\n\n##CCLE Cell line\n\nThe CCLE Cell line entity represents cell lines, which are permanently established cell cultures that will proliferate indefinitely given appropriate fresh medium and space. The CCLE Cell line entity contains these cell lines' clinical and biospecimen data. See the table below for clinical and biospecimen properties and descriptions of** CCLE Cell line**.\n[block:parameters]\n{\n  \"data\": {\n    \"h-0\": \"Properties\",\n    \"h-1\": \"Description\",\n    \"0-0\": \"ID\",\n    \"1-0\": \"Program\",\n    \"2-0\": \"Investigation\",\n    \"3-0\": \"Gender\",\n    \"4-0\": \"Disease type\",\n    \"5-0\": \"Disease type abbreviation\",\n    \"6-0\": \"Primary site\",\n    \"7-0\": \"Histologic diagnosis\",\n    \"8-0\": \"Histology\",\n    \"9-0\": \"Note\",\n    \"10-0\": \"Sample name\",\n    \"11-0\": \"Sample type\",\n    \"12-0\": \"Sample type code\",\n    \"13-0\": \"Source\",\n    \"0-1\": \"A human-readable identifier, such as a number or a string that may contain information about the entity. This identifier is often referred as submitter ID.\",\n    \"1-1\": \"The research program under which the data was generated. See NCI Thesaurus Code: C82662.\",\n    \"2-1\": \"A value denoting the project or study that generated the data. See NCI Thesaurus Code: C41198.\",\n    \"3-1\": \"The collection of behaviors and attitudes that distinguish people on the basis of the societal roles expected for the two sexes. See NCI Thesaurus Code: C17357.\",\n    \"4-1\": \"The type of the disease or condition studied. See NCI Thesaurus Code: C2991.\",\n    \"5-1\": \"An acronymn or initials for the disease or condition studied. See NCI Thesaurus Code: C2991.\",\n    \"6-1\": \"The anatomical site where the primary tumor is located in the organism. See NCI Thesaurus Code: C43761.\",\n    \"7-1\": \"Diagnosis of a disease based on the type of tissue, where type is determined based on the microscopic examination of tissue. See NCI Thesaurus Code: C61478.\",\n    \"8-1\": \"The study of the structure of the cells and their arrangements to constitute tissues and the association among these to form organs. In pathology, the microscopic process of identifying normal and abnormal morphologic characteristics in tissues, by employing various cytochemical and immunocytochemical stains. See NCI Thesaurus Code: C16681.\",\n    \"9-1\": \"A brief written record which provides information on cell line relations. For instance, notes mention if two cell lines come from the same patient. See NCI Thesaurus Code: C42619.\",\n    \"10-1\": \"A specific name given to material taken from a biological entity for testing, diagnosis, propagation,treatment, or research purposes, including but not limited to tissues, body fluids, cells, organs, embryos, body excretory products, etc. See NCI Thesaurus Code: C70713.\",\n    \"11-1\": \"The type of material taken from a biological entity for testing, diagnosis, propagation, treatment, or research purposes. This includes tissues, body fluids, cells, organs, embryos, body excretory products, etc. See NCI Thesaurus Code: C70713.\",\n    \"12-1\": \"Code that determines the type of material taken from a biological entity for testing, diagnosis, propagation, treatment, or research purposes. This includes tissues, body fluids, cells, organs, embryos, body excretory products, etc. See NCI Thesaurus Code: C70713.\",\n    \"13-1\": \"Commercial vendors or academic labs that the cell lines were obtained from.\"\n  },\n  \"cols\": 2,\n  \"rows\": 14\n}\n[/block]\n<div align=\"right\"><a href=\"#top\">top</a></div>\n\n##Aliquot\n\nThe aliquot entity in the CCLE metadata schema refers to aliquots, products or units extracted from a sample or specimen 's portion and prepared for analysis. Members of the aliquot entity can be identified by a Universally Unique Identifier (UUID). See below for metadata properties and descriptions relating to the aliquot entity.\n[block:parameters]\n{\n  \"data\": {\n    \"h-0\": \"Property\",\n    \"h-1\": \"Description\",\n    \"0-0\": \"ID\",\n    \"0-1\": \"A human-readable identifier, such as a number or a string that may contain metadata information. This identifier is often referred as submitter ID.\"\n  },\n  \"cols\": 2,\n  \"rows\": 1\n}\n[/block]\n<div align=\"right\"><a href=\"#top\">top</a></div>\n\n##File\n\nThe file entity in the CCLE metadata schema refers to the files in CCLE produced by aliquot analyses. See below for metadata properties and descriptions relating to the file entity.\n[block:parameters]\n{\n  \"data\": {\n    \"h-0\": \"Property\",\n    \"h-1\": \"Description\",\n    \"0-0\": \"Analyte type\",\n    \"1-0\": \"File size\",\n    \"2-0\": \"Data format\",\n    \"3-0\": \"Experimental strategy\",\n    \"4-0\": \"Platform\",\n    \"5-0\": \"Data submitting center\",\n    \"6-0\": \"Data submitting center code\",\n    \"7-0\": \"Last modified date\",\n    \"8-0\": \"Published date\",\n    \"9-0\": \"Storage path\",\n    \"10-0\": \"Reference genome\",\n    \"11-0\": \"Access level\",\n    \"12-0\": \"Submitter ID\",\n    \"0-1\": \"Defines the type of an analyte on molecular bases.\",\n    \"1-1\": \"Size of a file measured in bytes (B), kilobytes (KB), megabytes (MB), gigabytes (GB), terabytes (TB), and larger values.\",\n    \"2-1\": \"The type of format that determines data content.\",\n    \"3-1\": \"The method or protocol used to perform the laboratory analysis. See NCI Thesaurus Code: C43622.\",\n    \"4-1\": \"The version (for instance, manufacturer or model) of the technology that was used for sequencing or assaying. See NCI Thesaurus Code: C45378.\",\n    \"5-1\": \"This field takes a string denoting the name of the center that has submitted data.\",\n    \"6-1\": \"Alphanumerical values assigned to the center that has submitted the data.\",\n    \"7-1\": \"Date the file was last modified.\",\n    \"8-1\": \"Date the file was published.\",\n    \"9-1\": \"The storage path of the file\",\n    \"10-1\": \"The reference assembly (such as HG19 or GRCh37) to which the nucleotide sequence of a case can be aligned.\",\n    \"11-1\": \"A boolean value indicating Controlled Data or Open Data. Controlled Data is data from public datasets that has limitations on use and requires approval by dbGaP. Open Data is data from public datasets that doesn't have limitations on its use.\",\n    \"12-1\": \"Analytical identification assigned by the center that submitted the data.\"\n  },\n  \"cols\": 2,\n  \"rows\": 13\n}\n[/block]\n<div align=\"right\"><a href=\"#top\">top</a></div>","excerpt":"<a href=\"about-metadata-for-datasets\" style=\"color:#132c56\">ABOUT METADATA FOR DATASETS</a> > CCLE metadata","slug":"ccle-metadata","type":"basic","title":"CCLE metadata"}

CCLE metadata

<a href="about-metadata-for-datasets" style="color:#132c56">ABOUT METADATA FOR DATASETS</a> > CCLE metadata

[block:callout] { "type": "warning", "title": "On this page:", "body": "* [Overview](#section-overview)\n* [CCLE Cell line](#section-ccle-cell-line)\n* [Aliquot](#section-aliquot)\n* [File](#section-file)" } [/block] ##Overview Metadata is data that describes other data. On this page, we've detailed CCLE metadata that are available for viewing and filtering Cancer Cell Line Encyclopedia (CCLE) data in the Data Browser, Datasets API, and the SPARQL endpoint on the CGC. The CCLE contains Open Access sequencing data in the form of reads aligned to the hg19 reference genome for nearly 1000 cancer cell line samples, as available from cgHub on May 11, 2016. CCLE metadata on the CGC consist of **entities** and their **properties**. **Entities** are particular resources with UUIDs, such as files, cases, samples, and cell lines. **Properties** can either describe an entity or relate that entity to another entity. For instance, properties include an entity's vital status, gender, data format, or experimental strategy. Entities for CCLE include: * **CCLE Cell line**, which represents data generated for each cell line. Dependent elements include biospecimen data such as **Sample** and clinical data such as **Investigation**. * **Aliquot** * **File** Below, each of these three entities is followed by a table of their related properties. <div align="right"><a href="#top">top</a></div> ##CCLE Cell line The CCLE Cell line entity represents cell lines, which are permanently established cell cultures that will proliferate indefinitely given appropriate fresh medium and space. The CCLE Cell line entity contains these cell lines' clinical and biospecimen data. See the table below for clinical and biospecimen properties and descriptions of** CCLE Cell line**. [block:parameters] { "data": { "h-0": "Properties", "h-1": "Description", "0-0": "ID", "1-0": "Program", "2-0": "Investigation", "3-0": "Gender", "4-0": "Disease type", "5-0": "Disease type abbreviation", "6-0": "Primary site", "7-0": "Histologic diagnosis", "8-0": "Histology", "9-0": "Note", "10-0": "Sample name", "11-0": "Sample type", "12-0": "Sample type code", "13-0": "Source", "0-1": "A human-readable identifier, such as a number or a string that may contain information about the entity. This identifier is often referred as submitter ID.", "1-1": "The research program under which the data was generated. See NCI Thesaurus Code: C82662.", "2-1": "A value denoting the project or study that generated the data. See NCI Thesaurus Code: C41198.", "3-1": "The collection of behaviors and attitudes that distinguish people on the basis of the societal roles expected for the two sexes. See NCI Thesaurus Code: C17357.", "4-1": "The type of the disease or condition studied. See NCI Thesaurus Code: C2991.", "5-1": "An acronymn or initials for the disease or condition studied. See NCI Thesaurus Code: C2991.", "6-1": "The anatomical site where the primary tumor is located in the organism. See NCI Thesaurus Code: C43761.", "7-1": "Diagnosis of a disease based on the type of tissue, where type is determined based on the microscopic examination of tissue. See NCI Thesaurus Code: C61478.", "8-1": "The study of the structure of the cells and their arrangements to constitute tissues and the association among these to form organs. In pathology, the microscopic process of identifying normal and abnormal morphologic characteristics in tissues, by employing various cytochemical and immunocytochemical stains. See NCI Thesaurus Code: C16681.", "9-1": "A brief written record which provides information on cell line relations. For instance, notes mention if two cell lines come from the same patient. See NCI Thesaurus Code: C42619.", "10-1": "A specific name given to material taken from a biological entity for testing, diagnosis, propagation,treatment, or research purposes, including but not limited to tissues, body fluids, cells, organs, embryos, body excretory products, etc. See NCI Thesaurus Code: C70713.", "11-1": "The type of material taken from a biological entity for testing, diagnosis, propagation, treatment, or research purposes. This includes tissues, body fluids, cells, organs, embryos, body excretory products, etc. See NCI Thesaurus Code: C70713.", "12-1": "Code that determines the type of material taken from a biological entity for testing, diagnosis, propagation, treatment, or research purposes. This includes tissues, body fluids, cells, organs, embryos, body excretory products, etc. See NCI Thesaurus Code: C70713.", "13-1": "Commercial vendors or academic labs that the cell lines were obtained from." }, "cols": 2, "rows": 14 } [/block] <div align="right"><a href="#top">top</a></div> ##Aliquot The aliquot entity in the CCLE metadata schema refers to aliquots, products or units extracted from a sample or specimen 's portion and prepared for analysis. Members of the aliquot entity can be identified by a Universally Unique Identifier (UUID). See below for metadata properties and descriptions relating to the aliquot entity. [block:parameters] { "data": { "h-0": "Property", "h-1": "Description", "0-0": "ID", "0-1": "A human-readable identifier, such as a number or a string that may contain metadata information. This identifier is often referred as submitter ID." }, "cols": 2, "rows": 1 } [/block] <div align="right"><a href="#top">top</a></div> ##File The file entity in the CCLE metadata schema refers to the files in CCLE produced by aliquot analyses. See below for metadata properties and descriptions relating to the file entity. [block:parameters] { "data": { "h-0": "Property", "h-1": "Description", "0-0": "Analyte type", "1-0": "File size", "2-0": "Data format", "3-0": "Experimental strategy", "4-0": "Platform", "5-0": "Data submitting center", "6-0": "Data submitting center code", "7-0": "Last modified date", "8-0": "Published date", "9-0": "Storage path", "10-0": "Reference genome", "11-0": "Access level", "12-0": "Submitter ID", "0-1": "Defines the type of an analyte on molecular bases.", "1-1": "Size of a file measured in bytes (B), kilobytes (KB), megabytes (MB), gigabytes (GB), terabytes (TB), and larger values.", "2-1": "The type of format that determines data content.", "3-1": "The method or protocol used to perform the laboratory analysis. See NCI Thesaurus Code: C43622.", "4-1": "The version (for instance, manufacturer or model) of the technology that was used for sequencing or assaying. See NCI Thesaurus Code: C45378.", "5-1": "This field takes a string denoting the name of the center that has submitted data.", "6-1": "Alphanumerical values assigned to the center that has submitted the data.", "7-1": "Date the file was last modified.", "8-1": "Date the file was published.", "9-1": "The storage path of the file", "10-1": "The reference assembly (such as HG19 or GRCh37) to which the nucleotide sequence of a case can be aligned.", "11-1": "A boolean value indicating Controlled Data or Open Data. Controlled Data is data from public datasets that has limitations on use and requires approval by dbGaP. Open Data is data from public datasets that doesn't have limitations on its use.", "12-1": "Analytical identification assigned by the center that submitted the data." }, "cols": 2, "rows": 13 } [/block] <div align="right"><a href="#top">top</a></div>