{"_id":"587e8c03a9ad983100112a9c","project":"55faf11ba62ba1170021a9a7","user":"5613e4f8fdd08f2b00437620","category":{"_id":"58458b4fba4f1c0f009692bb","project":"55faf11ba62ba1170021a9a7","version":"55faf11ba62ba1170021a9aa","__v":0,"sync":{"url":"","isSync":false},"reference":false,"createdAt":"2016-12-05T15:44:15.650Z","from_sync":false,"order":6,"slug":"datasets-hub","title":"DATASETS HUB"},"__v":0,"version":{"_id":"55faf11ba62ba1170021a9aa","project":"55faf11ba62ba1170021a9a7","__v":38,"createdAt":"2015-09-17T16:58:03.490Z","releaseDate":"2015-09-17T16:58:03.490Z","categories":["55faf11ca62ba1170021a9ab","55faf8f4d0e22017005b8272","55faf91aa62ba1170021a9b5","55faf929a8a7770d00c2c0bd","55faf932a8a7770d00c2c0bf","55faf94b17b9d00d00969f47","55faf958d0e22017005b8274","55faf95fa8a7770d00c2c0c0","55faf96917b9d00d00969f48","55faf970a8a7770d00c2c0c1","55faf98c825d5f19001fa3a6","55faf99aa62ba1170021a9b8","55faf99fa62ba1170021a9b9","55faf9aa17b9d00d00969f49","55faf9b6a8a7770d00c2c0c3","55faf9bda62ba1170021a9ba","5604570090ee490d00440551","5637e8b2fbe1c50d008cb078","5649bb624fa1460d00780add","5671974d1b6b730d008b4823","5671979d60c8e70d006c9760","568e8eef70ca1f0d0035808e","56d0a2081ecc471500f1795e","56d4a0adde40c70b00823ea3","56d96b03dd90610b00270849","56fbb83d8f21c817002af880","573c811bee2b3b2200422be1","576bc92afb62dd20001cda85","5771811e27a5c20e00030dcd","5785191af3a10c0e009b75b0","57bdf84d5d48411900cd8dc0","57ff5c5dc135231700aed806","5804caf792398f0f00e77521","58458b4fba4f1c0f009692bb","586d3c287c6b5b2300c05055","58ef66d88646742f009a0216","58f5d52d7891630f00fe4e77","59a555bccdbd85001bfb1442"],"is_deprecated":false,"is_hidden":false,"is_beta":true,"is_stable":true,"codename":"","version_clean":"1.0.0","version":"1.0"},"parentDoc":null,"updates":[],"next":{"pages":[],"description":""},"createdAt":"2017-01-17T21:26:27.824Z","link_external":false,"link_url":"","githubsync":"","sync_unique":"","hidden":false,"api":{"results":{"codes":[]},"settings":"","auth":"required","params":[],"url":""},"isReference":false,"order":6,"body":"##Overview\n\nThe <a href=\"https://www.simonsfoundation.org/life-sciences/simons-genome-diversity-project-dataset/\" target=\"blank\">Simons Genome Diversity Project (SGDP) dataset</a> is made possible by the <a href=\"https://www.simonsfoundation.org/about-us/\" target=\"blank\">Simons Foundation</a>. The dataset contains complete genome sequences from more than one hundred diverse human populations. It is the largest dataset of diverse, high quality human genome sequences ever reported. To represent as much anthropological, linguistic, and cultural diversity as possible, the dataset includes many deeply divergent human populations that are not well-represented in other datasets.\n\n<div align=\"right\"><a href=\"#top\">top</a></div>\n\n##Distribution of the data\n\nThe SGDP public project contains Open Access whole genome sequencing data for **279 samples**.\n[block:image]\n{\n  \"images\": [\n    {\n      \"image\": [\n        \"https://files.readme.io/cd9225a-sgdp.jpg\",\n        \"sgdp.jpg\",\n        2828,\n        904,\n        \"#e0dfe0\"\n      ],\n      \"border\": false\n    }\n  ]\n}\n[/block]\nBy geographical regions, the SGDP dataset is comprised of 44 Africans, 22 Native Americans, 27 Central Asians or Siberians, 47 East Asians, 25 Oceanians, 39 South Asians and 75 West Eurasians. \n\n<div align=\"right\"><a href=\"#top\">top</a></div>\n\n##SGDP metadata\n\nLearn more about SGDP metadata:\n\n1. Access the [Nature article about SGDP](http://www.nature.com/nature/journal/v538/n7624/full/nature18964.html#supplementary-information). \n2. Look under **Excel files**.\n3. Select **Supplementary Table 1**. Note that this will start a download for a local copy of the spreadsheet.\n4. Open your local version of the spreadsheet and filter for **X** in **Column G**. This displays all the Open Access data in the SGDP which CGC has made available in their [Simons Genome Diversity Project (SGDP) public project](doc:simons-genome-diversity-project-sgdp-dataset).\n\n<div align=\"right\"><a href=\"#top\">top</a></div>\n\n##Access SGDP data\n\nAccess a repository of SGDP files via the [SGDP public project](doc:simons-genome-diversity-project-sgdp-dataset).\n\n**Note that you cannot currently query the SGDP dataset via the Data Browser.**\n\n<div align=\"right\"><a href=\"#top\">top</a></div>","excerpt":"","slug":"sgdp-data","type":"basic","title":"SGDP data"}
##Overview The <a href="https://www.simonsfoundation.org/life-sciences/simons-genome-diversity-project-dataset/" target="blank">Simons Genome Diversity Project (SGDP) dataset</a> is made possible by the <a href="https://www.simonsfoundation.org/about-us/" target="blank">Simons Foundation</a>. The dataset contains complete genome sequences from more than one hundred diverse human populations. It is the largest dataset of diverse, high quality human genome sequences ever reported. To represent as much anthropological, linguistic, and cultural diversity as possible, the dataset includes many deeply divergent human populations that are not well-represented in other datasets. <div align="right"><a href="#top">top</a></div> ##Distribution of the data The SGDP public project contains Open Access whole genome sequencing data for **279 samples**. [block:image] { "images": [ { "image": [ "https://files.readme.io/cd9225a-sgdp.jpg", "sgdp.jpg", 2828, 904, "#e0dfe0" ], "border": false } ] } [/block] By geographical regions, the SGDP dataset is comprised of 44 Africans, 22 Native Americans, 27 Central Asians or Siberians, 47 East Asians, 25 Oceanians, 39 South Asians and 75 West Eurasians. <div align="right"><a href="#top">top</a></div> ##SGDP metadata Learn more about SGDP metadata: 1. Access the [Nature article about SGDP](http://www.nature.com/nature/journal/v538/n7624/full/nature18964.html#supplementary-information). 2. Look under **Excel files**. 3. Select **Supplementary Table 1**. Note that this will start a download for a local copy of the spreadsheet. 4. Open your local version of the spreadsheet and filter for **X** in **Column G**. This displays all the Open Access data in the SGDP which CGC has made available in their [Simons Genome Diversity Project (SGDP) public project](doc:simons-genome-diversity-project-sgdp-dataset). <div align="right"><a href="#top">top</a></div> ##Access SGDP data Access a repository of SGDP files via the [SGDP public project](doc:simons-genome-diversity-project-sgdp-dataset). **Note that you cannot currently query the SGDP dataset via the Data Browser.** <div align="right"><a href="#top">top</a></div>