Import data from the CDS

πŸ“˜

For information on currently available CDS data on the CGC, the details and history of CDS data updates.

About the CDS

The Cancer Data Service (CDS) is a data repository under the NCI's Cancer Research Data Commons (CRDC) infrastructure for storing cancer research data generated by NCI funded programs. Its data is stored in the Database for Genotypes and Phenotypes (dbGaP) database provided by National Center for Biotechnology Information (NCBI). CDS hosts datasets that contain controlled access data, with access permissions being controlled by dbGaP

The process of importing files from the CDS to the Cancer Genomics Cloud (CGC) consists of the following two stages:

  • Downloading a manifest file from theΒ CDS website.
  • Importing files to the CGC based on the downloaded manifest file.

Download a manifest file from the CDS

Manifest files that are downloaded from the CDS contain information about the data you want to import in the second stage of this process.

To download a manifest file from the CDS:

  1. Open the CDS website.

  2. ClickΒ EXPLORE CDS PORTAL.

  3. (Optional) In the Filters pane, use the available filtering options to narrow down the search results.

    1425
  4. Select the Files tab below the chart. A list of all files is displayed below.

  5. Select the files you want to import to CGC and click Add Selected Files.

  6. Next, click the shopping cart icon in the upper right corner. The information about the number of selected files is also shown.

  7. Click Download manifest and save the manifest to your computer.

The rest of the steps are done on the CGC.

Import files to the CGC

  1. Access the project you want to import the files to.

  2. Click the Files tab.

  3. Click Add files and choose "Import from a manifest file".

  4. Next, choose Cancer Data Service (CDS)from the menu.

  5. Click Browse files and locate the manifest file you have previously downloaded from the CDS Portal.

  6. (Optional) In the Add tags field, add the keywords (tags) that describe the imported items.

  7. Resolve naming conflicts - Select the action to be taken if a naming conflict occurs. Available actions are Skip and Auto Rename. Read more about naming conflicts resolution.

  8. Click Import.

The files are now imported to your project.