Public projects overview

The CGC maintains public projects which act as repositories for examples of specific analyses as well as the associated data and tools you need to replicate these analyses on the CGC.

To preserve the integrity of the public projects' resources, you only have read permissions for these projects. You can browse through a public project to review the available data, workflows, and tools. To conduct further analyses, you can also make a copy of the entire project or copy specific resources from the public project into your personal projects. At this point, the copied project becomes your own project: you will have full project admin permissions and control over the project. Your actions in your own project will not affect the public project from which you obtained the resources.

Public projects on the CGC include:

  • Human Cell Atlas Preview Datasets - Human Cell Atlas released the first three single-cell sequencing datasets to the research community as “Human Cell Atlas Preview Datasets”. All datasets presented in this public project are single-cell RNA sequencing datasets (scRNA-seq).
  • Smart Variant Filtering - Smart Variant Filtering (SVF) uses machine learning algorithms trained on features from the existing Genome In A Bottle (GIAB) variant-called samples (HG001-HG005) to perform variant filtering (classification).
  • Cancer Cell Line Encyclopedia (CCLE) project: This project contains data from the Cancer Cell Line Encyclopedia dataset which you can use on the CGC.
  • ICGC-TCGA Dream Somatic Mutation Calling Challenge public project: This project contains data from the ICGC-TCGA DREAM Somatic Mutation Calling – RNA Challenge (SMC-RNA).
  • Simons Genome Diversity Project (SGDP) dataset: This project contains data from the Simons Genome Diversity Project dataset which you can use on the CGC.
  • Personal Genome Project UK (PGP-UK) pilot dataset: This project contains data from the Personal Genome Project UK dataset which you can use on the CGC.
  • Data Studio Interactive Analyses: This project contains interactive analyses that can be executed in Data Studio, and are additional resources that help users mitigate some of the challenges related to interpretation of data obtained as a result of secondary analysis. The project contains the following analyses:
  • Ballgown Interactive Analysis
  • VCF Visualization Interactive Analysis
  • Structural Variation Interactive Analysis
  • ChIP-seq Interactive Analysis
  • Microbiome Differential Abundance Analysis

Learn more about accessing public projects below.

Access public projects

To access public projects on the CGC:

  1. Select Public Projects from the top navigation bar.
  2. Choose a public project from the drop-down menu.


Access public projects via the API

You can also access public projects via the API by using the naming convention of {project_owner}/{project}. Locate the project's owner and name from the URL bar on the visual interface.

For instance, the CCLE public project is located at You can refer to the CCLE public project in the API as sevenbridges/cancer-cell-line-encyclopedia-ccle.