Can I bring my own tools to the CGC?

Who qualifies for credits?

How to add data from a GCS bucket?

There's world-readable data in a Google Cloud Storage bucket. I can access it from curl from the https URL without authenticating, or from boto or gsutil using the gs:// URL. Is there a way to add this data to my CGC project?

How billing works w.r.t. AWS biling mechanism

Hi, I'm struggling to understand how your CGC billing is tied to individual's AWS accounts?

Problems using bash script in a CGC tool

Hello! I have created a tool based on Samtools, I used the following repository: images.sbgenomics.com/marouf/samtools:1.3 which i found in the public Samtools app. My goal is to make a tool that extracts the sample's name starting from .bam files, because i need them in the subsequent tool of my workflow. As input i'll give to samtools an array composed by two files (1 Tumor and 1 Normal tissue from the same patient), and i want the tool to discriminate if the name extracted belongs to a tumor sample, or a normal sample. So i wrote this bash script: for i in /sbgenomics/Projects/<myprojectpath>/*.bam; do if [ `/opt/samtools-1.3/samtools view -H $i | grep '^@RG' | sed "s/.*SM:............-\(...\)-.*/\1/g" | uniq` == "01A" ]; then /opt/samtools-1.3/samtools view -H $i | grep '^@RG' | sed "s/.*SM:\([^\t]*\).*/\1/g" | uniq; fi; done > tumor_name.txt In other words, for every bam in my folder (1 tumor and 1 normal) it should extract the 3 numbers that identify the sample type, compare them to "01A" (which is specific for tumor samples), if they are correct then it prints the entire sample name and puts it into a file. it returns the subsequent error log: 2017-10-13T18:10:34.886498415Z sh: 1: [: 11A: unexpected operator 2017-10-13T18:10:34.891268970Z sh: 1: [: 01A: unexpected operator 11A and 01A should be the "3 number ID" extracted as the first argument of the "if loop" (01A for the tumor sample, 11A for the normal sample), so apparently it seems that the if statement doesn't like them as arguments, as well as the opened square brackets. In the end, the tool returns me the file tumor_name.txt which is unfortunately empty (reasonably because the if statement didn't work). I thought that should have put #!/bin/bash before the "for loop" as my script is using bash commands. However when i use #!/bin/bash the standard output command ">" stops working, and this doesn't make sense to me. I tried a simple bash script to test it: #!/bin/bash echo 123 > file.txt and it doesn't work, while echo 123 > file.txt (without #!/bin/bash) works perfectly. Any help? Am i missing something in order to use bash scripts in CGC? Thank you very much in advance!

TCGA meta-data from BAM files

I'm developing a tool that uses multiple TCGA BAM and MAF files. I need to associate a mutation by its sample name to a BAM file or vice versa. Is there any metadata available to associate the BAM files to their sample name? Thanks in advance

How can I contribute to the CGC?

Who is the CGC designed for?

Local file upload fails with cgc_uploader

Initializing upload... Starting upload of 1 file(s) to MY-PROJECT The upload stays at 0.00%, then fails. I'm using the auth token from the Developer Dashboard. The command-line: ~/cgc-uploader/bin/cgc-uploader.sh -t MY-TOKEN -p MY-USERNAME/MY-PROJECT MY-FILE (Side note: I'm unable to use the GUI uploader, because my IT department blocks it as an unknown app.)

How can I get started with Docker?

I'm new to Docker. How can I get started? Any helpful tips?