TCGA GRCh38 metadata

🚧
On this page:

Overview

Investigation

Case

Demographic

Diagnosis

Treatment

Exposure

Drug therapy

Radiation therapy

Follow up

New tumor event

Sample

Portion

Slide

Analyte

Aliquot

Read group

Read group QC

File

Analysis

Overview

Metadata is data that describes other data. On this page, we've detailed TCGA metadata that are available for viewing and filtering TCGA data in the Data Browser and the Datasets API. TCGA metadata on the CGC consists of properties which describe the entities of the TCGA dataset.

Entities are particular resources with UUIDs, such as files, cases, samples, and cell lines.

Properties can either describe an entity or relate that entity to another entity. For instance, properties include an entity's vital status, sex, data format, or experimental strategy.

Property	Description
dbGaP accession number	The dbGaP accession number provided for each study. See NCI Thesaurus Code: C25402.
Investigation name	The full name of the project or study that generated the data. See NCI Thesaurus Code: C41198.
Submitter ID	A human-readable identifier, such as a number or a string that may contain metadata information for investigations.

Property	Description
Batch number	A set of related analytes prepared for further analysis and numbered sequentially from the same disease. Once a Case has been assigned to a batch number, subsequent shipments from that case are assigned the same batch number as the original. Seven Bridges only field.
Submitter ID	Usually a human-readable identifier, such as a number or a string that may contain metadata information. In some instances, this can also be a UUID.
Disease type	The type of the disease or condition studied. See NCI Thesaurus Code: C2991.
Primary site	The anatomical site where the primary tumor is located in the organism. See NCI Thesaurus Code: C43761.
Tissue source site ID	A clinical site that collects and provides patient samples and clinical metadata for research use. This is identified with UUID. See NCI Thesaurus Code: C103264.
Tissue source site name	The full name of a clinical site that collects and provides patient samples and clinical metadata for research use. See NCI Thesaurus Code: C103264.
Tissue source site code	The alphanumeric code for clinical site that collects and provides patient samples and clinical metadata for research use. See NCI Thesaurus Code: C103264.
Tissue source site BCR ID	The BCR (Biospecimen Core Resource) provided ID for a tissue source site. See NCI Thesaurus Code: C103264.

Property	Description
Submitter ID	Usually a human-readable identifier, such as a number or a string that may contain metadata information. In some instances, this can also be a UUID.
Ethnicity	A socially-defined category of people based on common ancestral, cultural, biological, and social factors. See NCI Thesaurus Code: C29933.
Race	A classification of humans characterized by certain heritable traits, common history, nationality, or geographic distribution. See NCI Thesaurus Code: C17049.
Sex	The collection of behaviors and attitudes that distinguish people on the basis of the societal roles expected for the two sexes. See NCI Thesaurus Code: C17357.
Year of birth	A numeric value to represent the calendar year in which an individual was born. See CDE (Common Data Element) Public ID: 2896954.
Year of death	A numeric value to represent the year of the death of an individual. See CDE (Common Data Element) Public ID: 2897030.

Property	Description
Submitter ID	Usually a human-readable identifier, such as a number or a string that may contain metadata information. In some instances, this can also be a UUID.
Age at diagnosis	The age in years of the Case at the initial pathological diagnosis of the disease or cancer. See NCI Thesaurus Code: C15220.
Classification of tumor	Text that describes the kind of disease present in the tumor specimen as related to a specific point in time. See CDE (Common Data Element) Public ID: 3288124.
Days to birth	The time interval from a person's date of birth to the date of initial pathologic diagnosis, represented as a calculated negative number of days. See CDE (Common Data Element) Public ID: 3008233.
Days to death	The time interval from a person's date of death to the date of initial pathologic diagnosis, represented as a calculated number of days. See CDE (Common Data Element) Public ID: 3165475.
Days to last follow up	The time interval from the date of the last follow up to the date of the initial pathologic diagnosis, represented as a calculated number of days. See CDE (Common Data Element) Public ID: 3008273.
Days to last known disease status	The time interval from the date of the last follow up to the date of the initial pathologic diagnosis, represented as a calculated number of days. See CDE (Common Data Element) Public ID: 3008273.
Days to recurrence	The time interval from the date of new tumor event, including progression, recurrence and new primary malignancies, to the date of the initial pathologic diagnosis, represented as a calculated number of days. See CDE (Common Data Element) Public ID: 3392464.
Last known disease status	The state or condition of an individual's neoplasm at a particular point in time. See CDE (Common Data Element) Public ID: 3392464.
Morphology	The morphology code which describes the characteristics of the tumor itself, including its cell type and biologic activity, according to the third edition of the International Classification of Diseases for Oncology (ICD-O). See CDE (Common Data Element) Public ID: 3226275.
Primary diagnosis	Text term for the structural pattern of cancer cells used to define a microscopic diagnosis. See CDE (Common Data Element) Public ID: 3081934.
Prior malignancy	Text term to describe the patient's history of prior cancer diagnosis and the spatial location of any previous cancer occurrence. See CDE (Common Data Element) Public ID: 3081934.
Progression or recurrence	Yes/No/Unknown indicator to identify whether a patient has had a new tumor event after initial treatment. See CDE (Common Data Element) Public ID: 3121376.
New tumor event after initial treatment	A Boolean value denoting whether a neoplasm developed after the initial treatment was finished.
Site of resection or biopsy	The topography code which describes the anatomical site of origin of the neoplasm according to the third edition of the International Classification of Diseases for Oncology (ICD-O). See NCI Thesaurus Code: C37978. See CDE (Common Data Element) Public ID: 3226281.
Tissue or organ of origin	The text term that describes the anatomic site of the tumor or disease. See CDE (Common Data Element) Public ID: 3226281.
Tumor grade	The numeric value to express the degree of abnormality of cancer cells, a measure of differentiation and aggressiveness. See CDE (Common Data Element) Public ID: 2785839.
Tumor stage	The extent of a cancer in the body. Staging is usually based on the size of the tumor, whether lymph nodes contain cancer, and whether the cancer has spread from the original site to other parts of the body. NCI Thesaurus Code: C16899; also see NCI Thesaurus Code: C28257 for Pathological stage.
Vital status	The state of being living or deceased for Cases that are part of the investigation. See NCI Thesaurus Code: C25717.
Histological diagnosis	The diagnosis of a disease based on the type of tissue as determined based on the microscopic examination of the tissue. See NCI Thesaurus Code: C61478.
Histological diagnosis other	Additional options for histologics diagnosis (see Histologic diagnosis), which have not been pre-determined in the listed values for histologic diagnosis.
Year of diagnosis	The numeric value to represent the year of an individual's initial pathologic diagnosis of cancer. See CDE (Common Data Element) Public ID: 2896960.
Clinical T (TNM)	The TNM Staging System is based on the extent of the tumor (T), the extent of spread to the lymph nodes (N), and the presence of metastasis (M). The T category describes the original (primary) tumor. NCI Thesaurus Code: C48881 and C253840.
Clinical M (TNM)	The TNM Staging System is based on the extent of the tumor (T), the extent of spread to the lymph nodes (N), and the presence of metastasis (M). The M category tells whether there are distant metastases (spread of cancer to other parts of the body). NCI Thesaurus Code: C48881 and C25385.
Clinical N (TNM)	The TNM Staging System is based on the extent of the tumor (T), the extent of spread to the lymph nodes (N), and the presence of metastasis (M). The N category describes whether or not the cancer has reached nearby lymph nodes NCI Thesaurus Code: C48881 and C25384.
Clinical stage	The extent of a cancer in the body. Staging is usually based on the size of the tumor, whether lymph nodes contain cancer, and whether the cancer has spread from the original site to other parts of the body. See CDE (Common Data Element) Public ID: 5243162.
Pathologic T (TNM)	The TNM Staging System is based on the extent of the tumor (T), the extent of spread to the lymph nodes (N), and the presence of metastasis (M). The T category describes the original (primary) tumor. NCI Thesaurus Code: C48881 and C48739.
Pathologic N (TNM)	The TNM Staging System is based on the extent of the tumor (T), the extent of spread to the lymph nodes (N), and the presence of metastasis (M). The N category describes whether or not the cancer has reached nearby lymph nodes NCI Thesaurus Code: C48881 and C48740.
Pathologic M (TNM)	The TNM Staging System is based on the extent of the tumor (T), the extent of spread to the lymph nodes (N), and the presence of metastasis (M). The M category tells whether there are distant metastases (spread of cancer to other parts of the body). NCI Thesaurus Code: C48881 and C48741.
Performance status scale: Timing	A time reference for the Karnofsky score and/or the ECOG score using the defined categories.
Performance status scale: Karnofsky score	An index designed for classifying patients 16 years of age or older by their functional impairment. A standard way of measuring the ability of cancer patients to perform ordinary tasks. NCI Thesaurus Code: C28013.
Performance status scale: ECOG	A performance status scale designed to assess disease progression and its effect on the daily living abilities of the patient. NCI Thesaurus Code: C105721.
Tumor status	The condition or state of the tumor at a particular time. See NCI Thesaurus Code: C96643.
Primary therapy outcome success	A value denoting the result of therapy for a given disease or condition in a patient or group of patients. See NCI Thesaurus Code: C18919.

🚧On this page:

Overview

Entities for TCGA GRCh38

Investigation

Case

Demographic

Diagnosis

Treatment

Exposure

Drug therapy

Radiation therapy

Follow up

New tumor event

Sample

Portion

Slide

Analyte

Aliquot

Read group

Read group QC

File

Analysis

🚧
On this page: