R for Cancer Bioinformatics

Decoding Cancer: TCGA Data Analysis with R

🧑 Instructor: Md. Jubayer Hossain
🗓 July 5, 2024 - July 6, 2024 | 9:30 am - 12:30pm (Bangladesh Time)
🏨 Medium - Zoom
💥 Register with Google Forms
💥 Registration Fee: 1020BDT (for students), 1530BDT (for professionals)
📝 To join private Telegram group for the course, follow instructions in the email you received after registration.

Welcome!

The TCGA Data Analysis with R course is designed to equip participants with the knowledge and skills necessary to effectively analyze and interpret data from The Cancer Genome Atlas (TCGA) using the R programming language. TCGA is a valuable resource for cancer researchers, providing comprehensive genomic and clinical data on various cancer types. This course will cover essential concepts, tools, and techniques for data preprocessing, exploratory data analysis, differential gene expression analysis, survival analysis, and data visualization using R. Participants will gain hands-on experience by working with real TCGA datasets and will learn to derive meaningful insights from complex cancer genomics data.

Learning Objectives

  • Master the Fundamentals of R: Gain proficiency in basic R programming essential for bioinformatics workflows.

  • Perform Data Wrangling with R: Learn to manipulate and prepare large datasets for analysis using R.

  • Utilize TCGAbiolinks: Develop skills to efficiently download and manage genomic data from the GDC portal using TCGAbiolinks.

  • Analyze Somatic Mutations with maftools: Acquire the ability to conduct comprehensive somatic mutation analysis using the maftools package.

  • Conduct Pan-Cancer Downstream Analysis: Explore methods for multi-cancer comparative studies, focusing on BRCA and Low-Grade Glioma (LGG).

  • Integrate Multi-Omic Data: Learn techniques to integrate methylation and expression data, with a focus on Adrenocortical Carcinoma (ACC).

  • Apply the ELMER Pipeline: Implement the ELMER pipeline for integrative analysis of DNA methylation and gene expression, particularly in Kidney Renal Clear Cell Carcinoma (KIRC).

Pre-requisites

Course Format

The course will be delivered through a combination of lectures, hands-on practical sessions, and interactive discussions. Participants will have access to real TCGA datasets and will be guided through step-by-step analysis using R. Additionally, participants will be provided with relevant learning resources, including code examples and data repositories, to support their learning outside the course hours.

What you’ll learn?

Case Study 1: Pan Cancer Downstream Analysis BRCA

In this case study, we will delve into the world of bioinformatics and cancer research by performing a downstream analysis of the BRCA (Breast Invasive Carcinoma) cancer dataset from the Pan Cancer Atlas. Our primary focus will be on utilizing R programming and basic concepts of bioinformatics and data analysis to gain insights into this specific cancer type.

Case Study 2: Pan Cancer Downstream Analysis of LGG (Low-Grade Glioma)

In this case study, we will embark on a bioinformatics journey to analyze the LGG (Low-Grade Glioma) cancer dataset from the Pan Cancer Atlas. Our focus will be on utilizing R programming and basic bioinformatics concepts to gain valuable insights into the genetic and molecular characteristics of LGG.

Case Study 3: Integration of Methylation and Expression Data for Adrenocortical Carcinoma (ACC)

In this case study, we will dive into the field of bioinformatics and cancer research by exploring the integration of methylation and gene expression data for Adrenocortical Carcinoma (ACC). ACC is a rare and aggressive cancer that originates in the adrenal cortex. Our primary focus will be on utilizing R programming to integrate these two types of data and gain deeper insights into the molecular characteristics of ACC.

Case Study 4: ELMER Pipeline for Integrative Analysis of DNA Methylation and Gene Expression in Kidney Renal Clear Cell Carcinoma (KIRC)

In this case study, we will delve into the world of bioinformatics and cancer research by exploring the ELMER (Enhancer Linking by Methylation/Expression Relationships) pipeline for integrative analysis of DNA methylation and gene expression data in Kidney Renal Clear Cell Carcinoma (KIRC). KIRC is the most common type of kidney cancer, and understanding the epigenetic regulation of gene expression in this cancer can provide valuable insights into its molecular mechanisms.