Skip to content

Analysis of synthetic head and neck cancer patient data modeled after real cases from the Ocean Road Cancer Institute. The project aims to explore the distribution of demographics, tumor characteristics and treatment modalities among patients. Utilizing R and various data visualization techniques provides insights.

Notifications You must be signed in to change notification settings

rshungu/head-neck-cancer_synthetic-data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

Data Analysis for Head and Neck Cancer

This project involves the analysis of synthetic patient data, focusing on head and neck cancer cases. The data has been generated to simulate real-world cases while maintaining the confidentiality of the original dataset.

Project Description

This project involves analyzing of synthietic patient data focusing on head and neck cancer cases. The data has been generated to simulate real-world cases while maintaining the confidentiality of the original dataset.

Data Source

The dataset is a collection of patient-level data, containing variables such as:

  • Patient demographics: Age, gender, location, occupation, marital status, alcohol and tobacco use.
  • Tumor characteristics: Histology, tumor site, staging (T, N, M stages), and grade.
  • Treatment details: Surgery, chemotherapy regimen, radiotherapy (XRT) dates and doses, concurrent treatment modalities.
  • Other medical data: HIV status, insurance coverage, date of histological diagnosis, ECG readings.

Methods

The analysis includes:

  • Descriptive statistics for summarizing patient demographics and tumor characteristics.
  • Visualization of the distribution of variables like tumor site, treatment modalities, and histology.
  • Analysis of associations between demographic factors and tumor stages.
  • Evaluation of the prevalence of different treatment modalities (surgery, chemo, radiotherapy) among patients.

Requirements

This analysis requires the following R packages:

  • tidyverse - A collection of packages for data manipulation and visualization
  • gtsummary - For creating publication-ready summary tables and regression tables
  • ggsci - Provides color palettes for ggplot2 based on scientific themes.
  • cowplot - For combining multiple ggplot2 plots into a single figure.
  • patchwork - An intuitive syntax for combining ggplot2 plots
  • sf - For handling spatial data and simple features.
  • tmap - For creating thematic maps and visualizing spatial data
install.packages(c("tidyverse", "ggsci","gtsummary","cowplot","patchwork","sf","tmap"))

About

Analysis of synthetic head and neck cancer patient data modeled after real cases from the Ocean Road Cancer Institute. The project aims to explore the distribution of demographics, tumor characteristics and treatment modalities among patients. Utilizing R and various data visualization techniques provides insights.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages