8000 GitHub - shanecrinion/I2EHR: An integrative web app using R Shiny using genomic expression data and simulated patient data (Synthea) to address the lack of framework for integrated analysis.
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

An integrative web app using R Shiny using genomic expression data and simulated patient data (Synthea) to address the lack of framework for integrated analysis.

Notifications You must be signed in to change notification settings

shanecrinion/I2EHR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

76 Commits
 
 
 
 

Repository files navigation

Interactive Integrated Electronic Health Records (I2EHR)

This is the R Shiny based Interactive Integrated Electronic Health Records (I2EHR). This app integrates clinical and genomic data to identify patient subgroups and biomarkers related to disease.

This application was originally created as a deliverable for my MSc in Biomedical Genomics. The original app used data from Synthea, a synthetic patient generator (https://github.com/synthetichealth/synthea), and microarray samples from the Gene Expression Omnibus (GEO) to create a biobank consisting of artificial patients linked to real genomic samples.

The microarray CEL files are used to construct a R ExpressionSet object which also contained patient clinical observations that were used in lieu of Synthea. The decision to remove Synthea was because I wanted to focus on improving analytical power with machine learning and improved interactivity. This version of the I2EHR app was constructed using R packages flexdashboard and Shiny. The analyses were performed many R packages which include multiclust, affy and topGO.

I will discussed the application by each of the 5 tabs. Namely these are: Clustering, Heatmap, Clinical Integration, Data Summary, Filtered Data and About.

Clustering

Clustering Tab

This tab uses expression data to cluster samples.

Heatmap

Clustering Tab

The heatmap tab contains a visualisation of a heatmap for each sample and probe. The heatmap is interactive and hover can be used to indicates the sample and probe. Scaling is column-based.

Clinical Integration

Clustering Tab

The Clinical Integration tab provides further insight into the cluster assignment. Cluster Breakdown is a stacked histogram for a clinical measurement chosen by the user. The colour indicates the colour and can be used to identify if a cluster is associated with a clinical phenotype. Principal Component Analysis and MultiDimensional Scaling represent group relationships and separation. The Clinical Phenotypes tables show cluster assignment and clinical info for each sample.

Filtered Data

The Filtered Data tab allows you to view visualisations related to a certain sample or probe. If sample is selected, a histogram will be generated of the expression distribution by either case/control status or cluster assignment. If sample is selected, a t-test is performed for expression change from baseline to 1 year. A slope plot of expression change is also created which includes all samples but highlights the sample of interest.

Data Summary

Clustering Tab

The Data Summary tab allows you to visualise the significant genes found from the expression data. The thresholds for p and log2FC can be changed using radio buttons which will be represented in the volcano plot axis lines. The parameter thresholds will also be used to recalculate the pathways of greatest significance.

About

An integrative web app using R Shiny using genomic expression data and simulated patient data (Synthea) to address the lack of framework for integrated analysis.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

0