Data Science Hub Directory of data tools

ClinPhen

A fast, high-accuracy algorithm that scans clinical notes and generates a prioritized list of patient phenotypes.

Epic Caboodle

Epic’s enterprise data warehouse, an abstracted database that supports the clinical information system and allows for the exploration and analysis of patient data from hospitals around the nation.

Epic Clarity Reports

A tool for generating reports from the Epic database that include longer timeframes or require complex analysis.

Epic Cosmos

A database of de-identified data from inpatient and outpatient electronic health records for use in clinical research. It includes records from over 180 million patients and over 6.6 billion encounters.

Epic Slicer Dicer

A data exploration and visualization tool that allows users to analyze aggregate patient data.

Natural Language Processing

A tool that uses machine learning to process and analyze natural language text and data (e.g., free-text notes in electronic health records).

OMOP (Observational Medical Outcomes Partnership)

An open-science collaborative that aims to standardize the way healthcare data is structured and analyzed for observational research.

PhenoGPT

A specialized version of the GPT language model designed to analyze clinical text, enabling tasks such as phenotype extraction, disease coding, and clinical decision support in electronic health records.

PHIS (Pediatric Health Information System) Database

A comparative database with clinical and resource utilization data from inpatient, ambulatory surgery, emergency department, and observation unit patient encounters from more than 49 children’s hospitals.

Population Builder: Stratification Module

A tool that allows organizations to rapidly identify patients best suited for population health programs through predefined criteria and customizable filters.

REDCap

A secure web application used to build and manage online surveys and databases.

Updates

Data Science Hub Directory of data tools

Bioconductor

Bowtie

BWA (Burrows-Wheeler Aligner)

GATK (Genome Analysis Toolkit)

HISAT2 (Hierarchical Indexing for Spliced Alignment of Transcripts 2)

Salmon

Scanpy

Seurat

STAR (Spliced Transcripts Alignment to a Reference)

ClinPhen

Epic Caboodle

Epic Clarity Reports

Epic Cosmos

Epic Slicer Dicer

Natural Language Processing

OMOP (Observational Medical Outcomes Partnership)

PhenoGPT

PHIS (Pediatric Health Information System) Database

Population Builder: Stratification Module

REDCap

G*Power

JAMOVI

JASP

Minitab

SAS (Statistical Analysis System)

SPSS (Statistical Package for the Social Sciences)

STATA

ggplot2 (R)

Matplotlib (Python)

Plotly (Python/R)

Informatica

Keras (Python)

PyTorch (Python)

Scikit-learn (Python)

TensorFlow (Python)

MATLAB

Python

R

SQL

Nextflow

Snakemake