Skip to content

Latest commit

 

History

History
28 lines (20 loc) · 843 Bytes

README.md

File metadata and controls

28 lines (20 loc) · 843 Bytes

Description

The underlying question is to build a model that can predict the cancer class from the given features. In this project, 3 machine learning models were used to predict the class of five cancers (BRCA, KIRC, COAD, LUAD and PRAD) using 20531 genes according to their level of expression in 801 individuals. The collection of data is part of the RNA-Seq (HiSeq) PANCAN data set: https://archive.ics.uci.edu/ml/datasets/gene+expression+cancer+RNA-Seq

BRCA: Breast Invasive Carcinoma
KIRC: Kidney Renal Clear Cell Carcinoma
COAD: Colon Adenocarcinoma
LUAD: Lung Adenocarcinoma
PRAD: Prostate Adenocarcinoma

Get started

Requirements

conda create -n yourenvname python=x.x 
conda activate yourenvname
pip install -r requirements.txt

Run

python main.py