Forest-Cover-Classification

Description:

In this project, I use deep learning to predict forest cover type (the most common kind of tree cover) based only on cartographic variables. The actual forest cover type for a given 30 x 30 meter cell was determined from US Forest Service (USFS) Region 2 Resource Information System data. The covertypes are the following:

Spruce/Fir
Lodgepole Pine
Ponderosa Pine
Cottonwood/Willow
Aspen
Douglas-fir
Krummholz

Independent variables are derived from data obtained from the US Geological Survey and USFS. The original data is raw and has not been scaled or preprocessed. It contains binary columns of data for qualitative independent variables such as wilderness areas and soil type.

This study area includes four wilderness areas located in the Roosevelt National Forest of northern Colorado. These areas represent forests with minimal human-caused disturbances, so existing forest cover types are mainly a result of ecological processes rather than forest management practices.

Project Objectives:

Develop a classifiers for this multi-class classification problem.
Use TensorFlow with Keras to build my classifier
Use my knowledge of hyperparameter tuning to improve the performance of my model
Test and analyze performance
Create clean and modular code

What I learned from the projects:

How to build a multi-class classifier
How to analyse performance of a model and improve it through manual hyperparameter tuning
How to use Github for version control

How to use:

Fork the repository.
Open the 'CoverType-classification.ipynb' notebook file
Run the file.
Use model.predict(your_data) to predict forest cover type for your unclassified data
Use the suggestions given to improve the model and get even better predictions (current metrics are: precision: recall: accuracy: )

Technologies:

python: main language
jupyter notebooks: main computing platform
pandas and numpy: for exploratory data analysis (EDA)
tensorflow: main machine learning library to create data pipeline and model
scikit-learn: secondary machine learning library used to create performance metrics at the end
seaborn and matplotlib: for plotting graphs
markdown: for this README.md file
git: for file version control

Collaborators:

None

License:

None

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
other_resources		other_resources
CoverType-classification.ipynb		CoverType-classification.ipynb
README.md		README.md
cover_data.csv		cover_data.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Forest-Cover-Classification

Description:

How to use:

Technologies:

Collaborators:

License:

About

Releases

Packages

Languages

jaimeggb/Forest-Cover-Classification

Folders and files

Latest commit

History

Repository files navigation

Forest-Cover-Classification

Description:

How to use:

Technologies:

Collaborators:

License:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages