This repository is aimed to provide step by step methodology from creation of dataset to classifying slums through transfer learning. Slums detection in cities of major developing nations has been experimented with different methods and techniques. Majority of which are difficult to reiterate and follow to implement in different cities. While Very High Resolution datasets have primarily been used for such tasks, some studies have also worked with Medium Resolution datasets in Slum detection and mapping. This study utilised both VHR and MR datasets and compared the results. We demonstrate a simple step by step approach to map slums. Similar procedure can be followed for slum or any landform detection in any city.
The training dataset is manually created with the help of Google Earth Pro. GE provides interface to quickly zoom and locate the various classes. In this study, the samples for four classes such as (a) Slums (b) Non-Slums/Other Built (c) Green, and (d) Water are created with the help of the path tool. The paths created by the path tool is saved as a lat/long pair as a csv file. Two satellite imageries are used in this study (a) Pleiades: 0.5m resolution and (b) Sentinel: 10m resolution. Natural color (RGB) bands are used from Pleiades while False Color (8,4,3) are used from Sentinel product. The lat/long pair are treated as centre pixel of the varying-size image patches such as 10x10 pixel (Sentinel) and 200x200 pixel (Pleiades) created as a part of the final dataset.
The image patches created for each satellite dataset are rescaled to 128x128 pixels and fed into the Inception Network to collect 1024-D transfer values. These values can be plotted in 2-D representation with the help of t-SNE and PCA to visualize the samples of classes or formation of clusters.
For each satellite dataset, corresponding transfer values are then fed into the Neural Network for further train the clssifier to detect 4 classes.
To map the prediction results in entire area of the city, each satellite map is sequentially clipped and fed into the Inception Net to collect transfer values which are further fed into the Neural Network to obtain model results. These results are then mapped over the corresponding satellite patches.
data_generation
includes the data files and the scripts to create datasets for training and prediction./point-based-samples/
include kml file created from path tool and the notebook to convert kml to csv./scripts/
include python scripts to create fixed size buffers given lat long/pair in QGIS platform./creating_clips_using_gdal.ipynb
creates sequential clips of the satellite imagery, which are used for final predictions./data_generation_mumbai.ipynb
converts patches to array format for further processing through Transfer Learning.data_processing
consists notebook to implement Transfer Learning.data_plotting
consists of notebook to assign the Transfer Learning predictions to the corresponding sequential clips.
The Entire procedure and other details are further discussed in the Research Paper.