Machine learning Model for Crop Type Classification in Central Asia

Summary: This computational report was developed using Python in a Jupyter Notebook environment as the final assignment for GEOG3300 (Advanced Spatial Analysis) unit. It describles a spatial data analysis workflow that develops and evaluates a crop type classification maching learning model for agricultural landscapes in Central Asia.

0. Crop type classification workflow diagram

1. Import data from MLHub and explore data. Create a bounding box for query
(Example from one of the 4 sampling areas)

2. Use the bounding box to get NDVI (left) and Rededge (right) data
during the cropping season from Sentinel-2 to use as predictors

3. Visualize the differences in band signatures of each crop type

4. Divide dataset into training and test set, then run the Random Forest Classifier.
Evaluate the model with classification report and confusion matrix

5. Compute permutation importance to determine the perforomance of each predictor

6. Visualize ground truth labels and predicted labels of each cropping field