Skip to content

Latest commit

 

History

History

Lab3

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Lab 3 - Classification and segmentation of Hyperspectral Images

  • Short Description: In this third lab we begin our investigation into the Deep Learning framework. The problem at hand is the classification of hyperspectral images. Hyperspectral image classification has been a vibrant area of research in recent years. Given an image, usually obtained from a satellite, the goal is to assign each pixel of the image to a class from a predetermined set of classes. In the case of satellite images, one is usually interested in mapping each pixel to the corresponding land use; e.g. urban fabric, forest, water etc. In this lab our goal is to develop a model with the highest generalization capabilities in order to make pixel-wise predictions to three satellite images.

  • The Dataset: The dataset that we use in this lab is the HyRANK dataset which was developed by a scientific initiative of the ISPRS, WG III/4. It is composed of five hyperspectral images gathered with the Hyperion sensor Earth Observing-1. After a preprocessing step, these images were provided with 176 surface reflectance bands. The ground truths from two (Loukia and Dioni) out of five images were provided. 14 Land Use and Land Cover (LULC) were annotated in the ground truth following the CORINE Land Cover principles. In the following table we see the 14 categories and their corresponding labels and color codes.

The 14 LULC classes of the HyRANK dataset

Label Color Code Category
0 #000000 Non Defined
1 #ff0000 Dense Urban Fabric
2 #a600cc Mineral Extraction Sites
3 #ffffa8/td> Non Irrigated Arable Land
4 #f2a64d Fruit Trees/td>
5 #e6a600 Olive Groves/td>
6 #80ff00 Broad-leaved forest
7 #00a600 Coniferous-Forest
8 #a600cc Mixed Forest
9 #819c25 Dense Sclerophyllous Vegetation
10 #e6cc4d/td> Sparse Sclerophyllous Vegetation
11 #e6e64d Sparcely Vegetated Areas
12 #cccccc Rocks and Sand
13 #4d4dff Water
14 #80f2e6 Coastal Water

The images of Dioni and Loukia and their corresponding ground truth labels are depicted in the following figures.

The image of Dioni in RGB composite and the corresponding ground truth labels. The size of the image is 250x1376 with 176 spectral bands and the spatial resolution is 30m. Pixels shown in black color correspond to undefined land uses.

The image of Loukia in RGB composite and the corresponding ground truth labels. The size of the image is 249x945 with 176 spectral bands and the spatial resolution is 30m. Pixels shown in black color correspond to undefined land uses.

Below you can see the three images for validation in RGB composite. Erato, Kirki and Nefeli.

The validation images in RGB composite. Each image has spatial resolution equal to 30m and 176 spectral bands. On top is Erato with size 241x1623. In middle is Kirki with size 245x1626 and in bottom is Nefeli with size 249x772. The ground truth labels are not given and the end goal is to make predictions for these images.

  • Approach to the classification problem: For the classification of the hyperspectral images we develop several approaches and different models. At first, we examine we train some classic ML classifiers like Random forest and a Multi-Layer-Perceptron (MLP). These classifiers correspond to the pixel-based approach where the features consists of all the available pixels from the images of Dioni and Loukia. This approach is not the optimal one in order to make prediction to unknown images, e.g. Erato, Kirki and Nefeli, because the high-dependency of neighboring-pixels. To this end, we develop more sophisticated approaches and models to deal with this problem. A CNN model is designed and a patched based approached is utilized. We train the CNN by allowing maximum overlapping between the patches and with no overlapping at all. Furthermore, we use techniques of data augmentation and transfer learning. In the maximum overlapping case we achieve the highest score on the training images. However, this approach is not the appropriate one for making predictions to Erato, Kirki and Nefeli but we can still use this model to completey annotate our training images. In the following figure we see the annotation of Dioni using the CNN model.

Predictions made by the CNN with maximum overlapping on Dioni.

The best model in order to make predictions to Erato, Kirki and Nefeli turns out to be the one following the UNet principles. It achieves an 80% accuracy score on the test set and it is trained without overlapping images. Futhermore, with this approach we create a dataset we enough samples in order to train and test the model. In the following figures you can see the confusion matrix of the model on the test set and the predictions made on Erato, Kirki and Nefeli.

The Confusion matrix of the UNet model on the test set.

Predictions made by the UNet On Erato.

Predictions made by the UNet On Kirki.

Predictions made by the UNet On Nefeli.