Free public images: Sentinel dataset of Very High Resolution Images (1973 by 2263 pixels) with a resolution of .5m per pixel was used. Masks (960 by 960 pixels) with resolution of 1m per pixel were used for both building and damage detection.
Data : https://s3.eu-central-1.amazonaws.com/corupublic/AAAI_harvey_data/harvey.zip
BICUBIC Resampling : Bicubic resampling computes new pixels using cubic splines. When upsampling, this method operates on the 4 by 4 cell of pixels surrounding each new pixel location. This is the recommended resampling method for most images as it represents a good trade-off between accuracy and speed.
The U-Net is convolutional network architecture for fast and precise segmentation of images. It is an encoder-decoder style network.
Using a U-Net with a pretrained resnet encoder means that the encoder part of the U-Net will be replaced by the resnet pretrained weights. It's a concept of transfer learning, i.e we don't need to train the model from scratch.
Adaptive Moment Estimation (Adam) is an optimizer that computes adaptive learning rates for each parameter. In addition to storing an exponentially decaying average of past squared gradients vt like RMSprop, Adam also keeps an exponentially decaying average of past gradients mt, similar to momentum.
The 1cycle policy has three steps: We progressively increase our learning rate from base_lr to lr_max and at the same time we progressively decrease our momentum from mom_max to mom_min.
We do the exact opposite: we progressively decrease our learning rate from lr_max to lr_max/div_factor and at the same time we progressively increase our momentum from mom_min to mom_max.
We further decrease our learning rate from lr_max/div_factor to lr_max/(div_factor x 100) and we keep momentum steady at mom_max.
Validation Set Dice score(IOU) : 0.86
Validaton Set Dice score(IOU) : 0.87
We used an end-to-end trainable neural network architecture for multiresolution, multisensor, and multitemporal satellite images and showed that it can perform building footprint and flooded building segmentation tasks, and demonstrated that publicly available imagery alone can be used for effective segmentation of flooded buildings. Our approach is applicable to different types of flood events, and could be used to predict damage caused by other types of disasters. It substantially reduces the amount of time needed to produce flood maps for first responders compared to current methods.