Skip to content

Pytorch code for "CoinNet: Deep Ancient Roman Republican Coin Classification via Feature Fusion and Attention."

Notifications You must be signed in to change notification settings

saeed-anwar/CoinNet

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

37 Commits
 
 
 
 

Repository files navigation

CoinNet: Deep Ancient Roman Republican Coin Classification via Feature Fusion and Attention

This repository is for CoinNet: Deep Ancient Roman Republican Coin Classification via Feature Fusion and Attention (CoinNet) introduced in the following paper

Hafeez Anwar, Saeed Anwar, Sebastian Zambanini, Fatih Porikli, "CoinNet: Deep Ancient Roman Republican Coin Classification via Feature Fusion and Attention", Pattern Recognition, 2021, PDF and arXiv

The model is built in

  • PyTorch 1.1.0,
  • Ubuntu 14.04/16.04 environment
  • Python 3.7.5
  • CUDA9.0
  • cuDNN5.1

Contents

  1. Introduction
  2. Network
  3. Test
  4. Dataset
  5. Results
  6. Citation

Introduction

We perform classification of ancient Roman Republican coins via recognizing their reverse motifs where various objects, faces, scenes, animals, and buildings are minted along with legends. Most of these coins are eroded due to their age and varying degrees of preservation, thereby affecting their informative attributes for visual recognition. Changes in the positions of principal symbols on the reverse motifs also cause huge variations among the coin types. Lastly, in-plane orientations, uneven illumination, and a moderate background clutter further make the task of classification non-trivial and challenging. To this end, we present a novel network model, CoinNet, that employs compact bilinear pooling, residual groups, and feature attention layers. Furthermore, we gathered the largest and most diverse image dataset of the Roman Republican coins that contains more than 18,000 images belonging to 228 different reverse motifs. On this dataset, our model achieves a classification accuracy of more than 98% and outperforms the conventional bag-of-visual-words based approaches and more recent state-of-the-art deep learning methods. We also provide a detailed ablation study of our network and its generalization capability.

Variations in the anatomy of the reverse motifs due to the positions of the symbol (Red-dotted line border), main object(Blue-Solid line border), and legend(Orangedashed line border).

Network

The following figure shows the architecture of our network

Network Our model highlighting the Compact Bilinear Pooling, residual blocks, skip connections, and feature attention. The green and yellow cubes indicate the embedded features via CNN networks.

Test

Quick start

  1. Download the trained models for our paper and place them in '..'. The model can be downloaded from Google Drive. The size is 409MB

  2. Cd to 'CoinNet_Model', run the following scripts. You can use the following script to test the algorithm

    #CoinNet
    CUDA_VISIBLE_DEVICES=0 python test.py

Dataset

The dataset can be downloaded from Google Drive. The size of the dataset is 8.65GB

Samples images of the 100 classes that constitute the RRCD-Main.

Results

Quantitative Results

Comparison of our method with state-of-the-art methods on train-test split of 30%-70%. All results reported as top-1 mean accuracy on the test set

Ablation Study

Comparison of different input features combinations to our CoinNet. Our network is robust to the change in the input features such as generated via ResNet50 (r50), DenseNet161 (d161) and Vgg19.

The effect of the vocabulary size on the classification performance for BoVWs and rectangular tiling.

Accuracy on the unseen coin types for competing CNNs

Visual Results

Visualization results from Grad-CAM. The visualization is computed for the last convolutional outputs, and the ground-truth labels are shown on the left column the input images.

The correctly classified images are represented with green circles while the wrongly classified ones are in red circles. In the first row, the confidence of the NasNet is always low although the model can classify correctly. The second shows that the confidence of the VGG, which is consistently high even for wrongly classified classes. The traditional classifiers as the CNN methods may be benefiting from the weights of ImageNet.

Citation

If you find the code helpful in your resarch or work, please cite the following papers.

@article{Anwar2021CoinNet,
    title = {Deep Ancient Roman Republican Coin Classification via Feature Fusion and Attention},
    journal = {Pattern Recognition},
    pages = {107871},
    year = {2021},
    issn = {0031-3203},
    doi = {https://doi.org/10.1016/j.patcog.2021.107871},
    url = {https://www.sciencedirect.com/science/article/pii/S0031320321000583},
    author = {Hafeez Anwar and Saeed Anwar and Sebastian Zambanini and Fatih Porikli},
}

About

Pytorch code for "CoinNet: Deep Ancient Roman Republican Coin Classification via Feature Fusion and Attention."

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published