Xi Shen1 Ilaria Pastrolin2 Oumayma Bounou2 Spyros Gidaris3 Marc Smith2 Olivier Poncet2 Mathieu Aubry1
1LIGM (UMR 8049) - Ecole des Ponts, UPE 2Ecole Nationale des Chartes 3Valeo AI
Historical watermark recognition is a highly practical, yet unsolved challenge for archivists and historians. With a large number of well-defined classes, cluttered and noisy samples, different types of representations, both subtle differences between classes and high intra-class variation, historical watermarks are also challenging for pattern recognition. In this paper, overcoming the difficulty of data collection, we present a large public dataset with more than 6k new photographs, allowing for the first time to tackle at scale the scenarios of practical interest for scholars: one-shot instance recognition and cross-domain one-shot instance recognition amongst more than 16k fine-grained classes. We demonstrate that this new dataset is large enough to train modern deep learning approaches, and show that standard methods can be improved considerably by using mid-level deep features. More precisely, we design both a matching score and a feature fine-tuning strategy based on filtering local matches using spatial consistency. This consistency-based approach provides important performance boost compared to strong baselines. Our model achieves 55% top-1 accuracy on our very challenging 16,753-class one-shot cross-domain recognition task, each class described by a single drawing from the classic Briquet catalog. In addition to watermark classification, we show our approach provides promising results on fine-grained sketch-based image retrieval.
Video (5mins) |
A watermark was made by pressing a water-coated metal onto the paper during manufacturing. Watermark appears on almost all papers from XIV to XIXth century.
Recognizing watermark helps people (historians, archivists, auction houses, collectors) to locate and date papers, which is crucial to analyse and assess document.
We release the watermark dataset composed of 4 parts targeting 4 different tasks:
It contains 100 classes: 50 images / class for training and 10 images / class for validation. This part can be used to train a traditional classification neural network.
It contains 100 classes: 1 clean watermark as reference (without any text) + 2 query photographs in each class. This part can be used to evaluate one-shot recognition.
It contains:
This part can be used to evaluate one-shot cross-domain recognition.
It contains 16,753 classes, only 1 drawing in each class. The drawings in the Dataset B-cross-domain are included in Briquet. This part can be used to evaluate large scale one-shot cross-domain recognition.
Click here to download the whole dataset(~400M). It also includes the synthetic references that we mentioned in the paper.
We show some top retrieval results here and provide more visual results in the following webpages:
To cite our paper,
@inproceedings{shen2020watermark, title={Large-Scale Historical Watermark Recognition: dataset and a new consistency-based approach}, author={Shen, Xi and Pastrolin, Ilaria and Bounou, Oumayma and Gidaris, Spyros and Smith, Marc and Poncet, Olivier and Aubry, Mathieu}, booktitle={ICPR}, year={2020} }
This work was partly supported by ANR project EnHeritANR-17-CE23-0008 PSL Filigrane pour tous project and gifts from Adobe to Ecole des Ponts.