Template Generation from Postmarks Using Cascaded Unsupervised Learning

Elisa H. Barney Smith, Gernot Fink

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Information in historical datasets comes in many forms. We are working with a set of World War I era postcards that contain hand written text, some preprinted text, postage stamps and postmark/cancellation stamps. The postmarks are of considerable interest to collectors looking for images of samples they had not previously seen. The postmarks also provide information on the originating location of the card that complements the information in the address block.

The postmarks vary considerably with towns and dates, but also styles. The styles can be grouped into categories. A method for automatically extracting templates for each category of these postmark stamps is described. The problem is complicated by the high levels of degradation present in the cards. The approach uses a cascade of unsupervised learning steps separated with image cleaning. This introduces averaging steps, which reduces noise. It also provides a reduction in the number of comparisons between samples. While merges happen at each stage, the number of times merges are needed within each stage is reduced. The templates once extracted can be used to group the postmarks, and will contribute information about the postmark content to better separate the postmark from the paper and other interfering marks to extract further information about the postmarks and postcards.

Original languageAmerican English
JournalHIP '15: Proceedings of the 3rd International Workshop on Historical Document Imaging and Processing
DOIs
StatePublished - 1 Jan 2015

Keywords

  • document seal recognition
  • image clustering
  • sequential learning

EGS Disciplines

  • Electrical and Computer Engineering

Fingerprint

Dive into the research topics of 'Template Generation from Postmarks Using Cascaded Unsupervised Learning'. Together they form a unique fingerprint.

Cite this