Reading handwritten German words in historical documents
- Resource Type
- Conference
- Authors
- Steinke, Karl-Heinz; Zhang, Yuanchen
- Source
- 2012 5th International Congress on Image and Signal Processing Image and Signal Processing (CISP), 2012 5th International Congress on. :1294-1298 Oct, 2012
- Subject
- Signal Processing and Analysis
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Hidden Markov models
Handwriting recognition
Time series analysis
Feature extraction
Estimation
Training
handwriting recognition
writer recognition
HMM
DTW
Fourier
- Language
The research project “Herbar Digital” was started in 2007 with the aim to digitize 3.5 million dried plants on paper sheets belonging to the Botanic Museum Berlin in Germany. Frequently there are printed labels on the sheets with handwritten annotations. The annotations are written by the determiner or the finder of the plant. They often describe the plant and give information about its name and where it was found. So procedures have to be developed in order to read the most important handwritten words on the sheets. A HMM-approach, a Fourier-approach and a DTW-approach are compared. With a limited number of words a recognition rate of about 95% is obtained by all three methods.