Results

Training

This section details the results of training the neural nets to the various characters. 250 samples of the digits 0 through 9 were used. As expected, the neural nets converge to have very few misrecognized values. In this case, only 3 of 250 were incorrectly identified.

Figure 1 shows a graphical breakdown of errors. It is a graph of expected values versus determined values. A perfect system would be a diagonal of complete squares. Any white bars outside the diagonal are misrecognitions. (Note: This graph is actually a combination of Training + High Resolution tests, but in general shows the desired training.)

Figure 1 - Training + High Res errors

0 - 0 / 25
1 - 0 / 25
2 - 1 / 25
3 - 0 / 25
4 - 0 / 25
5 - 0 / 25
6 - 2 / 25
7 - 0 / 25
8 - 0 / 25
9 - 0 / 25
Total: 3 / 250 (1.2%)

Low Resolution Tests

Originally we had planned for this to be our objective test sample. However, it was a full page of numbers scanned at too low a resolution. This resulted in many difficulties identifying numbers, even to the human eye. The number of misrecognized values is pretty severe, 28 of 128 or 23.33%. In one case, for nines, half were incorrectly detected.

Figure 2 - Low Resolution Test Sample

Figure 3 - Low Resolution Test Errors

In Figure 3, notice the numerous bands outside the diagonal.

0 - 1 / 12
1 - 3 / 12
2 - 5 / 12
3 - 5 / 12
4 - 1 / 12
5 - 2 / 12
6 - 0 / 12
7 - 3 / 12
8 - 2 / 12
9 - 6 / 12
Total: 28 / 120 (23.33%)

High Resolution Tests

These samples were of a higher resolution, the same as the training samples. As expected, this results in much better recognition. Only 13 of 150, or 8.6%, were misrecognized. Only 2 and 5 had serious problems, resulting in 10 of the 13 errors. Both of these are very similar to other characters resulting in easy misrecognition.

High Resolution Test Errors

As shown by Figure 2, our OCR system does much better with these higher resolution samples.

0 - 0 / 15
1 - 1 / 15
2 - 5 / 15
3 - 1 / 15
4 - 1 / 15
5 - 4 / 15
6 - 0 / 15
7 - 0 / 15
8 - 1 / 15
9 - 0 / 15
Total: 13 / 150 (8.6%)

Results Summary

Training - 1.2%
Low Resolution - 23.33%
High Resolution - 8.6%

Postal Sporks (harton@rice.edu)