In this Section, we present the results of the Gabor filter based script segmentation approach described above. The algorithm was tested on a few textures and multi-script documents. The textures used are the Brodatz textures. The algorithm works well for any placements of the textures within the image. The multi-script documents were obtained by combining different Indian scripts obtained from Indian web sites containing newspaper articles in Indian scripts.
The following table shows the 5 top features for segmenting English and Hindi scripts:
Table 1:
Features used for segmenting English and Hindi
Scale
Orientation
3
0
6
162
2
24
6
12
6
174
The English script has a high response to the 0 degree Gabor filer because of the number of vertical segments in the script. The Hindi script is tilted a bit to its right and has a high response to the 162 and 174 degree filters.
The following table shows the 5 top features for segmenting Tamil and English scripts:
Table 2:
Features used for segmenting Tamil and English
Scale
Orientation
5
150
6
24
6
156
4
12
1
0
The Tamil script is more curved than the English script and hence has a higher response to the 150, 156, 24 and 12 degree Gabor filters than the English script. The English script has higher response to the 0 degree filter.
The following table shows the 5 top features for segmenting Gujarati and Hindi scripts:
Table 3:
Features used for segmenting Gujarati and Hindi
Scale
Orientation
4
90
6
24
6
66
2
60
2
24
The Hindi script has a high response to the 90 degree filter because of its headline feature. Note that this feature cannot be used for differentiating it from English which also has a high response to the 90 degree Gabor filter. The other features get selected because of the more rounded Gujarati script.
The algorithm was tried on a few textures and the following results obtained:
Figure 8:
Result of segmenting two textures
The results obtained on some multi-script images are shown below:
Figure 9:
Result of segmenting a document containing Tamil and English scripts
Figure 10:
Result of segmenting a document containing English and Hindi scripts
Figure 11:
Result of segmenting a document containing Gujarati and Hindi scripts
Figure 12:
Result of segmenting a document containing English and Hindi scripts in a more complex layout
Figure 13:
Result of segmenting a document containing Tamil, English and Hindi scripts