Next: Clustering using the Feature
Up: Document Segmentation
Previous: Feature Selection
  Contents
The K-Means and other standard clustering algorithms use simple euclidean distance between two vectors for segmentation. The euclidean difference between two feature vectors, u and v, is given by
. However, the response of the image to different bands may not be equally strong [22]. To control the impact of each band individually, the difference along each component is weighed differently. The weighted distance between two feature vectors is given by:
 |
(19) |
where
is the weight assigned to the scale/orientation band in feature vector component i. It is clear that
for all i. Without loss in generality, the constraint
 |
(20) |
can also be added, as only relative weights are important.
The weights should be assigned such that the bands having high discriminatory power are assigned higher weights.
The clustering algorithm assigns a label
under the clustering scheme represented by I to each pixel i such that the aggregate clustering error:
 |
(21) |
is minimum. The objective function is a function of the clustering scheme as well as the weight vector W as the clustering error depends on the distance measure given by Equation 19.
J can be optimized in two steps:
- Optimize J with respect to cluster assignments using the K-Means algorithm.
- Optimize J with respect to the weight vector W. The procedure is described below.
- Iterate over the above two steps till the change in J is very low.
The objective function can be written as:
 |
(22) |
where n is the number of components in the feature vector, and
is the clustering error limited to the feature j.
To minimize J with respect to
, we incorporate the constraint given in Equation 20 into the objective function:
 |
(23) |
Differentiating Equation 23 with respect to
, we get
 |
(24) |
Substituting Equation 24 into Equation 20 and equating to zero, we get,
 |
(25) |
Thus, using the above formula the weights can be calculated iteratively.
Next: Clustering using the Feature
Up: Document Segmentation
Previous: Feature Selection
  Contents
2002-06-03