Modeling the Markov PDFs parametrically entails data-driven optimal estimation of the parameters
associated with the GRF potential functions or the Markov PDFs, lest we enforce an ill-fitted model
on the data. Even nonparametric schemes are not free of internal parameters and one would want to
learn these parameters in a data-driven manner. Standard estimation schemes, e.g., maximum
likelihood, are not applicable in a straightforward manner for this task. Consider that we want to
estimate some parameter
in the MRF model. A ML-estimation scheme needs to evaluate the
joint PDF of all the RVs in the MRF, i.e.,
, which is a function of
. We can compute the potential functions
, as functions of
,
in a simple way. The partition function
, however, involves a
-dependent
integral over the entire
-dimensional space of possible realizations of the MRF. This
is virtually intractable for any practical dataset, or image, comprising a reasonable number of
indices
. For instance, a 256
256 pixels image results in a
D space.
Besag [14,15] devised one way to bypass this problem in the following way. Based on
his idea, we first choose a set of indices
such that the neighborhoods for the
indices in
do not overlap, i.e.,
| (80) | |||
| (81) |
![]() |
(82) |
| (83) |
A major drawback of the coding-based parameter estimation is the wastage of
data [14,15] because it utilizes only a small part
(
) of the entire data. Another drawback is that the
partition
is not unique, and different partitions produce potentially different
parameter estimates. There appears no clear way of reconciliation between these different
estimates [99].
To alleviate the drawbacks of the coding scheme, Besag [14,15] invented a simple
approximate scheme called the pseudo-likelihood estimation. This eliminated any coding
strategies and used all the data at hand. The pseudo-likelihood function
is simply the product of the conditional likelihoods at each index
,
i.e.,
![]() |
(84) |
| (85) |
The literature also presents other methods of MRF-parameter estimation such as those based on mean-field approximations and least-squares fitting [99].