Next: Maximum-Likelihood (ML) Estimation
Up: Technical Background
Previous: Random Variables
In practice, we only have access to the data that a physical process generates rather than the
underlying RVs or PDFs. Statistical inference refers to the process of using observed data to
estimate the forms of the PDFs of the RVs, along with any associated parameters, that
model the physical processes fairly accurately. The foundations of modern statistical
analysis were laid down by Sir Ronald A. Fisher in the early 1900s.
In the statistical-inference terminology, a population is the set of elements about which we
want to infer. A sample is a subset of the population that is actually observed. Thus, the
goal is to learn about the statistical characteristics of the population from the sample data. Let
us consider an RV
, with the associated PDF
, that models some physical process and
produces a set of
independent observations
. The goal is to infer
some properties of
from its observations. For instance, knowing that
was of a Gaussian
form, we may want to determine the exact value for its mean and variance parameters such that the
observed data best conform with the specific Gaussian model. We can consider each observation
as the value of an RV
. Such a set of RVs
constitutes a
random sample, and comprises a set of mutually independent RVs that are identically
distributed:
 |
|
|
(24) |
Suppose we want to estimate a particular parameter
associated with the PDF of
. Here we
assume that the data were derived from the PDF
. A statistic
is any deterministic function of the random sample and, hence, an RV itself. An estimator is
a statistic
that is used to estimate the value of some
parameter
. Some properties of an estimator are highly desirable, e.g.,:
As an example, for an RV
, an unbiased and consistent estimator of its mean, or expectation, is
the sample mean [167],
 |
|
|
(28) |
Another interesting example is that of the empirical CDF of a discrete RV, which is a consistent
estimator of the true CDF
[167]. The empirical CDF for a discrete RV is
 |
|
|
(29) |
where
is the Heaviside step (unit step) function.
Subsections
Next: Maximum-Likelihood (ML) Estimation
Up: Technical Background
Previous: Random Variables
Suyash P. Awate
2007-02-21