Sampling and Estimation from Finite Populations. Yves Tille
where
Definition 2.1
A sampling design without replacement
Definition 2.2
A random sample
A random sample can also be defined as a discrete random vector composed of non‐negative integer variables
Often, we try to select the sample as randomly as possible. The usual measure of randomness of a probability distribution is the entropy.
Definition 2.3
The entropy of a sampling design is the quantity
We suppose that
We can search for sampling designs that maximize the entropy, with constraints such as a fixed sample size or given inclusion probabilities (see Section 2.3). A very random sampling design has better asymptotic properties and allows a more reliable inference (Berger, 1996, 1998a; Brewer & Donadio, 2003).
The sample size
When the sample size is not random, we say that the sample is of fixed sample size and we simply denote it by
The variables are observed only on the units selected in the sample. A statistic
The variance operator is defined using the expectation operator:
2.3 Inclusion Probabilities
The inclusion probability
for all
The second‐order inclusion probability (or joint inclusion probability)
for all
The variance of the indicator variable
which is the variance of a Bernoulli variable. The covariances between indicators are
One can also use a matrix notation. Let
be a column vector. The