MATH 313: Survey Design and Samping
\[ P(\text{selection of element } i) = \delta_i \]
Sampling typically involves randomly selecting elements based on their defined probabilities.
Sampling with Replacement: Each element has an equal or specified chance to be chosen again after it is selected.
Sampling without Replacement: Once an element is chosen, it is not returned to the pool for the remainder of the sampling process.
Impact on Variability:
\[\begin{align*} \hat{\tau} = \frac{1}{n} \sum_{i=1}^n \frac{y_i}{\delta_i} \end{align*}\]
\[\begin{align*} \delta_i = \frac{y_i}{\tau} \end{align*}\]
Impact: This approach reduces variance and increases the accuracy of the estimates.
\[\begin{align*} \hat{\tau} = \sum_{i=1}^n \frac{y_i}{\pi_i} \end{align*}\]
# Possible samples and their probabilities
samples <- list(c(1, 2), c(1, 3), c(1, 4), c(2, 3), c(2, 4), c(3, 4),
c(1, 1), c(2, 2), c(3, 3), c(4, 4))
# Compute tau_hat for each sample, ensuring to divide by the sample size
tau_hats <- sapply(samples, function(s) sum(elements[s] / deltas[s]) / length(s))
# Check tau_hats values
tau_hats
[1] 15.00 8.75 10.00 13.75 15.00 8.75 10.00 20.00 7.50 10.00
The estimator \(\hat{\tau}\) for each sample is calculated using the formula:
\[ \hat{\tau} = \frac{1}{n} \sum_{i=1}^n \frac{y_i}{\delta_i} \]
# Calculate pair probabilities
sample_probs <- c(
0.1*0.1 * 2, # (1, 2)
0.1*0.4 * 2, # (1, 3)
0.1*0.4 * 2, # (1, 4)
0.1*0.4 * 2, # (2, 3)
0.1*0.4 * 2, # (2, 4)
0.4*0.4 * 2, # (3, 4)
0.1*0.1, # (1, 1)
0.1*0.1, # (2, 2)
0.4*0.4, # (3, 3)
0.4*0.4 # (4, 4)
)
sample_probs
[1] 0.02 0.08 0.08 0.08 0.08 0.32 0.01 0.01 0.16 0.16