Tony Tsai

May the force of science be with you

Apr 23, 2015 - 2 minute read - Comments - Statistical Computing

Why Does the Jackknife Estimate of Standard Error Have the Factor (n-1)/n?

Today I read 7.2 The Jackknife in Stastical Computing with R and found the explanation for why the jackknife estimate of standard error have the factor (n1)/n is unclear. I refered to An Introduction to the Bootstrap by Bradley Efron and R. J. Tibshirani, and the slides of jackknife by Rozenn Dahyot to figure out the reason. Here is my understanding for the existence of factor (n1)/n.

The jackknife samples are computed by leaving out one observation xi from a sample x=(x1,x2,,xn) at a time: x(i)=(x1,x2,,xi1,xi+1,,xn) for i=1,2,,n. The ith jackknife replication θ^(i) of the statistic θ^=s(x) is θ^(i)=s(x(i)),i=1,2,,n. Thus, for θ^=x¯, we have s(x(i))=x¯(i) =1n1jixj =nx¯xin1

The jackknife estimate of standard error is defined by se^jack=[n1ni=1n(θ^(i)θ^())2]1/2 where θ^()=1ni=1nθ^(i).

The exact form of the factor (n1)/n in the above formula is derived by considering the special case θ^=x¯. For θ^=x¯, it is easy to show that x¯()=1ni=1nx¯(i) =1ni=1nnx¯xin1 =nn1x¯1n11ni=1nxi =nn1x¯1n1x¯ =x¯ Therefore, se^jack=[n1ni=1n(θ^(i)θ^())2]1/2 =[n1ni=1n(x¯(i)x¯())2]1/2 =[n1ni=1n(nx¯xin1x¯)2]1/2 =[1n(n1)i=1n(xix¯)2]1/2 =[1n1i=1n(xix¯)2]1/2n =σ^n =se^(x¯) where se^(x¯) is the unbiased estimate of the standard error of the sample mean, and σ^ is unbiased estimate of the standard deviation of the population.

The derivation suggests that the factor (n1)/n os exactly what is needed to make se^jack equal to the unbiased estimate of the standard error of the sample mean se^(x¯). It is a somewhat arbitrary convention that se^jack uses the factor (n1)/n, as in fact, the factor (n1)/n is derived by considering the special case θ^=x¯.

Tags: Jackknife Standard Error Resampling

Stratified Importance Sampling Resources for Influenza Research