Professional Documents
Culture Documents
DOI 10.1007/s40304-015-0069-7
Abstract This paper presents an efficient class of estimators for estimating the population mean of the variate under study in two-phase sampling using information
on several auxiliary variates. The expressions for bias and mean square error (MSE)
of the proposed class have been obtained using Taylor series method. In addition, the
minimum attainable MSE of the proposed class is obtained to the first order of approximation. The proposed class encompasses a wide range of estimators of the sampling
literature. Efficiency comparison has been made for demonstrating the performance
of the proposed class. An attempt has been made to find optimum sample sizes under
a known fixed cost function. Numerical illustrations are given in support of theoretical
findings.
Keywords
62D05
1 Introduction
Information on auxiliary variates are widely utilized for obtaining precise estimator
of a population parameter such as the mean, the variance, or the distribution function
of a study variate. A great variety of techniques (i.e., ratio, product, regression, ratio-
Gajendra K. Vishwakarma
vishwagk@rediffmail.com
Manish Kumar
manishstats88@gmail.com
Department of Applied Mathematics, Indian School of Mines, Dhanbad, Jharkhand 826004, India
123
G. K. Vishwakarma, M. Kumar
(1.1)
123
(1.2)
where u = x/
X , v = sx2 /Sx2 , t (, ) is a parametric function such that t (1, 1) = 1,
and the function t is continuous and bounded, with continuous and bounded first and
second partial derivatives in R.
Diana and Perri [4] developed a general class of estimators for Y using information
on means and variances of multi-auxiliary variates in SRSWOR as
yg = g( y , x , s) = g( y , x1 , x2 , . . . , x p , s12 , s22 , . . . , s 2p ),
(1.3)
(1.4)
where h is a function such that h(Y , X 1 , X 2 , X 1 , X 2 ) = Y and satisfies certain regularity conditions as given in Srivastava [17].
In this paper, an attempt is made to propose a general class of estimators for the finite
population mean of the study variate Y in two-phase sampling using information on
multi-auxiliary variates. The asymptotic expression for the mean square error (MSE)
of the proposed class is obtained along with its minimum attainable value. Some of the
estimators of the sampling literature are identified as members of the proposed class.
Efficiency comparison is made for judging the merit of the proposed class over the
usual unbiased estimator. In addition, the optimum sample sizes of the first-phase and
second-phase samples along with the optimum MSEs have been obtained for the proposed class and its members. Lastly, an empirical study is carried out for demonstrating
the magnitude of possible gain in efficiency, which is followed by Sect. 7.
(2.1)
where the vector v assumes values in a bounded closed convex subset, R, of pdimensional real space containing the unit vector e. Also, the function m(v) of the
vector v satisfies the following conditions:
(i) It is continuous and bounded in R;
123
G. K. Vishwakarma, M. Kumar
(ii) Its first and second partial derivatives exist and are continuous and bounded in
R;
(iii) m(e) = 1.
Now expanding m(v) at the point v = e in a second-order Taylors series, we have
1
m(v) = m(e) + (v e) m (e) + (v e) m (v )(v e)
2
1
m(v) = 1 + m (e) + m (v ),
2
or
(2.2)
(2.3)
0 =
Then, we have
E( ) = 0, E(02 ) = f 1 C02 ,
E(0 ) = ( f 1 f 2 )b = f 3 b ,
E( ) = ( f 1 f 2 )A = f 3 A,
(2.4)
where f 1 = n1 N1 , f 2 = n1 N1 , f 3 = f 1 f 2 = n1 n1 .
C0 denotes the coefficient of variation of study variate Y , 0 denotes the (1 p) row
vector consisting of all zero elements, and b denotes the (1 p) row vector, which
is given by
b = b1 b2 . . . b p ; bi = 0i C0 Ci , (i = 1, 2, . . . , p).
Also, A denotes the ( p p) positive definite matrix and is given by
A = [ai j ] = [i j Ci C j ] , (i, j = 1, 2, . . . , p),
123
ym Y = Y 0 + m (e) .
(2.7)
Squaring both sides of (2.7), taking expectation and using the results in (2.4), we
obtain the MSE of ym to the terms of order O(n 1 ) as
MSE( ym ) = Y 2 f 1 C02 + f 3 2b m (e) + m (e) Am (e) .
(2.8)
(2.9)
(2.10)
123
G. K. Vishwakarma, M. Kumar
Table 1 Members of the class of estimators ym
Authors
Sukhatme [20]
Srivastava [16]
Singh and Vishwakarma [13]
Kumar and Bahl [6]
Estimators
y Rd = y xx
y Pd = y xx
yds = y xx
Y Md = y xx ++xx
;
y D R = y xx = y (1 + g) g x
x
x = (1 + g)x g x ; g = n nn
x
Y ReMd = y ex p xx
+ x
x
Y PeMd = y ex p xx
+ x
(d)
y R P = y k x + (1 k) xx
x
y Pd R = y x + (1 ) xx
x
= y xx + (1 ) (1 + g) g xx
;
x = (1 + g)x g x ; g = n nn
In sampling literature, several estimators have been suggested in two-phase sampling by following different approaches and techniques. Also, many of them can be
regarded as particular members of the proposed class ym . Any estimator, which is a
member of ym , is optimum if it attains the minimum MSE as given in (2.10).
In Table 1, some of the members of the proposed class have been listed. The
expressions for the MSE of these members are given to the terms of order O(n 1 ). It
has also been verified, with the help of an empirical study, that some of these members
do not attain the minimum variance bound (MVB) as that in (2.10) and hence are not
optimum.
In Table 1, , and k denote the scalars, which are suitably determined so as to
minimize the MSEs of the concerned estimators. Also, the expressions for the MSEs
of various estimators to the terms of order O(n 1 ) are given by
MSE( y Rd ) = Y 2 f 1 C02 +
MSE( y Pd ) = Y 2 f 1 C02 +
MSE( yds ) = Y 2 f 1 C02 +
MSE(Y Md ) = Y 2 f 1 C02 +
MSE( y D
f 1 C02 +
)
=
Y
R
123
f 3 C12 201 C0 C1 ,
f 3 C12 + 201 C0 C1 ,
f 3 2 C12 201 C0 C1 ,
1
,
f 3 F 2 C12 + 2F01 C0 C1 ; F =
1+
f 3 g 2 C12 2g01 C0 C1 ,
(3.1)
(3.2)
(3.3)
(3.4)
(3.5)
MSE(Y ReMd ) = Y 2 f 1 C02 +
MSE(Y PeMd ) = Y 2 f 1 C02 +
2 f 1 C02 +
MSE( y (d)
RP) = Y
MSE( y Pd R ) = Y 2 f 1 C02 +
1 2
C1 01 C0 C1 ,
(3.6)
4
1 2
(3.7)
C1 + 01 C0 C1 ,
f3
4
(3.8)
f 3 (1 2k)2 C12 + 2(1 2k)01 C0 C1 ,
f 3 2 C12 + 201 C0 C1 ; = (1 + g) g,
f3
(3.9)
where the term C1 denotes the coefficient of variation of X , and 01 denotes the
correlation coefficient between Y and X .
Furthermore, the minimum attainable MSEs of the estimators yds , Y Md , y (d)
R P and
2
,
f 3 01
2
,
f 3 01
2
,
f 3 01
2
.
f 3 01
(3.10)
(3.11)
(3.12)
(3.13)
4 Efficiency Comparison
The variance of the sample mean y in SRSWOR is
V ( y ) = f 1 Y 2 C02 .
(4.1)
For making efficiency comparison of the proposed class ym with the sample mean y ,
we have from (2.8) and (4.1),
(i) MSE( ym ) = V ( y ) if
m (e) = 2A1 b .
(ii) MSE( ym ) < V ( y ) if
m (e) = A1 b .
123
G. K. Vishwakarma, M. Kumar
Let c1 and c2 be the costs per unit associated with the first-phase and second-phase
samples, respectively. The total cost of the survey is, therefore, given by
c = c1 n + c2 n .
(5.1)
The optimum values of n and n that minimize the MSE of ym for fixed cost c c0 ,
are obtained by using the function as
= MSE( ym ) + (c1 n + c2 n c0 ),
(5.2)
,
S02
n
N
n
n
(5.3)
where S0 = Y C0 , and = Y 2 2b m (e) m (e) Am (e) .
Substituting (5.3) in (5.2), we have
=
1
1
n
N
S02
1
1
n
n
+ (c1 n + c2 n c0 ).
(5.4)
Differentiating (5.4) with respect to n and n, equating the results to zero, and using
(5.1), we obtain the optimum values of n and n as
c A
n =
,
c2 + c1 A
n
c
n= =
,
A
c2 + c1 A
where A =
(5.5)
(5.6)
c2
.
c1 (S02 )
Hence, substituting the values of n and n from (5.5) and (5.6) in (5.3), the optimum
MSE of the proposed class ym is obtained as
MSE( ym )opt
c
(c2 + c1 A)
1
2
S0 1
.
1
=
c
N (c2 + c1 A)
A
(5.7)
Remark 5.1 On considering the minimum MSE of ym at (2.10), the optimum values
of n and n for fixed cost c c0 are obtained on replacing A in (5.5) and (5.6) by A ,
123
n
y
y Rd
y Pd
yds
Y Md
y D R
Y ReMd
Y PeMd
(d)
y R P
y Pd R
n
c
c2
c A
1
c2 + c1 A1
c A
2
c2 + c1 A2
c A
3
c2 + c1 A3
c A
4
c2 + c1 A4
c
c2 +c1 A1
c A5
c2 + c1 A5
c A
6
c2 + c1 A6
c A
7
c2 + c1 A7
c A
8
c2 + c1 A8
c A
9
c2 + c1 A9
c
c2 + c1 A5
c
c2 + c1 A2
c
c2 + c1 A3
c
c2 + c1 A4
c
c2 + c1 A6
c
c2 + c1 A7
c
c2 + c1 A8
c
c2 + c1 A9
where
A =
Optimum MSEs
c2
1
2
c N S0
(c2 + c1 A1 )
c
1
S02 1 1
1
c
N (c2 + c1 A1 )
A1
(c2 + c1 A2 )
c
1
S02 1 1
2
c
N (c2 + c1 A2 )
A2
(c2 + c1 A3 )
c
1
2
1
S0 1
3
c
N (c2 + c1 A3 )
A3
(c2 + c1 A4 )
c
1
S02 1 1
4
c
N (c2 + c1 A4 )
A4
(c2 + c1 A5 )
c
2 1 1
1
S
5
c
0
N (c2 + c1 A5 )
A5
(c2 + c1 A6 )
c
1
S02 1 1
6
c
N (c2 + c1 A6 )
A6
(c2 + c1 A7 )
c
1
S02 1 1
7
c
N (c2 + c1 A7 )
A7
(c2 + c1 A8 )
c
1
S02 1 1
8
c
N (c2 + c1 A8 )
A8
(c2 + c1 A9 )
c
1
S02 1 1
9
c
N (c2 + c1
A9 )
A9
c2
; = S02 R 2 ,
c1 (S02 )
c
(c2 + c1 A )
1
.
1
S02 1
c
N (c2 + c1 A )
A
The optimum values of n and n along with the optimum MSEs of various estimators
for fixed cost c c0 are listed in Table 2.
The notations used in Table 2 are as follows:
A1 =
A2 =
A3 =
A4 =
A5 =
c2 1
c1 (S02 1 )
c2 2
c1 (S02 2 )
c2 3
c1 (S02 3 )
c2 4
c1 (S02 4 )
c2 5
c1 (S02 5 )
; 1 = Y 2 C12 + 201 C0 C1 ,
; 2 = Y 2 C12 201 C0 C1 ,
; 3 = Y 2 2 C12 + 201 C0 C1 ,
; 4 = Y 2 F 2 C12 2F01 C0 C1 ,
; 5 = Y 2 g 2 C12 + 2g01 C0 C1 ,
123
G. K. Vishwakarma, M. Kumar
c2 6
c1 (S02 6 )
c2 7
A7 =
c1 (S02 7 )
c2 8
A8 =
c1 (S02 8 )
c2 9
A9 =
c1 (S02 9 )
A6 =
1
; 6 = Y 2 C12 + 01 C0 C1 ,
4
1
; 7 = Y 2 C12 01 C0 C1 ,
4
; 8 = Y 2 (1 2k)2 C12 2(1 2k)01 C0 C1 ,
; 9 = Y 2 2 C12 201 C0 C1 .
6 Empirical Study
To examine the merits of the proposed class ym , three natural population datasets
have been considered. The description of the populations and the values of various
parameters are given as follows:
Population ISource: Cochran [1]
Y : Number of placebo children,
X 1 : Number of paralytic polio cases in the placebo group,
X 2 : Number of paralytic polio cases in the not inoculated group,
N = 34, n = 15, n = 10, Y = 4.92, X 1 = 2.59, X 2 = 2.91, 01 = 0.7326,
02 = 0.6430, 12 = 0.6837, C02 = 1.0248, C12 = 1.5175, C22 = 1.1492 .
Population IISource: Srivnstava et al. [19]
Y : The measurement of weight of children,
X 1 : Mid arm circumference of children,
X 2 : Skull circumference of children,
N = 55, n = 30, n = 18, Y = 17.08, X 1 = 16.92, X 2 = 50.44, 01 = 0.54,
02 = 0.51, 12 = 0.08, C02 = 0.0161, C12 = 0.0049, C22 = 0.0007.
Population IIISource: Sahoo and Swain [10]
Y : Yield of rice per plant,
X 1 : Number of tillers,
X 2 : Percentage of sterility,
N = 50, n = 30, n = 15, Y = 12.842, X 1 = 9.04, X 2 = 18.77, 01 =
0.7133,
02 = 0.2509, 12 = 0.0224, C0 = 0.3957, C1 = 0.2627, C2 = 0.0970.
The MSEs along with the percent relative efficiencies (PREs) of various estimators of
Y have been computed and findings are presented in Table 3. The PREs are obtained
for various suggested estimators of Y with respect to the usual unbiased estimator y
using the formula:
PRE(, y ) =
V ( y )
100,
MSE()
(d)
, Y
where = y , y Rd , y Pd , yds , Y Md , y D
ReMd , Y PeMd , y R P , y Pd R , ym(opt) .
R
123
Auxiliary
Population I
variates used
MSE
PRE
MSE
Population II
PRE
MSE
Population III
1.75106
100.00
0.175537
100.00
1.20505
100.00
y Rd
X1
1.50119
116.65
0.145116
120.96
0.769203
156.66
*
PRE
X2
1.55225
112.81
0.157876
111.19
y Pd
X2
1.15089
104.71
yds
X1
1.30727
133.95
0.145102
120.98
0.767102
157.09
X2
1.40919
124.26
0.148389
118.29
Y Md
y D R
Y ReMd
Y PeMd
(d)
y R P
y Pd R
ym(opt)
X1
1.30727
133.95
0.145102
120.98
0.767102
157.09
X2
1.40919
124.26
0.148389
118.29
1.15086
104.71
X1
3.7002
47.32
0.15373
114.19
0.769203
156.66
X2
3.20798
54.58
0.152449
115.14
X1
1.32002
132.66
0.152385
115.19
0.892283
135.05
X2
1.41984
123.33
0.165572
106.02
X2
1.16504
103.43
157.09
X1
1.30727
133.95
0.145102
120.98
0.767102
X2
1.40919
124.26
0.148389
118.29
1.15086
104.71
X1
1.30727
133.95
0.145102
120.98
0.767102
157.09
X2
1.40919
124.26
0.148389
118.29
1.15086
104.71
X1
1.30727
133.95
0.145102
120.98
0.767102
157.09
X2
1.40919
124.26
0.148389
118.29
1.15086
104.71
X 1 and X 2
1.27591
137.24
0.112954
155.41
0.705765
170.74
and y Pd R are same, and are equal to that of the AOE ym(opt) . Hence, these members
are better than the other members and also the usual unbiased estimator y as well.
(ii) In all the three populations, the proposed class ym(opt) , based on two auxiliary
variates, exhibits better performances over the usual unbiased estimator y and the
other estimators, which are based on single auxiliary variates.
123
G. K. Vishwakarma, M. Kumar
References
1. Cochran, W.G.: Sampling Techniques. Wiley, New York (1977)
2. Dash, P.R., Mishra, G.: An improved class of estimators in two-phase sampling using two auxiliary
variables. Commun. Stat. Theory Methods 40(24), 43474352 (2011)
3. Diana, G., Giordan, M., Perri, P.F.: An improved class of estimators for the population mean. Stat.
Methods Appl. 20, 123140 (2011)
4. Diana, G., Perri, P.F.: Estimation of finite population mean using multi-auxiliary information. Metron,
LXV 1, 99112 (2007)
5. Diana, G., Tommasi, C.: Optimal estimation for finite population mean in two-phase sampling. Stat.
Methods Appl. 12, 4148 (2003)
6. Kumar, M., Bahl, S.: Class of dual to ratio estimators for double sampling. Stat. Pap. 47, 319326
(2006)
7. Mukerjee, R., Rao, T.J., Vijayan, K.: Regression type estimators using multiple auxiliary information.
Aust. J. Stat. 29, 244254 (1987)
8. Olkin, I.: Multivariate ratio estimation for finite populations. Biometrika 45, 154165 (1958)
9. Raj, D.: On a method of using multi-auxiliary information in sample surveys. J. Am. Stat. Assoc. 60,
270277 (1965)
10. Sahoo, L.N., Swain, A.K.P.C.: Unbiased ratio-cum-product estimator. Sankhya 42, 5662 (1980)
11. Singh, B.K., Choudhury, S.: Double sampling product-cum-dual to ratio estimators for finite population
mean in sample surveys. Int. J. Sci. Eng. Res. 3(4), 17 (2012)
12. Singh, H.P., Ruiz Espejo, M.: Double sampling ratio-product estimator of a finite population mean in
sample surveys. J. Appl. Stat. 34(1), 7185 (2007)
13. Singh, H.P., Vishwakarma, G.K. An efficient variant of the product and ratio estimators in double
sampling. Model Assist. Stat. Appl. 1, 157167 (2005, 2006)
14. Singh, H.P., Vishwakarma, G.K.: Modified exponential ratio and product estimators for finite population
mean in double sampling. Aust. J. Stat. 36(3), 217225 (2007)
123
123