Professional Documents
Culture Documents
Renewable Energy
journal homepage: www.elsevier.com/locate/renene
a r t i c l e i n f o
a b s t r a c t
Article history:
Received 29 August 2015
Received in revised form
16 May 2016
Accepted 5 August 2016
Wind climate analysis and modelling is of most importance during site selection for offshore wind farm
development. In this regard, reliable long-term wind data are required. Buoy measurements are
considered as a reference source in relevant applications including evaluation and calibration of wind
data obtained from less reliable sources, combined assessment, blending and homogenization of multisource wind data, etc. Most of these applications are based on regression techniques elaborated by using
the principle of ordinary least squares (OLS). However, wind data usually contain several outliers, which
may question the validity of the regression analysis, if not properly considered. This study is focused on
the implementation of the most important robust regression methods, which can identify and reveal
outliers, and retain at the same time their efciency. Long-term reference wind data series obtained from
buoys at six locations in the Mediterranean Sea are used to calibrate hindcast (model) wind data by
applying robust methods and OLS. The obtained results are compared according to several statistical
measures. The effects of the calibration methods are also assessed with respect to the available wind
power potential. The results clearly suggest that least trimmed squares and L1-estimator perform in all
respects better than OLS.
2016 Elsevier Ltd. All rights reserved.
Keywords:
Offshore wind energy
Robust methods
Calibration
Outliers
Mediterranean sea
1. Introduction
1.1. General
The accurate modelling and description of the wind climate in
coastal or offshore areas is highly required either for research
purposes (e.g. modelling of atmospheric circulation patterns) or for
coastal and offshore applications, such as design and construction
of marine structures, coastal environmental management and
protection, site selection and feasibility analysis for offshore wind
farm development, estimation of the available offshore wind
resource, etc. Regarding offshore wind energy aspects, the most
essential requirement for the annual and seasonal climate modelling and the estimation of the inter-annual variability is the availability of long-term time series of wind characteristics. In this
connection, long-term wind data are often acquired by appropriately combining wind data from multiple sources; see e.g. Ref. [43].
Nowadays, long-term offshore wind data are available not only
through oceanographic buoy measurements, satellite sensors and
* Corresponding author.
E-mail address: tsouki@hcmr.gr (T.H. Soukissian).
http://dx.doi.org/10.1016/j.renene.2016.08.009
0960-1481/ 2016 Elsevier Ltd. All rights reserved.
1288
1.2. Outliers
Taking into consideration the position of an observation in a
scatter plot and the variable in which it corresponds, the conspicuous observations can be grouped in three classes; see also [37].
The outliers in the explanatory variables (xoutliers or leverage
points) can tilt the least-squares line due to the large effect on the
corresponding estimator. Leverage points do not necessarily have
to be outliers. The outliers in the response variable (youtliers) can
be considered as such, because they have large standardized residuals and rather large inuence on the least-squares line, since
they increase the magnitude of residuals. Finally, an outlier both in
the response and explanatory variables may be either a point with a
large standardized residual or a point that deviates from the linear
relationship set by the majority of the data or both. In this case, the
relationship between the two variables must be taken into account
for the detection of the outlier in question. Another signicant class
of atypical observations are the so-called inuential points, which
individually or jointly excessively inuence the regression equation
and are characterized by very high or very low values of x, see
Ref. [6].
Since outliers may seriously affect regression analysis outputs
and estimation of the relevant parameters, numerous procedures
have been developed for the detection and investigation of a single
or few outliers in linear regression. A well-known and straightforward statistical approach to deal with outliers is through diagnostic
methods, i.e. statistics generally based on classical estimates aiming
to the detection of outliers from the assumed model. This family of
techniques is often implemented by the same procedure: rst
delete each observation one at a time and in turn, examine if there
is any impact on the various calculated values. Furthermore, scatter
plots may sometimes contain as much information as diagnostics
about residuals and hat elements.2 However, these techniques
perform poorly in the presence of multiple outliers because of
masking and swamping effects.3
In contrast with the role of diagnostic measures, there is another
approach that takes into account deviations from idealized assumptions and attempts to temper the inuence of outliers and
inuential observations on the estimator. This approach is known
as robust regression and aims to produce estimators that are
1
A more general case that can be found in any wind data source is the one of
exceptional phenomena, where there are legitimately outlying data points. In this
situation, it is constructive to determine effectively the extreme values from a set of
outliers through an extreme value analysis, forming in this way the cornerstone of
all outlier detection algorithms as a nal step; see Ref. [1].
2
Hat elements are the diagonal elements of the least-squares projection matrix
or hat matrix.
3
Masking is the inefciency of the procedure to identify a set of outliers because
of the presence of another set, usually neighboring, while swamping is the mistake
of clean observations as outliers because of the presence of another group of
observations, usually distant.
insensitive
in
the
presence
of
these
points;
see
Refs. [11,12,15,16,18,34,37,44,46,47]. Compared with diagnostic
methods, Huber [18] mentions that robust methods are more
reliable and perform better, since they provide a middle ground
between rejecting and accepting a suspicious observation without
the analyst's subjective decision (to keep or remove a suspicious
outlier from the data sample).
Particularly, in wind energy applications, robust estimation
methods have been implemented in wind data ltering for the
optimization (operation and maintenance) of a wind farm with
promising results; see Refs. [26,38]. Moreover, a robust approach
was applied in order to estimate the two parameters of a Weibull
distribution on wind speed data with and without outliers; see
Ref. [21]. This analysis showed that the robust estimator performs
better compared to maximum likelihood estimator in both cases. In
Soukissian et al. [41], robust methods were applied and compared
with the traditional OLS approach through a rational evaluation
methodology for four different scenarios in order to examine the
effects of outliers in wind speed assessment.
In this work, robust regression methods are applied in detail for
the assessment of concurrent wind speed data obtained from buoys
and a high resolution NWP model at four offshore locations of the
Aegean Sea (eastern Mediterranean Sea). In order to assess the
proposed methods in diverse wind climates, the same methods
have been also applied at two locations of the Spanish waters
(western Mediterranean Sea). Specically, the main objective is to
evaluate, validate and correct wind data from a less accurate source,
namely NWP model data, by taking as reference source measured
data from oceanographic buoys located in the eastern and western
Mediterranean Sea. Apart from the classical OLS technique, several
robust regression methods are examined in order to evaluate the
performance of each method. As an additional strict test, the effects
that the different calibration relations have on the estimation of the
mean offshore wind power density compared to the values provided directly by the buoy measurements are also assessed.
This work is structured as follows: In the next section, the
theoretical background governing the most known estimators of
robust regression techniques is presented in detail. In Section 3, an
overview of the wind data sources used herewith is made along
with a primary statistical analysis of the concurrent wind data sets.
In Section 4, the evaluation methodology is described analytically,
and in Section 5, the correction relations of the different methods
for the six examined locations in the Mediterranean Sea are
compared in terms of some statistical measures. Furthermore, the
corresponding results are analytically discussed for each step of the
evaluation methodology. Finally, the concluding remarks are presented in Section 6.
2. Robust regression
2.1. Robust vs linear regression
Simple linear regression is based on the principle of the OLS
method, i.e. the minimization of the sum of squared residuals. The
OLS estimators have some important properties: they are linear,
unbiased, efcient (they have the minimum variance) and consistent. For a detailed review of OLS and simple linear regression
methods see Refs. [30,31]. However, all these properties hold inter
alia under the assumptions that the random error terms are statistically independent, normally distributed with zero mean. In
practice, these conditions are rarely satised or even examined and
normality is usually considered just as a convenient approximation.
Despite the elegant properties of the OLS method, it should be bear
in mind that it is clearly not robust to violations of its assumptions
and especially, deviations from the normality assumption.
1289
1) There is a rather naive trust that large sample sizes make robust
techniques unnecessary.
2) A certainty that the outliers can be either detected by visual
inspection or by identifying unusual OLS residuals.
3) There is lack of expertise as regards the interpretation of results
from a robust analysis and lack of knowledge of the gains
available from such analysis.
2.2.4. Equivariance
Attaining the equivariance properties assures that the results of
the regression analysis will not alter in case of particular transformations of the data. There are two main types of equivariance4
for regression estimators. According to Rousseeuw and Leroy [34],
the specic properties are addressed below:
1. An estimator T is called regression equivariant, if
i 1; 2; ; n;
i 1; 2; ; n;
for any constant c. The above condition practically suggests that the
measurement units of the dependent variable y (with respect to the
measurement units of x) is irrelevant for the regression analysis.
2.3. Robust estimators
2.3.1. L-estimators
Any estimator that is a linear combination of the order statistics
P
Ln ni1 cni Xn:i , where cni are real constants and Xn:1 Xn:n
denote the ordered values of the sample observations
X1 ; ; Xn ; i 1; 2; ; n, is called an L-estimator. In this regard, an
alternative approach for the estimation of the regression parameters is to minimize the sum of the absolute values of the residuals,
i.e.
min
b
b 0 ;b
b1
n
X
i ;
b
i 1; 2; ; n;
(1)
i1
4
There is an additional type of equivariance, called afne equivariance; see Ref
[34]; p. 116.
1290
min
b
b 0 ;b
b1
h
X
b
2
i1
i:n
i 1; 2; ; n;
(2)
n
.
X
j b i s xi 0;
2.3.2. M-estimators
In Huber [17], the use of another approach of robust regression
was introduced, the M-estimators, which are a trade-off between
the efciency of OLS and the resistance of L1-estimators. This class
of estimators can also be regarded as a generalization of maximum
likelihood estimators. An M-estimator is dened by minimizing the
following function of the residuals:
n
X
r b i ;
i 1; 2; ; n;
(3)
i1
.
b
2i 2;
jb
ij k
.
2
kjb
i j k 2; jb
i j > k;
(4)
i 1; 2; ; n;
(5)
i1
(6)
where b
1:n ::: b
n:n are the ordered squared residuals and h
is the number of the remaining observations after the trimming. In
general, h may depend on some trimming proportion a, for instance
h n1 a 1; see Ref. [33]. When h is approximately n/2, then
LTS achieves the best robustness properties. Moreover, LTS satises
the properties of regression and scale equivariance. Although LTS
has a relatively low asymptotic efciency, as stated by Croux et al.
[7], it still plays a role in the estimation of the parameters of other
more robust methods.
Other known L-estimators are the least median of squares (LMS)
that minimizes the median of the squared residuals. Some authors
claim that teregression quantile, LMS and LTS are also called
resistant regression methods. In fact, resistant methods use estimates that are not inuenced by unusual points, such as the median, instead of diminishing their inuence.
i 1; 2; ; n;
i1
s
2
rb i
estimators, [44]. Because they are not scale equivariant, the residuals should be standardized by means of some estimate of scale
s, i.e.:
MAD
;
0:6745
(7)
where
the
MAD
(of
the
residuals)
is
MAD medianfjb
i medianfb
i gjg; i 1; 2; ; n. Since this estimator is based on the median, it is highly resistant to outlying
observations, with BDP 50%. Generally, M-estimators are statistically better than OLS with regard to resistance and robustness to
youtliers. In some cases, their performance is poor compared to
the latter, since they do not consider leverage points.
In order to facilitate the procedure, j is replaced by appropriate
weights that decrease as the size of the residual increases. To this
end, the weight function is wi wb
i jb
i =b
i , i1,2,,n, and
Equation (6) may then be written as:
n
X
.
i s xi 0;
wi b
i 1; 2; ; n:
(8)
i1
2.3.3. MM-estimators
Before proceeding to the MM-estimators, the S-estimators are
described in brief, since they provide an initial estimate in more
complex robust regression methods.
The S-estimators were introduced by Rousseeuw and Yohai [36]
and were developed to improve the efciency of both LMS and LTS,
based on the estimates of scale. Specically, their objective functions are replaced by a more efcient scale estimator that is applied
to the residuals b
i in order to minimize their dispersion, and their
mathematical expression is
minsb
1 b; :::; b
n b:
(9)
The dispersion sb
1 b; :::; b
n b should satisfy the following
constraint:
n
b
1X
r i K;
n i1
s
i 1; 2; ; n;
(10)
(11)
i1
which satises
n
n
X
X
r b i b=sn
r b i b* sn ;
i1
i 1; 2; ; n:
(12)
i1
1291
sensors for the measurement of temperature, atmospheric pressure, wind speed, direction and gust. The wind measurements are
performed at 3 m height above sea surface with recording period
600 s and frequency 1 Hz, and the measurements are performed
every 3 h. In this work, the buoy wind data consist of long-term
wind speed time series of the mean values of wind speed
measured during the 600 s and are obtained from four buoys
located in the Aegean Sea.
In the Spanish Seas, twelve oceanographic buoys located in deep
water depths provide data from measured parameters similar to
the Greek buoys. The monitoring system operates under the responsibility of the Spanish Port Authority (Puertos del Estado). The
measurements are made at 3 m height above sea surface with a
recording interval of 1 h. The corresponding wind speed time series
cover time periods varying between 5 and 18 years. The buoy wind
data used in this work consist of long-term wind speed time series
obtained from two buoys located in the Mediterranean part of the
Spanish waters.
3.2. NWP model data
The physical and dynamical properties of the atmosphere can be
described by numerical models that are implemented at global or
regional scale. One of the most relevant reanalysis procedures,
producing long time series of consistent analyses, is from the European Centre for Medium-Range Weather Forecasts (ECMWF).
ECMWF reanalysis data sets include ERA-40 (1957e2002) and, the
more recent, ERA-Interim global atmospheric reanalysis, producing
wind data since 1979 on a primary spatial resolution of
0.75 0.75 . Since the resolution of global atmospheric models is,
in general, too scant for wind speed elds to be used directly in the
offshore wind resource assessment, they should be downscaled
using either regional models or statistical approaches. In this study,
the results of a version of the Eta-SKIRON mesoscale meteorological
model have been analysed. Specically, Eta-SKIRON model is used
for the dynamical downscaling approach in the Mediterranean Sea,
with a horizontal grid increment of 0.1 0.1 (about
10 km 10 km) and initial and boundary conditions obtained from
the ECMWF analysis elds. The wind data sets from the atmospheric model refer to 10 m above sea surface. A detailed description regarding the dynamical downscaling procedure of the EtaSKIRON model can be found in Ref. [29].
Let us note that data from NWP models contain inherent uncertainties derived from: i) model errors such as parameterization
errors, discretization error of the grid, inadequate physical processes of the model, and ii) boundary and initial errors (e.g. inaccurate wind input), which can be sources of outliers.
3.3. Statistical analysis of wind data
Wind measurements from four buoys deployed in the Aegean
Sea and two buoys in the Spanish waters are used as reference data
source. Their geographic coordinates along with the examined
measurement periods are presented in Table 1.
Table 1
Names, geographical coordinates and concurrent measurement time periods for the
examined buoys.
Buoy name
Location
Athos
Lesvos
Mykonos
Santorini
Cabo Begur
Cabo de Gata
39 580 N
39 100 N
37 310 N
36 160 N
41 550 N
36 340 N
Recording period
24 430 E
25 490 E
25 280 E
25 300 E
3 390 E
2 190 E
5/2000e12/2004
1/2000e12/2004
1/2000e12/2004
1/2000e12/2004
3/2001e12/2004
1/2000e10/2004
1292
P4
ui
i1 r 2
i
u P4
1
i1 r 2
i
i 1; 2; 3; 4;
(13)
where u1, u2, u3 and u4 denote the wind speeds at the four grid
points surrounding the buoy location, r1, r2, r3 and r4 denote the
corresponding distances from that location, and u is the sought-for
(model) wind speed. Regarding the temporal collocation, the
common time frame is 3 h (00:00, 03:00, 06:00, etc. UTC).
In Table 2, the results of a primary statistical analysis regarding
the concurrent wind data sets (buoy measurements and NWP
model results) are presented for the examined locations. The statistical parameters depicted in this table are the following: sample
size N, mean value m, standard deviation s, minimum value min,
maximum value max, coefcient of variation CV and coefcient of
determination r2 between buoy measurements and NWP model
results. It should be noted that, despite the collocation procedure,
the sample size remains adequate for performing a statistically
reliable analysis.
From the results of Table 2 the following remarks can be made:
1) Regarding the Greek Seas, the highest values of mean wind
speed and standard deviation (for both data sources) correspond to Mykonos (8.135 ms-1 and 4.176 ms-1, respectively) and
the maximum wind speed corresponds to Lesvos (34.419 ms-1).
The highest value of coefcient of variation is observed for Athos
(74.347 for buoy measurements and 64.309 for the NWP model)
and the smallest one for Mykonos (51.328 for buoy measurements and 48.462 for the NWP model). The highest coefcient
of determination corresponds to Mykonos (0.707) and the
smallest one to Santorini (0.577).
2) For the Spanish Seas, the highest values of the mean wind speed
and standard deviation correspond to Cabo Begur (8.216 ms-1
and 5.847 ms-1, respectively), which are also the highest for all
the examined locations and wind data sources in this work.
3) For all examined locations, the mean value, standard deviation,
maximum value and coefcient of variation of wind speed are
systematically larger for buoy wind data except for the latter
(14)
where
.
.
0
0
b
b 0 b
b 1 and b
b0 b
b11 b
b1
(15)
Table 2
Basic statistics of wind speed for all the examined sites in the Mediterranean Sea for the concurrent recording periods.
Location
Data source
m (ms1)
s (ms1)
min (ms1)
max (ms1)
CV (%)
r2
Athos
Buoy
Model
Buoy
Model
Buoy
Model
Buoy
Model
Buoy
Model
Buoy
Model
9044
5.379
4.908
7.136
5.749
8.135
6.622
6.779
4.754
8.216
6.704
6.204
5.403
3.999
3.156
4.072
3.053
4.176
3.209
3.650
2.410
5.847
3.969
3.832
3.377
0.118
0.035
0.117
0.098
0.117
0.164
0.118
0.074
0.232
0.175
0.232
0.100
25.472
20.759
34.419
22.922
21.480
17.967
21.342
18.820
27.718
22.501
20.992
18.377
74.347
64.309
57.068
53.111
51.328
48.462
53.841
50.696
71.171
59.198
61.760
62.502
0.670
Lesvos
Mykonos
Santorini
Cabo Begur
Cabo de Gata
11453
8606
11220
2257
8270
0.588
0.707
0.577
0.738
0.704
1293
PN
i1
BIAS
b M;i uB;i
u
(16)
RMSE
v
u
2
uPN
u
b
u
u
B;i
M;i
t i1
N
(17)
MAE
PN
b
u
u
M;i
B;i
i1
N
(18)
5
The analysis in case C.2 resembles the concept of measureecorrelateepredict
(MCP) methods in wind speed forecasting; see Refs. [5,25,32]. In MCP methods, the
available short-term wind data series in a candidate area are extrapolated and
extended in time from the closest meteorological stations that are used as reference
stations. The usual time period for the available reference data is one year.
1294
RMSE
SI
;
uB
(19)
Fig. 2. Scatter plot (with colour-scale indicating density) and regression lines of wind speeds obtained for case C.1 for all the applied methods at: (a) Athos, (b) Lesvos, (c) Mykonos,
(d) Santorini, (e) Cabo Begur and (f) Cabo de Gata. (For interpretation of the references to colour in this gure legend, the reader is referred to the web version of this article.)
Table 3
Estimated calibration parameters obtained from each applied method for cases C.1
and C.2 for all examined locations.
Location
Athos
Lesvos
Mykonos
Santorini
Cabo Begur
Cabo de Gata
Method
OLS
MM
M-H
LTS
L1
OLS
MM
M-H
LTS
L1
OLS
MM
M-H
LTS
L1
OLS
MM
M-H
LTS
L1
OLS
MM
M-H
LTS
L1
OLS
MM
M-H
LTS
L1
0
b
b0
0
b
b1
C.1
C.2
C.1
C.2
2.220
1.980
2.035
1.857
1.913
2.865
2.059
2.241
1.806
1.849
2.113
1.867
1.900
1.750
1.888
2.702
2.610
2.644
2.660
2.555
3.285
3.273
3.279
3.300
3.231
1.104
0.837
0.883
0.700
0.676
2.160
1.871
1.949
1.692
1.840
2.087
1.771
1.804
1.634
1.551
2.753
2.497
2.539
2.349
2.473
2.667
2.593
2.611
2.572
2.430
3.338
3.217
3.262
3.177
3.289
0.971
0.738
0.774
0.646
0.579
1.548
1.499
1.512
1.471
1.491
1.740
1.622
1.649
1.587
1.589
1.548
1.511
1.517
1.494
1.516
1.994
1.993
1.997
2.015
1.991
1.715
1.703
1.705
1.697
1.701
1.353
1.304
1.313
1.280
1.278
1.631
1.556
1.577
1.508
1.550
1.657
1.604
1.610
1.581
1.559
1.666
1.627
1.635
1.608
1.631
2.037
2.032
2.035
2.035
1.997
1.776
1.731
1.743
1.712
1.743
1.307
1.265
1.272
1.248
1.242
Table 4
Statistics of calibration equations based on ordinary least squares (OLS), MMestimate (MM), M-estimate by Huber (M-H), least trimmed squares (LTS) and L1estimate for all the examined locations for case C.1.
Location
Method
BIAS
RMSE
MAE
SI
Athos
OLS
MM
M-H
LTS
L1
OLS
MM
M-H
LTS
L1
OLS
MM
M-H
LTS
L1
OLS
MM
M-H
LTS
L1
OLS
MM
M-H
LTS
L1
OLS
MM
M-H
LTS
L1
0.297
0.231
0.256
0.191
0.243
0.347
0.334
0.331
0.338
0.315
0.078
0.067
0.073
0.062
0.083
0.190
0.260
0.247
0.321
0.298
0.292
0.214
0.225
0.150
0.246
0.135
0.089
0.100
0.071
0.082
2.744
2.668
2.687
2.630
2.660
3.317
3.152
3.188
3.107
3.108
2.664
2.607
2.615
2.583
2.615
3.046
3.056
3.058
3.091
3.057
3.389
3.362
3.367
3.350
3.362
2.455
2.381
2.394
2.347
2.345
2.081
2.022
2.037
1.992
2.016
2.501
2.361
2.393
2.323
2.325
2.064
2.019
2.025
2.000
2.025
2.300
2.305
2.307
2.329
2.305
2.576
2.554
2.558
2.545
2.555
1.872
1.811
1.822
1.783
1.781
0.480
0.472
0.474
0.467
0.472
0.441
0.427
0.431
0.424
0.423
0.322
0.316
0.317
0.313
0.317
0.437
0.440
0.440
0.445
0.441
0.394
0.390
0.391
0.389
0.390
0.383
0.376
0.378
0.373
0.373
Lesvos
Mykonos
MAE and RMSE (along with SI), which are statistic measures
quantifying the absolute and squared difference between corrected and measured wind speeds, respectively, were systematically lower for LTS and L1 methods for all examined locations
except for Santorini.
From the overall combination of 6 locations with 4 statistical
criteria (i.e. in total 24 outcomes), LTS performed better for 15
out of the 24 outcomes and L1-estimator for 5 outcomes. OLS
performed better 4 times out of 24.
1295
Santorini
Cabo Begur
Cabo de Gata
1296
Table 5
Statistics of calibration equations based on ordinary least squares (OLS), MMestimate (MM), M-estimate by Huber (M-H), least trimmed squares (LTS) and L1estimate for all the examined locations for case C.2
Location
Method
BIAS
RMSE
MAE
SI
Athos
OLS
MM
M-H
LTS
L1
OLS
MM
M-H
LTS
L1
OLS
MM
M-H
LTS
L1
OLS
MM
M-H
LTS
L1
OLS
MM
M-H
LTS
L1
OLS
MM
M-H
LTS
L1
0.975
0.818
0.868
0.719
0.814
0.587
0.550
0.559
0.530
0.475
0.347
0.325
0.338
0.325
0.370
0.510
0.556
0.552
0.588
0.533
0.812
0.620
0.661
0.529
0.636
0.061
0.094
0.086
0.099
0.083
3.089
2.918
2.966
2.816
2.909
3.305
3.222
3.234
3.187
3.146
2.865
2.793
2.807
2.758
2.806
3.158
3.162
3.166
3.172
3.107
3.542
3.414
3.445
3.363
3.441
2.358
2.304
2.311
2.283
2.279
2.358
2.229
2.266
2.151
2.223
2.451
2.382
2.392
2.353
2.320
2.226
2.169
2.180
2.141
2.178
2.366
2.368
2.370
2.374
2.329
2.715
2.617
2.640
2.577
2.637
1.786
1.743
1.749
1.726
1.723
0.551
0.526
0.533
0.510
0.525
0.449
0.441
0.442
0.437
0.433
0.350
0.343
0.344
0.339
0.345
0.461
0.462
0.462
0.463
0.455
0.441
0.425
0.429
0.419
0.428
0.379
0.374
0.374
0.372
0.373
Lesvos
Mykonos
Santorini
Cabo Begur
Cabo de Gata
Table 6
Relative errors (%) of mean wind power density based on ordinary least squares
(OLS), MM-estimate (MM), M-estimate by Huber (M-H), least trimmed squares (LTS)
and L1-estimate for all the examined locations for cases C.1 and C.2
Location
Method
Case C.1
Case C.2
Athos
OLS
MM
M-H
LTS
L1
OLS
MM
M-H
LTS
L1
OLS
MM
M-H
LTS
L1
OLS
MM
M-H
LTS
L1
OLS
MM
M-H
LTS
L1
OLS
MM
M-H
LTS
L1
40.18
32.21
34.57
27.76
31.84
38.37
29.57
31.53
27.29
26.71
21.87
18.80
19.36
17.50
19.63
45.71
48.41
48.18
52.60
49.67
28.54
25.43
25.95
23.45
25.99
25.84
19.76
21.03
16.91
17.12
82.01
64.61
69.74
54.06
63.78
40.74
35.20
36.09
32.86
29.25
45.39
41.51
42.51
39.98
43.38
65.40
66.86
67.01
68.41
62.58
57.84
47.13
49.64
42.41
48.90
13.21
8.39
9.23
6.66
6.91
Lesvos
N
1 X
ru3 ;
2N i1 i
Mykonos
Santorini
(20)
where N is the sample size and ui, i1,2,,N, is the observed wind
speed time series.
P can be directly estimated by utilizing the wind speed time
series uB,i, i1,2,,N, obtained from buoys. This estimate is denoted
by P B and is considered as the reference value. Moreover, P can be
also estimated by using the calibrated results of the NWP model, i.e.
b M;i , i1,2,,N; this estimate is denoted
the wind speed time series u
Cabo Begur
Cabo de Gata
1297
1298
[42]
[43]
[44]
[45]
[46]
[47]
[48]
[49]
[50]
[51]
and Polar Engineering Conference Busan, Korea, June 15e20, 2014. ISBN 9781 880653 91-3 (Set); ISSN1098-6189 (Set).
T. Soukissian, C. Kechris, About applying linear structural method on ocean
data: adjustment of satellite wave data, Ocean. Eng. 34 (3e4) (2007) 371e389,
http://dx.doi.org/10.1016/j.oceaneng.2006.04.002.
T.. Soukissian, A. Papadopoulos, Effects of different wind data sources in
offshore wind power assessment, Renew. Energy 77 (2015) 101e114, http://
dx.doi.org/10.1016/j.renene.2014.12.009.
R.G. Staudte, S.J. Sheather, Robust Estimation and Testing, Wiley, New York,
1990.
P.K. Taylor, E. Dunlap, F.W. Dobson, R.J. Anderson, V.R. Swail, On the accuracy
of wind and wave measurements from buoys, in: Data Buoy Cooperation
Panel (DBCP) Technical Documents No. 21, 2002.
J.W. Tukey, A Survey of Sampling from Contaminated Distributions, Contributions to Probability and Statistics, Stanford University Press, Stanford (CA),
1960.
J.W. Tukey, The future of data analysis, Ann. Math. Stat. 33 (1) (1962) 1e67,
http://dx.doi.org/10.1214/aoms/1177704711.
W.N. Venables, B.D. Ripley, Modern Applied Statistics with S, Springer, New
York, 2002.
R. Wilcox, Introduction to Robust Estimation and Hypothesis Testing, Elsevier,
2012.
V.J. Yohai, High breakdown-point and high efciency robust estimates for
regression, Ann. Stat. 15 (2) (1987) 642e656, http://dx.doi.org/10.1214/aos/
1176350366.
A. Zaman, P.J. Rousseeuw, M. Orhan, Econometric applications of highbreakdown robust regression techniques, Econ. Lett. 71 (1) (2001) 1e8,
http://dx.doi.org/10.1016/S0165-1765(00)00404-3.