You are on page 1of 10

2013 46th Hawaii International Conference on System Sciences

Estimation of Transmission Line Parameters from Historical Data

Katherine R. Davis Sudipta Dutta Thomas J. Overbye Jim Gronquist


PowerWorld Corporation University of Illinois at University of Illinois at Bonneville Power
kate@powerworld.com Urbana-Champaign Urbana-Champaign Administration
dutta4@illinois.edu overbye@illinois.edu jfgronquist@bpa.gov

Abstract efforts, yet other existing sources tend to be


Power system analysis results are based on a overlooked. One resource which has not been
model of the system. The model is often assumed utilized to its full potential is the enormous volume of
known, yet it may contain errors. Model errors can historical SCADA data which is readily available to
impact the accuracy of the results. This paper many utilities.
utilizes historical data over time to extract the This work addresses the research question of
transmission line model parameters. The focus is on whether the sheer volume of available SCADA data
application of line parameter estimation techniques facilitates acceptable parameter estimates in the
for real data from a North American utility. The presence of data errors. The idea exploited is to use
approach in this paper focuses on the challenges of data over time, and the approach in this paper is
dealing with uncertainty and errors encountered in advantageous over methods which consider a single
the real-world data. Future applications are also snapshot of the system. The parameters of a pi-
discussed. model of a medium length transmission line are
determined from multiple measurements of complex
power and voltage magnitude at both ends of the
lines.
1. Introduction The organization of the paper is as follows. A
brief background of the estimation problem is
Model development and validation is an area of provided in Section 2. The transmission line model
growing importance for power systems. The reason used in the work and an overview of the parameter
for placing high emphasis on model accuracy is that estimation method is presented in Section 3. The
in any discipline, results are only as good as the proposed methodology is described in Section 4.
models upon which the analysis is based. Thus, Results obtained by applying the proposed
identifying model errors is an important problem. methodology on a simulated test case are presented in
This work incorporates large volumes of Section 5 and results obtained on real SCADA data
historical data collected over time to estimate the from a North American utility are presented in
values of transmission line model parameters. Real Section 6. Section 7 presents discussions on the
data often contain significant errors which do not results. Section 8 discusses future applications for
have known probability distributions. Meters used transformer model parameter estimation. Finally,
for measuring data are a common source error. A conclusions are made in Section 9.
key aspect of this work involves being able to
perform analysis such that any errors in the available
data do not corrupt the estimate. 2. Estimation Background
With respect to data collection, phasor
measurement units (PMUs) [1] have received high Many applications rely on correct underlying
visibility. Increasing PMU deployment leads to an model information. In power systems, the area where
increase in the amount of data available and a desire estimation tends to receive the most attention has
to effectively manage, send, and process the data. It been in power system state estimation [2], [3].
remains a challenge to intelligently analyze the data Parameter identification has often been studied in the
and obtain useful information to advance the state-of- context of state estimation literature as state
the-art for power system operations and control. estimators also serve as data filters. Uncertainty in
Data from PMUs is often the focus of data mining data with respect to state estimation is analyzed in
[4].

1530-1605/12 $26.00 2012 IEEE 2149


2151
DOI 10.1109/HICSS.2013.206
Transmission system parameter inaccuracies can Q2, and V2. This is the raw SCADA data, recorded
have adverse effects on state estimation. Thus, every five minutes. Note that what is physically
existing work is aimed at identifying and correcting measured by potential transformers (PTs) and current
transmission model parameter errors using telemetry transformers (CTs) is only voltage and current,
data [3],[5],[6],[7],[8]. While a few methods provide respectively; however, these true raw measurements
a means for updating the estimates online, most may be unavailable. The goal is to use available
existing approaches consider only a single snapshot SCADA measurements to estimate values of r, x, and
in time. Also, typically, results for small simulated b in Figure 1. The assumption is that the angle
systems are presented as opposed to real systems. difference across the line and the line parameters are
This paper circumvents both previous limitations. unknown. If it is possible to know the angle
Historical SCADA data, recorded every five minutes difference across the line (as with PMUs), the
over several months, is available and is presented in problem is considerably simplified.
Section 6 of this paper. A close analogy to the key
concept of this work is introduced in [9], where PMU 3.2. Basic SCADA Data Properties
data collected over a number of different operating
points can be applied to construct a reduced
The pi model equations are now considering the
equivalent model at the PMU buses. The statistical
assumed known or measured quantities. Several
properties of the data are important to the quality of
quantities can always be derived directly from the
our estimate. Background on the statistical analysis
SCADA data. Since P1, Q1, P2, and Q2 are
of data [10] provides insight into the quality of the
measurements, the complex powers S1, S2 at the line
obtained estimate.
ends are directly known. Thus, the apparent powers
|S1|, |S2| are also known. From the apparent power
3. Transmission Line Model and and voltage magnitudes, the current magnitudes, I1,
Available Data I2, can be directly computed.

In this section, the transmission line model is I i  Vi / Si (1)


described and analyzed in the context of the
measured data which is used in this work. This From the current magnitudes, power factor angles are
provides a fundamental expectation of what can (and also known at both line ends, 1,pf, 2,pf. Notably
cannot) be estimated about the model based on the lacking in these quantities, which can be exactly
available data. computed from the SCADA data, are the line
parameters and the angle difference across the line.
3.1. Transmission Line Model
3.3. Pi Line Model
The line model considered is the pi model for
transmission lines, shown in Figure 1. This is a In the pi model, let b/2 denote the value of the
standard model which is often used to model lines of shunt modeled at each bus. The admittance matrix of
medium length [11], of approximately 50 to 150 the transmission line model is Y=G+jB:
miles.
 1 1 
I1 I2  r  jx  jb / 2 
r  jx 
r+jx Y  (2)
P1 + jQ1 P2 + jQ2  1 1 
  r  jx r  jx
 jb / 2 
V1 V2

jb/2 jb/2
The real and reactive components of Y gives matrices
G and B with elements Gij and Bij. The equations for
real and reactive power flow at both ends of the line
are given by the following:
Figure 1. Pi Line Model

Let V1, V2, I1, and I2 denote the magnitudes of the


P1  V12G11  VV G12 cos 12  B12 sin 12 
(3)
1 2 

phasor quantities. The assumption is that the P2  V22G22  V2V1  G21 cos 21  B21 sin 21 
(4)
measurements for this application are P1, Q1, V1, P2,

2152
2150
G12 sin 12  B12 cos 12 
(5)
Q1  V12 B11  VV SCADA estimation problem, some approximation is
1 2 
necessary. The choice of the approximation is
Q2  V22 B22  V2V1  G21 sin 21  B21 cos 21 
(6) therefore important.

From (2)-(6), the equations for the real and reactive 3.4. Estimation Equations
power losses in the line are thus given by P1+P2 and
Q1+Q2 respectively: The equations used to perform the SCADA
estimation are presented in this section. The
Plosses  V12  V22 G11  2VV G12 cos 12 
(7)
1 2 
following two equations for real power losses and
reactive power losses contain the three unknowns (r,
Qlosses  V12  V22 B11  2VV B12 cos 12 
(8)
1 2  x, and b) which are time independent.

Thus, four out of the six equations in (3)-(8) are Plosses  I 2 r (10)
independent. The loss equations are obviously not Qlosses  V 1
2
b / 2  V2 b / 2  I
2 2
x (11)
independent of the first four since they are each the
sum of two other equations. In equations (3)-(6), the
The need for approximation arises since the current
unknowns are G12, B12, b/2, and 12 = (1-2). Thus,
magnitude I in (10) and (11) is not known exactly
there are four equations and four unknowns.
from the measurable quantities. Due to current
This looks similar to a traditional state estimation
injections from the shunt elements, I differs from I1
problem except with different unknown variables.
and I2 which are known from (1). As an
The difficulty is that the unknowns appear in the
approximation, the average of I1 and I2 is taken for I.
equations in a nonlinear manner. Thus, it follows
This avoids the problem of unknowns which depend
that use of a linear technique to estimate these
on time.
quantities will not yield complete success. To see
Other variations of the estimation equations are
this, observe that what appears in the equations is
possible depending upon which approximations are
always a product term of Gij and Bij multiplied with
desirable to make. Future work may investigate the
either the term sin(12) or cos(12). Solution requires
impact of the model approximation on the parameter
a nonlinear iterative approach such as Newtons
estimation. For example, a bias term u can be added
method to find b, x, and r to minimize the difference
to the estimation equation for r,
between the measurements and the calculated
quantities. Essentially, this is state estimation. The
key difficulty is that from one snapshot, there are Plosses  u  I 2 r (12)
more unknowns than equations so traditional state  V  V2 
2 2

estimation is not sufficient. The goal of this work is Qlosses    1


b  I x
2
(13)
 2 
to improve the ability to directly estimate b, x, and r
using data collected over multiple points in time.
The unknowns Gij, Bij, and b/2 are not dependent which represents a Ploss static error in the
on time. This is true at least over some window of measurements. Based on (12) and (13), a system of
time, whereas the angle difference 12 is a function of equations is obtained by appending the contributions
the operating point and changes over time. Thus, the of repeated measurements to the matrix A. This may
terms in (3)-(6) which depend on time are the be expressed as (14) or (15):
following:
1 I 2 0 0  u   P 
G12 cos 12 , B12 sin 12    r   losses 
G12 sin 12 , B12 cos 12
(9) 0 0 I 2 1 2   x    Qgen  (14)
V 2
 V 2
/ 2
 
 
b   

The proposed approach of considering multiple Aey  fp,q (15)


snapshots in time to estimate the transmission line
parameters is only successful if there are fewer
unknowns which depend on time than the number of As additional rows are added corresponding to
linearly independent equations. In this case, there are additional time points, this becomes an
four linearly independent equations. Each additional overdetermined problem. Least squares estimation
time point will produce four more equations but will minimizes the residual,
also produce four more unknowns. Thus, in the

2153
2151
r  Aey  fp,q (16) DataSet = DataSet (keepIndex)
and results in the following estimate (14), Algorithm 1. Procedure for Dataset Selection

In summary, Algorithm 1 returns a random


ey  [AT A]1 ATfp,q (17) consecutive data chunk of size chunkSize which has
undergone initial screening and value filtering.
where ey = [u, r, x, b] denotes a vector of the
estimated quantities. The vector fp,q contains real 4.2. Estimation and Outlier Rejection
power losses Plosses and the negative of reactive
power losses Qgen. It is possible to estimate Plosses
Once an initial data set is selected, regression is
and Qgen independently since the equations are
applied, as outlined in Algorithm 2. An initial fit to
completely decoupled, as evident in (14).
the dataset is obtained from least squares. However,
in this initial fit, all of the data are used
4. Proposed Estimation Approach indiscriminately. Thus, outliers can have a
significant impact on the estimate. By removing
An overview of the proposed methodology is now outliers or problematic data points, the estimate
presented. Initially, a data selection module returns improves. At each iteration, the residuals or
the dataset and also accomplishes data screening deviations are computed using (16) with the current
functions such as value filtering and initial bad data values of the estimated parameters. High residuals
rejection. Then, for a given set of data, regression is indicate a poor match of the particular data point to
applied to detect and reject possibly wrong data the model. The strategy is to identify data points
based on analysis of a fitted model. with residuals which exceed a predetermined
threshold and reject them. The process continues and
iteratively fits the data to the refined data set until all
4.1. Dataset Selection data points have residuals within the specified
threshold. The procedure stops when the number of
The procedure outlined in Algorithm 1 is responsible rows removed from the dataset during any iteration,
for selecting and returning a set of data on which the nR, is equal to zero.
analysis is to be performed. For a specified line, the
sample dataset consists of measurements of P, Q, and Function PerformRegression
Input: DataSet
V at both line ends. This function also provides Output: ey = [u, r, x, b], DataSet
initial screening and filtering of the raw SCADA ey = LeastSquaresEstimate(DataSet)
data. Currently, all data is screened simply by r = Aey-b
removing data points which have all values of zero. While nR > 0 then begin
nR = 0
Additional screening may also be performed. A data [DataSet, nR] = OutlierFilter (DataSet, r, nR)
point or a row refers to one set of SCADA ey = LeastSquaresEstimate(DataSet)
measurements from one point in time. Value filters, End
when applied, allow Algorithm 1 to return a dataset Algorithm 2. Regression
for particular conditions, such as a<x<b, where x is
any of the measured or computed variables. The regression algorithm thus progressively
identifies and removes bad data and computes a new
Function DataSelection estimate from the refined data set. However, any
Input: RawData, whichLine, filter, chunkSize, errors which exist in the initial dataset can still have
Output: DataSet consisting of P, Q, V at line ends
keepIndex = (rand) to (rand+chunkSize) an impact on the quality of the initial fit.
DataSet = RawData(keepIndex) Outlier filtering is performed as indicated in
Foreach row of DataSet do begin Algorithm 3, which is applied as a component of
If row = 0 then begin Algorithm 2. While Algorithm 3 is the currently
Remove row
End implemented, other outlier filtering algorithms can
End easily be substituted in its place.
clear keepIndex
Foreach row of DataSet do begin Function OutlierFilter
If row.MeetsFilter then begin Input: DataSet, r, nR
keepIndex.add(row.Index); Output: DataSet, nR
End Foreach row of DataSet do begin
End [F, xi] = ksdensity(r)
[, , ci, ci] = normfit(r)

2154
2152
threshold = f(r, ) The estimated line is between Bus 3 and Bus 4, with
If residual(row) > threshold then begin model parameters: r=0.01 pu, x=0.03 pu, b=0.02 pu.
Remove row from DataSet
Increment(nR)
End 5.1. Simulated Error-Free Data
End
End
Algorithm 3. Outlier Filtering A simulated SCADA dataset for line (3,4) is
obtained from five different operating points. The
In Algorithm 3, Matlab functions ksdensity and normfit units of P, Q, and V measurements are MW, MVAr
have been used from the Statistics toolbox. The and per unit, respectively. Using only these points
function ksdensity (r) is used to compute a probability with no added error, the estimate matches the model,
distribution estimate for the sample vector r. This as expected. The values of the estimates are given in
function returns a vector of density values F Table 2. Real and reactive power plots of the actual
evaluated at the points in vector xi. The function data, the estimate, and the model are shown in Figure
normfit(r) fits r to a normal distribution and returns the 3 and Figure 4. The per unit base is 100 MVA.
estimates of mean and standard deviation of the
data with a 95% confidence interval. Estimates on Table 1. Study system noise-free estimates
the bounds of the confidence intervals are given by ci Line u (pu) r (pu) x (pu) b (pu)
and ci. The objective of Algorithm 3 is to detect and (3, 4) -1.99e-5 0.010055 0.030161 0.020058
remove bad data characterized by deviation of
0.01
residual from the zero mean. If the error residual for r: 0.010055
0.009
a data point exceeds the threshold, the point is u: -1.9901e-005
0.008
removed.
Ploss(pu) 0.007

0.006
5. Application to Simulated Data 0.005

0.004

In this section, the procedures are applied to 0.003

simulated data from a test system to serve as a 0.002 Data


reference. To perform these simulations, PowerWorld 0.001
Estimate
Model
Simulator is used via SimAuto and script commands. 0
0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
A program allows SimAuto to retrieve SCADA data Current(pu)
from the simulated system. Then, the system is Figure 3. Seven-bus error-free data Ploss plot
perturbed to a new operating point. By repeating this
procedure, data for a number of operating points is 0.02
x: 0.030161
obtained. Simulations provide a reference case since b: 0.020058
0.015
both the model and the data are exactly known.
Application to real data is presented in the next 0.01
Qgen(pu)

section. The system used for the simulations is the


seven bus system shown in Figure 2. 0.005

-0.005 Data
Estimate
Model
-0.01
0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Current(pu)
Figure 4. Seven-bus error-free data Qgen plot

5.2. Simulated Data with Error

From the data for the five error-free operating


conditions above, 200 perturbations are made of each
point. This reflects the fact that during each quasi-
Figure 2. Seven bus study system steady-state operating point, the SCADA values do
not change much but may change slightly. Thus,

2155
2153
from the five exact operating points, we create 200*5 As may be evident when comparing the results,
= 1000 hypothetical measurement data points. Each some sets of operating points facilitate better
measurement is per-unitized and randomized by a estimates than others. The issue is that when
normal probability distribution of mean zero and simulating data, it is challenging to reproduce data
standard deviation 2 MW, 2 MVAr, and 0.02 per unit with a realistic distribution. The question arises of
voltage. These 1000 points are used to compute the how one should perturb the system to obtain
estimate. The results from the new dataset are shown representative operating points since there are infinite
in Figure 5 and Figure 6, and the corresponding possible ways to change the system and obtain a new
estimates are shown in Table 2. operating point. Thus, rather than speculate on what
representative operating points exist that should be
0.15
r: 0.0063688 Data studied, the most value is obtained from testing the
Estimate procedure on real data.
u: 0.0012165 Model
0.1

6. Application to Real SCADA Data


Ploss(pu)

0.05

0
The proposed procedures are applied to historical
SCADA data from a real North American power
-0.05
system. Identifiers of buses and lines in the real
system have been made anonymous. From a
-0.1
practical point of view, it is extremely valuable to be
0.2 0.3 0.4 0.5 0.6 0.7
Current(pu)
0.8 0.9 1 1.1 1.2
able to present these results for actual SCADA data.
Figure 5. Seven-bus noisy dataset Ploss plot Estimation on artificially constructed data is a
special/trivial case. However, parameter estimation
0.1 using real world data can be extremely challenging as
x: 0.029319 Data
0.08 b: 0.017988 Estimate evident in the results and discussions presented in the
Model
0.06
following sections. Real world data rarely follows a
0.04
Gaussian or known probability distribution.
Qgen(pu)

0.02

0
6.1. Study System - North American Utility
-0.02

-0.04
From a North American utility, SCADA data
collected over several months is available for the
-0.06
three transmission lines illustrated in Figure 7. The
-0.08
0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 model values of the line parameters for these lines are
Current(pu)
also available, tabulated in Table 3. Results for these
Figure 6. Seven-bus noisy dataset Qgen plot three lines are presented.
As indicated by the estimates for (3, 4)* in Table 2, Line A Line B Line C
the accuracy of the r estimate is considerably
improved when the static error u is not estimated. Bus 1 Bus 2 Bus 3 Bus 4

Table 2. Study system estimates with noise Figure 7. Three Lines with Available Data
Line u (pu) r (pu) x (pu) b (pu)
(3, 4) 0.0012165 0.006369 0.02932 0.017988 Each line identifier =(i,j) in Table 3 denotes a line
(3, 4)* - 0.010153 0.02931 0.020321 between bus i and bus j, from Figure 7.

It is interesting to note that the five operating Table 3. Existing Model Values for Lines
points result in five distinct clusters of points on the Line r (pu) x (pu) b (pu)
graphs. In real data, the authors have observed the A=(1, 2) 0.00056 0.01054 0.9079
presence of these bands, especially when a relatively B=(2, 3) 0.00052 0.00999 0.75194
small set of points is examined. Thus, multiple noisy C=(3, 4) 0.00025 0.00445 0.32042
measurements of the same operating point seem to
cause this feature. 6.2. Comparison of Model to Data

2156
2154
loss

An initial dataset with a size of 30,000 data points 2


Data
is selected according to Algorithm 2. No value Estimate
Model
filtering is applied. In the next sub-sections, Figures 1.5

8, 10, 12, 14, 16, and 18 plot Ploss and Qgen vs. I for r: 0.00086639

Ploss(pu)
1
u: 0.0067853
each line. The red and blue lines on these plots show
respectively the expected data points when using the
0.5
estimated and model parameter values. Figures 9, 10,
13, 15, 17, and 19 show distributions of the residuals
0
about the parameter estimate as well as Gaussian
distributions of the same mean and standard deviation
-0.5
(red dotted lines). The available datasets for lines A, 0 2 4 6 8 10 12
Current(pu)
B, and C lead to reasonable and consistent parameter Figure 10. Ploss vs. I (Line A)
estimates.
Resistance r and static power offset u are estimated 1
from the real power losses, while x and b are 0.9
estimated from the reactive power losses. These Mean: -3.551e-017

Density Distribution Estimate


0.8
Std: 0.039917
estimates for all of the lines are given in Table 4. 0.7
Each estimate is based on 30,000 data points. 0.6

0.5

6.2.1. Line A plots. 0.4

0.3
g
1.2
0.2
Data
1 Estimate 0.1
Model
0.8 0
-0.3 -0.25 -0.2 -0.15 -0.1 -0.05 0 0.05 0.1 0.15 0.2
x: 0.010794 Ploss Deviation (pu)
Qgen(pu)

0.6
b: 0.90628
Figure 11. Ploss residual distribution (Line A)
0.4

0.2

0
6.2.2. Line B plots.
-0.2
g
1.5
-0.4
0 2 4 6 8 10 12 Data
Current(pu) 1 Estimate
Model
Figure 8. Qgen vs. I (Line A) 0.5
x: 0.01163
0
b: 0.7144
1
Qgen(pu)

-0.5
0.9
Density Distribution Estimate

-1
0.8
-1.5
0.7 Mean: 2.0811e-006
0.6
Std: 0.014912 -2

-2.5
0.5

0.4 -3
8 0 2 4 6 8 10 12 14 16 18

0.3 Current(pu)
0.2 Figure 12. Qgen vs. I (Line B)
0.1

0
-0.2 -0.15 -0.1 -0.05 0 0.05 0.1 0.15
Qgen Deviation (pu)
Figure 9. Qgen residual distribution (Line A)

2157
2155
1 1
x: 0.0044154 Data
0.9 Estimate
0.8 b: 0.35742

Density Distribution Estimate


Model
0.8
Mean: -5.303e-005 0.6
0.7
Std: 0.035339
0.6

Qgen(pu)
0.4

0.5
0.2
0.4

0.3 0

0.2
-0.2
0.1
-0.4
0 4 0 2 4 6 8 10 12 14
-0.2 -0.15 -0.1 -0.05 0 0.05 0.1 0.15 0.2 0.25
Qgen Deviation (pu) Current(pu)

Figure 13. Qgen residual distribution (Line B) Figure 16. Qgen vs. I (Line C)

0.7 1
Data 0.9 Mean: -1.3784e-005

Density Distribution Estimate


0.6 Estimate Std: 0.019167
Model 0.8
0.5
r: 0.00047885 0.7
0.4
u: -0.0086491
0.6
Ploss(pu)

0.3
0.5
0.2
0.4
0.1
0.3

0
0.2

-0.1 0.1

-0.2 0
0 2 4 6 8 10 12 14 16 18 -0.3 -0.25 -0.2 -0.15 -0.1 -0.05 0 0.05 0.1 0.15
Current(pu) Qgen Deviation (pu)
Figure 14. Ploss vs. I (Line B) Figure 17. Qgen residual distribution (Line C)

1.4 0.3
r: -0.00025763 Data
Density Distribution Estimate

0.2 u: -0.010291 Estimate


1.2 Model

1
Mean: 2.7123e-017 0.1
Std: 0.039822
Ploss(pu)

0.8 0

0.6 -0.1

0.4 -0.2

-0.3
0.2

-0.4
0 0 2 4 6 8 10 12 14
-0.2 -0.1 0 0.1 0.2 0.3 0.4 0.5
Ploss Deviation (pu) Current(pu)
Figure 15. Ploss residual distribution (Line B) Figure 18. Ploss vs. I (Line C)
1

6.2.3. Line C plots.


Density Distribution Estimate

0.9 Mean: 1.5721e-017


0.8 Std: 0.065345

0.7

0.6

0.5

0.4

0.3

0.2

0.1

0
-0.25 -0.2 -0.15 -0.1 -0.05 0 0.05 0.1 0.15 0.2 0.25
Ploss Deviation (pu)
Figure 19. Ploss residual distribution (Line C)

2158
2156
6.2.4. Parameter Estimates for All Lines 0.3

0.2

The estimates of the line parameters based on the


0.1
data above are given in Table 4 below.

Ploss(pu)
0

Table 4. Estimates from 30,000 Data Points -0.1


Line u (pu) r (pu) x (pu) b (pu)
A=(1, 2) 0.0067853 0.0008664 0.010794 0.90628 -0.2

B=(2, 3) -0.008649 0.0004789 0.01163 0.7144 r:0.00030502 Data


-0.3
Estimate
C=(3, 4) -0.010291 -0.000257 0.004415 0.35742 u:0.0094121 Model
-0.4
0 2 4 6 8 10 12 14
Current(pu)
7. Discussion of Results
Figure 20. Ploss vs. I (Line C, corrected data)
As can be seen from the above plots, the estimated Table 5. Estimates after Regression
parameters are close to the available model Line u (pu) r (pu) x (pu) b (pu)
parameters. The estimated resistance for Line A is A=(1, 2) 0.0058326 0.0009012 0.010774 0.90586
slightly higher than its model parameter; this may be B=(2, 3) -0.010256 0.0004928 0.011749 0.71445
due to the fact that the available model resistance C=(3, 4) 0.0077929 0.00030975 0.004401 0.3589
value was computed using the dimensions of the line,
material resistance, etc. for different conditions than An effect which has not been included is how
actually seen in operation. The reason this does not actual line resistance varies with temperature. There
occur for the other two lines may be because Line A is approximately a 0.4% increase in resistance per C
is much longer in comparison to the Lines B and C. rise in temperature [12]. In seasonal planning models,
The estimated resistance of Line C is negative the r values are not adjusted since x values typically
when the algorithms are applied on the available raw dominate. When we use multiple months of SCADA
SCADA data. However, the magnitude of the data to obtain an estimate, any changes in the actual
estimated resistance is close to the model resistance parameters are ignored.
available. This may indicate the presence of
measurement sign errors. The suspected sign errors 8. GIC Model Validation Applications
are corrected, and a corrected dataset is returned after
applying this filtering in A future use of historical data to estimate model
Algorithm 1. The new estimate of the resistance, r parameters is in geomagnetic disturbance (GMD)
= 0.000305 pu, is considerably improved. Figure 20 modeling. GMDs induce geomagnetically induced
shows the new Ploss vs. I plot. currents (GICs) in transmission lines and
Next, estimation is done after applying regression transformers. Background on the problem and its
for all the lines. Since the actual parameters of the effects in large power systems is presented in [13].
transmission lines are not known or rather cannot be GIC-related transformer reactive power losses are
exactly known, there is no way to validate the final observed to vary linearly with terminal voltage [14],
results. However, the values provided in Table 5 are and may be represented, for example, as
expected to be the best parameter estimates. It is
clear that screening the data is a critical stage of the Qloss  Iline 2  xm  Vpu KIGIC (18)
estimation process, as the dataset which is chosen
ultimately impacts the quality of the final estimate.
The data for reactance and shunt estimation for the where K is the constant to be estimated and is
three lines was better than the data for resistance specific to the transformer, Vpu is the terminal voltage
estimation, as evident from the plots for distribution in per unit, and IGIC is the DC GIC. In the simple
of residuals. The latter data is noisy and the Ploss case of a wye-delta step-up transformer, IGIC is the
values are small compared to the value of the neutral current IN. Units of K are MVAr/A. For an
parameter being estimated. Measurements may also autotransformer, IGIC is given by
contain systemic biases which are not accounted for
in the approach. A more thorough coverage of the at I H  I L
I GIC  (19)
detection of data problems is beyond the scope of this at
paper.

2159
2157
from [15] [16] where the two DC coil currents IH and [4] A. K. Al-Othman, M. R. Irving, "Uncertainty
IL may be measured, and at is the transformer turns modelling in power system state estimation," IEE
ratio. Proceedings- Generation, Transmission and
When the K values of transformers are not known, Distribution, vol. 152, no. 2, pp. 233- 239, March
2005.
they are often approximated in the analysis by default [5] G. L. Kusic, D. L. Garrison, "Measurement of
values. Obviously, it is preferable to use actual values transmission line parameters from SCADA
to improve the meaningfulness of the GIC results. data," IEEE PES Power Systems Conference and
The NERC report [13] identifies as an action item the Exposition, 2004, pp. 440- 445 vol. 1, Oct. 2004.
development of tools for GIC flow and subsequent [6] J. Zhu, A. Abur, "Identification of network parameter
reactive power loss modeling. Parameter estimation errors," IEEE Transactions on Power Systems, vol.
can help validate transformer parameters which are 21, no. 2, pp. 586- 592, May 2006.
needed for GIC analysis. [7] J. Zhu, A. Abur, "Improvements in network
parameter error identification via synchronized
phasors," IEEE Transactions on Power Systems, vol.
9. Conclusion 25, no. 1, pp. 44-50, Feb. 2010.
[8] J. F. Dopazo, G. Irisarri, A. M. Sasson, Real-time
external equivalent for on-line contingency analysis,
The value of using data collected over time is
IEEE Transactions on Power Systems, vol. PAS-98,
apparent when estimating transmission line no. 6, pp. 2153-2171, Nov. 1979.
parameters, as shown in this paper. Results for both [9] K. M. Rogers, R. D. Spadoni, T. J. Overbye,
simulated and real data are discussed. It is clear that Identification of power system topology from
estimation based on real measurement data presents synchrophasor data, IEEE Power Systems
challenges which are not encountered in simulation. Conference and Exposition (PSCE), pp. 1-8, March
The presentation of the proposed solution approach 2011.
to real data is thus valuable. The approach is [10] J. Mandel, The Statistical Analysis of Experimental
validated on actual SCADA data from a real North Data, (Reprint) Dover, 1984, Mineola, N.Y.
American utility. In summary, the proposed [11] R. Bergen, V. Vittal, Power System Analysis,
2nd ed., Prentice Hall, 2000.
methodology is a valuable tool which can allow [12] S. Smolleck, J. Sims, Guidelines for the selection
planners to find and correct model errors based on and operation of bare ACSR conductors with regard
data which is already collected. to current carrying capacity, Electric Power Systems
Research, vol. 5, 1982, pp. 179-190.
10. Acknowledgments [13] 2012 Special Reliability Assessment Interim Report:
Effects of Geomagnetic Disturbances on the Bulk
Power System, NERC, Feb. 2012.
The authors would like to acknowledge support from [14] X. Dong, Y. Liu, J. G. Kappenman, "Comparative
the Stanford University Global Climate and Energy analysis of exciting current harmonics and reactive
Project (GCEP), the Power Systems Engineering power consumption from GIC saturated
Research Center (PSERC), Trustworthy Cyber transformers," IEEE Power Engineering Society
Infrastructure for the Power Grid (TCIPG), and Winter Meeting, pp. 318-322, Feb 2001.
[15] V.D. Albertson, J. G. Kappenman, N. Mohan, G. A.
Consortium for Electric Reliability Technology
Skarbakka, "Load-flow studies in the presence of
Solutions (CERTS). geomagnetically-induced currents," IEEE
Transactions on Power Apparatus and Systems, vol.
11. References PAS-100, no.2, pp.594-607, Feb. 1981.
[16] K. Zheng, et. al., Influence of system characteristics
on the amplitudes of geomagnetically induced
[1] A. G. Phadke, "Synchronized phasor measurements in
currents, submitted to IEEE Transactions on Power
power systems," IEEE Computer Applications in
Delivery, March 2012.
Power, vol. 6, no. 2, pp. 10-15, Apr 1993.
[2] A. Monticelli, State estimation in electric power
systems: a generalized approach. Kluwer Academic
Publishers, 1999.
[3] A. S. Debs, "Estimation of steady-state power system
model parameters," IEEE Transactions on Power
Apparatus and Systems, vol. PAS-93, no. 5, pp. 1260-
1268, Sept. 1974.

2160
2158

You might also like