<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>0120-6230</journal-id>
<journal-title><![CDATA[Revista Facultad de Ingeniería Universidad de Antioquia]]></journal-title>
<abbrev-journal-title><![CDATA[Rev.fac.ing.univ. Antioquia]]></abbrev-journal-title>
<issn>0120-6230</issn>
<publisher>
<publisher-name><![CDATA[Facultad de Ingeniería, Universidad de Antioquia]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S0120-62302009000100018</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Faulted zone determination using statistical modeling of voltage sag database in power distribution systems]]></article-title>
<article-title xml:lang="es"><![CDATA[Determinación de la zona en falla usando modelado estadístico de bases de datos de huecos de tensión en sistemas de distribución de energía eléctrica]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Ordóñez Plata]]></surname>
<given-names><![CDATA[Gabriel]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Cormane Angarita]]></surname>
<given-names><![CDATA[Jorge]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Mora Flórez]]></surname>
<given-names><![CDATA[Juan]]></given-names>
</name>
<xref ref-type="aff" rid="A02"/>
</contrib>
</contrib-group>
<aff id="A01">
<institution><![CDATA[,Universidad Industrial de Santander Escuela de Ingenierías Eléctrica, Electrónica y de Telecomunicaciones Grupo de Investigación en Sistema Eléctricos (GISEL)]]></institution>
<addr-line><![CDATA[Bucaramanga ]]></addr-line>
<country>Colombia</country>
</aff>
<aff id="A02">
<institution><![CDATA[,Universidad Tecnológica de Pereira Programa de Ingeniería Eléctrica Grupo de Investigación en Calidad de Energía Eléctrica y Estabilidad (ICE3)]]></institution>
<addr-line><![CDATA[Pereira Risaralda]]></addr-line>
<country>Colombia</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>03</month>
<year>2009</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>03</month>
<year>2009</year>
</pub-date>
<numero>47</numero>
<fpage>197</fpage>
<lpage>208</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.co/scielo.php?script=sci_arttext&amp;pid=S0120-62302009000100018&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.co/scielo.php?script=sci_abstract&amp;pid=S0120-62302009000100018&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.co/scielo.php?script=sci_pdf&amp;pid=S0120-62302009000100018&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[An alternative solution to the problem of power service continuity associated to fault location is presented in this paper, by using a methodology of statistical nature based on finite mixtures. A statistical model which helps to locate the faulted zone, is obtained from the extraction of the magnitude of the voltage sag registered during a fault event, along with the network parameters and topology. The objective is to offer an economic alternative of easy implementation for the development of strategies oriented to improve the reliability from the reduction of the restoration times in power distribution systems. As results presented for an application example in a 25kV system, the faulted zones were identified, having low error rates.]]></p></abstract>
<abstract abstract-type="short" xml:lang="es"><p><![CDATA[En este artículo, se presenta una solución alternativa para el problema de continuidad del servicio asociada a la localización de fallas. La metodología propuesta es de naturaleza estadística y basada en las mezclas finitas. El modelo estadístico es obtenido a partir de la extracción de la magnitud del hueco de tensión registrado durante un evento de falla y de los parámetros de la red y de su topología. El objetivo esta asociado a ofrecer una alternativa económica y de fácil implementación para el desarrollo de estrategias orientadas a mejorar la confiabilidad a partir de la reducción de los tiempos de restauración de los sistemas de distribución. Como resultados más importantes, se presentan los obtenidos en un ejemplo de aplicación en un sistema de 25 kV, en el cual las zonas en falla fueron localizadas con un bajo error en el desempeño del localizador.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Power Quality]]></kwd>
<kwd lng="en"><![CDATA[Fault Location]]></kwd>
<kwd lng="en"><![CDATA[Finite Mixtures]]></kwd>
<kwd lng="en"><![CDATA[Statistical Model]]></kwd>
<kwd lng="en"><![CDATA[density mixture models]]></kwd>
<kwd lng="es"><![CDATA[Calidad de potencia]]></kwd>
<kwd lng="es"><![CDATA[localización de fallas]]></kwd>
<kwd lng="es"><![CDATA[mezclas finitas]]></kwd>
<kwd lng="es"><![CDATA[modelos estadísticos]]></kwd>
<kwd lng="es"><![CDATA[modelos de mezclas de densidad]]></kwd>
</kwd-group>
</article-meta>
</front><body><![CDATA[ <p align="center"><font face="Verdana" size="4"> <b>Faulted zone determination using statistical modeling of voltage sag database in power distribution systems</b></font></p>     <p align="center"><font face="Verdana" size="4"> <b>Determinaci&oacute;n de la zona en falla usando modelado estad&iacute;stico de bases de datos de huecos de tensi&oacute;n en sistemas de distribuci&oacute;n de energ&iacute;a el&eacute;ctrica</b></font></p>     <p> <font face="Verdana" size="2"> <i>Gabriel Ord&oacute;&ntilde;ez Plata<sup>1*</sup> , Jorge Cormane Angarita<sup>1</sup>, Juan Mora Fl&oacute;rez<sup>2</sup></i></font>     <p> <font face="Verdana" size="2"> <sup>1</sup>Grupo de Investigaci&oacute;n en Sistema El&eacute;ctricos &#40;GISEL&#41;. Escuela de Ingenier&iacute;as El&eacute;ctrica, Electr&oacute;nica y de Telecomunicaciones. Universidad Industrial de Santander. Calle 9, Cra 27 Ciudad Universitaria. Bucaramanga, Colombia</font></p>     <p> <font face="Verdana" size="2"> <sup>2</sup>Grupo de Investigaci&oacute;n en Calidad de Energ&iacute;a El&eacute;ctrica y Estabilidad &#40;ICE3&#41;. Programa de Ingenier&iacute;a El&eacute;ctrica. Universidad Tecnol&oacute;gica de Pereira. La Julita, Ciudad Universitaria. El&eacute;ctrica 227. Pereira, Risaralda, Colombia</font></p>      <p><font face="Verdana" size="2">&nbsp;</font></p> <hr noshade size="1">      <p><font face="Verdana" size="3"> <b>Abstract</b></font></p>     <p><font face="Verdana" size="2">An alternative solution to the problem of power service continuity associated to fault location is presented in this paper, by using a methodology of statistical nature based on finite mixtures. A statistical model which helps to locate the faulted zone, is obtained from the extraction of the magnitude of the voltage sag registered during a fault event, along with the network parameters and topology. The objective is to offer an economic alternative of easy implementation for the development of strategies oriented to improve the reliability from the reduction of the restoration times in power distribution systems. As results presented for an application example in a 25kV system, the faulted zones were identified, having low error rates. </font></p>      <p><font face="Verdana" size="2"><b>Keywords:</b> Power Quality, Fault Location, Finite Mixtures, Statistical Model, density mixture models.</font></p>     <p><font face="Verdana" size="2">&nbsp;</font></p> <hr noshade size="1">     ]]></body>
<body><![CDATA[<p><font face="Verdana" size="3"><b>Resumen</b></font></p>     <p><font face="Verdana" size="2">En este art&iacute;culo, se presenta una soluci&oacute;n alternativa para el problema de continuidad del servicio asociada a la localizaci&oacute;n de fallas. La metodolog&iacute;a propuesta es de naturaleza estad&iacute;stica y basada en las mezclas finitas. El modelo estad&iacute;stico es obtenido a partir de la extracci&oacute;n de la magnitud del hueco de tensi&oacute;n registrado durante un evento de falla y de los par&aacute;metros de la red y de su topolog&iacute;a. El objetivo esta asociado a ofrecer una alternativa econ&oacute;mica y de f&aacute;cil implementaci&oacute;n para el desarrollo de estrategias orientadas a mejorar la confiabilidad a partir de la reducci&oacute;n de los tiempos de restauraci&oacute;n de los sistemas de distribuci&oacute;n. Como resultados m&aacute;s importantes, se presentan los obtenidos en un ejemplo de aplicaci&oacute;n en un sistema de 25 kV, en el cual las zonas en falla fueron localizadas con un bajo error en el desempe&ntilde;o del localizador.</font></p>      <p><font face="Verdana" size="2"><b>Palabras clave:</b> Calidad de potencia, localizaci&oacute;n de fallas, mezclas finitas, modelos estad&iacute;sticos, modelos de mezclas de densidad.</font></p>       <p><font face="Verdana" size="2">&nbsp;</font></p> <hr noshade size="1">        <p><font face="Verdana" size="3"><b>Introduction</b></font></p>      <p><font face="Verdana" size="2">The interest in improving quality of supplied power is due to the deregulation in the electrical industry where quality is not only an indicator of the participation in the open power market but also one of the most relevant aspects regarding the requirements imposed to utilities. In most of the countries, and as a consequence of the new regulation, it is intended to strengthen the business of electricity distribution and the market from the viewpoint of power quality [1, 2]. The dependency of human activities in electricity demands that energy be supplied under several criteria of security, reliability and quality [3]. Such criteria have been promoted by fixed prices charged to final customers and by standards imposed to the service provided by utilities [4]. The continuity of supply is one of the most important aspects for the customer. This importance emerges from the social and economic impacts of interruptions [5, 6]. Although it is not economically feasible to reach a 100&#37; of reliability, utilities are making efforts to mitigate the problem of interruptions with an adequate planning and operation of the power system [7].</font></p>       <p><font face="Verdana" size="2">According to the statistics data, about 80&#37; of interruptions are caused by faults in the distribution system. The application of transmission system fault location algorithms to distribution networks is not a easy task due to the topology and operating principles of the latter &#40;i.e. non homogeneous feeders, load taps, laterals, radial operation and the available measuring equipment&#41; [8]. There exists a variety of methods for locating faults in power distribution systems. These methods may be classified in three broad categories. The first one comprises methods that detect components of high frequency in travelling waves, the second includes methods that compute fault impedance from the rms values of current and voltages measured at the fundamental frequency, and the last one is based on methods of visual inspection that consist of patrolling and checking the faulted feeder [8, 9].</font></p>       <p><font face="Verdana" size="2">This paper is aimed to propose an alternative solution to the problems associated with interruptions by means of a statistical model of voltage sags database applied to determine the fault location in power distribution systems to reduce the time wasted in system restoration [10]. The achievement of this goal enables the improvement of reliability from the establishment of strategies which are both economic and easily applicable by utilities [11]. This paper consists of six sections. In section 2, the theory related to the method which the proposed approach is based and the requirements for obtaining the statistical model are introduced. In section 3, the methodology proposed to fault location in distribution systems is presented. Section 4 describes the algorithm. Section 5 presents an application example by using a 25 kV distribution system, where the results obtained with the application of the proposed methodology are shown. Finally, the conclusions are highlighted in section 6.</font></p>       <p><font face="Verdana" size="2"><b>Basics of the multivariate analysis</b></font></p>      <p><font face="Verdana" size="2">The multivariable analysis is used in the proposed approach as kernel to obtain a representation of the distribution system. Multivariate analysis is here decomposed in two main levels.</font></p>       ]]></body>
<body><![CDATA[<p><font face="Verdana" size="2">The first one consists of the extraction of information from available data, called Exploratory Data Analysis-EDA. In this case, the available data is composed by the fault registers of voltaje measured at the power substation. These registers are characterized and as a result the voltage sag magnitude is obtained [12].</font></p>       <p><font face="Verdana" size="2">Second level is intended to represent knowledge from using the characteristics obtained at the first stage and relate it to the fault location.</font></p>       <p><font face="Verdana" size="2">Considering the above proposed, two different techniques are used to develop the fault location model. The first one is the application k-means algorithm and the second one is the mixture of distributions-MD [13]. With the application of the first technique data exploration and definition of variables are achieved while in the last one the probability density function is estimated.</font></p>       <p><font face="Verdana" size="2">Visual exploration is a powerful tool that serves as a first step in the understanding of multivariate data and enables the information analysis. This process helps with the comprehension despite the complexity and volume of data [14, 15]. </font></p>       <p><font face="Verdana" size="2">Clustering data allows the conformation of meaningful groups in an analytical way, with the objective of classifying data in a population according to similarities or affinities [16]. Moreover, the use of relatively simple models for each local structure makes the implementation, analysis and computational simplification less difficult [17, 18]. The clustering algorithms of are based in the use of metric differences for the calculation of the distance. The metrics are subjected to the constraints in <a href="#Ecuaci&oacute;n1,2y3">&#40;1&#41;</a>, where <b>A</b>, <b>B</b> and <b>C</b> are individuals of a group of data and <i>d</i>&#40;<b>A</b>,<b>B</b>&#41; is the distance between individuals <b>A</b> and <b>B</b>.</font></p>       <p><font face="Verdana" size="2">There exists a great variety of metrics associated to the quantification of data variability. In this approach the Euclidean &#40;2&#41; and Mahalanobis &#40;3&#41; metrics are used for cluster analysis since the best results have been achieved with them [19]. <a href="#Ecuaci&oacute;n1,2y3">Equations 2 and 3</a> are written in matrix format, where <b>x</b><sub>i</sub> is the data vector which corresponds to observation i, <img src="/img/revistas/rfiua/n47/n47a18i01.gif"> is the mean magnitude vector, T indicates the transpose of a matrix and <b>V</b> is the covariance matrix.</font></p>       <p><font face="Verdana" size="2"><img src="/img/revistas/rfiua/n47/n47a18i02.gif" ><a name="Ecuaci&oacute;n1,2y3"></a></p>        <p><font face="Verdana" size="2">Two important aspects are evaluated by using the previous mentioned metrics. The first is concerned with the proximity of between elements of the same group which indicates the how compact is a group of data &#40;Internal homogeneity&#41;. The second is related to the distance between elements of different groups in order to guarantee that no overlap exists due to the relative proximity of groups &#40;external heterogeneity&#41; [19].</font></p>       <p><font face="Verdana" size="2">MD is a technique of statistical modelling which allows an estimation of the Probability Density Function – PDF of data in a random sample, represented as a finite weighted sum of multivariate density components [20]. This technique has been applied with several purposes such as the modelling of heterogeneity in a population &#40;biology&#41;, management of outliers, PDF estimation and clustering &#40;statistics&#41;, pattern recognition &#40;image processing&#41; and fraud detection &#40;utilities&#41;. Within the principal features of MD, it is worth to mention that a smoothing parameter for PDF estimation is not required. The finite number of terms in the mixture can be defined according to the needs of the analyst. MD have less computational burden in comparison with other methods such as Kernel Estimation and Histograms since less amount of information is stored for PDF estimation [19].</font></p>       <p><font face="Verdana" size="2">The main advantage of MD is due to the capability on analysis and modelling of clusters. The flexibility of mixture models allows its application to the fault location problem in distribution systems [21]. In this particular application, concepts about multivariate data were used. These concepts are a characterization of a multidimensional random phenomenon. The multivariate density mixtures are expressed as shown in &#40;4&#41;.</font></p>      ]]></body>
<body><![CDATA[<p><font face="Verdana" size="2"><img src="/img/revistas/rfiua/n47/n47a18i03.gif" ><a name="Ecuaci&oacute;n4"></a></p>        <p><font face="Verdana" size="2">MD taken from a random sample x of n observations of dimension d is comprised of G components &fnof;<sub>g</sub> &#40;x,&theta;&#41; related to the selected multivariate density function. Each component &fnof;<sub> g</sub> describes the behaviour of a group within the sample in which data related to such group have similar characteristics established in the estimation vector &theta; that corresponds to the parameters of each distribution &#40;homogeneous&#41;. An estimator is a parameter that defines the behaviour of data in a group, so it also describes the shape. For example, estimators for a normal PDF are: the mean vector &mu; and the covariance matrix <b>V</b> which define the central point of the distribution and how data are concentrated. The quantities <i>p</i><sub>g</sub>, called weights or coefficients of mixture, provide information about the importance of the group within the mixture. The conditions that coefficients must satisfy are given in &#40;5&#41;.</font></p>      <p><font face="Verdana" size="2"><img src="/img/revistas/rfiua/n47/n47a18i04.gif" ><a name="Ecuaci&oacute;n5"></a></p>         <p><font face="Verdana" size="2">The aim of MD is to identify an unknown quantity of groups in which data of a given population are clustered. That is to say, MD seeks the homogeneity within an initially heterogeneous sample. To achieve this goal MD utilizes the Expectation- Maximization-EM algorithm for parameter estimation. </font></p>       <p><font face="Verdana" size="2">The EM algorithm is an application of the Maximum Likelihood Method-MLM to find missing parameters. In this sense, it allows the determination of maximum likelihood estimators &theta; for each distribution &fnof;<sub>g</sub> &#40;x, &theta;&#41;from initial values <img src="/img/revistas/rfiua/n47/n47a18i05.gif">  [13]. L in the algorithm is the likelihood function of the sample, finding the expected value of the functions of missing values <b>Z</b> from the calculation of its density function with initial values <img src="/img/revistas/rfiua/n47/n47a18i05.gif"> and observed values <b>Y</b>. The result of this operation is called the E-step &#40;Expectation&#41; given in &#40;6&#41;.</font></p>      <p><font face="Verdana" size="2"><img src="/img/revistas/rfiua/n47/n47a18i06.gif" ><a name="Ecuaci&oacute;n6"></a></p>        <p><font face="Verdana" size="2">In the M-step &#40;Maximization&#41; the function <i>L </i>&#40;<b>&theta;/ Y</b>&#41; is maximized in &theta; to find the m&aacute;ximum likelihood estimators from the replacement of missing with estimated values. </font></p>       <p><font face="Verdana" size="2">Let <img src="/img/revistas/rfiua/n47/n47a18i07.gif" >  be the value of the estimator obtained in the M-step. Then we return to the E-step in an iterative procedure until convergence is reached as presented in &#40;7&#41;.</font></p>        <p><font face="Verdana" size="2"><img src="/img/revistas/rfiua/n47/n47a18i08.gif" ><a name="Ecuaci&oacute;n7"></a></p>        <p><font face="Verdana" size="2">Where &epsilon; is the accepted tolerance, and k is the iteration number [22].</font></p>      ]]></body>
<body><![CDATA[<p><font face="Verdana" size="2">The MLM selects a good value of the estimator the one that maximizes the probability of generating the observed sample from the model to est&iacute;mate [13, 21].</font></p>      <p><font face="Verdana" size="3"><b>Proposed methodology</b></font></p>     <p><font face="Verdana" size="2">To obtain the fault location, a characterization of the system response under fault conditions is proposed. The response of the system is reflected in voltage signals measured at the distribution substation &#40;single end measurements&#41;. In what follows, a description of the research done in five stages proposed methodology will be given.</font></p>      <p><font face="Verdana" size="2">In the first stage, a power distribution system is considered and voltage waveforms from faults are recorded at the substation. The most common type of faults in distribution systems are short circuits. This work takes into account single lineto-ground, line-to-line, double line-to-ground, three phase and three phase-to-ground faults with different values of fault resistance between 0 and 50 Ohms [23].</font></p>      <p><font face="Verdana" size="2">In the second stage, signals are pre-processed to obtain rms and per-unit values.</font></p>      <p><font face="Verdana" size="2">In the third stage the system was characterized in two different ways. The first is a deterministic one based on the calculation of single-phase features or&ldquo;descriptors&rdquo; of voltage signals [12]. The second of statistical nature is based on an application of EDA which results in descriptors that characterize system behaviour.</font></p>      <p><font face="Verdana" size="2">In the fourth stage the information of included descriptors is analyzed by setting rules and conditions for creating characteristic zones and relations of homogeneity between groups. This is achieved with the application of techniques for the analysis of clusters.</font></p>      <p><font face="Verdana" size="2">Finally, in the fifth stage, the model of mixtures is conformed and fault data are classified obtaining the most probable fault location zone.</font></p>      <p><font face="Verdana" size="2"><b>Structure of the fault locator</b></font></p>     <p><font face="Verdana" size="2">The description of the algorithms that allow a step by step construction of the model, which is based on clustering and MD theory, is presented in this section. The idea for the implementation of the algorithm is to progressively adjust the model of each cluster extracted from the information of the system under fault conditions. The initial values of model parameters are calculated in an iterative procedure. A summary-type scheme of the algorithms known as k-means is next presented:</font></p>     ]]></body>
<body><![CDATA[<p><font face="Verdana" size="2">a. Specify the number of groups by a preliminary analysis or considering suggestions of the maintenance crew.</font></p>     <p><font face="Verdana" size="2">b. Determine the centres of these groups. This can be done a priori or in a random way.</font></p>     <p><font face="Verdana" size="2">c. Take each data and calculate the distance from each cluster by using Euclidean or Mahalanobis approaches as presented in <a href="#Ecuaci&oacute;n1,2y3">equations &#40;2&#41; and &#40;3&#41;</a>.</font></p>     <p><font face="Verdana" size="2">d. Aggregate each data into the cluster whose distance is a minimum and compute the new centres. e. Repeat steps b, c and d until no further changes occur in groups.</font></p>      <p><font face="Verdana" size="2">The idea is to minimize the sum of squared distance from points to centres within each group. Once the clustering is done, the method procedes with the identification of predominant characteristics in each group with the aim of inferring and relating new data.</font></p>      <p><font face="Verdana" size="2">From the information about the groups from the k-means algorithm, initial values for the centres are calculated. The initial value of covariance matrix is taken as the identity matrix and the mixture coefficients are calculated with the proportion of data in each group, in relation to the sample. Once initial parameters have been obtained, the estimation of the mixture model parameters is initiated by the EM algorithm in an iterative procedure until desired convergence is reached. The results are the final values of parameters <b>&mu;</b>, <b>V</b> and <i>p</i> of each group. These parameters are used by the multivariable density of mixtures presented in <a href="#Ecuaci&oacute;n4">equation &#40;4&#41;</a>, in order to classify other posible observations. The steps of the EM algorithm are as follows:</font></p>     <p><font face="Verdana" size="2">a. Determine the number of components of the mixture by using the k-mean algorithm.</font></p>     <p><font face="Verdana" size="2">b. Determine initial values of parameters of each component <img src="/img/revistas/rfiua/n47/n47a18i09.gif" >.</font></p>     <p><font face="Verdana" size="2">c. Calculate the posterior probability for each observation &#40;E-step&#41; as shown in &#40;8&#41; and &#40;9&#41;.</font></p>       <p><font face="Verdana" size="2"><img src="/img/revistas/rfiua/n47/n47a18i10.gif"><a name="Ecuaci&oacute;n8y9"></a></p>        ]]></body>
<body><![CDATA[<p><font face="Verdana" size="2"> <img src="/img/revistas/rfiua/n47/n47a18i11.gif"> represents the posterior probability of x<sub>j</sub> corresponding to the i term,  <img src="/img/revistas/rfiua/n47/n47a18i12.gif"> is the normal multivariate density and <img src="/img/revistas/rfiua/n47/n47a18i13.gif">  corresponds to the estimated MD for the i terms evaluated in x<sub>j</sub>. <i>j</i> is a index which indicate the total amount of data.</font></p>     <p><font face="Verdana" size="2"> d. Update   <img src="/img/revistas/rfiua/n47/n47a18i14.gif"> and   <img src="/img/revistas/rfiua/n47/n47a18i15.gif"> of each component &#40;Mstep&#41; by using <a href="#Ecuaci&oacute;n10,11y12">&#40;10&#41;, &#40;11&#41; &#40;12&#41;</a>. &#40; <img src="/img/revistas/rfiua/n47/n47a18i16.gif"> the updated estimations of the parameters&#41;.</font></p>      <p><font face="Verdana" size="2"><img src="/img/revistas/rfiua/n47/n47a18i17.gif"><a name="Ecuaci&oacute;n10,11y12"></a></p>        <p><font face="Verdana" size="2">e. Repeat steps c and d until desired convergence is obtained.</font></p>      <p><font face="Verdana" size="2">Subsequently, the organization of groups in clases associated to faults is based in the probability of appearance in each group as given by the mixture model &#40;13&#41;.</font></p>         <p><font face="Verdana" size="2"><img src="/img/revistas/rfiua/n47/n47a18i18.gif"><a name="Ecuaci&oacute;n13"></a></p>       <p><font face="Verdana" size="2"><b>Proposed tests for locating faults</b></font></p>     <p><font face="Verdana" size="2">From the statistical model as presented before, fault location within the system according to its response is expected. Recorded voltage waveforms are the basis to find the solution. Each recorded event has relevant information that enables data classification within certain type of class established in the model. Each class corresponds to a zone within the distribution network.</font></p>      <p><font face="Verdana" size="2">All the information used in this approach corresponds to magnitudes of voltage &#40;not angles were used&#41;. That is because this approach is aimed to obtain a low economical cost tool due to constrains imposed in several distribution utilities.</font></p>      <p><font face="Verdana" size="2">A 25 kV power distribution system is proposed for tests. This system is taken from Saskatown Power and Light, Canada, and it is presented in figure 1 [8, 24].</font></p>      ]]></body>
<body><![CDATA[<p align="center"><font face="Verdana" size="2"><img src="/img/revistas/rfiua/n47/n47a18i19.gif"><a name="Figura1"></a></p>        <p><font face="Verdana" size="2"><b>Figure</b> 1 25 kV power distribution system used to test</font></p>      <p><font face="Verdana" size="2">For the discrimination of zone visually, detectable groups were taken into account in a preliminary data analysis. Also, the zone division of the power system has to consider the suggestions of the maintenance crew, according to their experience in fault recovering. The goal to be achieved by associating groups to zones is to establish the correspondence between fault location and data classification within groups. Three descriptors were used to represent distribution system information: maximum sag magnitude in each phase of a three phase system [12].</font></p>     <p><font face="Verdana" size="2">In this case, data for training and validation are short-circuit faults simulated in each bus of the power distribution system by using different fault resistances from 0,05O to 50O according to [23]. In <a href="#Tabla1">table 1</a>, fault resistance values used in the training &#40;T&#41; and validating &#40;V&#41; processes are presented. </font></p>      <p><font face="Verdana" size="2">In<a href="#Tabla2"> table 2</a> data used in training &#40;T&#41; and validation &#40;V&#41;, for each fault type are presented. </font></p>      <p><font face="Verdana" size="2">Before classification process, a verification of training data is performed to previously identify and divide the distribution network in several zones. The objective is to estimate the initial centres of each group.<a href="#Figura2"> Figure 2</a> shows each of the power system zones obtained previous grouping of voltaje sag descriptors for single-phase faults.</font></p>      <p><font face="Verdana" size="2"><b>Table 1</b> Fault resistance values</font></p>      <p align="center"><font face="Verdana" size="2"><img src="/img/revistas/rfiua/n47/n47a18i20.gif"><a name="Tabla1"></a></p>        <p align="center"><font face="Verdana" size="2"><img src="/img/revistas/rfiua/n47/n47a18i21.gif"><a name="Figura2"></a></p>        <p><font face="Verdana" size="2"><b>Figure 2</b> Zones previously determined by visual analysis of data</font></p>      ]]></body>
<body><![CDATA[<p><font face="Verdana" size="2">The proposed model is ordered in several steps of classification, the first one &#40;SC1&#41;, determines the faulted phase; the second &#40;SC2&#41;, finds fault resistance value and the third &#40;SC3&#41;, finds fault location. SC1 step is only applicable to singlephase and phase-to-phase faults while SC2 and SC3 steps are applicable to all fault types. <a href="#Figura3">Figures 3</a>, <a href="#Figura4">4</a> and<a href="#Figura5"> 5 </a>show the distribution of training data corresponding to three types of faults. Besides, three clearly defined groups which correspond to each one of the three phases are observed.</font></p>     <p align="center"><font face="Verdana" size="2"><img src="/img/revistas/rfiua/n47/n47a18i22.gif"><a name="Figura3"></a></p>        <p><font face="Verdana" size="2"><b>Figure 3</b> Distribution of training sag data for singlephase fault &#40;SC1&#41;</font></p>      <p><font face="Verdana" size="2"><b>Table 2</b> Data used in training and validating processes</font></p>     <p align="center"><font face="Verdana" size="2"><img src="/img/revistas/rfiua/n47/n47a18i23.gif"><a name="Tabla2"></a></p>       <p align="center"><font face="Verdana" size="2"><img src="/img/revistas/rfiua/n47/n47a18i24.gif"><a name="Figura4"></a></p>       <p><font face="Verdana" size="2"><b>Figure 4</b> Distribution of training sag data for phaseto-phase fault &#40;SC1&#41;</font></p>     <p align="center"><font face="Verdana" size="2"><img src="/img/revistas/rfiua/n47/n47a18i25.gif"><a name="Figura5"></a></p>       <p><font face="Verdana" size="2"><b>Figure 5</b> Distribution of training sag data for doublephase-to-ground fault &#40;SC1&#41;</font></p>      <p><font face="Verdana" size="2">Having the initial definition of groups and by applying the k-means algorithm, centers of groups are estimated to allow initialization of the method. Then, the shape and final proportion of groups within the distribution is defined utilizing EM algorithm and the initial estimation. In this step, a heterocedastic model is used to determine the shape of the covariance matrices for each distribution and also the shape of each group. Moreover, by using the same initial values for mixture coefficients of each group, it assumes that the occurrence of a fault within the group is equally probable.</font></p>      ]]></body>
<body><![CDATA[<p><font face="Verdana" size="2">In step SC2, the information about fault resistance allows the creation of possible scenarios where groups representing intervals of fault resistance values are conformed. In this particular case, five representative groups were established, as shown in <a href="#Figura6">figure 6</a>. The idea presented above is related to the capability of predicting what caused the fault. This information can be relevant to establishing the procedure to solve the problem [23].</font></p>      <p><font face="Verdana" size="2">The shape and final size of the five groups obtained at this application is shown in <a href="#Figura6">figure 6</a> &#40;ellipses&#41;. Centres and mixture coefficients are represented by stars. Fault resistance value grows advancing to the right in the figure. The elliptic shape of each group is related to the values of the covariance matrix [16]. At step SC3 all data are grouped, and each one represents a probable faulted zone.</font></p>     <p align="center"><font face="Verdana" size="2"><img src="/img/revistas/rfiua/n47/n47a18i26.gif"><a name="Figura6"></a></p>        <p><font face="Verdana" size="2"><b>Figure 6</b> Distribution of groups in training sag data in case of single-phase faults &#40;SC2&#41;</font></p>      <p><font face="Verdana" size="2">In SC3 step, data are clustered and used as zones of fault occurrence. Each defined zone is associated to a given number of buses within the distribution network as shown in<a href="#Figura2"> figure 2</a>. These zones are established with the information about the number of groups obtained in SC2 step. Hence in step SC3 we have&ldquo;r&rdquo; groups corresponding to fault resistance groups determined in SC2 and these contain&ldquo;z&rdquo; groups corresponding to each one of system zones. In the case of the prototype system, there is a MD that consists of five groups &#40;SC2&#41; and each group contains four subgroups that represent system zones. <a href="#Figura7">Figure 7</a> illustrates the construction of groups associated to zones. </font></p>      <p><font face="Verdana" size="2">The EM algorithm is able to modify the elements of the covariance matrix in each one of the iterations until the best estimation is obtained. If the covariance matrices remained without change from the initial estimation, circular shaped groups would be obtained [19]. Finally, the mixture model is applied with the aim of comparing initially assumed zones &#40;see <a href="#Figura2">figure 2</a>&#41;, with those determined by the algorithm. In the current application case, the initial estimate of zones corresponds to the result obtained with the algorithms. An aspect to remark is the incidence of the initial values, assumed by the analyst, in the improvement of the model during its construction. Once established the model, validation was done with sag data not used in the training process.</font></p>       <p align="center"><font face="Verdana" size="2"><img src="/img/revistas/rfiua/n47/n47a18i27.gif"><a name="Figura7"></a></p>        <p><font face="Verdana" size="2"><b>Figure 7</b> Distribution of zones associated to group 1 in SC2 step</font></p>      <p><font face="Verdana" size="2"><b>Table 3</b> Results of SC1 Validation Step</font></p>     <p align="center"><font face="Verdana" size="2"><img src="/img/revistas/rfiua/n47/n47a18i28.gif"><a name="Tabla3"></a></p>       ]]></body>
<body><![CDATA[<p><font face="Verdana" size="2">The observations contain voltage sags with fault resistance values of 10, 20, 30 and 40 O. Having established the data model, validation stage is performed by using data not previously used in training. The validation results are presented as the ratio between the well classified data and all the testing data. To evaluate the fist classification stage &#40;SC1&#41;, 468 faults were used. In <a href="#Tabla2">table 2</a> all data used in validation are presented. In <a href="#Tabla3">table 3</a> the results in case of single and double phase faults are presented.</font></p>      <p><font face="Verdana" size="2">In <a href="#tabla4">Table 4</a>, validation results obtained with SC2 classifier are presented &#40;564 observations&#41;. </font></p>     <p><font face="Verdana" size="2">The results in <a href="#Tabla5">table 5</a> reflect the good performance of SC3 classifier &#40;564 observations&#41;.</font></p>     <p><font face="Verdana" size="2"> Classification problems are found in further points along the distribution feeder. In these cases there exists a wrong assignation of the zone due to a classification error in the previous step SC2. Another possible cause of this inadequate behaviour is the location of data in the intersection of two or more groups. In this situation, the zone is assigned depending on its higher probability a posteriori.</font></p>      <p><font face="Verdana" size="2">Each observation is evaluated respect to each one of the groups by computing the probability of pertaining to the same groups through all the steps. The observation is classified within the group in which it has the higher probability.</font></p>      <p><font face="Verdana" size="2"><b>Table 4</b> Results of SC2 Validation Step</font></p>     <p align="center"><font face="Verdana" size="2"><img src="/img/revistas/rfiua/n47/n47a18i29.gif"><a name="tabla4"></a></p>        <p><font face="Verdana" size="2"><b>Table 5</b> Validation results in stage SC3</font></p>        <p align="center"><font face="Verdana" size="2"><img src="/img/revistas/rfiua/n47/n47a18i30.gif"><a name="Tabla5"></a></p>       <p><font face="Verdana" size="2"><a href="#Tabla6">Table 6</a> presents results of data classification according to calculated probabilities in each group. Eight data are presented as an example due to the great amount of processed information &#40;468 observations&#41;. Data shown corresponds to eight single-phase faults located in the four zones of the system in <a href="#Figura2">figure 2</a>. Each fault belongs to a real zone of occurrence and presents an estimated zone which is determined from its probability of occurrence calculated in the classification performed in stage SC3. No information is included about three phase and phase-to-phase faults because only the main three phase feeder is shown and divided in three zones. On the other hand, single phase feeders pertaining to the fourth zone of the system are not included.</font></p>     ]]></body>
<body><![CDATA[<p><font face="Verdana" size="3"><b>Conclusions</b></font></p>     <p><font face="Verdana" size="2">The proposed statistic based methodology for fault location in distribution systems has been presented and tested. This approach is based in the statistical modelling and extraction of the sag magnitude from voltage measurements stored in fault data bases. The fault locator here proposed contributes to satisfy actual needs of utilities in preserving and improving service quality, promoting the consolidation of strategies oriented to decrease the number and duration of interruptions. The total time of interruption can be associated to the time taken in several actions during system restoration such as alarm time, detection time, Access time and sectioning time among others.</font></p>      <p><font face="Verdana" size="2">Potential limitations for the proposed methodology are the selection of the number of groups, the proportion of samples in each group and initial values required by algorithms. These difficulties overcome by introducing theoretical and heuristic criteria, the latter being more influential in the structure of the model.</font></p>      <p><font face="Verdana" size="2">One of the advantages in the construction of the model is the determination of groups of well defined characteristics which allow an optimization in the classification of data thus ensuring good model accuracy. Moreover, the initial approximation of system zones is useful in the estimation of the real number and size of zones in the distribution system. Besides, detailed characterization of the system, model performance depends on the quality of information extraction and processing.</font></p>      <p><font face="Verdana" size="2"><b>Table 6</b> Examples of results of data classification</font></p>      <p align="center"><font face="Verdana" size="2"><img src="/img/revistas/rfiua/n47/n47a18i31.gif"><a name="Tabla6"></a></p>       <p><font face="Verdana" size="2">Software requirements for the implementation of a tool with the proposed methodology are supplied by statistical packages of commercial use and low license costs. This fact enables the easy and simple implementation of the model. Hardware requirements are basic and economical since they only depend of a data acquisition system with the capability of continuously monitoring power system magnitudes at the power distribution substation.</font></p>      <p><font face="Verdana" size="2">The proposed model takes into account technical, economical and operational issues existing in power distribution networks. A remarkable aspect is the low investment cost for the implementation of the fault detection system based in the proposed method.</font></p>      <p><font face="Verdana" size="3"><b>References</b></font></p>     <p><font face="Verdana" size="2">1. J. Mora, G. Carrillo, B. Barrera.&ldquo;Fault Location in Power distribution Systems using a Learning Algorithm for Multivariable Data Analysis&rdquo;. IEEE Trans. on Power Delivery. Vol. 22. 2007. pp. 1715- 1721.</font></p>     ]]></body>
<body><![CDATA[<p><font face="Verdana" size="2">2. J. Driesen, T. Green, T. Van Craenenbroeck, R. Belmans.&ldquo;The Development of Power Quality Markets&rdquo;. IEEE Power Engineering Society General Meeting. Vol. 1. 2004. pp. 963-967.</font></p>     <p><font face="Verdana" size="2">3. J. Martinez, J. Martin.&ldquo;Voltage Sag Stochastic Prediction Using an Electromagnetic Transients Program&rdquo;. IEEE Transactions on power delivery. Vol. 19. 2004. pp. 596- 602.</font></p>     <p><font face="Verdana" size="2">4. C. Crozier, W. Wisdom.&ldquo;A power quality and reliability index based on customer interruption costs&rdquo;. Power Engineering Review, IEEE. Vol. 19. 1999. pp. 59 – 61.</font></p>     <!-- ref --><p><font face="Verdana" size="2">5. M. Bollen. Understanding power quality problems: voltages sags and interruptions. Ed. IEEE Press. New York,. 2000. pp. 35 – 116.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=000126&pid=S0120-6230200900010001800005&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><p><font face="Verdana" size="2">6. A. Girgis, C. Fallon, D. Lubkeman.&ldquo;A fault location technique for rural distribution feeders&rdquo;. IEEE Trans. Industry applications. Vol. 29. 1993. pp. 1170-1175.</font></p>     <!-- ref --><p><font face="Verdana" size="2">7. R. Brown. Electric power distribution reliability. New York. Marcel Dekker. 2002. pp. 24-42.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=000128&pid=S0120-6230200900010001800007&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><p><font face="Verdana" size="2">8. R. Das.&ldquo;Determining the Locations of Faults in Distribution Systems&rdquo;. Ph.D dissertation. Saskatchewan Univ. Canada. 1998. pp. 15 – 48.</font></p>     <p><font face="Verdana" size="2">9. J. Zhu, D. Lubkeman, A. Girgis.&ldquo;Automated fault location and diagnosis on electric power distribution feeders&rdquo;. IEEE Trans. Power delivery. Vol. 12. 1997. pp. 801-809.</font></p>     <p><font face="Verdana" size="2">10. A. Girgis, C. Fallon, D. Lubkeman.&ldquo;A fault location technique for rural distribution feeders&rdquo;. IEEE Trans. Industry applications. Vol. 29. 1993. pp. 1170-1175.</font></p>     <!-- ref --><p><font face="Verdana" size="2">11. H. Willis. Power distribution planning reference book. New York. Marcel Dekker. 2004. pp. 47 – 59.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=000132&pid=S0120-6230200900010001800011&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><p><font face="Verdana" size="2">12. L. D. Zhang, M. H. J. Bollen.&ldquo;Characteristics of voltaje dips &#40;sags&#41; in power systems&rdquo;. IEEE Transactions on Power Delivery. Vol. 15. 2000. pp.827-832.</font></p>     <p><font face="Verdana" size="2">13. A. Rencher.&ldquo;Methods of Multivariable Analysis&rdquo;. Ed. John Wiley and Sons. New York. Brigham Young University. Utath. 1995. Chapter 12. pp. 415-443.</font></p>     <p><font face="Verdana" size="2">14. Y. Wang, L. Luo, M. Freedman, S. Kung.&ldquo;Probabilistic principal component subspaces: A hierarchical finite mixture model for data visualization&rdquo;. IEEE Trans. Neural Networks. Vol. 11. 2000. pp. 625-636.</font></p>     <!-- ref --><p><font face="Verdana" size="2">15. R. Johnson, D. Wichern. Applied Multivariate Statistical Analysis. Ed. Prentice Hall, New York. 1998. pp. 124-168.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=000136&pid=S0120-6230200900010001800015&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p><font face="Verdana" size="2">16. J. Hair, R. Anderson, R. Tatham, W. Black. Multivariable Data Analysis. Ed. Prentice Hall. Madrid.1999. pp. 85-97.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=000137&pid=S0120-6230200900010001800016&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><p><font face="Verdana" size="2">17. Y. Wang, S. Lin, H. Li, S. Kung.&ldquo;Data mapping by probabilistic modular networks and information theoretic criteria&rdquo;. IEEE Trans. Signal processing. Vol. 46. 1998. pp. 3378-3397.</font></p>     <p><font face="Verdana" size="2">18. M. Jordan, R. Jacobs.&ldquo;Hierarchical mixture of experts and the EM algorithm&rdquo;. IEEE Trans. Neural Computing. Vol. 6. 1994. pp. 181-214.</font></p>     <!-- ref --><p><font face="Verdana" size="2">19. W. Mart&iacute;nez, A. Mart&iacute;nez. Computational statistics Handbook whit MatLab. Ed. Chapman &amp; Hall New York. 2002. pp. 90-124.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=000140&pid=S0120-6230200900010001800019&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p><font face="Verdana" size="2">20. E. Dalla Jonson. M&eacute;todos multivariados aplicados al an&aacute;lisis de datos. Ed. Thompson. M&eacute;xico. 2000. pp. 53-86.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=000141&pid=S0120-6230200900010001800020&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p><font face="Verdana" size="2">21. G. Mclachlan, D. Peel. Finite mixture models. Ed. Wiley and Sons. Montreal. 2000. pp. 46-72.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=000142&pid=S0120-6230200900010001800021&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><p><font face="Verdana" size="2">22 M. Figueiredo.&ldquo;Unsupervised Learning of Finite Mixture Models&rdquo;. IEEE Trans. Pattern analysis and Machine intelligence. Vol. 24. 2002. pp. 135-143.</font></p>     <p><font face="Verdana" size="2">23. J. B. Dagenhart.&ldquo;The 40-Ground-Fault Phenomenon&rdquo;. IEEE Trans. on Industry Applications. Vol. 36. 2000. pp. 30-32.</font></p>     <p><font face="Verdana" size="2">24. S. Lee, M. Choi, S. Kang, B. Jin, D. Lee, B. Ahn, N. Yoon, H. Kim, S. Wee.&ldquo;An intelligent and efficient fault location and diagnosis scheme for radial distribution systems&rdquo;. IEEE Trans. Power Delivery. Vol. 19. 2004. pp. 524-532.</font></p>     <p><font face="Verdana" size="2">&#40;Recibido el 14 de marzo de 2008. Aceptado el 6 de noviembre de 2008&#41;</font></p>       <p><font face="Verdana" size="2"><sup>*</sup>Autor de correspondencia: tel&eacute;fono: + 57 + 6 + 334 40 00, fax: + 57 + 6 + 335 96 21, correo electr&oacute;nico: <a href="mailto:gaby@uis.edu.co">gaby@uis.edu.co</a> &#40;G. Ord&oacute;&ntilde;ez&#41;.</font></p>      ]]></body><back>
<ref-list>
<ref id="B1">
<label>1</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Mora]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
<name>
<surname><![CDATA[Carrillo]]></surname>
<given-names><![CDATA[G]]></given-names>
</name>
<name>
<surname><![CDATA[Barrera]]></surname>
<given-names><![CDATA[B]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Fault Location in Power distribution Systems using a Learning Algorithm for Multivariable Data Analysis]]></article-title>
<source><![CDATA[IEEE Trans. on Power Delivery]]></source>
<year>2007</year>
<volume>22</volume>
<page-range>1715- 1721</page-range></nlm-citation>
</ref>
<ref id="B2">
<label>2</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Driesen]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
<name>
<surname><![CDATA[Green]]></surname>
<given-names><![CDATA[T]]></given-names>
</name>
<name>
<surname><![CDATA[Van Craenenbroeck]]></surname>
<given-names><![CDATA[T]]></given-names>
</name>
<name>
<surname><![CDATA[Belmans]]></surname>
<given-names><![CDATA[R]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[The Development of Power Quality Markets]]></article-title>
<source><![CDATA[IEEE Power Engineering Society General Meeting]]></source>
<year>2004</year>
<volume>1</volume>
<page-range>963-967</page-range></nlm-citation>
</ref>
<ref id="B3">
<label>3</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Martinez]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
<name>
<surname><![CDATA[Martin]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Voltage Sag Stochastic Prediction Using an Electromagnetic Transients Program]]></article-title>
<source><![CDATA[IEEE Transactions on power delivery]]></source>
<year>2004</year>
<volume>19</volume>
<page-range>596- 602</page-range></nlm-citation>
</ref>
<ref id="B4">
<label>4</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Crozier]]></surname>
<given-names><![CDATA[C]]></given-names>
</name>
<name>
<surname><![CDATA[Wisdom]]></surname>
<given-names><![CDATA[W]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A power quality and reliability index based on customer interruption costs]]></article-title>
<source><![CDATA[Power Engineering Review, IEEE]]></source>
<year>1999</year>
<volume>19</volume>
<page-range>59 - 61</page-range></nlm-citation>
</ref>
<ref id="B5">
<label>5</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bollen]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
</person-group>
<source><![CDATA[Understanding power quality problems: voltages sags and interruptions]]></source>
<year>2000</year>
<page-range>35 - 116</page-range><publisher-loc><![CDATA[New York ]]></publisher-loc>
<publisher-name><![CDATA[Ed. IEEE Press]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B6">
<label>6</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Girgis]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
<name>
<surname><![CDATA[Fallon]]></surname>
<given-names><![CDATA[C]]></given-names>
</name>
<name>
<surname><![CDATA[Lubkeman]]></surname>
<given-names><![CDATA[D]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A fault location technique for rural distribution feeders]]></article-title>
<source><![CDATA[IEEE Trans. Industry applications]]></source>
<year>1993</year>
<volume>29</volume>
<page-range>1170-1175</page-range></nlm-citation>
</ref>
<ref id="B7">
<label>7</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Brown]]></surname>
<given-names><![CDATA[R]]></given-names>
</name>
</person-group>
<source><![CDATA[Electric power distribution reliability]]></source>
<year>2002</year>
<page-range>24-42</page-range><publisher-loc><![CDATA[New York ]]></publisher-loc>
<publisher-name><![CDATA[Marcel Dekker]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B8">
<label>8</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Das]]></surname>
<given-names><![CDATA[R]]></given-names>
</name>
</person-group>
<source><![CDATA[Determining the Locations of Faults in Distribution Systems]]></source>
<year></year>
<page-range>15 - 48</page-range></nlm-citation>
</ref>
<ref id="B9">
<label>9</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Zhu]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
<name>
<surname><![CDATA[Lubkeman]]></surname>
<given-names><![CDATA[D]]></given-names>
</name>
<name>
<surname><![CDATA[Girgis]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Automated fault location and diagnosis on electric power distribution feeders]]></article-title>
<source><![CDATA[IEEE Trans. Power delivery]]></source>
<year>1997</year>
<volume>12</volume>
<page-range>801-809</page-range></nlm-citation>
</ref>
<ref id="B10">
<label>10</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Girgis]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
<name>
<surname><![CDATA[Fallon]]></surname>
<given-names><![CDATA[C]]></given-names>
</name>
<name>
<surname><![CDATA[Lubkeman]]></surname>
<given-names><![CDATA[D]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A fault location technique for rural distribution feeders]]></article-title>
<source><![CDATA[IEEE Trans. Industry applications]]></source>
<year>1993</year>
<volume>29</volume>
<page-range>1170-1175</page-range></nlm-citation>
</ref>
<ref id="B11">
<label>11</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Willis]]></surname>
<given-names><![CDATA[H]]></given-names>
</name>
</person-group>
<source><![CDATA[Power distribution planning reference book]]></source>
<year>2004</year>
<page-range>47 - 59</page-range><publisher-loc><![CDATA[New York ]]></publisher-loc>
<publisher-name><![CDATA[Marcel Dekker]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B12">
<label>12</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Zhang]]></surname>
<given-names><![CDATA[L. D]]></given-names>
</name>
<name>
<surname><![CDATA[J. Bollen]]></surname>
<given-names><![CDATA[M. H]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Characteristics of voltaje dips (sags) in power systems]]></article-title>
<source><![CDATA[IEEE Transactions on Power Delivery]]></source>
<year>2000</year>
<volume>15</volume>
<page-range>827-832</page-range></nlm-citation>
</ref>
<ref id="B13">
<label>13</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Rencher]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
</person-group>
<source><![CDATA[Methods of Multivariable Analysis]]></source>
<year>1995</year>
<page-range>415-443</page-range><publisher-loc><![CDATA[New York ]]></publisher-loc>
<publisher-name><![CDATA[Ed. John Wiley and Sons]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B14">
<label>14</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Wang]]></surname>
<given-names><![CDATA[Y]]></given-names>
</name>
<name>
<surname><![CDATA[Luo]]></surname>
<given-names><![CDATA[L]]></given-names>
</name>
<name>
<surname><![CDATA[Freedman]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
<name>
<surname><![CDATA[Kung]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Probabilistic principal component subspaces: A hierarchical finite mixture model for data visualization]]></article-title>
<source><![CDATA[IEEE Trans. Neural Networks]]></source>
<year>2000</year>
<volume>11</volume>
<page-range>625-636</page-range></nlm-citation>
</ref>
<ref id="B15">
<label>15</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Johnson]]></surname>
<given-names><![CDATA[R]]></given-names>
</name>
<name>
<surname><![CDATA[Wichern]]></surname>
<given-names><![CDATA[D]]></given-names>
</name>
</person-group>
<source><![CDATA[Applied Multivariate Statistical Analysis]]></source>
<year>1998</year>
<page-range>124-168</page-range><publisher-loc><![CDATA[New York ]]></publisher-loc>
<publisher-name><![CDATA[Ed. Prentice Hall]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B16">
<label>16</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Hair]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
<name>
<surname><![CDATA[Anderson]]></surname>
<given-names><![CDATA[R]]></given-names>
</name>
<name>
<surname><![CDATA[Tatham]]></surname>
<given-names><![CDATA[R]]></given-names>
</name>
<name>
<surname><![CDATA[Black]]></surname>
<given-names><![CDATA[W]]></given-names>
</name>
</person-group>
<source><![CDATA[Multivariable Data Analysis]]></source>
<year>1999</year>
<page-range>85-97</page-range><publisher-loc><![CDATA[Madrid ]]></publisher-loc>
<publisher-name><![CDATA[Ed. Prentice Hall]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B17">
<label>17</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Wang]]></surname>
<given-names><![CDATA[Y]]></given-names>
</name>
<name>
<surname><![CDATA[Lin]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
<name>
<surname><![CDATA[Li]]></surname>
<given-names><![CDATA[H]]></given-names>
</name>
<name>
<surname><![CDATA[Kung]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Data mapping by probabilistic modular networks and information theoretic criteria]]></article-title>
<source><![CDATA[IEEE Trans. Signal processing]]></source>
<year>1998</year>
<volume>46</volume>
<page-range>3378-3397</page-range></nlm-citation>
</ref>
<ref id="B18">
<label>18</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jordan]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
<name>
<surname><![CDATA[Jacobs]]></surname>
<given-names><![CDATA[R]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Hierarchical mixture of experts and the EM algorithm]]></article-title>
<source><![CDATA[IEEE Trans. Neural Computing]]></source>
<year>1994</year>
<volume>6</volume>
<page-range>181-214</page-range></nlm-citation>
</ref>
<ref id="B19">
<label>19</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Martínez]]></surname>
<given-names><![CDATA[W]]></given-names>
</name>
<name>
<surname><![CDATA[Martínez]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
</person-group>
<source><![CDATA[Computational statistics Handbook whit MatLab]]></source>
<year>2002</year>
<page-range>90-124</page-range><publisher-name><![CDATA[Ed. Chapman & HallNew York]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B20">
<label>20</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Dalla Jonson]]></surname>
<given-names><![CDATA[E]]></given-names>
</name>
</person-group>
<source><![CDATA[Métodos multivariados aplicados al análisis de datos]]></source>
<year>2000</year>
<page-range>53-86</page-range><publisher-loc><![CDATA[México ]]></publisher-loc>
<publisher-name><![CDATA[Thompson]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B21">
<label>21</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Mclachlan]]></surname>
<given-names><![CDATA[G]]></given-names>
</name>
<name>
<surname><![CDATA[Peel]]></surname>
<given-names><![CDATA[D]]></given-names>
</name>
</person-group>
<source><![CDATA[Finite mixture models]]></source>
<year>2000</year>
<page-range>46-72</page-range><publisher-loc><![CDATA[Montreal ]]></publisher-loc>
<publisher-name><![CDATA[Ed. Wiley and Sons]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B22">
<label>22</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Figueiredo]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Unsupervised Learning of Finite Mixture Models]]></article-title>
<source><![CDATA[IEEE Trans. Pattern analysis and Machine intelligence]]></source>
<year>2002</year>
<volume>24</volume>
<page-range>135-143</page-range></nlm-citation>
</ref>
<ref id="B23">
<label>23</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Dagenhart]]></surname>
<given-names><![CDATA[J. B]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[The 40-Ground-Fault Phenomenon]]></article-title>
<source><![CDATA[IEEE Trans. on Industry Applications]]></source>
<year>2000</year>
<volume>36</volume>
<page-range>30-32</page-range></nlm-citation>
</ref>
<ref id="B24">
<label>24</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Lee]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
<name>
<surname><![CDATA[Choi]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
<name>
<surname><![CDATA[Kang]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
<name>
<surname><![CDATA[Jin]]></surname>
<given-names><![CDATA[B]]></given-names>
</name>
<name>
<surname><![CDATA[Lee]]></surname>
<given-names><![CDATA[D]]></given-names>
</name>
<name>
<surname><![CDATA[Ahn]]></surname>
<given-names><![CDATA[B]]></given-names>
</name>
<name>
<surname><![CDATA[Yoon]]></surname>
<given-names><![CDATA[N]]></given-names>
</name>
<name>
<surname><![CDATA[Kim]]></surname>
<given-names><![CDATA[H]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[An intelligent and efficient fault location and diagnosis scheme for radial distribution systems]]></article-title>
<source><![CDATA[IEEE Trans. Power Delivery]]></source>
<year>2004</year>
<volume>19</volume>
<page-range>524-532</page-range></nlm-citation>
</ref>
</ref-list>
</back>
</article>
