<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>0123-921X</journal-id>
<journal-title><![CDATA[Tecnura]]></journal-title>
<abbrev-journal-title><![CDATA[Tecnura]]></abbrev-journal-title>
<issn>0123-921X</issn>
<publisher>
<publisher-name><![CDATA[Universidad Distrital Francisco José de Caldas]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S0123-921X2023000100014</article-id>
<article-id pub-id-type="doi">10.14483/22487638.18268</article-id>
<title-group>
<article-title xml:lang="es"><![CDATA[Revisión y perspectivas para la construcción de bases de datos robustas con datos faltantes: caso aplicado a información financiera]]></article-title>
<article-title xml:lang="en"><![CDATA[Review and perspectives for the construction of robust databases with missing data: case applied to financial information]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Romero Duque]]></surname>
<given-names><![CDATA[Gustavo Andrés]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[González Prieto]]></surname>
<given-names><![CDATA[Cristian Andrés]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Díaz Barriosnuevos]]></surname>
<given-names><![CDATA[María Angélica]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Rueda Menjura]]></surname>
<given-names><![CDATA[Nataly Alejandra]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
</contrib-group>
<aff id="Af1">
<institution><![CDATA[,Fundación Universitaria Los Libertadores  ]]></institution>
<addr-line><![CDATA[Bogotá ]]></addr-line>
<country>Colombia</country>
</aff>
<aff id="Af2">
<institution><![CDATA[,Fundación Universitaria Los Libertadores  ]]></institution>
<addr-line><![CDATA[Bogotá ]]></addr-line>
<country>Colombia</country>
</aff>
<aff id="Af3">
<institution><![CDATA[,Fundación Universitaria Los Libertadores  ]]></institution>
<addr-line><![CDATA[Bogotá ]]></addr-line>
<country>Colombia</country>
</aff>
<aff id="Af4">
<institution><![CDATA[,Fundación Universitaria Los Libertadores  ]]></institution>
<addr-line><![CDATA[Bogotá ]]></addr-line>
<country>Colombia</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>03</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>03</month>
<year>2023</year>
</pub-date>
<volume>27</volume>
<numero>75</numero>
<fpage>14</fpage>
<lpage>37</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.co/scielo.php?script=sci_arttext&amp;pid=S0123-921X2023000100014&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.co/scielo.php?script=sci_abstract&amp;pid=S0123-921X2023000100014&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.co/scielo.php?script=sci_pdf&amp;pid=S0123-921X2023000100014&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="es"><p><![CDATA[Resumen  Contexto: Se propone un conjunto de opciones que ayudan a determinar el método más adecuado para subsanar en bases de datos de tamaño apreciable, condiciones iniciales de datos faltantes y que serán utilizadas en procesos de investigación.  Metodología: El presente artículo aborda una propuesta para el desarrollo y manejo de bases de datos robustas como el caso de registros financieros, enfocándose desde el proceso knowledge discovery in databases (KDD).  Resultados: Se desarrolla y prueba una metodología utilizando tres técnicas de imputación en una base de datos construida a partir de 1 253 280 registros financieros de 2238 empresas y que representan siete años de su actividad económica en la localidad de Chapinero, en la ciudad de Bogotá D. C.  Conclusiones: Se realiza un comparativo de los métodos de imputación como factor determinante para la elección del método de imputación y consolidación de la base para su posterior uso.  Financiamiento: Fundación Universitaria Los Libertadores.]]></p></abstract>
<abstract abstract-type="short" xml:lang="en"><p><![CDATA[ABSTRACT  Context:  A set of options is proposed to help determine the most appropriate method to correct in databases of appreciable size, initial conditions of missing data and that will be used in research processes.  Methodology: This article addresses a proposal for the development and management of robust databases such as financial records, focusing from the Knowledge Discovery in Data bases (KDD) process.  Results: A methodology is developed and tested using three imputation techniques in a database built from 1,253,280 financial records of 2,238 companies that represent seven years of their economic activity in the town of Chapinero in the city of Bogotá D.C.  Conclusions: A comparison of the imputation methods is carried out as a determining factor for the choice of the imputation method and consolidation of the base for later use.  Financing: Fundación universitaria Los Libertadores]]></p></abstract>
<kwd-group>
<kwd lng="es"><![CDATA[base de datos]]></kwd>
<kwd lng="es"><![CDATA[métodos de imputación]]></kwd>
<kwd lng="es"><![CDATA[KDD]]></kwd>
<kwd lng="es"><![CDATA[valores faltantes]]></kwd>
<kwd lng="en"><![CDATA[database]]></kwd>
<kwd lng="en"><![CDATA[imputation methods]]></kwd>
<kwd lng="en"><![CDATA[KDD]]></kwd>
<kwd lng="en"><![CDATA[missing values]]></kwd>
</kwd-group>
</article-meta>
</front><back>
<ref-list>
<ref id="B1">
<nlm-citation citation-type="">
<collab>Alcaldía de Bogotá</collab>
<source><![CDATA[Infraestructura de datos espaciales para el distrito capital]]></source>
<year>2021</year>
</nlm-citation>
</ref>
<ref id="B2">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Allison]]></surname>
<given-names><![CDATA[P]]></given-names>
</name>
</person-group>
<source><![CDATA[Missing data]]></source>
<year>2002</year>
<publisher-name><![CDATA[Sage]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B3">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Altman]]></surname>
<given-names><![CDATA[D. G.]]></given-names>
</name>
<name>
<surname><![CDATA[Bland]]></surname>
<given-names><![CDATA[J. M]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Missing data]]></article-title>
<source><![CDATA[British Medical Journal]]></source>
<year>2007</year>
<volume>334</volume>
<numero>7590</numero>
<issue>7590</issue>
<page-range>424</page-range></nlm-citation>
</ref>
<ref id="B4">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Benítez]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Álvarez]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Reconstrucción de series temporales en ciencias ambientales]]></article-title>
<source><![CDATA[Revista Latinoamericana de Recursos Naturales]]></source>
<year>2008</year>
<volume>4</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>326-35</page-range></nlm-citation>
</ref>
<ref id="B5">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Booth]]></surname>
<given-names><![CDATA[B. G.]]></given-names>
</name>
<name>
<surname><![CDATA[Keijsers]]></surname>
<given-names><![CDATA[N. L. W.]]></given-names>
</name>
<name>
<surname><![CDATA[Sijbers]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Huysmans]]></surname>
<given-names><![CDATA[T]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[An assessment of the information lost when applying data reduction techniques to dynamic plantar pressure measurements]]></article-title>
<source><![CDATA[Journal of Biomechanics]]></source>
<year>2019</year>
<numero>87</numero>
<issue>87</issue>
<page-range>161-6</page-range></nlm-citation>
</ref>
<ref id="B6">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Brintha Rajakumari]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Nalini]]></surname>
<given-names><![CDATA[C]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[An efficient data mining dataset preparation using aggregation in relational database]]></article-title>
<source><![CDATA[Indian Journal of Science and Technology]]></source>
<year>2014</year>
<numero>7</numero>
<issue>7</issue>
<page-range>44-6</page-range></nlm-citation>
</ref>
<ref id="B7">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Cañizares]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Barroso]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
<name>
<surname><![CDATA[Alfonso]]></surname>
<given-names><![CDATA[K]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Datos incompletos: una mirada crítica para su manejo en estudios sanitarios]]></article-title>
<source><![CDATA[Gaceta Sanitaria]]></source>
<year>2003</year>
<volume>18</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>58-63</page-range></nlm-citation>
</ref>
<ref id="B8">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Carpenter]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Kenward]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
</person-group>
<source><![CDATA[Multiple imputation and its application]]></source>
<year>2013</year>
<publisher-name><![CDATA[Wiley]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B9">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Dagnino]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Bioestadística y epidemiología. Datos faltantes (missing values)]]></article-title>
<source><![CDATA[Revista Chilena de Anestesia]]></source>
<year>2014</year>
<volume>43</volume>
<numero>4</numero>
<issue>4</issue>
<page-range>332-4</page-range></nlm-citation>
</ref>
<ref id="B10">
<nlm-citation citation-type="">
<collab>Departamento Nacional de Estadística (DANE)</collab>
<source><![CDATA[Estadísticas por tema]]></source>
<year>2020</year>
</nlm-citation>
</ref>
<ref id="B11">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Detours]]></surname>
<given-names><![CDATA[V.]]></given-names>
</name>
<name>
<surname><![CDATA[Dumont]]></surname>
<given-names><![CDATA[J. E.]]></given-names>
</name>
<name>
<surname><![CDATA[Bersini]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Maenhaut]]></surname>
<given-names><![CDATA[C]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Integration and cross-validation of high-throughput gene expression data: Comparing heterogeneous data sets]]></article-title>
<source><![CDATA[FEBS Letters]]></source>
<year>2003</year>
<volume>546</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>98-102</page-range></nlm-citation>
</ref>
<ref id="B12">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Dong]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Peng]]></surname>
<given-names><![CDATA[C. Y. J]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Principled missing data methods for researchers]]></article-title>
<source><![CDATA[Springer Plus]]></source>
<year>2013</year>
<volume>2</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>1-17</page-range></nlm-citation>
</ref>
<ref id="B13">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Enders]]></surname>
<given-names><![CDATA[C]]></given-names>
</name>
</person-group>
<source><![CDATA[Applied missing data analysis]]></source>
<year>2010</year>
<publisher-name><![CDATA[Guilford Press]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B14">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[García Reinoso]]></surname>
<given-names><![CDATA[P. L]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Imputación de datos en series de precipitación diaria caso de estudio cuenca del río Quindío]]></article-title>
<source><![CDATA[Ingeniare]]></source>
<year>2015</year>
<numero>5</numero>
<issue>5</issue>
<page-range>73-86</page-range></nlm-citation>
</ref>
<ref id="B15">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ge]]></surname>
<given-names><![CDATA[Z]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Process data analytics via probabilistic latent variable models: A tutorial review]]></article-title>
<source><![CDATA[Industrial and Engineering Chemistry Research]]></source>
<year>2018</year>
<volume>57</volume>
<numero>38</numero>
<issue>38</issue>
<page-range>12646-61</page-range></nlm-citation>
</ref>
<ref id="B16">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ge]]></surname>
<given-names><![CDATA[Z.]]></given-names>
</name>
<name>
<surname><![CDATA[Song]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Non-gaussian process monitoring]]></article-title>
<source><![CDATA[Multivariate statistical process control process monitoring methods and applications]]></source>
<year>2013</year>
<page-range>13-27</page-range><publisher-name><![CDATA[Springer]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B17">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Geng]]></surname>
<given-names><![CDATA[Z.]]></given-names>
</name>
<name>
<surname><![CDATA[Li]]></surname>
<given-names><![CDATA[K]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Factorization of posteriors and partial imputation algorithm for graphical models with missing data]]></article-title>
<source><![CDATA[Statistics and Probability Letters]]></source>
<year>2003</year>
<numero>64</numero>
<issue>64</issue>
<page-range>369-79</page-range></nlm-citation>
</ref>
<ref id="B18">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Giraldo]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[León]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Gómez]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Caracterización de flujos de datos usando algoritmos de agrupamiento]]></article-title>
<source><![CDATA[Tecnura]]></source>
<year>2013</year>
<volume>17</volume>
<numero>37</numero>
<issue>37</issue>
<page-range>153-66</page-range></nlm-citation>
</ref>
<ref id="B19">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Gleason]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Staelin]]></surname>
<given-names><![CDATA[R]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[A proposal for handling missing data]]></article-title>
<source><![CDATA[Psychometrika]]></source>
<year>1975</year>
<volume>40</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>229-52</page-range></nlm-citation>
</ref>
<ref id="B20">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Graham]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
</person-group>
<source><![CDATA[Missing data: Analysis and design]]></source>
<year>2012</year>
<publisher-name><![CDATA[Springer]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B21">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Hemel]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Van der Voet]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Hindriks]]></surname>
<given-names><![CDATA[F. R.]]></given-names>
</name>
<name>
<surname><![CDATA[Van der Slik]]></surname>
<given-names><![CDATA[W]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Stepwise deletion: A technique for missing data handling in multivariate analysis]]></article-title>
<source><![CDATA[Analytical Chemical Acta]]></source>
<year>1987</year>
<numero>193</numero>
<issue>193</issue>
<page-range>255-68</page-range></nlm-citation>
</ref>
<ref id="B22">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Herrera]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Campos]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Carrillo]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Estimación de datos faltantes de precipitación por el método de regresión lineal: caso de estudio Cuenca Guadalupe, Baja California, México]]></article-title>
<source><![CDATA[Redalyc]]></source>
<year>2017</year>
<volume>25</volume>
<numero>71</numero>
<issue>71</issue>
<page-range>34-44</page-range></nlm-citation>
</ref>
<ref id="B23">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Imtiaz]]></surname>
<given-names><![CDATA[S. A.]]></given-names>
</name>
<name>
<surname><![CDATA[Shah]]></surname>
<given-names><![CDATA[S. L]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Treatment of missing values in process data analysis]]></article-title>
<source><![CDATA[Canadian Journal of Chemical Engineering]]></source>
<year>2008</year>
<volume>86</volume>
<numero>5</numero>
<issue>5</issue>
<page-range>838-58</page-range></nlm-citation>
</ref>
<ref id="B24">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ingsrisawang]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Potawee]]></surname>
<given-names><![CDATA[D]]></given-names>
</name>
</person-group>
<source><![CDATA[Multiple imputation for missing data in repeated measurements using MCMC and Copulas]]></source>
<year>2012</year>
<conf-name><![CDATA[ Proceedings of the Internacional Multiconference of Engineers and Computer Scientists, II]]></conf-name>
<conf-loc> </conf-loc>
<page-range>1-5</page-range></nlm-citation>
</ref>
<ref id="B25">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jarrett]]></surname>
<given-names><![CDATA[R. G]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[The analysis of designed experiments with missing observations]]></article-title>
<source><![CDATA[Journal of the Royal Statistical Society. Series C (Applied Statistics)]]></source>
<year>1978</year>
<volume>27</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>38-46</page-range></nlm-citation>
</ref>
<ref id="B26">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jelicic]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Phelps]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Lerner]]></surname>
<given-names><![CDATA[R]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Use of missing data methods in longitudinal studies: The persistence of bad practices in developmental psychology]]></article-title>
<source><![CDATA[Developmental Psychology]]></source>
<year>2009</year>
<volume>45</volume>
<numero>4</numero>
<issue>4</issue>
<page-range>1195-9</page-range></nlm-citation>
</ref>
<ref id="B27">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Kadlec]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Gabrys]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Strandt]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Data-driven soft sensors in the process industry]]></article-title>
<source><![CDATA[Computers and Chemical Engineering]]></source>
<year>2009</year>
<volume>33</volume>
<numero>4</numero>
<issue>4</issue>
<page-range>795-814</page-range></nlm-citation>
</ref>
<ref id="B28">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Kalton]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Kasprzyk]]></surname>
<given-names><![CDATA[D]]></given-names>
</name>
</person-group>
<source><![CDATA[Imputing for Missing Survey Responses]]></source>
<year>1982</year>
<publisher-name><![CDATA[American Statistical Association. Proceeding of the Section on Survey Research Methods]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B29">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Kim]]></surname>
<given-names><![CDATA[W.]]></given-names>
</name>
<name>
<surname><![CDATA[Choi]]></surname>
<given-names><![CDATA[B. J.]]></given-names>
</name>
<name>
<surname><![CDATA[Hong]]></surname>
<given-names><![CDATA[E. K.]]></given-names>
</name>
<name>
<surname><![CDATA[Kim]]></surname>
<given-names><![CDATA[S. K.]]></given-names>
</name>
<name>
<surname><![CDATA[Lee]]></surname>
<given-names><![CDATA[D]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[A taxonomy of dirty data]]></article-title>
<source><![CDATA[Data Mining and Knowledge Discovery]]></source>
<year>2003</year>
<volume>7</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>81-99</page-range></nlm-citation>
</ref>
<ref id="B30">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Kodamana]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Huang]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Ranjan]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Zhao]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Tan]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Sammaknejad]]></surname>
<given-names><![CDATA[N]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Approaches to robust process identification: A review and tutorial of probabilistic methods]]></article-title>
<source><![CDATA[Journal of Process Control]]></source>
<year>2018</year>
<numero>66</numero>
<issue>66</issue>
<page-range>68-83</page-range></nlm-citation>
</ref>
<ref id="B31">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Koikkalainen]]></surname>
<given-names><![CDATA[P]]></given-names>
</name>
</person-group>
<source><![CDATA[Neural network for editing and imputation]]></source>
<year>2002</year>
<publisher-name><![CDATA[University of Jyvâskylâ]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B32">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Lin]]></surname>
<given-names><![CDATA[T. Y]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Attribute transformations for data mining I: Theoretical explorations]]></article-title>
<source><![CDATA[International Journal of Intelligent Systems]]></source>
<year>2002</year>
<volume>17</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>213-22</page-range></nlm-citation>
</ref>
<ref id="B33">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Little]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Rubin]]></surname>
<given-names><![CDATA[D]]></given-names>
</name>
</person-group>
<source><![CDATA[Statistical analysis with missing data. Series in Probability and Mathematical Statistics]]></source>
<year>1987</year>
<publisher-name><![CDATA[John Wiley &amp; Sons]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B34">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Little]]></surname>
<given-names><![CDATA[R. J. A.]]></given-names>
</name>
<name>
<surname><![CDATA[Rubin]]></surname>
<given-names><![CDATA[D. B]]></given-names>
</name>
</person-group>
<source><![CDATA[Statistical analysis with missing data]]></source>
<year>2002</year>
<publisher-name><![CDATA[Wiley &amp; Sons]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B35">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Little]]></surname>
<given-names><![CDATA[R. J.]]></given-names>
</name>
<name>
<surname><![CDATA[Rubin]]></surname>
<given-names><![CDATA[D]]></given-names>
</name>
</person-group>
<source><![CDATA[Statistical analysis with missing data]]></source>
<year>2019</year>
<publisher-name><![CDATA[John Wiley &amp; Sons]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B36">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Liu]]></surname>
<given-names><![CDATA[X.]]></given-names>
</name>
<name>
<surname><![CDATA[Wang]]></surname>
<given-names><![CDATA[X.]]></given-names>
</name>
<name>
<surname><![CDATA[Zou]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Xia]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Pang]]></surname>
<given-names><![CDATA[W]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Spatial imputation for air pollutants data sets via low rank matrix completion algorithm]]></article-title>
<source><![CDATA[Environment International]]></source>
<year>2020</year>
<numero>139</numero>
<issue>139</issue>
<page-range>105713</page-range></nlm-citation>
</ref>
<ref id="B37">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Manterola]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Otzen]]></surname>
<given-names><![CDATA[T]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Por qué investigar y cómo conducir una investigación]]></article-title>
<source><![CDATA[International Journal of Morphology]]></source>
<year>2013</year>
<volume>31</volume>
<numero>4</numero>
<issue>4</issue>
<page-range>1498-504</page-range></nlm-citation>
</ref>
<ref id="B38">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Medina]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Galván]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[Imputación de datos: teoría y práctica. Serie &#8220;Estudios estadísticos y prospectivos&#8221;]]></source>
<year>2007</year>
<publisher-name><![CDATA[Comisión Económica para América Latina y el Caribe (Cepal)]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B39">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Mesa]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Tsai]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Chambers]]></surname>
<given-names><![CDATA[R]]></given-names>
</name>
</person-group>
<source><![CDATA[Using tree-based models for missing data imputation: An evaluation using Uk Census Data. Reporte técnico]]></source>
<year>2000</year>
<publisher-name><![CDATA[Universidad de Southampton]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B40">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Moncada-Hernández]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Cómo realizar una búsqueda de información eficiente. Foco en estudiantes, profesores e investigadores en el área educativa]]></article-title>
<source><![CDATA[Investigación en Educación Médica]]></source>
<year>2014</year>
<volume>3</volume>
<numero>10</numero>
<issue>10</issue>
<page-range>106-15</page-range></nlm-citation>
</ref>
<ref id="B41">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Olinsky]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Chen]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Harlow]]></surname>
<given-names><![CDATA[L]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[The comparative efficacy of imputation methods for missing data in structural equation modeling]]></article-title>
<source><![CDATA[European Journal of Operational Research]]></source>
<year>2003</year>
<volume>151</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>53-79</page-range></nlm-citation>
</ref>
<ref id="B42">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Peugh]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Enders]]></surname>
<given-names><![CDATA[C]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Missing data in educational research: A review of reporting practices and suggestions for improvement]]></article-title>
<source><![CDATA[Review of Educational Research]]></source>
<year>2004</year>
<numero>74</numero>
<issue>74</issue>
<page-range>525e556</page-range></nlm-citation>
</ref>
<ref id="B43">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Puerta Goicoechea]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
</person-group>
<source><![CDATA[Imputación basada en árboles de clasificación]]></source>
<year>2002</year>
<publisher-name><![CDATA[Eustat]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B44">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Timaran]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Yépez]]></surname>
<given-names><![CDATA[M. C]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[La minería de datos aplicada al descubrimiento de patrones de supervivencia en mujeres con cáncer invasivo de cuello uterino]]></article-title>
<source><![CDATA[Universidad y Salud]]></source>
<year>2012</year>
<volume>14</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>117-29</page-range></nlm-citation>
</ref>
<ref id="B45">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Rubin]]></surname>
<given-names><![CDATA[D.B]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Inference and missing data]]></article-title>
<source><![CDATA[Biometrika]]></source>
<year>1976</year>
<numero>63</numero>
<issue>63</issue>
<page-range>581-92</page-range></nlm-citation>
</ref>
<ref id="B46">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Rubin]]></surname>
<given-names><![CDATA[D. B]]></given-names>
</name>
</person-group>
<source><![CDATA[Multiple imputation for nonresponse in surveys]]></source>
<year>2004</year>
<publisher-name><![CDATA[John Wiley &amp; Sons]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B47">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sande]]></surname>
<given-names><![CDATA[I. G]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Imputation in Surveys: Coping with reality]]></article-title>
<source><![CDATA[The American Statistician]]></source>
<year>1982</year>
<volume>36</volume>
<numero>3a</numero>
<issue>3a</issue>
<page-range>145-52</page-range></nlm-citation>
</ref>
<ref id="B48">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Schafer]]></surname>
<given-names><![CDATA[J. L.]]></given-names>
</name>
<name>
<surname><![CDATA[Graham]]></surname>
<given-names><![CDATA[J. W]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Missing data: Our view of the state of the art]]></article-title>
<source><![CDATA[Psychological Methods]]></source>
<year>2002</year>
<volume>7</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>147-77</page-range></nlm-citation>
</ref>
<ref id="B49">
<nlm-citation citation-type="">
<collab>Superintendencia de Sociedades</collab>
<source><![CDATA[Asuntos económicos y societarios]]></source>
<year>2020</year>
</nlm-citation>
</ref>
<ref id="B50">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Timarán-Pereira]]></surname>
<given-names><![CDATA[S. R.]]></given-names>
</name>
<name>
<surname><![CDATA[Hernández-Arteaga]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
<name>
<surname><![CDATA[Caicedo-Zambrano]]></surname>
<given-names><![CDATA[S. J.]]></given-names>
</name>
<name>
<surname><![CDATA[Hidalgo-Troya]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Alvarado-Pérez]]></surname>
<given-names><![CDATA[J. C]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[El proceso de descubrimiento de conocimiento en bases de datos]]></article-title>
<source><![CDATA[Ingenierías]]></source>
<year>2016</year>
<volume>8</volume>
<numero>26</numero>
<issue>26</issue>
<page-range>63-86</page-range></nlm-citation>
</ref>
<ref id="B51">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Todeschini]]></surname>
<given-names><![CDATA[R]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Weighted k-nearest neighbour method for the calculation of missing values]]></article-title>
<source><![CDATA[Chenometrics and Intelligent Laboratory Systems]]></source>
<year>1990</year>
<numero>9</numero>
<issue>9</issue>
<page-range>201-5</page-range></nlm-citation>
</ref>
<ref id="B52">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Torres]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Paz]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Salazar]]></surname>
<given-names><![CDATA[F. G]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Métodos de recolección de datos para una investigación]]></article-title>
<source><![CDATA[Boletín electrónico]]></source>
<year>2014</year>
<numero>3</numero>
<issue>3</issue>
<page-range>1-21</page-range></nlm-citation>
</ref>
<ref id="B53">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Useche]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Mesa]]></surname>
<given-names><![CDATA[D]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Una introducción a la imputación de valores perdidos]]></article-title>
<source><![CDATA[Terra Nueva Etapa]]></source>
<year>2006</year>
<volume>12</volume>
<numero>31</numero>
<issue>31</issue>
<page-range>127-51</page-range></nlm-citation>
</ref>
<ref id="B54">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Van Buuren]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Brand]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Groothuis-Oudshoorn]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Rubin]]></surname>
<given-names><![CDATA[D]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Fully conditional specification in multivariate imputation]]></article-title>
<source><![CDATA[Journal of Statistical Computation and Simulation]]></source>
<year>2006</year>
<numero>76</numero>
<issue>76</issue>
<page-range>1049e1064</page-range></nlm-citation>
</ref>
<ref id="B55">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Vásquez]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
</person-group>
<source><![CDATA[Aportación al análisis biplot: un enfoque algebraico]]></source>
<year>1995</year>
<publisher-name><![CDATA[Universidad de Salamanca]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B56">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Wilks]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Moments and distributions of estimates of population parameters from fragmentary simple]]></article-title>
<source><![CDATA[Annals of Mathematical Statistics]]></source>
<year>1932</year>
<page-range>163-95</page-range></nlm-citation>
</ref>
<ref id="B57">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Witten]]></surname>
<given-names><![CDATA[I. H.]]></given-names>
</name>
<name>
<surname><![CDATA[Frank]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Hall]]></surname>
<given-names><![CDATA[M. A.]]></given-names>
</name>
<name>
<surname><![CDATA[Pal]]></surname>
<given-names><![CDATA[C. J]]></given-names>
</name>
</person-group>
<source><![CDATA[Data mining: Practical machine learning tools and techniques]]></source>
<year>2016</year>
<edition>4</edition>
<publisher-name><![CDATA[Morgan Kaufmann]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B58">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Wood]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[White]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
<name>
<surname><![CDATA[Thompson]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Are missing outcome data adequately handled? A review of published randomized controlled trials in major medical journals]]></article-title>
<source><![CDATA[Clinical Trials]]></source>
<year>2004</year>
<numero>1</numero>
<issue>1</issue>
<page-range>368e376</page-range></nlm-citation>
</ref>
<ref id="B59">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Xu]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Lu]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Baldea]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Edgar]]></surname>
<given-names><![CDATA[T. F.]]></given-names>
</name>
<name>
<surname><![CDATA[Wojsznis]]></surname>
<given-names><![CDATA[W.]]></given-names>
</name>
<name>
<surname><![CDATA[Blevins]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Nixon]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Data cleaning in the process industries]]></article-title>
<source><![CDATA[Reviews in Chemical Engineering]]></source>
<year>2015</year>
<volume>31</volume>
<numero>5</numero>
<issue>5</issue>
<page-range>453-90</page-range></nlm-citation>
</ref>
</ref-list>
</back>
</article>
