<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>0120-338X</journal-id>
<journal-title><![CDATA[Forma y Función]]></journal-title>
<abbrev-journal-title><![CDATA[Forma funcion, Santaf, de Bogot, D.C.]]></abbrev-journal-title>
<issn>0120-338X</issn>
<publisher>
<publisher-name><![CDATA[Universidad Nacional de Colombia.]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S0120-338X2021000100006</article-id>
<article-id pub-id-type="doi">10.15446/fyf.v34n1.80581</article-id>
<title-group>
<article-title xml:lang="es"><![CDATA[Una propuesta de herramientas informáticas para el tratamiento estadístico del índice de disponibilidad léxica en estudios correlacionales de educación y movilidad social]]></article-title>
<article-title xml:lang="en"><![CDATA[A Proposal of Computer Tools for the Statistical Treatment of the Lexical Availability Index in Education and Social Mobility Correlation Studies]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Reyes Valdés]]></surname>
<given-names><![CDATA[Dalia]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Reyes Valdés]]></surname>
<given-names><![CDATA[José R.]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Flores Treviño]]></surname>
<given-names><![CDATA[María Eugenia]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Ojeda Castañeda]]></surname>
<given-names><![CDATA[Rina B.]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
</contrib-group>
<aff id="Af1">
<institution><![CDATA[,Universidad Autónoma de Nuevo León  ]]></institution>
<addr-line><![CDATA[Nuevo León ]]></addr-line>
<country>Mexico</country>
</aff>
<aff id="Af2">
<institution><![CDATA[,Universidad Autónoma de Coahuila  ]]></institution>
<addr-line><![CDATA[Saltillo ]]></addr-line>
<country>Mexico</country>
</aff>
<aff id="Af3">
<institution><![CDATA[,Universidad Autónoma de Nuevo León  ]]></institution>
<addr-line><![CDATA[Nuevo León ]]></addr-line>
<country>Mexico</country>
</aff>
<aff id="Af4">
<institution><![CDATA[,Universidad Autónoma de Coahuila  ]]></institution>
<addr-line><![CDATA[Saltillo ]]></addr-line>
<country>Mexico</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>06</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>06</month>
<year>2021</year>
</pub-date>
<volume>34</volume>
<numero>1</numero>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.co/scielo.php?script=sci_arttext&amp;pid=S0120-338X2021000100006&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.co/scielo.php?script=sci_abstract&amp;pid=S0120-338X2021000100006&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.co/scielo.php?script=sci_pdf&amp;pid=S0120-338X2021000100006&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="es"><p><![CDATA[Resumen El trabajo plantea una propuesta para el procesamiento de índices de disponibilidad léxica (ámbito educativo) mediante herramientas estadísticas más eficientes que las usadas en la última década. El planteamiento asociado deriva una ruta de ejercicio interdisciplinar entre lingüistas y estadísticos que capitaliza los corpus lingüísticos, tratándolos como estudios correlacionales dentro del marco de la minería de datos. Se muestran resultados iniciales de la fase cuantitativa del estudio La escuela secundaria como reguladora de los factores discursivos correlativos entre disponibilidad léxica y movilidad social, procesados con metodología de lingüística de corpus y software estadístico libre, correlacionando, con alta eficiencia, índices de disponibilidad léxica y perspectiva de movilidad social en alumnos de secundaria en Saltillo, Coahuila, México, y como un corpus viable de ser correlacionado con bases de datos parciales o censuales para la toma de decisiones en el aula y la política pública por la posibilidad de correlación entre bases de datos.]]></p></abstract>
<abstract abstract-type="short" xml:lang="en"><p><![CDATA[Abstract The work presents a proposal for the processing of lexical availability indexes (educational field) through the employment of more efficient statistical tools than the ones used in the last decade. The associated approach derives from an interdisciplinary exercise route between linguists and statisticians that capitalizes on a linguistic corpus, treating them as correlational studies within the framework of data mining. Initial results of the quantitative phase of the study The high school as a regulator of the correlative discursive factors between lexical availability and social mobility are shown, processed with corpus linguistics methodology and free statistical software, correlating, with high efficiency, indices of lexical availability, and perspective of social mobility in high school students in Saltillo, Coahuila, Mexico, and as a viable corpus to be correlated with partial or census databases for decision-making in the classroom and public policy due to the possibility of a correlation between databases.]]></p></abstract>
<kwd-group>
<kwd lng="es"><![CDATA[educación]]></kwd>
<kwd lng="es"><![CDATA[estudios correlacionales]]></kwd>
<kwd lng="es"><![CDATA[herramientas tecnológicas]]></kwd>
<kwd lng="es"><![CDATA[lingüística de corpus]]></kwd>
<kwd lng="es"><![CDATA[minería de datos]]></kwd>
<kwd lng="en"><![CDATA[corpus linguistics]]></kwd>
<kwd lng="en"><![CDATA[correlational studies]]></kwd>
<kwd lng="en"><![CDATA[data mining]]></kwd>
<kwd lng="en"><![CDATA[education]]></kwd>
<kwd lng="en"><![CDATA[technological tools]]></kwd>
</kwd-group>
</article-meta>
</front><back>
<ref-list>
<ref id="B1">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Adler]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<source><![CDATA[R in Nutshell]]></source>
<year>2009</year>
<publisher-name><![CDATA[O´Reilly]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B2">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Arriaga]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Memorias del Encuentro sobre problemas para la enseñanza del español]]></article-title>
<source><![CDATA[Educación e involución de la complejidad lingüística]]></source>
<year>2003</year>
<page-range>33-46</page-range><publisher-name><![CDATA[UAZ]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B3">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ayres]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<source><![CDATA[Information, Entropy, and Progress]]></source>
<year>1994</year>
<publisher-name><![CDATA[AIP Press American Institute of Physics]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B4">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bauman]]></surname>
<given-names><![CDATA[Z.]]></given-names>
</name>
</person-group>
<source><![CDATA[Tiempos líquidos]]></source>
<year>2005</year>
<publisher-name><![CDATA[Tusquets]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B5">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Baayen]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<source><![CDATA[Analyzing Linguistic Data]]></source>
<year>2008</year>
<publisher-name><![CDATA[Cambridge University Press]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B6">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bolaños]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[La Lingüística de Corpus, perspectivas para la investigación lingüistica contemporánea]]></article-title>
<source><![CDATA[Forma y Función]]></source>
<year>2015</year>
<volume>28</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>31-54</page-range></nlm-citation>
</ref>
<ref id="B7">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bourdieu]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<source><![CDATA[Capital cultural, escuela y espacio social]]></source>
<year>2011</year>
<publisher-name><![CDATA[Siglo XXI]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B8">
<nlm-citation citation-type="book">
<collab>Conapo.</collab>
<source><![CDATA[Informe sobre el consumo de drogas en México y su atención integral]]></source>
<year>2019</year>
<publisher-name><![CDATA[Conadic]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B9">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Cortez]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
</person-group>
<source><![CDATA[Una aplicación de la disponibilidad léxica]]></source>
<year>2016</year>
<publisher-name><![CDATA[UAZ]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B10">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Du Bois]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<source><![CDATA[MySQL]]></source>
<year>2009</year>
<publisher-name><![CDATA[Addison-Wesley]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B11">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Echeverría]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Parada]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<source><![CDATA[DispoLex. Programa de cómputo]]></source>
<year>1990</year>
<publisher-name><![CDATA[Universidad de la Concepción]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B12">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Graser]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Learning QGIS 2.0]]></source>
<year>2013</year>
<publisher-name><![CDATA[Packt Publishing]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B13">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Goffman]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
</person-group>
<source><![CDATA[La presentación de la persona en la vida cotidiana]]></source>
<year>1997</year>
<publisher-name><![CDATA[Amorrotu]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B14">
<nlm-citation citation-type="book">
<collab>INEGI.</collab>
<source><![CDATA[Censo 2010]]></source>
<year>2010</year>
<publisher-name><![CDATA[Instituto Nacional de Estadística, Geografía e Informática]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B15">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Janert]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<source><![CDATA[Data Analysis with Open Source Tools]]></source>
<year>2010</year>
<publisher-name><![CDATA[O'Reilly]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B16">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Kelleher]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Tierney]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
</person-group>
<source><![CDATA[Data science]]></source>
<year>2018</year>
<publisher-name><![CDATA[MIT Press]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B17">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Konishi]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Kitagawa]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
</person-group>
<source><![CDATA[Information Criteria and Statitical Modeling]]></source>
<year>2008</year>
<publisher-name><![CDATA[Springer]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B18">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Liu]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
</person-group>
<source><![CDATA[Sentiment Analysis. Mining Opinions, Sentiment, and Emotions]]></source>
<year>2015</year>
<publisher-name><![CDATA[Cambridge University Press]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B19">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[López]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Los estudios de disponibilidad léxica, pasado y presente]]></article-title>
<source><![CDATA[Boletín de Filología]]></source>
<year>1996</year>
<volume>35</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>245-59</page-range></nlm-citation>
</ref>
<ref id="B20">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[López]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<source><![CDATA[¿Qué te viene a la memoria?]]></source>
<year>2003</year>
<publisher-name><![CDATA[UAZ]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B21">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[López]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<source><![CDATA[Estudio de disponibilidad léxica en 43 estudiantes de ELE]]></source>
<year>2008</year>
<publisher-name><![CDATA[Universidad de Nebrija]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B22">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Michea]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Mots fréquents et mots disponibles. Un aspecto Nouveau de la statistique du language]]></article-title>
<source><![CDATA[Les Langues Modernes]]></source>
<year>1953</year>
<volume>47</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>338-44</page-range></nlm-citation>
</ref>
<ref id="B23">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Moreno]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Cálculo de disponibilidad léxica. El programa LexiDisp]]></article-title>
<source><![CDATA[Lingüística]]></source>
<year>1995</year>
<volume>51</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>243-50</page-range></nlm-citation>
</ref>
<ref id="B24">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Pacheco]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Incidencia de la variable «sexo» en la disponibilidad léxica de estudiantes preuniversitarios en Pinar del Río, Cuba]]></article-title>
<source><![CDATA[Íkala]]></source>
<year>2016</year>
<volume>22</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>237-53</page-range></nlm-citation>
</ref>
<ref id="B25">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Pérez]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Análisis del léxico disponible del centro de interés del insulto en estudiantes de secundaria de San Luis Potosí, México]]></article-title>
<source><![CDATA[Revista de Filología y Lingüística de la Universidad de Costa Rica]]></source>
<year>2020</year>
<volume>46</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>261-78</page-range></nlm-citation>
</ref>
<ref id="B26">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Perkins]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<source><![CDATA[Python 3 Text Processing with NLTK3 Cookbook]]></source>
<year>2014</year>
<publisher-name><![CDATA[Packt Publishing]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B27">
<nlm-citation citation-type="book">
<collab>R Core Team.</collab>
<source><![CDATA[Language and Enviroment for Statistical Computing]]></source>
<year>2018</year>
<publisher-name><![CDATA[R Foundation for Statistical Computing]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B28">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Reyes]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Flores]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[La movilidad social en los implícitos discursivos de estudiantes de secundaria en México. De la escuela pública a la privada]]></article-title>
<source><![CDATA[Oxímora]]></source>
<year>2018</year>
<volume>13</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>58-80</page-range></nlm-citation>
</ref>
<ref id="B29">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Rojas]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Metodología de análisis de disponibilidad léxica en alumnos de Pedagogía a través de la comparación jerárquica de lexicones]]></article-title>
<source><![CDATA[Formación universitaria]]></source>
<year>2017</year>
<volume>10</volume>
<numero>4</numero>
<issue>4</issue>
<page-range>3-14</page-range></nlm-citation>
</ref>
<ref id="B30">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Russell]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[Mining the Social Web]]></source>
<year>2014</year>
<publisher-name><![CDATA[O'Reilly]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B31">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Shmueli]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
</person-group>
<source><![CDATA[Data Mining for Bussines Analytics. Concepts, Techniques and Applications in R]]></source>
<year>2018</year>
<publisher-name><![CDATA[Wiley]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B32">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Silge]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Robinson]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<source><![CDATA[Text Mining with R]]></source>
<year>2017</year>
<publisher-name><![CDATA[O'Reilly]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B33">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Torgo]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<source><![CDATA[Mining with R]]></source>
<year>2011</year>
<publisher-name><![CDATA[Chapman and Hall/CRC]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B34">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Urbizagástegui]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Restrepo]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[La ley de Zipf y el punto de transición de Goffman en la indización automática]]></article-title>
<source><![CDATA[Revista de Investigación Bibliotecnológica]]></source>
<year>2011</year>
<volume>25</volume>
<numero>54</numero>
<issue>54</issue>
<page-range>25-32</page-range></nlm-citation>
</ref>
</ref-list>
</back>
</article>
