<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>0120-6230</journal-id>
<journal-title><![CDATA[Revista Facultad de Ingeniería Universidad de Antioquia]]></journal-title>
<abbrev-journal-title><![CDATA[Rev.fac.ing.univ. Antioquia]]></abbrev-journal-title>
<issn>0120-6230</issn>
<publisher>
<publisher-name><![CDATA[Facultad de Ingeniería, Universidad de Antioquia]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S0120-62302016000200003</article-id>
<article-id pub-id-type="doi">10.17533/udea.redin.n79a03</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Applicability of semi-supervised learning assumptions for gene ontology terms prediction]]></article-title>
<article-title xml:lang="es"><![CDATA[Aplicabilidad de las suposiciones del aprendizaje semi-supervisado para la predicción de términos de la ontología genética]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Jaramillo-Garzón]]></surname>
<given-names><![CDATA[Jorge Alberto]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
<xref ref-type="aff" rid="A02"/>
<xref ref-type="aff" rid="A04"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Castellanos-Domínguez]]></surname>
<given-names><![CDATA[César Germán]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Perera-Lluna]]></surname>
<given-names><![CDATA[Alexandre]]></given-names>
</name>
<xref ref-type="aff" rid="A02"/>
</contrib>
</contrib-group>
<aff id="A01">
<institution><![CDATA[,Universidad Nacional de Colombia Facultad de Ingeniería y Arquitectura ]]></institution>
<addr-line><![CDATA[Manizales ]]></addr-line>
<country>Colombia</country>
</aff>
<aff id="A02">
<institution><![CDATA[,Instituto Tecnológico Metropolitano Facultad de Ingenierías ]]></institution>
<addr-line><![CDATA[Medellín ]]></addr-line>
<country>Colombia</country>
</aff>
<aff id="A03">
<institution><![CDATA[,Universidad Politécnica de Cataluña  ]]></institution>
<addr-line><![CDATA[Barcelona ]]></addr-line>
<country>España</country>
</aff>
<aff id="A04">
<institution><![CDATA[,Universidad Nacional de Colombia  ]]></institution>
<addr-line><![CDATA[ ]]></addr-line>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>06</month>
<year>2016</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>06</month>
<year>2016</year>
</pub-date>
<numero>79</numero>
<fpage>19</fpage>
<lpage>32</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.co/scielo.php?script=sci_arttext&amp;pid=S0120-62302016000200003&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.co/scielo.php?script=sci_abstract&amp;pid=S0120-62302016000200003&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.co/scielo.php?script=sci_pdf&amp;pid=S0120-62302016000200003&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[Gene Ontology (GO) is one of the most important resources in bioinformatics, aiming to provide a unified framework for the biological annotation of genes and proteins across all species. Predicting GO terms is an essential task for bioinformatics, but the number of available labelled proteins is in several cases insufficient for training reliable machine learning classifiers. Semi-supervised learning methods arise as a powerful solution that explodes the information contained in unlabelled data in order to improve the estimations of traditional supervised approaches. However, semi-supervised learning methods have to make strong assumptions about the nature of the training data and thus, the performance of the predictor is highly dependent on these assumptions. This paper presents an analysis of the applicability of semi-supervised learning assumptions over the specific task of GO terms prediction, focused on providing judgment elements that allow choosing the most suitable tools for specific GO terms. The results show that semi-supervised approaches significantly outperform the traditional supervised methods and that the highest performances are reached when applying the cluster assumption. Besides, it is experimentally demonstrated that cluster and manifold assumptions are complementary to each other and an analysis of which GO terms can be more prone to be correctly predicted with each assumption, is provided.]]></p></abstract>
<abstract abstract-type="short" xml:lang="es"><p><![CDATA[La Ontología Genética (GO) es uno de los recursos más importantes en la bioinformática, el cual busca proporcionar un marco de trabajo unificado para la anotación biológica de genes y proteínas de todas las especies. La predicción de términos GO es una tarea esencial en bioinformática, pero el número de secuencias etiquetadas que se encuentran disponibles es insuficiente en muchos casos para entrenar sistemas confiables de aprendizaje de máquina. El aprendizaje semi-supervisado aparece entonces como una poderosa solución que explota la información contenida en los datos no etiquetados, con el fin de mejorar las estimaciones de las aplicaciones supervisadas tradicionales. Sin embargo, los métodos semi-supervisados deben hacer suposiciones fuertes sobre la naturaleza de los datos de entrenamiento y, por lo tanto, el desempeño de los predictores es altamente dependiente de estas suposiciones. En este artículo se presenta un análisis de la aplicabilidad de las diferentes suposiciones del aprendizaje semi-supervisado en la tarea específica de predicción de términos GO, con el fin de proveer elementos de juicio que permitan escoger las herramientas más adecuadas para términos GO específicos. Los resultados muestran que los métodos semi-supervisados superan significativamente a los métodos tradicionales supervisados y que los desempeños más altos son alcanzados cuando se implementa la suposición de cluster. Además se comprueba experimentalmente que las suposiciones de cluster y manifold son complementarias entre sí y se realiza un análisis de cuáles términos GO pueden ser más susceptibles de ser correctamente predichos usando cada una de éstas.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Semi-supervised learning]]></kwd>
<kwd lng="en"><![CDATA[gene ontology]]></kwd>
<kwd lng="en"><![CDATA[support vector machines]]></kwd>
<kwd lng="en"><![CDATA[protein function prediction]]></kwd>
<kwd lng="es"><![CDATA[Aprendizaje semi-supervisado]]></kwd>
<kwd lng="es"><![CDATA[ontología a genética]]></kwd>
<kwd lng="es"><![CDATA[máquinas de vectores de soporte]]></kwd>
<kwd lng="es"><![CDATA[predicción de funciones proteicas]]></kwd>
</kwd-group>
</article-meta>
</front><body><![CDATA[  <font face= "Verdana" size="2">     <p align="right"><b>ART&Iacute;CULO ORIGINAL</b></p>     <p align="right">DOI: <a href="http://dx.doi.org/10.17533/udea.redin.n79a03">10.17533/udea.redin.n79a03</a></p>      <p align="right">&nbsp;</p>     <p align="center"><font size="4"><b>Applicability of semi-supervised learning assumptions for gene ontology terms prediction</b></font></p>     <p align="center">&nbsp;</p>     <p align="center"><font size="3"><b>Aplicabilidad   de las suposiciones del aprendizaje semi-supervisado para la predicci&oacute;n de   t&eacute;rminos de la ontolog&iacute;a gen&eacute;tica</b></font></p>     <p align="center">&nbsp;</p>     <p align="center">&nbsp;</p>     <p><i><b>Jorge Alberto Jaramillo-Garz&oacute;n<sup>1<i>,</i>2</sup>*, C&eacute;sar Germ&aacute;n   Castellanos-Dom&iacute;nguez<sup>1</sup>, Alexandre Perera-Lluna<sup>3 </sup></b></i></p>     ]]></body>
<body><![CDATA[<p><sup>1</sup>Departamento de Ingenier&iacute;a El&eacute;ctrica, Electr&oacute;nica y   Computaci&oacute;n, Facultad de Ingenier&iacute;a y Arquitectura, Universidad Nacional de   Colombia. Cra. 27 #   64-60. A. A. 127. Manizales, Colombia. </p>     <p><sup>2</sup>Grupo de Autom&aacute;tica, Electr&oacute;nica y Ciencias   Computacionales, Facultad de Ingenier&iacute;as, Instituto   Tecnol&oacute;gico Metropolitano. Calle 73 # 76A - 354 V&iacute;a al Volador. C. P. 050013.   Medell&iacute;n, Colombia. </p>     <p><sup>3</sup>Centro de Investigaci&oacute;n en Ingenier&iacute;a Biom&eacute;dica, Universidad   Polit&eacute;cnica de Catalu&ntilde;a. Edifici U.   c/ Pau Gargallo, 5. C. P. 08028. Barcelona, Espa&ntilde;a. </p>     <p>* Corresponding author: Jorge Alberto Jaramillo Garz&oacute;n, e-mail: <a href="mailto:: jajaramillog@gmail.com">jajaramillog@gmail.com </a></p>     <p>DOI: 10.17533/udea.redin.n79a03</p>     <p>&nbsp;</p>     <p align="center">(Received May 30, 2015; accepted January 26, 2016)</p>     <p align="center">&nbsp;</p>     <p align="center">&nbsp;</p> <hr noshade size="1">     <p><font size="3"><b>ABSTRACT</b></font></p>     ]]></body>
<body><![CDATA[<p>Gene Ontology (GO) is one   of the most important resources in bioinformatics, aiming to provide a unified   framework for the biological annotation of genes and proteins across all   species. Predicting GO terms is an essential task for bioinformatics, but the   number of available labelled proteins is in several cases insufficient for   training reliable machine learning classifiers. Semi-supervised learning   methods arise as a powerful solution that explodes the information contained in   unlabelled data in order to improve the estimations of traditional supervised   approaches. However, semi-supervised learning methods have to make strong   assumptions about the nature of the training data and thus, the performance of   the predictor is highly dependent on these assumptions. This paper presents an   analysis of the applicability of semi-supervised learning assumptions over the   specific task of GO terms prediction, focused on providing judgment elements   that allow choosing the most suitable tools for specific GO terms. The results   show that semi-supervised approaches significantly outperform the traditional   supervised methods and that the highest performances are reached when applying   the cluster assumption. Besides, it is experimentally demonstrated that cluster   and manifold assumptions are complementary to each other and an analysis of   which GO terms can be more prone to be correctly predicted with each   assumption, is provided.</p>     <p><i>Keywords:</i><b> </b> Semi-supervised learning, gene ontology, support vector machines, protein function prediction</p> <hr noshade size="1">     <p><font size="3"><b>RESUMEN</b></font></p>     <p>La Ontolog&iacute;a Gen&eacute;tica (GO) es uno de los recursos   m&aacute;s importantes en la bioinform&aacute;tica, el cual busca proporcionar un marco de   trabajo unificado para la anotaci&oacute;n biol&oacute;gica de genes y prote&iacute;nas de todas las   especies. La predicci&oacute;n de t&eacute;rminos GO es una tarea esencial en bioinform&aacute;tica,   pero el n&uacute;mero de secuencias etiquetadas que se encuentran disponibles es   insuficiente en muchos casos para entrenar sistemas confiables de aprendizaje   de m&aacute;quina. El aprendizaje semi-supervisado aparece entonces como una poderosa   soluci&oacute;n que explota la informaci&oacute;n contenida en los datos no etiquetados, con   el fin de mejorar las estimaciones de las aplicaciones supervisadas   tradicionales. Sin embargo, los m&eacute;todos semi-supervisados deben hacer   suposiciones fuertes sobre la naturaleza de los datos de entrenamiento y, por   lo tanto, el desempe&ntilde;o de los predictores es altamente dependiente de estas   suposiciones. En este art&iacute;culo se presenta un an&aacute;lisis de la aplicabilidad de   las diferentes suposiciones del aprendizaje semi-supervisado en la tarea   espec&iacute;fica de predicci&oacute;n de t&eacute;rminos GO, con el fin de proveer elementos de   juicio que permitan escoger las herramientas m&aacute;s adecuadas para t&eacute;rminos GO   espec&iacute;ficos. Los resultados muestran que los m&eacute;todos semi-supervisados superan   significativamente a los m&eacute;todos tradicionales supervisados y que los desempe&ntilde;os   m&aacute;s altos son alcanzados cuando se implementa la suposici&oacute;n de <i>cluster.</i> Adem&aacute;s se comprueba   experimentalmente que las suposiciones de <i>cluster</i> y <i>manifold</i> son complementarias entre   s&iacute; y se realiza un an&aacute;lisis de cu&aacute;les t&eacute;rminos GO pueden ser m&aacute;s susceptibles   de ser correctamente predichos usando cada una de &eacute;stas.</p>     <p><i>Palabras clave:</i> Aprendizaje semi-supervisado, ontolog&iacute;a a gen&eacute;tica, m&aacute;quinas de vectores de soporte, predicci&oacute;n de funciones proteicas</p> <hr noshade size="1">     <p><font size="3"><b>1. Introduction</b></font> </p>     <p>Proteins are essential for   living organisms due to the diversity of molecular functions they perform,   which are also related to processes at cellular and phenotypical levels. At   molecular level, for instance, binding proteins are capable of creating a wide   variety of structurally and chemically deferent surfaces, allowing for   recognizing other molecules and performing regulation functions; enzymes use   binding plus specific chemical reactivity for speeding up molecular reactions;   structural proteins constitute some of the main morphological components of   living organisms, building resistant structures and being sources of   biomaterials. At the cellular level, proteins perform the majority of functions   of the organelles. Structural proteins in the cytoskeleton are responsible for   maintaining the shape of the cell and keeping organelles in place; in the   endoplasmatic reticulum, binding proteins transport molecules between and   within cells; in the lysosome, catalytic proteins break large molecules into   small ones for carrying out digestion (for a deeper description of subcellular   locations of proteins, see &#91;1&#93;). Phenotypical roles of proteins are harder to   determine, since phenotype is the result of many cellular function assemblies   and their response under environmental stimuli. However, by the comparison of   genes descended from the same ancestor across many different organisms, or by   studying the effects of modifying individual genes in model organisms, several   thousands of gene products have been associated with phenotypes &#91;2&#93;.</p>     <p>The Gene Ontology (GO) project aims to cover the whole   universe of protein functions by constructing controlled and structured   vocabularies known as ontologies, and applying them in the annotation of gene   products in biological databases &#91;3&#93;. The project comprises three ontologies: <i>Molecular function</i> (biochemical   activities at the molecular level), <i>cellular   component</i> (specific sub-cellular location where a gene product is active)   and <i>biological process</i> (events at   phenotypical level to which the protein contributes). Recent methods for   predicting GO terms employ machine learning techniques trained over   physical-chemical and statistical attributes for predicting functional labels   that later can be subjected to experimental verification &#91;4&#93;. However, the   succesfullness of supervised machine learning strategies relies on the amount   and quality of a labelled set of instances needed to train the classifier.   Labelled instances are often difficult, expensive, or time consuming to obtain,   as they require the e orts of experienced human annotators. Meanwhile   unlabelled data may be relatively easy to collect, but there has been few ways   to use them &#91;5&#93;. In the particular case of protein function prediction, it is   also a known fact that only a small number of proteins have actually been   annotated for certain functions. Therefore, it is di cult to obtain sufficient   training data for the supervised learning algorithms and, consequently, the   tools for protein function prediction have very limited scopes &#91;6&#93;. Besides, it   is particularly hard to find the representative negative samples because the   available information in the annotation databases, such as GO &#91;3&#93;, only   provides information about which protein belongs to which functional class but   there is no certainty about which protein does not belong to the class &#91;7&#93;.   Under such circumstances, semi-supervised learning methods provide an   alternative approach to protein annotation &#91;6&#93;. Semi-supervised learning (SSL)   is halfway between supervised and unsupervised learning: in addition to   labelled data, the algorithm is provided with an amount of unlabelled samples   that can be used to improve the estimations.</p>     <p>One significant difference between supervised and   semi-supervised methods is that, unlike supervised learning, in which a good   generic learning algorithm can perform well on a lot of real-world data sets   without specific domain knowledge, in semi-supervised learning it is commonly   accepted that there is no ''black box'' solution and a good understanding of the nature   of the data is required to achieve successful performance &#91;8&#93;. There are   several different semi-supervised learning methods and all of them have to make   strong assumptions about the relation of the probability of the feature space   and the joint probability of the feature space and the label set. These methods   include generative models, graph-based models, semi-supervised support vector   machines, and soon &#91;9&#93;.</p>     <p>A few semi-supervised methods have been applied for   both gene function prediction (over the DNA sequence) and protein function   prediction (over the amino acids sequence). &#91;10&#93; used a S<sup>3</sup>VMs for   promoter recognition, improving predictive performance by 55% over the standard inductive   SVM results. &#91;11&#93; used a ''co-updating'' schema of two SVMs, each one trained   over a different source of data, for discriminating among five functional   classes in the yeast genome. For the problem of predicting the functional   properties of proteins, &#91;12&#93; conducted an extensive study on the caveats of   incorporating semi-supervised learning and transduction for predicting various   functional properties of proteins corresponding to genes in the yeast genome,   founding that S<sup>3</sup>VMs significantly decrease performance compared to   inductive SVMs. &#91;13&#93; used graph-based semi-supervised learning for functional   class prediction of yeast proteins, using protein interaction networks for   obtaining the graphs. </p>     ]]></body>
<body><![CDATA[<p>More recently, &#91;14&#93; proposes a generative   semi-supervised method for protein functional classification and provide   experimental results of classifying a set of eukaryotic proteins into seven   subcellular locations from the Cellular Component ontology of GO. &#91;6&#93; proposed   a new algorithm to de ne the negative samples in protein function prediction.   In detail, the one-class SVMs and two-class SVMs are used as the core learning   algorithm in order to identify the representative negative samples so that the   positive samples hidden in the unlabelled data can be recovered. &#91;15&#93; proposes   a method for integrating multiple graphs within a framework of semi-supervised   learning and applies the method to the task of protein functional class   prediction in yeast. The proposed method performs significantly better than the   same algorithm trained on any single graph. </p>     <p>In &#91;16&#93;, we presented the prediction of protein   sub-cellular localizations with a semi-supervised support vector machine over a database of   over 108 <i>Embryophiyta</i> plants, showing that semi-supervised learning significantly outperforms   the supervised learning approach in several cases. However, since only one   semi-supervised assumption was employed, those results could be subjected to   further improvement when several assumptions are considered. Moreover, our   previous work only considered the molecular function ontology and, if the other   two ontologies are included, the high diversity of data may need diverse tools   to be accurately classified. </p>     <p>The present work expands our previous results, presenting an analysis of   the applicability of semi-supervised learning assumptions over the three ontologies   of Gene Ontology: molecular function, cellular component and biological   process. The analysis aims to demonstrate that one semi-supervised assumption   is insufficient to classify the whole set of Gene Ontology terms and to provide   judgment elements that allow choosing the most suitable tool for protein   function prediction among the existing semi-supervised alternatives. The   results show that semi-supervised approaches significantly outperform the   traditional supervised methods and that the highest performances are reached   when applying the cluster assumption. Besides, it is experimentally   demonstrated that cluster and manifold assumptions are complimentary to each   other and an analysis of which GO terms can be more prone to be correctly   predicted with each assumption, is provided.</p>   &nbsp;&nbsp;&nbsp;     <p><font size="3"><b>2. Theoretical background</b></font></p>     <p>The main assumption made by   semi-supervised learning algorithms is the ''semi-supervised smoothness   assumption'' &#91;8&#93;.</p>     <p> - <b>Semi-supervised smoothness assumption: </b>If two points <em>x<sub>1</sub></em>,and <em>x<sub>2 </sub></em>in a high-density region   are close, then so should be their corresponding label sets <em>y<sub>1</sub></em>, <em>y<sub>2</sub></em>. Note that by   transitivity, this assumption implies that if two points are linked by a path   of high density (e.g., if they belong to the same cluster), then their outputs   are likely to be close. If, on the other hand, they are separated by a   low-density region, then their outputs need not be close.<b> </b></p>     <p>Such assumption originates   the two common assumptions used in semi-supervised learning:</p>     <p> - <b>Cluster assumption:</b> If points are in the same cluster, they are likely to be of the same   class. This assumption does not imply that each class forms a single, compact   cluster, it only means that there are no instances of two distinct classes in   the same cluster. The cluster assumption can be formulated in an equivalent   way: </p>     <p> - <b>Low density separation</b>: The decision boundary should lie in a low-density region. </p>     <p> - <b>Manifold assumption:</b> The (high-dimensional) data lie (roughly) on a low-dimensional   manifold. Instances that are close according to the manifold geodesic distance   are likely to be of the same class. </p>     ]]></body>
<body><![CDATA[<p>According to each   assumption, there are three main families of semi-supervised methods:   generative methods (cluster assumption), density-based methods (low density   separation), and graph-based methods (manifold assumption). In the following   sub-sections, each family of methods will be reviewed, emphasizing on the   assumptions made by each one. It should be pointed out that, since   semi-supervised learning is a rapidly evolving field, the review is necessarily   incomplete. A wider review in this matter can also be found on &#91;9&#93;.</p>     <p><b>2.1. Generative methods</b></p>     <p>Generative methods follow a   common strategy of augmenting the set of labelled samples with a large set of   unlabelled data and combining the two sets with the Expectation-Maximization   algorithm, in order to improve the parameter estimates &#91;17&#93;. They assume a   probabilistic model <em>p(x, y) = p( y) p(x | y)</em>, where <em>p(x | y)</em>is an identifiable   mixture distribution. The most commonly employed distributions are the Gaussian   Mixture Models shown in Eq. (1). </p>     <p><img src="img/revistas/rfiua/n79/n79a03e01.jpg"></p>     <p>where <sub><img src="img/revistas/rfiua/n79/n79a03ea01.jpg"></sub>is the   Gaussian distribution with parameters <img src="img/revistas/rfiua/n79/n79a03ea02.jpg">, being <i><sub><img src="img/revistas/rfiua/n79/n79a03ea03.jpg"></sub></i>the mean vector and <b>&#931;</b><i><sub>k </sub></i>the covariance matrix of the <em>k</em> &#8722;th Gaussian component, and <i>&#960;</i><i><sub>k </sub></i>the mixing components such that<img src="img/revistas/rfiua/n79/n79a03ea04.jpg">for <em>k</em> = 1,2,&hellip;,<em>K</em>.</p>     <p>Ideally, only one labelled   example per component is needed to fully determine the mixture distribution. In   this setting, any additional information on <em>p(x)</em> is useful and the EM algorithm can be used for estimating  <img src="img/revistas/rfiua/n79/n79a03ea05.jpg"> A strength of the generative approach is that   knowledge of the structure of the problem or the data can naturally be   incorporated by modelling it &#91;8&#93;. However, generative techniques provide an estimate   of <em>p(x)</em>along the way,   although this is not required for classification, and in general this proves   wasteful given limited data. For example, maximizing the joint likelihood of a   finite sample need not lead to a small classification error, because depending   on the model, it may be possible to increase the likelihood more by improving   the t of <em>p(x)</em> than the t of <img src="img/revistas/rfiua/n79/n79a03ea06.jpg">&#91;8&#93;. </p>     <p>The works of &#91;18, 19&#93;,   among others, showed to be strong methods for classifying text data.   Furthermore, &#91;20&#93; have applied the EM algorithm on mixture of multinomial for   the task of text classification, showing better performance than those trained   only from the supervised set. &#91;21&#93; extend generative mixture models by   including a ''bias correction'' term and discriminative training using the   maximum entropy principle. However, anecdotal evidence is that many more   studies were not published because they obtained negative results, showing that   learning a mixture model will often degrade the performance of a model fit using   only the labelled data &#91;22&#93;; one published study with these conclusions is &#91;18&#93;.   This is due to the strong assumption done by generative methods: the data   actually comes from the mixture model, where the number of components, prior <em>p(x)</em>, and conditional <em>p(x | y)</em> are all correct &#91;9&#93;.</p>     <p><b>2.2. Density-based methods</b></p>     <p>With the rising popularity   of support vector machines (SVMs), Semi-Supervised SVMs (S<sup>3</sup>VMs)   emerged as an extension to standard SVMs for semi-supervised learning. S<sup>3</sup>VMs   find a labelling for all the unlabelled data, and a separating hyperplane, such   that maximum margin is achieved on both the labelled data and the (now   labelled) unlabelled data. As a result, unlabelled data guide the decision   boundary away from dense regions. The assumption of S<sup>3</sup>VMs is that   the classes are well-separated, such that the decision boundary falls into a   low density region in the feature space, and does not cut through dense   unlabelled data &#91;9&#93;.</p>     <p>In a similar way to the conventional SVMs, the optimization problem for an S<sup>3</sup>VMs   can be stated as follows shown in Eq. (2). </p>     ]]></body>
<body><![CDATA[<p><img src="img/revistas/rfiua/n79/n79a03e02.jpg"></p>     <p>where <i>&#8467;</i>(<i>t</i>) = max(0<i>,</i>1 &#8722; <em>t</em> ) is the hinge loss function, C <i></i>is   the trade-o parameter and <i>&#955;</i><i> </i>is a new regularization parameter. The first two terms   in the above equation correspond to the traditional solution for the standard   supervised SVM, while the last term puts<img src="img/revistas/rfiua/n79/n79a03ea07.jpg">of the unlabelled points x<i><sub>i </sub></i>away from 0 (thereby implementing the low density   assumption) &#91;24&#93;. </p>     <p>Again, as in the supervised case, the kernel trick can be used for   constructing nonlinear S<sup>3</sup>VMs. While the optimization in SVM is   convex and can be solved with QP-hard complexity, optimization in S<sup>3</sup>VM   is a non-convex combinatorial task with NP-Hard complexity. Most of the recent   work in S<sup>3</sup>VM has been focused on   the optimization procedure (a full survey in this matter can be found in &#91;24&#93;).   Among the proposed methods for solving the non-convex optimization problem   associated with S<sup>3</sup>VMs, one of the first implementations is the S<sup>3</sup>VM<i><sup>light </sup></i>by &#91;25&#93;, which is based on local combinatorial search   guided by a label switching procedure. &#91;26&#93; presented a method based on   gradient descent on the primal, that performs significantly better than the   optimization strategy pursued in S<sup>3</sup>VM<i><sup>light</sup></i>; the work by &#91;22&#93; proposes the use of a global optimization technique   known as ''continuation'', often leading to lower test errors than other   optimization algorithms; &#91;27&#93; uses the Concave-Convex procedure, providing a   highly scalable algorithm in the non-linear case. </p>     <p>Other recent proposals include &#91;28&#93; which focuses on   the class-imbalance problem and proposes a cost-sensitive S<sup>3</sup>VM; &#91;29&#93;   which describes Laplacian twin support vector machines; and several approaches   to adaptive regularizations like &#91;30, 31&#93;.</p>     <p><b>2.3. Graph-based methods</b></p>     <p>Graph-based methods start   with a graph where the nodes are the labelled and unlabelled data points, and   (weighted) edges reflect the similarity of nodes. The assumption is that nodes   connected by a large-weight edge tend to have the same label, and labels can   propagate throughout the graph. In other words, graph-based methods do the   assumption that labels are smooth with respect to the graph, such that they vary   slowly on the graph. That is, if two instances are connected by a strong edge,   their labels tend to be the same &#91;9&#93;.</p>     <p>This family of methods   enjoy nice properties from spectral graph theory. They commonly use an energy   function as objective in the optimization problem, ensuring that the labels   will change slowly through the graph (consequently implementing the manifold   assumption) &#91;32&#93;.</p>     <p>A graph is represented by the (<i>L </i>+ <i>U</i>)   &times; (<i>L </i>+ <i>U</i>) weight matrix <b>W</b>, <i>W<sub>ij </sub></i>= 0 if there is no edge between instances <em>x<sub>i </sub></em>and <em>x<sub>j</sub></em>. Once the graph has been defined, a real function over the   nodes can be defined <img src="img/revistas/rfiua/n79/n79a03ea08.jpg">. In order to achieve that unlabelled points that are similar   (as determined by the edges of the graph) to have similar labels, the quadratic   energy function shown in Eq. (3) can be used as objective: </p>     <p><img src="img/revistas/rfiua/n79/n79a03e03.jpg"></p>     <p>Since this objective   function is minimized by constant functions, it is necessary to constrain <em>f<sub>&theta;</sub></em><sub></sub>to take values <em>f<sub>&theta;</sub> (x<sub>i</sub>) = y<sub>i</sub></em> , for all the labelled data x<sub>i </sub>&isin;<em>X</em><sub>L</sub>. Finally, let <em>D</em> be the diagonal degree matrix, where <i><sub><img src="img/revistas/rfiua/n79/n79a03ea09.jpg"></sub></i>is the degree of node <em>x<sub>i</sub></em>. The combinatorial Laplacian <b>&#8710; </b>is   defined as in Eq. (4). </p>     ]]></body>
<body><![CDATA[<p><img src="img/revistas/rfiua/n79/n79a03e04.jpg"></p>     <p>and it is easy to verify Eq. (5):</p>     <p><img src="img/revistas/rfiua/n79/n79a03e05.jpg"></p>     <p>Most graph-based methods are inherently transductive,   giving predictions for only those points in the unlabelled set, and not for an   arbitrary test point. The simplest strategy for extending the method for unseen   data is by dividing the input space into Voronoi cells centered on the labelled   instances. From an algorithmic point of view, this strategy is equal to   classify instances by its 1-nearest-neighbour. &#91;21&#93; proposed an approach that   combines generative mixture models and discriminative regularization using the   graph Laplacian in order to provide an inductive model. Laplacian SVMs,   proposed by &#91;33&#93;, provide a natural inductive algorithm since they use a   modified SVM for classification. The optimization problem in this case is   regularized by the introduction of a term for controlling the complexity of the   model according to Eq. (6):</p>     <p><img src="img/revistas/rfiua/n79/n79a03e06.jpg"></p>     <p>where <i>W<sub>ij </sub></i>is the weight between the <em>i</em> &minus; th and<em>j</em> &minus; th instances in the graph and <i>&#955;</i><i> </i>is again a regularization parameter. A lot of experiments   show that Laplacian SVM achieves state of the art performance in graph-based   semi-supervised classification &#91;29&#93;. </p>   &nbsp;&nbsp;&nbsp;   &nbsp;&nbsp;&nbsp;     <p><font size="3"><b>3. Proposed methodology: semi-supervised learning for predicting gene ontology terms in<i> Embryophyta</i> plants</b></font></p>     <p><b>3.1. Selected semi-supervised algorithms</b></p>     <p>In order to test the efficiency of semi-supervised learning in the task   of predicting protein functions, two state of the art methods were chosen, each   one implementing a different semi-supervised assumption: S<sup>3</sup>VM   following the concave-convex optimization procedure (CCP) &#91;27&#93; (implementing   the low-density separation assumption and consequently the cluster assumption)   and Laplacian-SVM &#91;34&#93; (implementing the manifold assumption).</p>     <p> - <b>CCP S<sup>3</sup>VM:</b> The S<sup>3</sup>VM proposed by &#91;27, 34&#93; was chosen since it provides   high scalability in the non-linear case, making it the most suitable choice for   the amounts of <i>Embryophyta</i> proteins   in the databases used in this work. Consider the set of labelled points<img src="img/revistas/rfiua/n79/n79a03ea10.jpg"> for which labels<img src="img/revistas/rfiua/n79/n79a03ea11.jpg">are provided, and the   points<img src="img/revistas/rfiua/n79/n79a03ea12.jpg">the labels of which   are not known. The objective function to be optimized in this case, corresponds   to Eq. (7): </p>     ]]></body>
<body><![CDATA[<p><img src="img/revistas/rfiua/n79/n79a03e07.jpg"></p>     <p>where the function <i>&#8467;</i>(<i>t</i>) = max(0<i>,</i>1&#8722;|<i>t</i>|) is the hinge loss   function. The main problem with this objective function, in contrast to the   classical SVM objective, is that the additional term is non-convex and gives   rise to local minima. Additionally, it has been experimentally observed that   the objective function tends to give unbalanced solutions, classifying all the   unlabelled points in the same class. A constraint should be imposed on the data   to avoid this problem &#91;26&#93;, as shown in Eq. (8): </p>     <p><img src="img/revistas/rfiua/n79/n79a03e08.jpg"></p>     <p>which ensures that the number of unlabelled samples assigned to each   class will be the same fraction as in the labelled data. CCP decomposes a   non-convex function <em>J</em> into a convex component <em>J<sub>vex</sub></em> and a concave component <em>J<sub>cave</sub></em>. At each iteration, the concave part is replaced by the   tangential approximation at the current point and the sum of this linear   function and the convex part is minimized to get the next iterate. The first   two terms in Eq. (7) are convex, while the third term can be decomposed into   the sum of a convex function (Eq. 9) plus a concave one (Eq. 10):</p>     <p><img src="img/revistas/rfiua/n79/n79a03e09.jpg"></p>     <p><img src="img/revistas/rfiua/n79/n79a03e10.jpg"> </p>     <p>If an unlabelled point is currently classified positive, then at the   next iteration, the convex loss on this point will be determined by Eq. (11):</p>     <p><img src="img/revistas/rfiua/n79/n79a03e11.jpg"></p>     <p>The CCP algorithm for the semi-supervised support vector machines is presented   in Algorithm 1 (<a href="#Table1">Table 1</a>).</p>     <p align="center"><b><a name="Table1"></a></b><img src="img/revistas/rfiua/n79/n79a03t01.jpg"></p>     ]]></body>
<body><![CDATA[<p> - <b>Laplacian SVM:</b> Regarding the graph-based algorithms, Laplacian support vector machines   (Lap-SVM) were chosen since, according to &#91;29&#93;, many experiments show that   Lap-SVM achieves state of the art performance among graph-based semi-supervised   classification methods. This method, as proposed in &#91;33&#93;, uses an objective   function that is slightly different to Eq. (6) and can be seen in Eq. (12): </p>     <p><img src="img/revistas/rfiua/n79/n79a03e12.jpg"></p>     <p>where <i>&#955;</i><i><sub>A </sub></i>and <i>&#955;</i><i><sub>I </sub></i>are two regularizing constants that must be set by the user. &#91;32&#93;, also   demonstrated a modified version of the Representer Theorem that ensures that   the solution function can be given again by linear combination of kernel   functions and the Lap-SVMs can be implemented by using a standard SVM quadratic   solver. </p>     <p>The S<sup>3</sup>VM and Lap-SVM were used as base   classifiers, both of them with the Gaussian kernel. For the Lap-SVM, the K-NN   graph was selected for implementing the manifold regularization term, since   there is some empirical evidence that suggests that fully connect graphs   performs worse than sparse graphs &#91;9&#93;.</p>     <p>All the parameters of the algorithms, including the   dispersion of the kernels, the trade-o parameters of the SVMs, the   regularization constants of both methods and the number of neighbours for   constructing the graph, were tuned with a particle swarm optimization   meta-heuristic. The decision making was implemented following the one   against-all strategy with SMOTE oversampling for avoiding class-imbalance.   Also, the 5-fold cross-validation strategy was implemented for assessing the   performance of the predictors.</p>     <p><b>3.2. Database</b></p>     <p>The database designed in   &#91;4&#93; was used as the set of labeled instances. This database is conformed by all   the available <i>Embryophyta</i> proteins at   UniProtKB/Swiss-Prot database &#91;35&#93; (file version: 10<i>/</i>01<i>/</i>2013), with at least one annotation in the Gene Ontology   Annotation (GOA) project &#91;36&#93; (file version: 7<i>/</i>01<i>/</i>2013). In order to avoid the presence of protein families   that could bias the results, the dataset was filtered at several 30% of sequence identity using   the Cd-Hit software &#91;37&#93;. The set of labelled instances is then conformed by 3368 protein sequences, from which 1973 sequences   are annotated with molecular functions, 2210 with cellular components and 2798 with biological processes &#91;4&#93;. </p>     <p>Classes are defined by the plants GO slim developed by   The Arabidopsis Information Resource - TAIR &#91;38&#93;, (file version: 14<i>/</i>03<i>/</i>2012). Positive samples   associated to each GO term are selected by considering the propagation   principle of GO: if a protein is predicted to be associated to any given GO   term, it must be automatically associated to all the ancestors of that category   and thus, it is enough to predict only the lowest level entries. As in &#91;4&#93;, in   order to explicitly note that some GO terms are not including their descendants   categories, such incomplete GO terms are marked with an asterisk throughout the   paper. The resulting set comprises 14 GO terms in the molecular function ontology, 20 GO terms in the cellular   component ontology and 41 GO terms in the biological process ontology. <a href="#Table2">Table 2</a>  shows the final list of categories, as well as the acronyms used to cite them   throughout this paper. </p>     <p align="center"><b><a name="Table2"></a></b><img src="img/revistas/rfiua/n79/n79a03t02.jpg"></p>     <p>Regarding unlabeled instances, all the available   Embryophyta proteins at UniProtKB/SwissProt database that has no entries in the   GOA project were added as the core set of unlabeled samples. Also, proteins   associated to the nodes in the functional path of a GO term that were not   associated to the node itself, were left as unlabeled instances regarding that   classifier. Finally, 30000 unlabeled instances were randomly chosen in order to accomplish an approximate relation of ten unlabeled instances per each labeled one. </p>     ]]></body>
<body><![CDATA[<p>Both labeled and unlabeled sequences were characterized according to the   procedure described in section &#91;4&#93; obtaining three types of attributes:   physical-chemical features, primary structure composition statistics and   secondary structure composition statistics (see <a href="#Table3">Table 3</a>).</p>     <p align="center"><b><a name="Table3"></a></b><img src="img/revistas/rfiua/n79/n79a03t03.jpg"></p>   &nbsp;&nbsp;&nbsp;     <p><font size="3"><b>4. Results and discussion</b></font></p>     <p><a href="#Figure1">Figure 1</a> shows a comparison   between the results with the S<sup>3</sup>VM (orange line) and the SVM method   presented in &#91;4&#93; (green line). Classes are ordered according to the performance   of the SS<sup>3</sup>VM method from top to bottom. Left plots show sensitivity,   specificity and geometric mean achieved with the five-fold cross-validation   procedure, while right plots depicts the corresponding p-values obtained from a   paired t-test at a 95% significance   level. Orange bars show the cases when the S<sup>3</sup>VM significantly   outperforms the supervised SVM and green bars show the opposite case. </p>     <p align="center"><b><a name="Figure1"></a></b><img src="img/revistas/rfiua/n79/n79a03i01.jpg"></p>     <p>The main purpose of this comparison is to verify   whether or not the inclusion of the additional cluster-based semi-supervised   term in the training of the SVM improves the performance of the system, thus   providing information about the accomplishment of the cluster assumption when   the unlabelled data is incorporated to the training process. <a href="#Figure1">Figure 1(a)</a> shows   that six out of the fourteen molecular functions considered in this ontology   were significantly improved. In particular, <i>Receptor   binding</i>, <i>Transcription factor   activity</i> and <i>Enzyme regulator   activity</i> have a special importance, considering that the SVM method was   outperformed by BLASTp in those three GO terms when using the supervised model   (see &#91;4&#93;). The inclusion of the cluster assumption also improved the   performance on <i>Hydrolase activity*</i>, <i>Binding*</i> and <i>Protein binding*</i>. Regarding the Cellular Component ontology (<a href="#Figure1">Figure   1(b)</a>), eight cellular components were significantly improved, while other two (<i>Mitochondria and Cytoplasm*</i>) also   reached high p-values over 0<i>.</i>9. Finally, sixteen   biological processes presented statistically significant improvements when   including the unlabelled data with the semi-supervised cluster assumption. Only   one biological process, <i>Lipid metabolic   process</i>, suffered a statistically significant deterioration, which   indicates that the unlabelled data is presenting a misleading cluster structure   regarding this GO term. </p>     <p>In order to analyse how this improvements affect the   system when compared to conventionally used prediction tools, <a href="#Figure2">Figure 2</a> shows a   comparison between the results with the S<sup>3</sup>VM (orange line) and the   traditional BLASTp method (blue line). It can be seen from <a href="#Figure2">figure 2(a)</a> that the   S<sup>3</sup>VM significantly outperforms BLASTp in five molecular functions,   while BLASTp remains better than the S<sup>3</sup>VM only for <i>Transcription factor activity.</i></p>     <p align="center"><b><a name="Figure2"></a></b><img src="img/revistas/rfiua/n79/n79a03i02.jpg"></p>     <p>Regarding the cellular component ontology, there are   only two cellular components for which there is no statistically significant   difference between BLASTp and the S<sup>3</sup>VM: <i>Perixosome and Endosome</i>. For all the remaining eighteen cellular   components, the semi-supervised method   obtained superior performance. A similar behaviour is shown at <a href="#Figure2">figure 2(c)</a>,   where the S<sup>3</sup>VM significantly outperforms BLASTp in 35 out of the 41 biological processes, while   the remaining six process showed no statistical difference between the methods. </p>     <p>On the other hand,<a href="#Figure3"> Figure 3</a> shows the comparison between the supervised   SVM and the Laplacian-SVM. This analysis provides information about the impact   of incorporating unlabelled data on the training set but, this time, by   implementing the semi-supervised manifold assumption. This time, it is possible   to see that there are less GO terms that have been improved by the inclusion of   the unlabelled data. For the molecular function ontology (<a href="#Figure3">Figure 3(a)</a>), only   the <i>Nucleotide binding</i> and <i>Enzyme regulator activity</i> GO terms were   significantly improved respecting the supervised SVM; in turn the   implementation of the manifold assumption significantly degraded the   performance for the GO term <i>Transcription   factor activity</i>. Regarding the cellular component ontology (<a href="#Figure3">Figure 3(b)</a>),   improvements are present for <i>Perixosome</i>, <i>Vacuole</i> and the root node of the   ontology, while a decrease is evinced for the <i>Nucleus*</i> GO term. As for the biological process ontology, seven GO   terms enhanced their prediction performance (<i>Embryonic development, Response to extracellular stimulus, Response to   external stimulus*, Metabolic process*, Response to biotic stimulus, Cell   communication</i> and the root node of the ontology), while other two were   worsened (<i>Cell cycle</i> and DNA <i>metabolic process</i>).</p>     ]]></body>
<body><![CDATA[<p align="center"><b><a name="Figure3"></a></b><img src="img/revistas/rfiua/n79/n79a03i03.jpg"></p>     <p><a href="#Figure4">Figure 4</a> depicts a comparison between the results   obtained with BLASTp and the LapSVM method. The first important result that can   be inferred from the present analysis is that, in general terms, for the   problem of protein function prediction, the semi-supervised cluster assumption   holds for many more cases than the semi-supervised manifold assumption.   However, the most important aspect to be analyzed here, is how the results in   <a href="#Figure4">Figure 4</a> complement the results from <a href="#Figure2">Figure 2</a>. Only two molecular functions presented   an statistically significant superior performance with the Lap-SVM over BLASTp.   One of them, <i>RNA binding</i>, did not   show statistical significance when comparing BLASTp and S<sup>3</sup>VM. The   same behaviour is present for the <i>Perixosome</i> cellular component and for the biological processes <i>Transport</i> and <i>Lipid metabolic   process</i>. These results indicate that the manifold assumption is best suited   than the cluster assumption for this particular GO terms. A few GO terms were   not improved by any of the assumptions.</p>     <p align="center"><b><a name="Figure4"></a></b><img src="img/revistas/rfiua/n79/n79a03i04.jpg"></p>   &nbsp;&nbsp;&nbsp;     <p><font size="3"><b>5. Conclusions</b></font></p>     <p>In this paper, an analysis of the suitability of semi-supervised methods   for the prediction of protein functions in <i>Embryophyta</i> plants was performed. A review of the state of the art of semi-supervised   classifiers was presented, highlighting the different assumptions that each   method does about the underlying distribution of the data. Two semi-supervised   methods were chosen to perform the tests, each representing one of the main   semi-supervised assumptions: cluster assumption and manifold assumption. The   results show that semi-supervised learning applied to the prediction of GO terms in <i>Embryophyta</i> organisms, significantly outperforms the supervised   learning approach, at the same time outperforming the commonly used sequence   alignment strategy in most cases. In general terms, the highest performance   were reached when applying the cluster assumption. However, several GO terms   that were not significantly improved with the cluster assumption, achieved   higher performance with the manifold based semi-supervised method,   demonstrating that a single assumption is not enough for improving the learning   process by the exploitation of the additional unlabelled data. As future work,   it is desirable to implement a unified strategy exploiting both assumptions at   the same time, in order to achieve high performances in most applications.   Also, classifiers devoted to hierarchical classification, such as decision   trees, could be used to improve classification performance. </p>   &nbsp;&nbsp;&nbsp;     <p><font size="3"><b>6. References</b></font></p>     <!-- ref --><p> 1. K. Chou and H. Shen, ''Recent progress   in protein subcellular location prediction'', <i>Analytical Biochemistry</i>, vol. 370, no. 1, pp. 1-16, 2007.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145563&pid=S0120-6230201600020000300001&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 2. P. Benfey and T. Mitchell, ''From   Genotype to Phenotype: Systems Biology Meets Natural Variation'', <i>Science</i>, vol. 320, no. 5875, pp.   495-497, 2008.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145565&pid=S0120-6230201600020000300002&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     ]]></body>
<body><![CDATA[<!-- ref --><p> 3. M. Harris <i>et al.</i>, ''The gene ontology (GO) database and informatics resource'', <i>Nucleic Acids Res</i>., vol.   32, pp. 258-261, 2004.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145567&pid=S0120-6230201600020000300003&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 4. J. Jaramillo, J. Gallardo, C.   Castellanos and A. Perera, ''Predictability of gene ontology slim-terms from   primary structure information in <i>Embryophyta</i> plant proteins'', <i>BMC Bioinformatics</i>, vol.   14, no. 68, pp. 1-11, 2013.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145569&pid=S0120-6230201600020000300004&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 5. X. Zhu, ''Semi-Supervised Learning Literature   Survey'', University of Wisconsin&#8211;Madison,   Madison, USA, Tech. Rep.   TR-1530, Jul. 2008.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145571&pid=S0120-6230201600020000300005&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 6. X. Zhao, L. Chen and K. Aihara,   ''Protein function prediction with high-throughput data'', <i>Amino Acids</i>, vol.   35, no. 3, pp. 517-530, 2008.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145573&pid=S0120-6230201600020000300006&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 7. X. Zhao, Y. Wang, L. Chen and K.   Aihara, ''Gene function prediction using labeled and unlabeled data'', <i>BMC Bioinformatics</i>, vol.   9, no. 57, pp. 1-14, 2008.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145575&pid=S0120-6230201600020000300007&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     ]]></body>
<body><![CDATA[<!-- ref --><p> 8. O. Chapelle, B. Sch&ouml;lkopf and A.   Zien, <i>Semi-supervised learning</i>, 1<i><sup>st </sup></i>ed. Cambridge,   USA: MIT Press, 2006.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145577&pid=S0120-6230201600020000300008&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 9. X. Zhu and A. Goldberg, <i>Introduction to semi-supervised learning</i>,   1<i><sup>st </sup></i>ed. Madison, USA:   Morgan &amp; Claypool, 2009.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145579&pid=S0120-6230201600020000300009&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 10. N. Kasabov and S. Pang, ''Transductive   support vector machines and applications in bioinformatics for promoter   recognition'', in <i>Int. Conf. on</i><i> </i><i>Neural Networks and Signal Processing</i>, Nanjing, China, 2003, pp.   1-6.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145581&pid=S0120-6230201600020000300010&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 11. T. Li, S. Zhu, Q. Li and M. Ogihara,   ''Gene functional classification by semisupervised learning from heterogeneous   data'', in <i>ACM Symposium</i><i> </i><i>on Applied Computing </i>(SAC), Melbourne, USA, 2003, pp.   78-82.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145583&pid=S0120-6230201600020000300011&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 12. M. Krogel and T. Scheffer,   ''Multi-relational learning, text mining, and semisupervised learning for   functional genomics'', <i>Machine   Learning</i>, vol. 57, no. 1, pp. 61-81, 2004.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145585&pid=S0120-6230201600020000300012&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     ]]></body>
<body><![CDATA[<!-- ref --><p> 13. H. Shin and K. Tsuda, ''Prediction   of protein function from networks'', in <i>Semi-supervised learning</i>, 1<i><sup>st</sup></i> ed., O. Chapelle, B. Sch&ouml;lkopf and A. Zien (eds). Cambridge,   USA: MIT Press, 2006, pp. 339-352.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145587&pid=S0120-6230201600020000300013&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 14. B. King and C. Guda, ''Semi-supervised   learning for classification of protein sequence data'', <i>Scientific Programming</i>, vol. 16,   no. 1, pp. 5-29, 2008.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145589&pid=S0120-6230201600020000300014&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 15. H. Shin, K. Tsuda and B. Scholkopf,   ''Protein functional class prediction with a combined graph'', <i>Expert Systems with Applications</i>, vol.   36, no. 2, pp. 3284-3292, 2009.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145591&pid=S0120-6230201600020000300015&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 16. J. Jaramillo and C. Castellanos, ''Improving   protein sub-cellular localization prediction through semi-supervised learning'',   in <i>BIOTECHNO: 6<sup>th</sup> International</i><i> </i><i>Conference on Bioinformatics,   Biocomputational Systems and Biotechnologies</i>, Chamonix, France, 2014,   pp. 99-103.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145593&pid=S0120-6230201600020000300016&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 17. F. Cozman, I. Cohen and M. Cirelo,   ''Semi-supervised learning of mixture models'', in <i>20<sup>th</sup> International Conference on Machine Learning </i>(ICML), Washington D.C., USA, 2003, pp. 99-106.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145595&pid=S0120-6230201600020000300017&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     ]]></body>
<body><![CDATA[<!-- ref --><p> 18. D. Miller and H. Uyar, ''A generalized   gaussian mixture classifier with learning based on both labelled and unlabelled   data'', in <i>Conference on</i><i> </i><i>Information Science and Systems</i>, Princeton, USA, 1996, pp.   783-787.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145597&pid=S0120-6230201600020000300018&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 19. G. McLachlan and T. Krishnan, <i>The EM algorithm and extensions</i>, 2<i><sup>nd</sup></i> ed. St. Lucia, Australia:   John Wiley &amp; Sons, 2007.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145599&pid=S0120-6230201600020000300019&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 20. K. Nigam, A. McCallum, S. Thrun and   T. Mitchell, ''Text classification from labeled and unlabeled documents using   EM'', <i>Machine learning</i>, vol.   39, no. 2, pp. 103-134, 2000.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145601&pid=S0120-6230201600020000300020&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 21. A. Fujino, N. Ueda and K. Saito, ''A   hybrid generative/discriminative approach to semi-supervised classifier   design'', in <i>20<sup>th</sup> National   Conference on Artificial Intelligence </i>(AAAI), Pittsburgh, USA, 2005, pp.   764-769.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145603&pid=S0120-6230201600020000300021&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 22. X. Zhu and J. Lafferty, ''Harmonic   mixtures: combining mixture models and graph-based methods for inductive and   scalable semi-supervised learning'', in <i>22<sup>nd</sup> International Conference on Machine Learning </i>(ICML), Bonn, Germany, 2005, pp. 1052-1059.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145605&pid=S0120-6230201600020000300022&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     ]]></body>
<body><![CDATA[<!-- ref --><p> 23. O. Chapelle, M. Chi and A. Zien, ''A   continuation method for semi-supervised SVMs'', in <i>23<sup>rd</sup> international conference on Machine learning (ICML)</i>,   Pittsburgh, USA, 2006, pp. 185-192.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145607&pid=S0120-6230201600020000300023&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 24. O. Chapelle, V. Sindhwani and S.   Keerthi, ''Optimization techniques for semi-supervised support vector machines'', <i>Journal of Machine Learning Research</i>, vol.   9, pp. 203-233, 2008.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145609&pid=S0120-6230201600020000300024&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 25. T. Joachims, ''Transductive inference   for text classification using support vector machines'', in <i>16<sup>th </sup>International Conference on Machine Learning </i>(ICML), Bled, Slovenia, 1999, pp. 200-209.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145611&pid=S0120-6230201600020000300025&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 26. O. Chapelle and A. Zien,   ''Semi-supervised classification by low density separation'', in <i>10<sup>th</sup> Int. Workshop on </i><i>Artificial Intelligence and Statistics</i> (AISTATS), Bridgetown,   Barbados, 2005, pp. 57-64.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145613&pid=S0120-6230201600020000300026&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 27. R. Collobert, F. Sinz, J. Weston and   L. Bottou, ''Large scale transductive SVMs'', <i>Journal of Machine Learning Research</i>, vol. 7, pp. 1687-1712, 2006.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145615&pid=S0120-6230201600020000300027&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     ]]></body>
<body><![CDATA[<!-- ref --><p> 28. Y. Li, J. Kwok and Z. Zhou,   ''Cost-Sensitive Semi-Supervised Support Vector Machine'', in <i>24<sup>th</sup> Conference on Artificial Intelligence </i>(AAAI), Atlanta, USA, 2010, pp.   500-505.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145617&pid=S0120-6230201600020000300028&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 29. Z. Qi, Y. Tian and Y. Shi, ''Laplacian   twin support vector machine for semi-supervised classification'', <i>Neural networks</i>, vol.   35, pp. 46-53, 2012.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145619&pid=S0120-6230201600020000300029&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 30. Z. Xu <i>et al.</i>, ''Adaptive regularization for transductive support vector   machine'', in <i>Advances in Neural Information Processing Systems 22 </i>(NIPS), Vancouver, Canada, 2009, pp.   2125-2133.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145621&pid=S0120-6230201600020000300030&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 31. Z. Wang, S. Yan and C. Zhang, ''Active   learning with adaptive regularization'', <i>Pattern Recognition</i>, vol. 44, no. 10-11, pp. 2375-2383,   2011.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145623&pid=S0120-6230201600020000300031&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 32. M. Hein, J. Audibert and U. Luxburg,   ''From graphs to manifolds-weak and strong pointwise consistency of graph   Laplacians'', in<i> 18<sup>th </sup></i><i>Annual Conference on Learning Theory</i> (COLT),   Bertinoro, Italy, 2005, pp. 470-485.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145625&pid=S0120-6230201600020000300032&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     ]]></body>
<body><![CDATA[<!-- ref --><p> 33. M. Belkin, P. Niyogi and V.   Sindhwani, ''Manifold regularization: A geometric framework for learning from   labeled and unlabeled examples'', <i>Journal   of Machine</i><i> </i><i>Learning Research</i>, vol. 7,   pp. 2399-2434, 2006.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145627&pid=S0120-6230201600020000300033&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 34. F. Sinz, O. Chapelle, A. Agarwal and   B. Sch&ouml;lkopf, ''An analysis of inference with the universum'', in   Conference on <i>Advances in   Neural Information Processing Systems 20 </i>(NIPS),   Vancouver, Canada, 2007, pp. 1369-1376.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145629&pid=S0120-6230201600020000300034&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 35. E. Jain <i>et al., </i>''Infrastructure for the life sciences: design and   implementation of the UniProt website'', <i>BMC Bioinformatics</i>, vol. 10, no. 136, pp. 1-19, 2009.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145631&pid=S0120-6230201600020000300035&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 36. D. Barrell <i>et al</i>., ''The GOA database in 2009-an integrated Gene Ontology   Annotation resource'', <i>Nucleic</i><i> </i><i>Acids Research</i>, vol. 37, pp. 396-403, 2009.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145633&pid=S0120-6230201600020000300036&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     <!-- ref --><p> 37. W. Li and A. Godzik, ''Cd-hit: a fast   program for clustering and comparing large sets of protein or nucleotide   sequences'', <i>Bioinformatics</i>, vol.   22, no. 13, pp. 1658-1659, 2006.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145635&pid=S0120-6230201600020000300037&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p>     ]]></body>
<body><![CDATA[<!-- ref --><p> 38. T. Berardini <i>et al</i>., ''Functional annotation of   the Arabidopsis genome using controlled vocabularies'', vol. 135, no. 2, pp.   745-755, 2004.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=3145637&pid=S0120-6230201600020000300038&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </p> </font>      ]]></body><back>
<ref-list>
<ref id="B1">
<label>1</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Chou]]></surname>
<given-names><![CDATA[K]]></given-names>
</name>
<name>
<surname><![CDATA[Shen]]></surname>
<given-names><![CDATA[H]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Recent progress in protein subcellular location prediction]]></article-title>
<source><![CDATA[Analytical Biochemistry]]></source>
<year>2007</year>
<volume>370</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>1-16</page-range></nlm-citation>
</ref>
<ref id="B2">
<label>2</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Benfey]]></surname>
<given-names><![CDATA[P]]></given-names>
</name>
<name>
<surname><![CDATA[Mitchell]]></surname>
<given-names><![CDATA[T]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[From Genotype to Phenotype: Systems Biology Meets Natural Variation]]></article-title>
<source><![CDATA[Science]]></source>
<year>2008</year>
<volume>320</volume>
<numero>5875</numero>
<issue>5875</issue>
<page-range>495-497</page-range></nlm-citation>
</ref>
<ref id="B3">
<label>3</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Harris et al.]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[The gene ontology (GO) database and informatics resource]]></article-title>
<source><![CDATA[Nucleic Acids Res.]]></source>
<year>2004</year>
<volume>32</volume>
<page-range>258-261</page-range></nlm-citation>
</ref>
<ref id="B4">
<label>4</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jaramillo]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
<name>
<surname><![CDATA[Gallardo]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
<name>
<surname><![CDATA[Castellanos]]></surname>
<given-names><![CDATA[C]]></given-names>
</name>
<name>
<surname><![CDATA[Perera]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Predictability of gene ontology slim-terms from primary structure information in Embryophyta plant proteins]]></article-title>
<source><![CDATA[BMC Bioinformatics]]></source>
<year>2013</year>
<volume>14</volume>
<numero>68</numero>
<issue>68</issue>
<page-range>1-11</page-range></nlm-citation>
</ref>
<ref id="B5">
<label>5</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Zhu]]></surname>
<given-names><![CDATA[X]]></given-names>
</name>
</person-group>
<source><![CDATA[Semi-Supervised Learning Literature Survey]]></source>
<year>Jul.</year>
<month> 2</month>
<day>00</day>
<publisher-loc><![CDATA[Madison ]]></publisher-loc>
<publisher-name><![CDATA[University of Wisconsin-Madison]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B6">
<label>6</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Zhao]]></surname>
<given-names><![CDATA[X]]></given-names>
</name>
<name>
<surname><![CDATA[Chen]]></surname>
<given-names><![CDATA[L]]></given-names>
</name>
<name>
<surname><![CDATA[Aihara]]></surname>
<given-names><![CDATA[K]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Protein function prediction with high-throughput data]]></article-title>
<source><![CDATA[Amino Acids]]></source>
<year>2008</year>
<volume>35</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>517-530</page-range></nlm-citation>
</ref>
<ref id="B7">
<label>7</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Zhao]]></surname>
<given-names><![CDATA[X]]></given-names>
</name>
<name>
<surname><![CDATA[Wang]]></surname>
<given-names><![CDATA[Y]]></given-names>
</name>
<name>
<surname><![CDATA[Chen]]></surname>
<given-names><![CDATA[L]]></given-names>
</name>
<name>
<surname><![CDATA[Aihara]]></surname>
<given-names><![CDATA[K]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Gene function prediction using labeled and unlabeled data]]></article-title>
<source><![CDATA[BMC Bioinformatics]]></source>
<year>2008</year>
<volume>9</volume>
<numero>57</numero>
<issue>57</issue>
<page-range>1-14</page-range></nlm-citation>
</ref>
<ref id="B8">
<label>8</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Chapelle]]></surname>
<given-names><![CDATA[O]]></given-names>
</name>
<name>
<surname><![CDATA[Schölkopf]]></surname>
<given-names><![CDATA[B]]></given-names>
</name>
<name>
<surname><![CDATA[Zien]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
</person-group>
<source><![CDATA[Semi-supervised learning]]></source>
<year>2006</year>
<publisher-loc><![CDATA[Cambridge ]]></publisher-loc>
<publisher-name><![CDATA[MIT Press]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B9">
<label>9</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Zhu]]></surname>
<given-names><![CDATA[X]]></given-names>
</name>
<name>
<surname><![CDATA[Goldberg]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
</person-group>
<source><![CDATA[Introduction to semi-supervised learning]]></source>
<year>2009</year>
<publisher-loc><![CDATA[Madison ]]></publisher-loc>
<publisher-name><![CDATA[Morgan & Claypool]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B10">
<label>10</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Kasabov]]></surname>
<given-names><![CDATA[N]]></given-names>
</name>
<name>
<surname><![CDATA[Pang]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
</person-group>
<source><![CDATA[Transductive support vector machines and applications in bioinformatics for promoter recognition]]></source>
<year>2003</year>
<conf-name><![CDATA[ Int. Conf. on Neural Networks and Signal Processing]]></conf-name>
<conf-loc> </conf-loc>
<publisher-loc><![CDATA[Nanjing ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B11">
<label>11</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Li]]></surname>
<given-names><![CDATA[T]]></given-names>
</name>
<name>
<surname><![CDATA[Zhu]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
<name>
<surname><![CDATA[Li]]></surname>
<given-names><![CDATA[Q]]></given-names>
</name>
<name>
<surname><![CDATA[Ogihara]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
</person-group>
<source><![CDATA[Gene functional classification by semisupervised learning from heterogeneous data]]></source>
<year>2003</year>
<publisher-loc><![CDATA[Melbourne ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B12">
<label>12</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Krogel]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
<name>
<surname><![CDATA[Scheffer]]></surname>
<given-names><![CDATA[T]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Multi-relational learning, text mining, and semisupervised learning for functional genomics]]></article-title>
<source><![CDATA[Machine Learning]]></source>
<year>2004</year>
<volume>57</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>61-81</page-range></nlm-citation>
</ref>
<ref id="B13">
<label>13</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Shin]]></surname>
<given-names><![CDATA[H]]></given-names>
</name>
<name>
<surname><![CDATA[Tsuda]]></surname>
<given-names><![CDATA[K]]></given-names>
</name>
</person-group>
<source><![CDATA[Prediction of protein function from networks]]></source>
<year>2006</year>
<page-range>339-352</page-range><publisher-loc><![CDATA[Cambridge ]]></publisher-loc>
<publisher-name><![CDATA[MIT Press]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B14">
<label>14</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[King]]></surname>
<given-names><![CDATA[B]]></given-names>
</name>
<name>
<surname><![CDATA[Guda]]></surname>
<given-names><![CDATA[C]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Semi-supervised learning for classification of protein sequence data]]></article-title>
<source><![CDATA[Scientific Programming]]></source>
<year>2008</year>
<volume>16</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>5-29</page-range></nlm-citation>
</ref>
<ref id="B15">
<label>15</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Shin]]></surname>
<given-names><![CDATA[H]]></given-names>
</name>
<name>
<surname><![CDATA[Tsuda]]></surname>
<given-names><![CDATA[K]]></given-names>
</name>
<name>
<surname><![CDATA[Scholkopf]]></surname>
<given-names><![CDATA[B]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Protein functional class prediction with a combined graph]]></article-title>
<source><![CDATA[Expert Systems with Applications]]></source>
<year>2009</year>
<volume>36</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>3284-3292</page-range></nlm-citation>
</ref>
<ref id="B16">
<label>16</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jaramillo]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
<name>
<surname><![CDATA[Castellanos]]></surname>
<given-names><![CDATA[C]]></given-names>
</name>
</person-group>
<source><![CDATA[Improving protein sub-cellular localization prediction through semi-supervised learning]]></source>
<year>2014</year>
<conf-name><![CDATA[ BIOTECHNO: 6th International Conference on Bioinformatics, Biocomputational Systems and Biotechnologies]]></conf-name>
<conf-loc> </conf-loc>
<publisher-loc><![CDATA[Chamonix ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B17">
<label>17</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Cozman]]></surname>
<given-names><![CDATA[F]]></given-names>
</name>
<name>
<surname><![CDATA[Cohen]]></surname>
<given-names><![CDATA[I]]></given-names>
</name>
<name>
<surname><![CDATA[Cirelo]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
</person-group>
<source><![CDATA[Semi-supervised learning of mixture models]]></source>
<year>2003</year>
<conf-name><![CDATA[ 20th International Conference on Machine Learning]]></conf-name>
<conf-loc> </conf-loc>
</nlm-citation>
</ref>
<ref id="B18">
<label>18</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Miller]]></surname>
<given-names><![CDATA[D]]></given-names>
</name>
<name>
<surname><![CDATA[Uyar]]></surname>
<given-names><![CDATA[H]]></given-names>
</name>
</person-group>
<source><![CDATA[A generalized gaussian mixture classifier with learning based on both labelled and unlabelled data]]></source>
<year>1996</year>
<publisher-loc><![CDATA[Princeton ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B19">
<label>19</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[McLachlan]]></surname>
<given-names><![CDATA[G]]></given-names>
</name>
<name>
<surname><![CDATA[Krishnan]]></surname>
<given-names><![CDATA[T]]></given-names>
</name>
</person-group>
<source><![CDATA[The EM algorithm and extensions]]></source>
<year>2007</year>
<publisher-loc><![CDATA[Lucia ]]></publisher-loc>
<publisher-name><![CDATA[John Wiley & Sons]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B20">
<label>20</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Nigam]]></surname>
<given-names><![CDATA[K]]></given-names>
</name>
<name>
<surname><![CDATA[McCallum]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
<name>
<surname><![CDATA[Thrun]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
<name>
<surname><![CDATA[Mitchell]]></surname>
<given-names><![CDATA[T]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Text classification from labeled and unlabeled documents using EM]]></article-title>
<source><![CDATA[Machine learning]]></source>
<year>2000</year>
<volume>39</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>103-134</page-range></nlm-citation>
</ref>
<ref id="B21">
<label>21</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Fujino]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
<name>
<surname><![CDATA[Ueda]]></surname>
<given-names><![CDATA[N]]></given-names>
</name>
<name>
<surname><![CDATA[Saito]]></surname>
<given-names><![CDATA[K]]></given-names>
</name>
</person-group>
<source><![CDATA[A hybrid generative/discriminative approach to semi-supervised classifier design]]></source>
<year>2005</year>
<conf-name><![CDATA[ 20th National Conference on Artificial Intelligenc]]></conf-name>
<conf-loc> </conf-loc>
<page-range>764-769</page-range><publisher-loc><![CDATA[Pittsburgh ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B22">
<label>22</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Zhu]]></surname>
<given-names><![CDATA[X]]></given-names>
</name>
<name>
<surname><![CDATA[Lafferty]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
</person-group>
<source><![CDATA[Harmonic mixtures: combining mixture models and graph-based methods for inductive and scalable semi-supervised learning]]></source>
<year>2005</year>
<page-range>1052-1059</page-range></nlm-citation>
</ref>
<ref id="B23">
<label>23</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Chapelle]]></surname>
<given-names><![CDATA[O]]></given-names>
</name>
<name>
<surname><![CDATA[Chi]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
<name>
<surname><![CDATA[Zien]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
</person-group>
<source><![CDATA[A continuation method for semi-supervised SVMs]]></source>
<year>2006</year>
<conf-name><![CDATA[ 23rd international conference on Machine learning (ICML)]]></conf-name>
<conf-loc> </conf-loc>
<page-range>185-192</page-range><publisher-loc><![CDATA[Pittsburgh ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B24">
<label>24</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Chapelle]]></surname>
<given-names><![CDATA[O]]></given-names>
</name>
<name>
<surname><![CDATA[Sindhwani]]></surname>
<given-names><![CDATA[V]]></given-names>
</name>
<name>
<surname><![CDATA[Keerthi]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Optimization techniques for semi-supervised support vector machines]]></article-title>
<source><![CDATA[Journal of Machine Learning Research]]></source>
<year>2008</year>
<volume>9</volume>
<page-range>203-233</page-range></nlm-citation>
</ref>
<ref id="B25">
<label>25</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Joachims]]></surname>
<given-names><![CDATA[T]]></given-names>
</name>
</person-group>
<source><![CDATA[Transductive inference for text classification using support vector machines]]></source>
<year>1999</year>
<page-range>200-209</page-range><publisher-loc><![CDATA[Bled ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B26">
<label>26</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Chapelle]]></surname>
<given-names><![CDATA[O]]></given-names>
</name>
<name>
<surname><![CDATA[Zien]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
</person-group>
<source><![CDATA[Semi-supervised classification by low density separation]]></source>
<year>2005</year>
<page-range>57-64</page-range><publisher-loc><![CDATA[Bridgetown ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B27">
<label>27</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Collobert]]></surname>
<given-names><![CDATA[R]]></given-names>
</name>
<name>
<surname><![CDATA[Sinz]]></surname>
<given-names><![CDATA[F]]></given-names>
</name>
<name>
<surname><![CDATA[Weston]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
<name>
<surname><![CDATA[Bottou]]></surname>
<given-names><![CDATA[L]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Large scale transductive SVMs]]></article-title>
<source><![CDATA[Journal of Machine Learning Research]]></source>
<year>2006</year>
<volume>7</volume>
<page-range>1687-1712</page-range></nlm-citation>
</ref>
<ref id="B28">
<label>28</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Li]]></surname>
<given-names><![CDATA[Y]]></given-names>
</name>
<name>
<surname><![CDATA[Kwok]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
<name>
<surname><![CDATA[Zhou]]></surname>
<given-names><![CDATA[Z]]></given-names>
</name>
</person-group>
<source><![CDATA[Cost-Sensitive Semi-Supervised Support Vector Machine]]></source>
<year>2010</year>
<conf-name><![CDATA[ 24th Conference on Artificial Intelligence]]></conf-name>
<conf-loc> </conf-loc>
<publisher-loc><![CDATA[Atlanta ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B29">
<label>29</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Qi]]></surname>
<given-names><![CDATA[Z]]></given-names>
</name>
<name>
<surname><![CDATA[Tian]]></surname>
<given-names><![CDATA[Y]]></given-names>
</name>
<name>
<surname><![CDATA[Shi]]></surname>
<given-names><![CDATA[Y]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Laplacian twin support vector machine for semi-supervised classification]]></article-title>
<source><![CDATA[Neural networks]]></source>
<year>2012</year>
<volume>35</volume>
<page-range>46-53</page-range></nlm-citation>
</ref>
<ref id="B30">
<label>30</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Xu]]></surname>
<given-names><![CDATA[Z]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Adaptive regularization for transductive support vector machine]]></article-title>
<source><![CDATA[Advances in Neural Information Processing Systems 22]]></source>
<year>2009</year>
<page-range>2125-2133</page-range><publisher-loc><![CDATA[Vancouver ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B31">
<label>31</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Wang]]></surname>
<given-names><![CDATA[Z]]></given-names>
</name>
<name>
<surname><![CDATA[Yan]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
<name>
<surname><![CDATA[Zhang]]></surname>
<given-names><![CDATA[C]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Active learning with adaptive regularization]]></article-title>
<source><![CDATA[Pattern Recognition]]></source>
<year>2011</year>
<volume>44</volume>
<numero>10-11</numero>
<issue>10-11</issue>
<page-range>2375-2383</page-range></nlm-citation>
</ref>
<ref id="B32">
<label>32</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Hein]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
<name>
<surname><![CDATA[Audibert]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
<name>
<surname><![CDATA[Luxburg]]></surname>
<given-names><![CDATA[U]]></given-names>
</name>
</person-group>
<source><![CDATA[From graphs to manifolds-weak and strong pointwise consistency of graph Laplacians]]></source>
<year>2005</year>
<page-range>470-485</page-range><publisher-loc><![CDATA[Bertinoro ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B33">
<label>33</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Belkin]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
<name>
<surname><![CDATA[Niyogi]]></surname>
<given-names><![CDATA[P]]></given-names>
</name>
<name>
<surname><![CDATA[Sindhwani]]></surname>
<given-names><![CDATA[V]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Manifold regularization: A geometric framework for learning from labeled and unlabeled examples]]></article-title>
<source><![CDATA[Journal of Machine Learning Research]]></source>
<year>2006</year>
<volume>7</volume>
<page-range>2399-2434</page-range></nlm-citation>
</ref>
<ref id="B34">
<label>34</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sinz]]></surname>
<given-names><![CDATA[F]]></given-names>
</name>
<name>
<surname><![CDATA[Chapelle]]></surname>
<given-names><![CDATA[O]]></given-names>
</name>
<name>
<surname><![CDATA[Agarwal]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
<name>
<surname><![CDATA[Schölkopf]]></surname>
<given-names><![CDATA[B]]></given-names>
</name>
</person-group>
<source><![CDATA[An analysis of inference with the universum]]></source>
<year>2007</year>
<publisher-loc><![CDATA[Vancouver ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B35">
<label>35</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jain]]></surname>
<given-names><![CDATA[E]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Infrastructure for the life sciences: design and implementation of the UniProt website]]></article-title>
<source><![CDATA[BMC Bioinformatics]]></source>
<year>2009</year>
<volume>10</volume>
<numero>136</numero>
<issue>136</issue>
<page-range>1-19</page-range></nlm-citation>
</ref>
<ref id="B36">
<label>36</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Barrell et al.]]></surname>
<given-names><![CDATA[D]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[The GOA database in 2009-an integrated Gene Ontology Annotation resource]]></article-title>
<source><![CDATA[Nucleic Acids Research]]></source>
<year>2009</year>
<volume>37</volume>
<page-range>396-403</page-range></nlm-citation>
</ref>
<ref id="B37">
<label>37</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Li]]></surname>
<given-names><![CDATA[W]]></given-names>
</name>
<name>
<surname><![CDATA[Godzik]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences]]></article-title>
<source><![CDATA[Bioinformatics]]></source>
<year>2006</year>
<volume>22</volume>
<numero>13</numero>
<issue>13</issue>
<page-range>1658-1659</page-range></nlm-citation>
</ref>
<ref id="B38">
<label>38</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Berardini]]></surname>
<given-names><![CDATA[T]]></given-names>
</name>
</person-group>
<source><![CDATA[Functional annotation of the Arabidopsis genome using controlled vocabularies]]></source>
<year>2004</year>
<volume>135</volume>
<page-range>745-755</page-range></nlm-citation>
</ref>
</ref-list>
</back>
</article>
