SciELO - Scientific Electronic Library Online

 
vol.51 número2Comportamiento de la amoxicilina en agua mediante métodos de solvatación implícita y explícitaThe effect of the acid pretreatment temperature of natural diatomaceous earth on the Si/Al ratio of ZSM-5 zeolite índice de autoresíndice de materiabúsqueda de artículos
Home Pagelista alfabética de revistas  

Servicios Personalizados

Revista

Articulo

Indicadores

Links relacionados

  • En proceso de indezaciónCitado por Google
  • No hay articulos similaresSimilares en SciELO
  • En proceso de indezaciónSimilares en Google

Compartir


Revista Colombiana de Química

versión impresa ISSN 0120-2804versión On-line ISSN 2357-3791

Resumen

VALENCIA-COLMAN, Laura S.  y  DAZA С., Edgar E.. Recognition of biosynthetic pathways for semiochemicals using machine learning techniques. Rev.Colomb.Quim. [online]. 2022, vol.51, n.2, pp.35-40.  Epub 12-Ene-2024. ISSN 0120-2804.  https://doi.org/10.15446/rev.colomb.quim.v51n2.101546.

In this work we consider 148 semiochemicals reported for the family Scarabaeidae, whose chemical structure was characterized using a set of 200 molecular descriptors from five different classes. The selection of the most discriminating descriptors was carried out with three different techniques: Principal Component Analysis, for each class of descriptors, Random Forests and Boruta-Shap, applied to the total of descriptors. Although the three techniques are conceptually different, they select a similar number of descriptors from each class. We proposed a combination of machine learning techniques to search for a structural pattern in the set of semiochemicals and then perform their classification. The pattern was established from the high belonging of a subset of these metabolites to the groups that were obtained by a grouping method based on fuzzy C-means logic; the discovered pattern corresponds to the biosynthetic pathway by which they are obtained biologically. This first classification was corroborated with Kohonen's self-organizing maps. To classify those semiochemicals whose belonging to a biosynthetic pathway was not clearly defined, we built two models of Multilayer Perceptrons which had an acceptable performance.

Palabras clave : Random forests; C-means; molecular descriptors; family Scarabaeidae; multilayer perceptron; neural networks.

        · resumen en Español | Portugués     · texto en Español     · Español ( pdf )