SciELO - Scientific Electronic Library Online

 
vol.14 issue2Effect of supplementation level with propylene glycol during the transition period to lactation on ovarian activity and reproductive performance in Holstein cowsHealth tourism: A form of medicalization of society? author indexsubject indexarticles search
Home Pagealphabetic serial listing  

Services on Demand

Journal

Article

Indicators

Related links

  • On index processCited by Google
  • Have no similar articlesSimilars in SciELO
  • On index processSimilars in Google

Share


Revista Lasallista de Investigación

Print version ISSN 1794-4449

Abstract

GIRALDO MEJIA, Juan Camilo; MONTOYA QUINTERO, Diana María  and  JIMENEZ BUILES, Jovani Alberto. Knowledge-based model to support decision-making when choosing between two association data mining techniques. Rev. Lasallista Investig. [online]. 2017, vol.14, n.2, pp.41-50. ISSN 1794-4449.  https://doi.org/10.22507/rli.v14n2a4.

Introduction.

This paper presents the functionality and characterization of two Data Mining (DM) techniques, logistic regression and association rules (Apriori Algorithm). This is done through a conceptual model that enables to choose the appropriate data mining project technique for obtaining knowledge from criteria that describe the specific project to be developed.

Objective.

Support decision making when choosing the most appropriate technique for the development of a data mining project.

Materials and methods.

Association and logistic regression techniques are characterized in this study, showing the functionality of their algorithms.

Results.

The proposed model is the input for the implementation of a knowledge-based system that emulates a human expert's knowledge at the time of deciding which data mining technique to choose against a specific problem that relates to a data mining project. It facilitates verification of the business processes of each one of the techniques, and measures the correspondence between a project's objectives versus the components provided by both the logistic regression and the association rules techniques.

Conclusion.

Current and historical information is available for decision-making through the generated data mining models. Data for the models are taken from Data Warehouses, which are informational environments that provide an integrated and total view of the organization.

Keywords : Association rules; apriori algorithm; data mining; logistic regression.

        · abstract in Spanish | Portuguese     · text in English     · English ( pdf )