SciELO - Scientific Electronic Library Online

 
vol.27 número3Alternativas para tratar lodos originados en sistemas de tratamiento de aguas residuales: una revisiónSobre la determinación del umbral del rango del factor de intensidad de tensiones y los factores que lo afectan índice de autoresíndice de materiabúsqueda de artículos
Home Pagelista alfabética de revistas  

Servicios Personalizados

Revista

Articulo

Indicadores

Links relacionados

  • En proceso de indezaciónCitado por Google
  • No hay articulos similaresSimilares en SciELO
  • En proceso de indezaciónSimilares en Google

Compartir


Ingeniería

versión impresa ISSN 0121-750X

Resumen

ROSSO-MATEUS, Andrés E.; MONTILLA-MONTILLA, Yeimy. M.  y  GARZON-MARTINEZ, Sonia C.. Methodology for the Collection and Analysis of Real Estate Data Using Alternative Sources: Case Study in Three Medium-Sized Cities of Colombia. ing. [online]. 2022, vol.27, n.3, e400.  Epub 08-Nov-2022. ISSN 0121-750X.  https://doi.org/10.14483/23448393.17952.

Context:

The Multipurpose Cadastre public policy needs to consolidate real estate information from different sources for analysis, such as offers, transactions, and construction costs, among others. Real estate websites are part of these sources of information, although they have not yet been included in commercial analysis. In light of the above, it is necessary to review a methodology that allows optimal access to these web platforms and facilitates the analysis of the variables provided therein, which are crucial to a property’s commercial value. A study case was carried out in three Colombian cities: Fusagasugá, Manizales, and Villavicencio.

Method:

The method is implemented in two stages: (i) web scraping, which allows obtaining the information links from real estate web pages and downloading their data, and (ii) analyzing real estate data by developing a workflow that starts with data exploration and cleaning, continues with pre-modeling, and ends by modeling the crucial variables in the determination of real estate value using machine learning techniques.

Results:

By applying machine learning techniques, it was possible to automate the collection, cleaning, storage, and analysis of real estate data from web platforms, as well as to outline two models (Ridge Regression and Random Forest), which, according to their mean absolute percentage error (0,34 and 0,35, respectively), allow predicting the commercial value of a property while considering internal and external explanatory variables.

Conclusions:

Obtaining and analyzing real estate data from alternative sources such as web platforms through machine learning techniques contributes significantly to addressing the high information de-mand of the country’s cadastre. However, it is necessary to expand the supply of this information to rural areas, which have less access and availability to it.

Palabras clave : Multipurpose Cadastre; real estate dynamics; Real Estate Market; Commercial Value; web scraping..

        · resumen en Español     · texto en Español     · Español ( pdf )