Analysis of Satellite Images Using Deep Learning Techniques and Remotely Piloted Aircraft for a Detailed Description of Tertiary Roads

Moreno-Vergara, María-Camila; Sarmiento-Iscala, Brayan-Daniel; Casares-Pavia, Fabián-Enrique; Angulo-Rodríguez, Yerson-Duvan; Morales-Arenales, Danilo-José; Moreno-Vergara, María-Camila; Sarmiento-Iscala, Brayan-Daniel; Casares-Pavia, Fabián-Enrique; Angulo-Rodríguez, Yerson-Duvan; Morales-Arenales, Danilo-José

doi:10.19053/01211129.v30.n58.2021.13816

Services on Demand

Journal

Article

Indicators

Cited by SciELO
Access statistics

Revista Facultad de Ingeniería

Print version ISSN 0121-1129On-line version ISSN 2357-5328

Rev. Fac. ing. vol.30 no.58 Tunja Out./Dec. 2021 Epub Dec 22, 2021

https://doi.org/10.19053/01211129.v30.n58.2021.13816

Artículos

Analysis of Satellite Images Using Deep Learning Techniques and Remotely Piloted Aircraft for a Detailed Description of Tertiary Roads

Análisis de imágenes satelitales usando técnicas de aprendizaje profundo y aeronaves remotamente pilotadas para la descripción a detalle de las vías terciarias

Análise de imagens de satélite usando técnicas de aprendizado profundo e aeronaves pilotadas remotamente para a descrição detalhada de estradas terciárias

María-Camila Moreno-Vergara¹
http://orcid.org/0000-0002-9732-1622

Brayan-Daniel Sarmiento-Iscala²
http://orcid.org/0000-0002-0447-0902

Fabián-Enrique Casares-Pavia³
http://orcid.org/0000-0001-6593-8807

Yerson-Duvan Angulo-Rodríguez⁴
http://orcid.org/0000-0002-9037-2283

Danilo-José Morales-Arenales⁵
http://orcid.org/0000-0001-8650-7889

^¹ Universidad de Pamplona (Pamplona-Norte de Santander, Colombia). maria.moreno6@unipamplona.edu.co.

^² Universidad de Pamplona (Pamplona-Norte de Santander, Colombia). brayan.sarmiento@unipamplona.edu.co.

^³Universidad de Pamplona (Pamplona-Norte de Santander, Colombia). fabian.casares@unipamplona.edu.co.

^⁴ Universidad de Pamplona (Pamplona-Norte de Santander, Colombia). duvan.angulo@unipamplona.edu.co.

^⁵Universidad de Pamplona (Pamplona-Norte de Santander, Colombia). danilo.morales@unipamplona.edu.co.

Abstract

This document presents the results of a proof of concept for describing with more detail the social and complementary infrastructure around the tertiary roads of the Taminango region in the department of Nariño, Colombia. A dataset with samples of free satellite images from Google Maps and OpenStreetMaps was obtained. Then, a supervised deep learning algorithm with FCN (Fully Convolutional Network) topology is applied for the points of interest labeling process and the identification of the state of the roads using Keras and TensorFlow. Subsequently, a system consisting of a desktop application and a mobile application that integrates the functionalities of the trained algorithm through an intuitive interface and simple logic that stimulates interaction with the consultant is proposed. The desktop application includes a GUI designed in Python for tagging points of interest. The mobile application was developed with Flutter and comprises a database with documentation of the routes and road network in the region. It includes an augmented reality system in Vuforia Engine and Unity with virtual content developed in Blender and SolidWorks; A 3D model of the map of the region has been recreated for easier interaction and visualization of the points of interest and the status of the studied roads. In addition, complementary information was collected through remotely piloted aircraft for data acquisition in environments difficult to access, and through the community participation for the description and identification of areas not visible on official maps or statistics. This study addresses a method for the classification and identification of state of tertiary road network of the studied region, as well as labeling points of interest for the efficient management of resources for the development of new infrastructure there.

Keywords: augmented reality; community participation; deep learning; remotely piloted aircraft; satellite images; tertiary roads

Resumen

Este documento presenta los resultados de una prueba de concepto para la descripción con mayor detalle de la infraestructura social y complementaria alrededor de las vías terciarias de la región de Taminango, en el departamento de Nariño. Inicialmente, se obtuvo un conjunto de datos con muestras de imágenes satelitales de información libre de Google Maps y OpenStreetMaps. Seguidamente, se aplicaron algoritmos de aprendizaje profundo supervisado con topología de red FCN (Fully Convolutional Network) para el proceso de etiquetado de los puntos de interés y la identificación del estado de las vías mediante el uso de Keras y TensorFlow. Posteriormente, se propone un sistema compuesto por una aplicación de escritorio y una aplicación móvil que integre las funcionalidades del algoritmo entrenado a través de una interfaz intuitiva y de lógica simple que estimule la interacción con el consultor. La aplicación de escritorio contempla una GUI diseñada en Python para el etiquetado de puntos de interés. Por su parte, la aplicación móvil fue desarrollada con Flutter y comprende una base de datos con documentación de las rutas y red vial de la región. Incluye un sistema de realidad aumentada en Vuforia Engine y Unity con contenido virtual desarrollado en Blender y SolidWorks; se ha recreado un modelo 3D del mapa de la región para la interacción y visualización con mayor facilidad de los puntos de interés y el estado de las vías de estudio. Además, se recolectó información complementaria a través de aeronaves remotamente pilotadas, para la adquisición de datos en entornos de difícil acceso, y de la participación comunitaria para la descripción e identificación de áreas no visibles en mapas oficiales o estadísticas. En este estudio se aborda un método para la clasificación e identificación del estado de la red vial terciaria de la región, así como también se presenta el etiquetado de puntos de interés para el manejo eficiente de los recursos destinados al desarrollo de nueva infraestructura en la región.

Palabras clave: aeronaves remotamente pilotadas; aprendizaje profundo; imágenes satelitales; participación comunitaria; realidad aumentada; vías terciarias

Resumo

Este documento apresenta os resultados de uma prova de conceito para uma descrição mais detalhada da infraestrutura social e complementar no entorno das estradas terciárias da região de Taminango, no departamento de Nariño. Inicialmente, um conjunto de dados foi obtido com amostras de imagens gratuitas de imagens de satélite do Google Maps e OpenStreetMaps. Posteriormente, algoritmos de aprendizado profundo supervisionado com topologia de rede FCN (Fully Convolutional Network) foram aplicados para o processo de rotulagem dos pontos de interesse e identificação do estado das estradas usando Keras e TensorFlow. Posteriormente, é proposto um sistema composto por um aplicativo desktop e um aplicativo móvel que integra as funcionalidades do algoritmo treinado por meio de uma interface intuitiva e lógica simples que estimula a interação com o consultor. O aplicativo de desktop inclui uma GUI projetada em Python para a rotulagem de pontos de interesse. Por seu turno, a aplicação móvel foi desenvolvida com Flutter e inclui uma base de dados com documentação das rotas e rede viária da região. Inclui um sistema de realidade aumentada em Vuforia Engine e Unity com conteúdo virtual desenvolvido em Blender e SolidWorks; Um modelo 3D do mapa da região foi recriado para facilitar a interação e visualização dos pontos de interesse e do estado das estradas de estudo. Além disso, foram coletadas informações complementares por meio de aeronaves pilotadas remotamente, para aquisição de dados em ambientes de difícil acesso, e da participação da comunidade para descrição e identificação de áreas não visíveis em mapas oficiais ou estatísticas. Este estudo aborda um método de classificação e identificação da situação da malha rodoviária terciária na região, bem como a marcação de pontos de interesse para a gestão eficiente de recursos para o desenvolvimento de novas infraestruturas na região.

Palavras-chave: aeronave pilotada remotamente; aprendizagem profunda; imagens de satélite; participação da comunidade; realidade aumentada; rotas terciárias

I. INTRODUCTION

The tertiary road network in Colombia plays an important role in national, regional, and local integration. This approach is rooted in the interconnections of roads between footpaths and access to the national highway by isolated communities. Tertiary roads occupy a percentage of 67% of the total road network in Colombia, representing the largest transport infrastructure in the national territory. The relevance of this road network is also reflected in the policies and efforts carried out by the national government to generate development and connectivity in rural areas affected by the armed conflict. Given that the existing documentation on tertiary roads is scarce, incomplete, and in most cases very outdated, it is necessary to develop tools that allow the roads and points of interest to be easily identified to the people who travel them. One solution for this is through the use of deep learning techniques that make it possible to detect entities in satellite images with several layers in neural networks to carry out the analysis of points of interest in a versatile way. [¹], [²], [³], [⁴].

Recently, studies have been conducted using Deep Learning [⁵] - [⁸]. In [⁵], deep learning techniques are presented to predict the consumer spending of a village from satellite imagery and perform object detection and regression. Furthermore, in [⁶], deep learning is used for pathway state analysis and anomaly detection. On the other hand, in [⁷], deep learning techniques based on single class detection are used to identify road safety attributes, that is, to train a model for each attribute. In the case of [⁸], there is evidence of a monitoring and surveillance system based on the analysis of images captured with low-altitude drones using deep learning techniques.

Comparing the previous contributions, the purpose of this article is the development of a method for the classification and labeling of points of interest such as park infrastructures and sports spaces. In addition to the identification of tertiary roads using supervised deep learning techniques with the topology of FCN network in satellite images from Google Maps and OpenStreetMaps. The collection of complementary and more detailed information is carried out through the implementation of remotely piloted aircraft and the contribution of data by the local community. Consequently, a mobile application in Flutter and a desktop application in Python are developed to facilitate the actions of the consultants. Finally, an augmented reality system developed with Vuforia Engine in Unity is intended for the distribution of content from the road network of the study region. This research project aims to be a support instrument for decision-making in the execution of viable inventories with improvements in the action times of private and state entities to contribute to the connectivity and competitiveness of the region by applying artificial intelligence techniques and augmented reality systems for the visualization of the most relevant road documentation in the region in immersive and interactive scenarios.

The article is organized as follows: The study and planning phase of the proof of concept is presented in Section 2. This section includes data collection, artificial intelligence concepts, mobile and desktop application development, and the design of the augmented reality system for the distribution of virtual content related to the study. In Section 3, the results obtained for the classification and labeling of points of interest and identification of the most distinctive tertiary roads in the region are presented. In addition, the results of the implementation of the mobile application and the visualization of virtual content through the augmented reality system are described. Finally, in Section 4, the final conclusions of the study are presented.

II. METHODOLOGY

For the study and planning phase of the proof of concept, the Logical Framework Approach (LFA) was used to systematically and logically define the objectives of this research, facilitating the coordination and concertation of long-term strategic actions.

The case study comprises the Taminango region in Nariño. In this context, the groups involved have been analyzed and characterized, i.e., the National Planning Department; the Government; the Municipal Council; transportation, energy, communication, and construction companies; and the Taminango community. Thus identifying problems related to the dispersion of official data consulted, lack of road inventories, high financing costs, and lack of academic research in the area. For this reason, a series of alternatives that make up the structure of this article have been proposed.

A. Dataset

Initially, a map of the area of interest was downloaded through SAS.Planet - a free software for acquiring high resolution georeferenced satellite images from multiple free access sources (freeware under the GNU license), for example: Google Earth, Google Maps, OpenStreetMaps, Bing Maps, and GeoHub. In Figure 1, the sample taken from the SAS.Planet program can be observed.

Fig. 1 Satellite image of the Taminango region on Google Maps.

For the registration of points of interest, satellite images have been extracted from Google Maps considering the identification of parks that contain a playing field. By applying image segmentation algorithms in MATLAB with restrictive conditions of color and area, it was possible to optimize the labeling process for the creation of the image bank. Figure 2. illustrates the binarization segmentation process for park labeling in Taminango.

Fig. 2 (A) Decomposition of channels in the satellite image; (B) Identification of the park using segmentation techniques; (C) Final labeling of the point of interest.

During the data acquisition process, there were difficulties in identifying the Taminango road network due to the outdated data and resolution of the images. To correct this situation, satellite images were collected through OpenStreetMaps to later carry out the preprocessing and keep only the information corresponding to the roads. In addition, the routes that were not indicated in the database have been manually marked. In Figure 3. the extraction of images for the recognition of roads can be observed.

Fig. 3 Image of the Taminango region road network in OpenStreetMaps.

In order to facilitate the training of a neural network with the ability to identify the road network of the region, the satellite and pre-processed images of the roads have been divided into equal parts to obtain an image bank, assigning an output to each input desired. This article presents a dataset that contains 441 inputs and 441 outputs in a resolution of 256x256 pixels. Figure 4. illustrates the process of dividing images.

Fig. 4 (A) Satellite image division process; (B) Dividing the processed satellite image.

By examining Figure 5. it can be seen how the image bank provides a satisfactory output to the various inputs from the raw satellite images.

Fig. 5 Examples of inputs and outputs of the proposed dataset.

B. Artificial Intelligence

For the classification of tertiary roads in satellite images, an FCN-type network architecture with a binary classification model as a problem is proposed, ruling out the use of the transfer learning technique. For the case study, the satellite images will be the training data and the binary images will be the target images, the entire database for training the model had a preprocessing in order to facilitate the training of the neural network. Due to all the tools offered by the TensorFlow library in Python, it was decided to use it for model training. The model receives as input 256x256x3 preprocessed images through the 3 RGB channels of the image, with a loss function binary cross-entropy and Adam as optimizer. Having the model already trained, it is validated with the previously assigned validation data, in this way it is possible to check its performance before being implemented. Figure 6. illustrates an example application of the algorithm.

Fig. 6 Identification process of a tertiary road.

C. Desktop Application

The design of the graphical interface was carried out in Python with the Tkinter library for the development of GUIs. The interface includes a button that allows loading an image from the user files. Then, there are two markers to select an action: detection of tertiary roads or labeling of points of interest. Figure 7 presents a screenshot of the desktop application development.

Fig. 7 Desktop application in Spyder (Python 3.8).

D. Mobile Application

The development of a platform called "Atlas-AR" is proposed to host the most relevant contents of the study to reduce the degree of uncertainty and the dispersion of data regarding the number and status of the tertiary road network of Taminango. The design and implementation stages of the application were carried out in the Flutter framework, this is an open-source SDK developed by Google that allows multiplatform mobile applications with a single code base, considerably reducing coding times. In Figure 8 the programming interface where the mobile application was made can be observed.

Fig. 8 Functional test of the mobile application in Flutter.

The identification of areas that are not visible on official maps or statistics required the process of gathering information through a system of remotely piloted aircraft in areas difficult to access. On the other hand, the participation of the local community was required to capture infrastructure images that are not registered in the database and the detailed description of the most relevant roads in the region. This information is stored on an Amazon Web Services server.

E. Augmented Reality

Considering previously processed images, an augmented reality interface was developed using Vuforia Engine and Unity to stimulate user interaction with virtual content generated in Blender and SolidWorks [⁹], [¹⁰]. For the above, multiple layers have been arranged: the first layer illustrates a satellite image, the second layer represents a relief of the region, and the third layer corresponds to the road network. This last layer contains icons and descriptive boxes about the state of the road or labeling of points of interest. The result of this configuration is seen in Figure 9.

Fig. 9 Augmented reality system preview in Unity.

III. RESULTS

In Figure 10. the results of the classification and labeling of a park are presented according to the algorithm shown in previous sections. Likewise, it is possible to identify the most distinctive tertiary roads in the region. In Figure 11., the final result of the mobile application is observed; it is composed of 2 stages that bring together both the visualization of the content in the database and the contribution of images through mobile phones by the local community.

Fig. 10 Desktop application running.

Fig. 11 (A) Presentation tab of the mobile application; (B) Tab for the contribution of infrastructure images by the community; (C) Gratitude tab for users.

Finally, the visualization of the virtual content was carried out through hyperlinks in the physical world [¹¹]. The mobile application generates QR (Quick Response) codes that link to the different models and documentation stored in the database, as shown in Figure 12.

Fig. 12 Augmented reality system in operation.

IV. CONCLUSIONS

The development of a method for the classification of tertiary roads and the labeling of points of interest was achieved with deep learning techniques, remotely piloted aircraft, and community participation.

It is essential to define the limit of images for the creation of the dataset. This is because a greater number of images required greater processing and storage capabilities. Therefore, the strategy proposed in this document may have implications due to the angle of incidence of the sun (shadows or overexposure), atmospheric conditions, or location of the satellite.

The augmented reality system facilitated the processes of exploration, understanding, and retention of the most relevant information about the road network and points of interest in Taminango.

Currently, various studies are being carried out to scale this proof of concept to other regions using the databases developed. In addition, the implementation of drones for photogrammetry is suggested for the study and for obtaining measurements with precision of dimensions and position of the study places.

REFERENCES

[1] Y. Nachmany, H. Alemohammad, "Detecting roads from satellite imagery in the developing world," in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2019, pp. 83-89 [ Links ]

[2] S. G. Kanakaraddi, A. K. Chikaraddi, B. L. Pooja, T. Preeti, “Detection of Roads in Satellite Images Using Deep Learning Technique,” ICT Analysis and Applications, vol. 154, pp. 441-451, 2021. https://doi.org/10.1007/978-981-15-8354-4_44 [ Links ]

[3] A. Courtial, A. El Ayedi , G. Touya, X. Zhang, “Exploring the Potential of Deep Learning Segmentation for Mountain Roads Generalisation,” ISPRS International Journal of Geo-Information, vol. 9, no. 5, p. 338, 2020. https://doi.org/10.3390/ijgi9050338 [ Links ]

[4] N. Choquehuayta, “Detección de embarcaciones utilizando Deep Learning e imágenes satelitales ópticas,” Grade Thesis, Universidad Nacional San Agustin de Arequipa, Arequipa, Perú, 2020 [ Links ]

[5] K. Ayush, B. Uzkent, M. Burke, D. Lobell, S. Ermon, “Generating Interpretable Poverty Maps using Object Detection in Satellite Images,” in IJCAI International Joint Conference on Artificial Intelligence, 2020, pp. 4410-4416. https://doi.org/10.24963/ijcai.2020/608 [ Links ]

[6] B. Varona, A. Monteserin, A. Teyseyre, “A deep learning approach to automatic road surface monitoring and pothole detection,” Personal and Ubiquitous Computing, vol. 24, no. 4, pp. 519-534, 2020. https://doi.org/10.1007/s00779-019-01234-z [ Links ]

[7] P. Sanjeewani, B. Verma, “Single class detection-based deep learning approach for identification of road safety attributes,” Neural Computing and Applications, vol. 33, pp. 9691-9702, 2021. https://doi.org/10.1007/s00521-021-05734-z [ Links ]

[8] H. Gupta, O. P. Verma, “Monitoring and surveillance of urban road traffic using low altitude drone images: a deep learning approach,” Multimedia Tools and Applications, vol. 2021, pp. 1-21, 2021. https://doi.org/10.1007/s11042-021-11146-x [ Links ]

[9] X. Zhou, L. Tang, D. Lin, W. Han, “Virtual & augmented reality for biological microscope in experiment education,” Virtual Reality & Intelligent Hardware, vol. 2, no. 4, pp. 316-329, 2020. https://doi.org/10.1016/j.vrih.2020.07.004 [ Links ]

[10] H. A. Bautista, M. A. Mendoza Pérez, R. G. Cruz Flores, "Diseño de una aplicación en realidad aumentada para la enseñanza de un seguidor de línea," RILCO: Revista de Investigación Latinoamericana en Competitividad Organizacional, vol. 2, no. 7, pp. 9, 2020 [ Links ]

[11] I. M. Melo Bohórquez, "Realidad aumentada y aplicaciones," Tecnología Investigación y Academia, vol. 6, no. 1, p. 28-35, 2018 [ Links ]

Citation: M.-C. Moreno-Vergara, B.-D. Sarmiento-Iscala, F.-E. Casares-Pavia, Y.-D. Angulo-Rodríguez, D.-J. Morales-Arenales, “Analysis of Satellite Images Using Deep Learning Techniques and Remotely Piloted Aircraft for a Detailed Description of Tertiary Roads,” Revista Facultad de Ingeniería, vol. 30 (58), e13816, 2021. https://doi.org/10.19053/01211129.v30.n58.2021.13816

AUTHORS’ CONTRIBUTION

María-Camila Moreno-Vergara: Supervision, Formal Analysis, Investigation, Methodology, Writing - Review and editing.

Brayan-Daniel Sarmiento-Iscala: Conceptualization, Investigation, Validation, Writing - Review and editing.

Fabián-Enrique Casares-Pavia: Methodology, Conceptualization, Writing - Review and editing.

Yerson-Duvan Angulo-Rodríguez: Investigation, Writing - Review and editing.

Danilo-José Morales-Arenales: Validation, Writing - Review and editing.

Received: October 11, 2021; Accepted: December 02, 2021; Published: December 08, 2021

This is an open-access article distributed under the terms of the Creative Commons Attribution License