SciELO - Scientific Electronic Library Online

 
vol.24 issue38Software Defined Radio: Basic Principles and ApplicationsA Technical and Environmental Study of Fortified Structures Used in Coal Mining in Norte de Santander author indexsubject indexarticles search
Home Pagealphabetic serial listing  

Services on Demand

Journal

Article

Indicators

Related links

  • On index processCited by Google
  • Have no similar articlesSimilars in SciELO
  • On index processSimilars in Google

Share


Revista Facultad de Ingeniería

Print version ISSN 0121-1129

Abstract

RICO-SULAYES, Antonio. Towards a supervised rescoring system for unstructured data bases used to build specialized dictionaries. Rev. Fac. ing. [online]. 2015, vol.24, n.38, pp.97-106. ISSN 0121-1129.

This article proposes the architecture for a system that uses previously learned weights to sort query results from unstructured data bases when building specialized dictionaries. A common resource in the construction of dictionaries, unstructured data bases have been especially useful in providing information about lexical items frequencies and examples in use. However, when building specialized dictionaries, whose selection of lexical items does not rely on frequency, the use of these data bases gets restricted to a simple provider of examples. Even in this task, the information unstructured data bases provide may not be very useful when looking for specialized uses of lexical items with various meanings and very long lists of results. In the face of this problem, long lists of hits can be rescored based on a supervised learning model that relies on previously helpful results. The allocation of a vast set of high quality training data for this rescoring system is reported here. Finally, the architecture of sucha system, an unprecedented tool in specialized lexicography, is proposed

Keywords : unstructured data bases; supervised rescoring; specialized lexicography; dictionary making.

        · abstract in Spanish | Portuguese     · text in English     · English ( pdf )