Automatic Extraction of Geographic Context from Textual Data
Proceedings of the 6th Conference „Applied Information and Communication Technologies” 2013
Jurijs Nikolajevs, Gints Jēkabsons

The amount of information on the internet grows exponentially. It is not enough anymore just to have a general access to this huge amount of data, instead it is becoming a necessity to be able to use different kinds of automatic filters to retrieve just the information you actually want. One solution for the information filtering and retrieval is context analysis in which one of the contexts of interest is the geographic context. This paper studies the problem and methodology of geoparsing – recognition of geographic names in unstructured textual content for the aim of extracting geographic context. A prototype implementation of a geoparsing system, capable of automatically analyzing unstructured text, recognizing geographic information and marking geographic names, is developed. Empirical evaluation of the system using articles from real-world news showed that the average quality of its geographic name recognition varies around 75-100%. Possible applications of the developed prototype include automated grouping of any texts by their geographic contexts (e.g., in news portals) and location-based search. Preliminary results of empirical evaluation showed that the average rate of its geographic name recognition varies around 75-100%.


Keywords
geoparsing, geocoding, geographic information retrieval, natural language processing
Hyperlink
http://aict.itf.llu.lv/files/rakstkraj/2013/Nikolajevs_AICT2013.pdf

Nikolajevs, J., Jēkabsons, G. Automatic Extraction of Geographic Context from Textual Data. In: Proceedings of the 6th Conference „Applied Information and Communication Technologies”, Latvia, Jelgava, 25-26 April, 2013. Jelgava: Latvia University of Agriculture, 2013, pp.18-22. ISSN 2255-8586.

Publication language
English (en)
The Scientific Library of the Riga Technical University.
E-mail: uzzinas@rtu.lv; Phone: +371 28399196