Skip to main content

stedsans is a package capable of doing geospatial analyses from text.

Project description

stedsans

This repository is for an exam project for the course Spatial Analytics at Aarhus University during the spring of 2021.

It is made by Jakob Grøhn Damgaard and Malte Højmark-Bertelsen.

For a demonstration, we heavily suggest you to use Google Colab:

Open In Colab

The purpose of it is to build a PyPI-package capable of plotting a map of any location in a Danish sentence. To do so we employ the Natural Language Processing (NLP) technique Named Entity Recognition (NER)

NER is a task consisting of finding words in text that constitute a specific entities and tagging them with specific labels. The most common entities are person names (PER), locations (LOC) and organizations (ORG) (Ruder, 2019). The way the named entities are tagged follows a tagging scheme called BIO-tagging, where the different words are separated as either being the beginning (B) of an entity, inside an entity (I), or other (O), meaning that a word is not part of the defined entities. An illustration of the aforementioned entities can be seen in Table 1.

Table 1: | NER-tag | Meaning | | --- | --- | | B-PER | Beginning of person name | | I-PER | Inside a person name | | B-LOC | Beginning of location | | I-LOC | Inside a location | | B-ORG | Beginning of organization | | I-ORG | Inside an organization | | O | Other |

Instructions

To use the code locally, start off by cloning the repository and install Anaconda for your OS. Afterwards create a conda environment and install the requirements.

# From the directory of this repository
conda create -n [env_name] python=3.8  # Create conda environment
conda activate [env_name]  # Activate conda environment
pip install -r requirements.txt  # Install required packages

Afterwards install geopandasusing the pre-build binaries from Anaconda:

conda install geopandas

References

Ruder, S. (2019). Neural transfer learning for natural language processing (Doctoral dissertation, NUI Galway).


Contact

For help or further information feel free to reach out to Jakob Grøhn Damgaard on bokajgd@gmail.com or Malte Højmark-Bertelsen on hjb@kmd.dk.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for stedsans, version 0.0.12
Filename, size File type Python version Upload date Hashes
Filename, size stedsans-0.0.12-py3-none-any.whl (23.3 MB) File type Wheel Python version py3 Upload date Hashes View
Filename, size stedsans-0.0.12.tar.gz (23.2 MB) File type Source Python version None Upload date Hashes View

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring DigiCert DigiCert EV certificate Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page