Skip to main content

PERDIDO Geoparser python library

Project description

Perdido Geoparser Python library

Open In Colab PyPI PyPI - License PyPI - Python Version

http://erig.univ-pau.fr/PERDIDO/

Installation

To install the latest stable version, you can use:

pip install --upgrade perdido

Quick start

Geoparsing

Open In Colab

Import

from perdido.geoparser import Geoparser

Run geoparser

geoparser = Geoparser(lang='fr')
doc = geoparser('Je visite la ville de Lyon, Annecy et le Mont-Blanc.')

Get tokens

for token in doc:
    print(f'token: {token.text}\tlemma: {token.lemma}\tpos: {token.pos}')

Print the XML-TEI output

print(doc.tei)

Print the GeoJSON output

print(doc.geojson)

Get the list of named entities

for entity in doc.ne:
    print(f'entity: {entity.text}\ttag: {entity.tag}')
    if entity.tag == 'place':
        for t in entity.toponyms:
            print(f' latitude: {t.lat}\tlongitude: {t.lng}\tsource {t.source}')

Get the list of nested named entities

for nestedEntity in doc.nne:
    print(f'entity: {nestedEntity.text}\ttag: {nestedEntity.tag}')
    if nestedEntity.tag == 'place':
        for t in nestedEntity.toponyms:
            print(f' latitude: {t.lat}\tlongitude: {t.lng}\tsource {t.source}')

Perdido Geoparser REST APIs

http://choucas.univ-pau.fr/docs#

Example: call REST API in Python

import requests

url = 'http://choucas.univ-pau.fr/PERDIDO/api/'
service = 'geoparsing'
content = 'Je visite la ville de Lyon, Annecy et le Mont-Blanc.'
parameters = {'api_key': 'demo', 'content': content}

r = requests.post(url+service, params=parameters)

print(r.text)

Acknowledgements

Perdido is an active project still under developpement.

This work was partially supported by the following projects:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

perdido-0.0.9.tar.gz (8.7 kB view details)

Uploaded Source

Built Distribution

perdido-0.0.9-py3-none-any.whl (8.7 kB view details)

Uploaded Python 3

File details

Details for the file perdido-0.0.9.tar.gz.

File metadata

  • Download URL: perdido-0.0.9.tar.gz
  • Upload date:
  • Size: 8.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.10.4

File hashes

Hashes for perdido-0.0.9.tar.gz
Algorithm Hash digest
SHA256 8274f242c0274076895625430346d470ff42ac96b932376a220e4e962a2aac37
MD5 f32e3be89d8c30509b29a0ebb1d1dc8d
BLAKE2b-256 9c49733b2ff6a0249feeb408935a3ebe967f84c2f0058b633701ed02907c90b3

See more details on using hashes here.

File details

Details for the file perdido-0.0.9-py3-none-any.whl.

File metadata

  • Download URL: perdido-0.0.9-py3-none-any.whl
  • Upload date:
  • Size: 8.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.10.4

File hashes

Hashes for perdido-0.0.9-py3-none-any.whl
Algorithm Hash digest
SHA256 fb21f307be4d110593fc9d4094dd8d1f58c2ad3606191ebe389feedb0ebc413d
MD5 c3ac51c65b2ebbd762efaa16075d06de
BLAKE2b-256 93393bbe9656f26dc0b7d505b6d950e3e84bff89e49630b12153bdfb7036a054

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page