Skip to main content

A client library for accessing Entity-fishing - Entity Recognition and Disambiguation

Project description

entifyfishing-client

A client library for accessing Entity-fishing - Entity Recognition and Disambiguation

Usage

First, create a client:

from entifyfishing_client import Client

client = Client(base_url="http://nerd.huma-num.fr/nerd/service")

Now call your endpoint and use your models:

from entifyfishing_client.api.knowledge_base import get_concept, term_lookup
from entifyfishing_client.api.query_processing import disambiguate
from entifyfishing_client.models import (
    Concept,
    DisambiguateForm,
    Language,
    QueryParameters,
    QueryResultFile,
    QueryResultTermVector,
    QueryResultText,
    TermSenses,
)
from entifyfishing_client.types import File

form = DisambiguateForm(
    query=QueryParameters(
        text="""Austria invaded and fought the Serbian army at the Battle of Cer and Battle of Kolubara beginning on 12 August. 
            The army, led by general Paul von Hindenburg defeated Russia in a series of battles collectively known as the First Battle of Tannenberg (17 August – 2 September). 
            But the failed Russian invasion, causing the fresh German troops to move to the east, allowed the tactical Allied victory at the First Battle of the Marne. 
            Unfortunately for the Allies, the pro-German King Constantine I dismissed the pro-Allied government of E. Venizelos before the Allied expeditionary force could arrive.
            """,
        language=Language(lang="en"),
        mentions=["ner", "wikipedia"],
        nbest=False,
        customisation="generic",
        min_selector_score=0.2,
    )
)
r = disambiguate.sync_detailed(client=client, multipart_data=form)
if r.is_success:
    result: QueryResultText = r.parsed
    assert result is not None
    assert len(result.entities) > 0
    assert result.entities[0].raw_name == "Austria"
    assert result.entities[0].wikidata_id == "Q40"
    
r = get_concept.sync_detailed(id="Q40", client=client)
result: Concept = r.parsed
if r.is_success:
    assert result is not None
    assert result.raw_name == "Austria"
    assert result.wikidata_id == "Q40"    
    assert len(result.statements) > 0

Or do the same thing with an async version:

pdf_file = "MyPDFFile.pdf"
with pdf_file.open("rb") as fin:
    form = DisambiguateForm(
        query=QueryParameters(
            language=Language(lang="en"),
            mentions=["wikipedia"],
            nbest=False,
            customisation="generic",
            min_selector_score=0.2,
            sentence=True,
            structure="grobid",
        ),
        file=File(file_name=pdf_file.name, payload=fin, mime_type="application/pdf"),
    )
    r = await disambiguate.asyncio_detailed(client=client, multipart_data=form)
    if r.is_success:
        result: QueryResultFile = r.parsed
        assert result is not None
        assert len(result.entities) > 0
        assert len(result.pages) > 0
        assert len(result.entities[0].pos) > 0

By default, when you're calling an HTTPS API it will attempt to verify that SSL is working correctly. Using certificate verification is highly recommended most of the time, but sometimes you may need to authenticate to a server (especially an internal server) using a custom certificate bundle.

client = Client(
    base_url="http://nerd.huma-num.fr/nerd/service", 
    verify_ssl="/path/to/certificate_bundle.pem",
)

You can also disable certificate validation altogether, but beware that this is a security risk.

client = Client(
    verify_ssl=False
)

Things to know:

  1. Every path/method combo becomes a Python module with four functions:

    1. sync: Blocking request that returns parsed data (if successful) or None
    2. sync_detailed: Blocking request that always returns a Request, optionally with parsed set if the request was successful.
    3. asyncio: Like sync but the async instead of blocking
    4. asyncio_detailed: Like sync_detailed by async instead of blocking
  2. All path/query params, and bodies become method arguments.

  3. If your endpoint had any tags on it, the first tag will be used as a module name for the function (my_tag above)

  4. Any endpoint which did not have a tag will be in entifyfishing_client.api.default

Building / publishing this Client

This project uses Poetry to manage dependencies and packaging. Here are the basics:

  1. Update the metadata in pyproject.toml (e.g. authors, version)
  2. If you're using a private repository, configure it with Poetry
    1. poetry config repositories.<your-repository-name> <url-to-your-repository>
    2. poetry config http-basic.<your-repository-name> <username> <password>
  3. Publish the client with poetry publish --build -r <your-repository-name> or, if for public PyPI, just poetry publish --build

If you want to install this client into another project without publishing it (e.g. for development) then:

  1. If that project is using Poetry, you can simply do poetry add <path-to-this-client> from that project
  2. If that project is not using Poetry:
    1. Build a wheel with poetry build -f wheel
    2. Install that wheel from the other project pip install <path-to-wheel>

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

entifyfishing-client-0.4.1.tar.gz (20.6 kB view details)

Uploaded Source

Built Distribution

entifyfishing_client-0.4.1-py3-none-any.whl (33.5 kB view details)

Uploaded Python 3

File details

Details for the file entifyfishing-client-0.4.1.tar.gz.

File metadata

  • Download URL: entifyfishing-client-0.4.1.tar.gz
  • Upload date:
  • Size: 20.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.4 CPython/3.8.2 Linux/5.3.0-62-generic

File hashes

Hashes for entifyfishing-client-0.4.1.tar.gz
Algorithm Hash digest
SHA256 ef3c8a0706c68c4c8ebb1c085270e4244e7bba1ad1ae8be8def932b057eab0fc
MD5 237934b7f6fe9f04bf91f86d0e52dae8
BLAKE2b-256 6051e3d2df94e62dc9066f77f0e8d4c7e46d266f67acf7ff02d7b42541d229c6

See more details on using hashes here.

File details

Details for the file entifyfishing_client-0.4.1-py3-none-any.whl.

File metadata

File hashes

Hashes for entifyfishing_client-0.4.1-py3-none-any.whl
Algorithm Hash digest
SHA256 8d71164f644f1018b2f70ac0b6e580b3188166c2a7b5423f98b23688f6f1548e
MD5 30658452b33e3648cf865df87bdf51d6
BLAKE2b-256 6b61be752186c5d491270aeccf2c8fd85c122e9d1573c62dc9f175598c4c3b81

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page