Skip to main content

Text analysis api

Project description

This is a Python client for the Bookworm natural language API, made available by Crawlica.

You can register to receive an API key at https://bookworm.crawlica.com/request_api_key/

Example usage

import tabulate
from bookworm import Bookworm


API_KEY = '...'

documents = ['De två största städerna i Sverige är Stockholm och Göteborg',
             'Donald Trump blir Barack Obamas efterträdare']

b = Bookworm(API_KEY, 'sv')
res = b.entities(documents)

for ents in res:
    data = [[ent['entity'], ent['type'], ', '.join(ent['variants'])] for ent in ents]
    print(tabulate.tabulate(data, headers=['Entity', 'Type', 'Variants']))
    print('')
Entity     Type    Variants
---------  ------  ----------
Sverige    geo     Sverige
Stockholm  geo     Stockholm
Göteborg   geo     Göteborg

Entity        Type    Variants
------------  ------  -------------
Donald Trump  person  Donald Trump
Barack Obama  person  Barack Obamas

Installation

$ pip install bookworm

Functionality currently available

  • autotag - Automatically tag documents with the best matching tags from our large database of tags.

  • wordsmash - Compare document sets to get the essence of what makes a subset special. Get the defining words and phrases.

  • entities - Extract named entities (people, organisations, and places) from each document

  • sentiment - Get a numerical representation of how positive or negative each document is

  • wordcount - Get a count of the distinct words in the documents. Declinations are grouped into the base form and stop words can be removed.

  • cluster - Automatically collect documents into groups, based on their topics, and describe those groups.

  • categorize - Automatically put documents in a set of categories that you choose. The categories are defined by topic words, but the documents need not contain any of those words, just “close” words.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bookworm-0.2.1.tar.gz (3.4 kB view details)

Uploaded Source

File details

Details for the file bookworm-0.2.1.tar.gz.

File metadata

  • Download URL: bookworm-0.2.1.tar.gz
  • Upload date:
  • Size: 3.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for bookworm-0.2.1.tar.gz
Algorithm Hash digest
SHA256 61e50e606440db4884502bd481b890d97645836cdc485157939f338dfa16ae25
MD5 78df7dc937f9c3ec2a9b35da2f9a939f
BLAKE2b-256 fb49f320f2204929d40107a11d47739604df4f333a2ca419d9d6ade12b983045

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page