Skip to main content

Bedrock is a high-level text pre-processing API, written in Python and can run on NLTK or Spacy as its backends.

Project description

Build Status

You have discovered bedrock

Bedrock is a high-level text pre-processing API, written in Python and can run on NLTK or Spacy as its backends. It allows you to quickly perform the text processing groundwork without having. It does the menial work, so you don’t have to.

Use this library if you find the following highlights useful: * Fast prototyping * Switching between different backends * Work in batches, rather than writing loops * Support for DataFrame inputs/outputs

Install bedrock in a jiffy:

pip install bedrock
bedrock download all

From zero to bedrock hero in 10 seconds

Now you can run

import bedrock
bedrock.process.pipeline('Hallo Welt')

Congrats! :tada:

Engines and Languages

Currently bedrock supports the following engines: * spacy * nltk

And the following languages and corresponding download arguments: * English (‘en’ or ‘english’) * German (‘de’, ‘german’ or ‘deutsch’)

Installation and usage

Package installation

pip install bedrock

Install support for all languages:

bedrock download all

Install support only for English:

bedrock download en

Install support for German:

bedrock download de

Import modules from package in your code:

from bedrock import process    # Processing texts
from bedrock import collection # Loading data collections
from bedrock import common     # Some common functions
from bedrock import feature    # Feature extraction
from bedrock import viz        # Visualizations

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

bedrock-0.1.0.dev10-py2.py3-none-any.whl (14.0 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page