Natural language processing support for Pandas dataframes.
Project description
Natural language processing support for Pandas DataFrames.
Text Extensions for Pandas adds extension types to Pandas DataFrames for representing natural language data, plus a library of functions for working with these extension types.
Features
SpanArray: A Pandas extension type for spans of text
- Connect features with regions of a document
- Visualize the internal data of your NLP application
- Analyze the accuracy of your models
- Combine the results of multiple models
TensorArray: A Pandas extension type for tensors
- Represent BERT embeddings in a Pandas series
- Store logits and other feature vectors in a Pandas series
- Store an entire time series in each cell of a Pandas series
Pandas front-ends for popular NLP toolkits
- SpaCy
- Transformers
- IBM Watson Natural Language Understanding
- IBM Watson Discovry Table Understanding
Documentation
For examples of how to use the library, take a look at the notebooks in this directory.
API documentation can be found at https://text-extensions-for-pandas.readthedocs.io/en/latest/
Source Code
The source code for Text Extensions for Pandas is available at https://github.com/CODAIT/text-extensions-for-pandas.
We welcome code and documentation contributions! See the README file for more information on contributing.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for text_extensions_for_pandas-0.1.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5a929d5c7404fa50200b1d66983c2407204f6a24d6517c889654c65433960dd9 |
|
MD5 | 4c480e8f9c8a8d2baf42817f05890e09 |
|
BLAKE2b-256 | cd149b7c799cbe66df12f08e5c665d0cdeb7fa62cb6a4325978fa43f414ce729 |
Hashes for text_extensions_for_pandas-0.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 739bfa4b2feeba9e099ba97203c720d9c45a8b1b9b2d2e12577d99d82102b84f |
|
MD5 | d233e0602ab7f89d908873eaf15ffe68 |
|
BLAKE2b-256 | 24ba2197686a831de2e6b80b2da34a84e38abe7de6b9ba8b59744afa8f3f8983 |