Skip to main content

Industry-strength Natural Language Processing extensions for Keras.

Project description

KerasNLP: Modular NLP Workflows for Keras

Python contributions welcome

KerasNLP is a natural language processing library that works natively with TensorFlow, JAX, or PyTorch. Built on multi-backend Keras (Keras 3), these models, layers, metrics, and tokenizers can be trained and serialized in any framework and re-used in another without costly migrations.

KerasNLP supports users through their entire development cycle. Our workflows are built from modular components that have state-of-the-art preset weights when used out-of-the-box and are easily customizable when more control is needed.

This library is an extension of the core Keras API; all high-level modules are Layers or Models that receive that same level of polish as core Keras. If you are familiar with Keras, congratulations! You already understand most of KerasNLP.

See our Getting Started guide to start learning our API. We welcome contributions.

Quick Links

For everyone

For contributors

Installation

To install the latest official release:

pip install keras-nlp --upgrade

To install the latest unreleased changes to the library, we recommend using pip to install directly from the master branch on github:

pip install git+https://github.com/keras-team/keras-nlp.git --upgrade

Quickstart

Fine-tune BERT on a small sentiment analysis task using the keras_nlp.models API:

import os
os.environ["KERAS_BACKEND"] = "jax"  # Or "tensorflow", or "torch".

import keras_nlp
import tensorflow_datasets as tfds

imdb_train, imdb_test = tfds.load(
    "imdb_reviews",
    split=["train", "test"],
    as_supervised=True,
    batch_size=16,
)
# Load a BERT model.
classifier = keras_nlp.models.BertClassifier.from_preset(
    "bert_base_en_uncased", 
    num_classes=2,
    activation="softmax",
)
# Fine-tune on IMDb movie reviews.
classifier.fit(imdb_train, validation_data=imdb_test)
# Predict two new examples.
classifier.predict(["What an amazing movie!", "A total waste of my time."])

For more in depth guides and examples, visit https://keras.io/keras_nlp/.

Configuring your backend

Keras 3 is an upcoming release of the Keras library which supports TensorFlow, Jax or Torch as backends. This is supported today in KerasNLP, but will not be enabled by default until the official release of Keras 3. If you pip install keras-nlp and run a script or notebook without changes, you will be using TensorFlow and Keras 2.

If you would like to enable a preview of the Keras 3 behavior, you can do so by setting the KERAS_BACKEND environment variable. For example:

export KERAS_BACKEND=jax

Or in Colab, with:

import os
os.environ["KERAS_BACKEND"] = "jax"

import keras_nlp

[!IMPORTANT] Make sure to set the KERAS_BACKEND before import any Keras libraries, it will be used to set up Keras when it is first imported.

Until the Keras 3 release, KerasNLP will use a preview of Keras 3 on PyPI named keras-core.

[!IMPORTANT] If you set KERAS_BACKEND variable, you should import keras_core as keras instead of import keras. This is a temporary step until Keras 3 is out!

To restore the default Keras 2 behavior, unset KERAS_BACKEND before importing Keras and KerasNLP.

Compatibility

We follow Semantic Versioning, and plan to provide backwards compatibility guarantees both for code and saved models built with our components. While we continue with pre-release 0.y.z development, we may break compatibility at any time and APIs should not be consider stable.

Disclaimer

KerasNLP provides access to pre-trained models via the keras_nlp.models API. These pre-trained models are provided on an "as is" basis, without warranties or conditions of any kind. The following underlying models are provided by third parties, and subject to separate licenses: BART, DeBERTa, DistilBERT, GPT-2, OPT, RoBERTa, Whisper, and XLM-RoBERTa.

Citing KerasNLP

If KerasNLP helps your research, we appreciate your citations. Here is the BibTeX entry:

@misc{kerasnlp2022,
  title={KerasNLP},
  author={Watson, Matthew, and Qian, Chen, and Bischof, Jonathan and Chollet, 
  Fran\c{c}ois and others},
  year={2022},
  howpublished={\url{https://github.com/keras-team/keras-nlp}},
}

Acknowledgements

Thank you to all of our wonderful contributors!

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

keras-nlp-nightly-0.7.0.dev2023112003.tar.gz (286.5 kB view details)

Uploaded Source

Built Distribution

File details

Details for the file keras-nlp-nightly-0.7.0.dev2023112003.tar.gz.

File metadata

File hashes

Hashes for keras-nlp-nightly-0.7.0.dev2023112003.tar.gz
Algorithm Hash digest
SHA256 888106fdc6ba74627270e2cfac3a4d65802a4db7fbf1f36370a2b7138f55d81d
MD5 28d02bcacf0973fec55c0a3980ec8697
BLAKE2b-256 82e502e9207ad2577eb066cb1dd9d2f56da2994f0115a9e8c6d795700edf26db

See more details on using hashes here.

File details

Details for the file keras_nlp_nightly-0.7.0.dev2023112003-py3-none-any.whl.

File metadata

File hashes

Hashes for keras_nlp_nightly-0.7.0.dev2023112003-py3-none-any.whl
Algorithm Hash digest
SHA256 7403fbad6f62821ec58c167783bdca280c0365f1abf78486f842cab737861291
MD5 de694aea7e0d973a1d5c6a9ebfff1bee
BLAKE2b-256 eb52d1a16e1f04b40a540e4f2231037d2d302d63df49499391f6fdaa5f8c1ff8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page