A SpaCy pipeline and models for NLP on indian legal text.
Project description
Opennyai : An efficient NLP Pipeline for Indian Legal documents
Opennyai is a python library for natural language preprocessing on Indian legal texts.
This library provides unified access to the following 3 pre-trained AI models developed by OpenNyAI which focus on Indian court judgments:
- Named Entity Recognition (NER): GitHub , paper, Huggingface Datasets
- Judgment Structuring using Sentence Rhetorical Roles: GitHub , paper, Huggingface Datasets
- Extractive Summarizer: GitHub
This library is mainly for running the pretrained models on your custom input judgments text. For more details about data and model training, please refer to individual git repo links.
🔧 1. Installation
To get started using opennyai first create a new python virtual environment using conda:
Supports python 3.8, 3.9, 3.10
conda create -n opennyai python=3.8
conda activate opennyai
Install it using pip by running the following line in your terminal
pip install -U opennyai
For GPU support
If you want to utilize spacy with GPU please install Cupy and cudatoolkit dependency of appropriate version.
conda install cudatoolkit==<your_cuda_version> #### E.g. cudatoolkit==11.2
pip install cupy-cuda<your_cuda_version> ##### E.g. cupy-cuda112
In case of any issue with installation please refer to spacy installation with cupy
Remember you need spacy of 3.2.4 version for models to work perfectly.
📖 2. Documentation
Please refer to the Documentation for more details.
👩💻 3. Usage
To run the 3 OpenNyAI models on judgment texts of your choice please run following python code
from opennyai import Pipeline
from opennyai.utils import Data
import urllib
# Get court judgment texts on which to run the AI models
text1 = urllib.request.urlopen(
'https://raw.githubusercontent.com/OpenNyAI/Opennyai/master/samples/sample_judgment1.txt').read().decode()
text2 = urllib.request.urlopen(
'https://raw.githubusercontent.com/OpenNyAI/Opennyai/master/samples/sample_judgment2.txt').read().decode()
# you can also load your text files directly into this
texts_to_process = [text1, text2]
# create Data object for data preprocessing before running ML models
data = Data(texts_to_process)
# If you have access to GPU then set this to True else False
use_gpu = True
# Choose which of the AI models you want to run from the 3 models 'NER', 'Rhetorical_Role','Summarizer'. E.g. If just Named Entity is of interest then just select 'NER'
pipeline = Pipeline(components=['NER', 'Rhetorical_Role', 'Summarizer'], use_gpu=use_gpu, verbose=True)
results = pipeline(data)
The output of each model is present in following keys of each element of the output
results[0][
'annotations'] ## shows the result of model at sentence level, each entry will have entities, rhetorical role, and other details
results[0]['summary'] ## shows Summary for each of the Rheorical Role for first judgment text
For more details on usage please refer to the documentation
Google Colab Notebook
We encourage you to use the local machine installation of opennyai library instead of Google colab as it works with python 3.8. Since Google colab works with higher versions of python, opennyai installation does not work.
Description | Link |
---|---|
Run Inference |
Visualization of outputs
We encourage users to use our webapp for visualizing the results for a judgment of your choice.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file opennyai-0.0.13.tar.gz
.
File metadata
- Download URL: opennyai-0.0.13.tar.gz
- Upload date:
- Size: 426.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.11.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 52f6195d94b744cc6e926a22152059aa2197d82353a9e30ac97f18ad977528ae |
|
MD5 | b8718fd2c03d09cea801aae229536619 |
|
BLAKE2b-256 | ab755a6a1f1a1d1f1ce748938c5cd8a817f1f7a0eb278f1b3b664ec03c8426e6 |
File details
Details for the file opennyai-0.0.13-py3-none-any.whl
.
File metadata
- Download URL: opennyai-0.0.13-py3-none-any.whl
- Upload date:
- Size: 170.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.11.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 59c4c2784c872660712f75525b4709e99c89fcaf8a95b5cdc9500f91e6266659 |
|
MD5 | c41ee0095b55a3b7b4973f47dd3222ff |
|
BLAKE2b-256 | 1a01a420a1c8861b74d3fd1dbff70b5f04fa9cbeaa61b473468dbafcb4ef1cd9 |