A dependency tree visualizer for Stanford CoreNLP
Project description
Convert Stanford CoreNLP's Dependency Tree to Spacy's for Visualization
This module provides a format converter from Stanford CoreNLP's dependency trees to Spacy's such that the visualation can be done using Spacy's visualizer.
Motivation
I just found that Spacy has an amazing visualizer that we should explore more and this project bridges the gap between the CoreNLP parsing outputs and it.
Pre-requisite
Install Spacy and stanfordnlp. The versions we have tested are Spacy 2.2.2 and stanfordnlp 0.2.0
pip install spacy stanfordnlp
Download Stanford CoreNLP (tested on 3.9.2) Java library: https://stanfordnlp.github.io/CoreNLP/
Set the env variable for stanfordnlp:
export CORENLP_HOME=/yourhome/stanford-corenlp-full-2018-10-05
Installation
pip install corenlp-vdep
Examples
import spacy
from spacy import displacy
from stanfordnlp.server import CoreNLPClient
from corenlp_dtree_visualizer.converters import _corenlp_dep_tree_to_spacy_dep_tree
# Input text
text = 'Jim killed John with a joke.'
# Get a dependency tree from a Stanford CoreNLP pipeline
with CoreNLPClient(annotators=['tokenize','ssplit','pos','lemma','ner','parse','depparse'],
timeout=60000, memory='8G', output_format='json') as client:
# submit the request to the server
ann = client.annotate(text)
# Convert dependency tree formats
sent = ann['sentences'][0]
tree = _corenlp_dep_tree_to_spacy_dep_tree(sent['tokens'], sent['enhancedPlusPlusDependencies'])
# Visualize with Spacy
nlp = spacy.load("en_core_web_sm")
displacy.render(tree, style="dep", manual=True)
# could also save to a file
# svg = displacy.render(tree, style="dep", manual=True)
# with open('tmp.svg', 'w', encoding='utf-8') as fw:
# fw.write(svg)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
corenlp-vdep-0.1.0.tar.gz
(2.4 kB
view hashes)
Built Distribution
Close
Hashes for corenlp_vdep-0.1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 58e04079c4cb57d3112a7bcc734dcaee0ef98beb2bed1d17a51fa84132fe4d45 |
|
MD5 | a565bbf57dfcb43dce0cb4b59bf69024 |
|
BLAKE2b-256 | cfa04d4aac04ba8845ee1068e6032c8429d7a0ae9b4a47682bb6acac3dd47a77 |