Python library to work with ConceptNet offline
Project description
ConceptNet Rocks!
Work is in progress.
The library comes with Apache License 2.0, and is separate from ConceptNet itself, although it uses some parts of its code. The ConceptNet is available under CC-BY-SA-4.0 license. See here for the list of conditions for using ConceptNet data.
This is the official citation for ConceptNet if you use it in research:
Robyn Speer, Joshua Chin, and Catherine Havasi. 2017. "ConceptNet 5.5: An Open Multilingual Graph of General Knowledge." In proceedings of AAAI 31.
Installation
pip install conceptnet-rocks
Usage
Install ArangoDB
ConceptNet Rocks uses ArangoDB for storage, managed by a companion Python Graph Garden library that is automatically installed with ConceptNet Rocks.
Graph Garden can manage the ArangoDB installation for you. To download the latest version of ArangoDB from official website and install it to ~/.arangodb
folder, simply run:
graph-garden arangodb install
For more options execute:
graph-garden arangodb install --help
Load CSV dump into database
Then you need to load the ConceptNet CSV dump into database. The dump can be downloaded from https://github.com/commonsense/conceptnet5/wiki/Downloads
Let's assume you've downloaded the dump to ~/conceptnet-data/assertions.csv
.
To load the dump, execute:
conceptnet-rocks-load ~/conceptnet-data/assertions.csv
This command will create database in ~/.arangodb/data
. For more options execute:
conceptnet-rocks-load --help
Run queries
Now you can query ConceptNet. ConceptNet Rocks uses the same simple API as ConceptNet5 for querying:
from conceptnet_rocks import AssertionFinder
af = AssertionFinder()
print(af.lookup("/c/en/test"))
print(af.lookup("/r/Antonym"))
print(af.lookup("/s/process/wikiparsec/2"))
print(af.lookup("/d/wiktionary/en"))
print(af.lookup("/a/[/r/Antonym/,/c/ang/gecyndelic/a/,/c/ang/ungecynde/]"))
ConceptNet Rocks uses the same JSON-LD format as the original ConceptNet5:
from conceptnet_rocks import AssertionFinder
from pprint import pprint
af = AssertionFinder()
pprint(af.lookup("/c/en/blow_dryer"))
# [
# ...
# {'@id': '/a/[/r/AtLocation/,/c/en/blow_dryer/,/c/en/beauty_salon/]',
# '@type': 'Edge',
# 'dataset': '/d/conceptnet/4/en',
# 'end': {'@id': '/c/en/beauty_salon',
# '@type': 'Node',
# 'label': 'a beauty salon',
# 'language': 'en',
# 'term': '/c/en/beauty_salon'},
# 'license': 'cc:by/4.0',
# 'rel': {'@id': '/r/AtLocation', '@type': 'Relation', 'label': 'AtLocation'},
# 'sources': [{'@id': '/and/[/s/activity/omcs/omcs1_possibly_free_text/,/s/contributor/omcs/bedume/]',
# '@type': 'Source',
# 'activity': '/s/activity/omcs/omcs1_possibly_free_text',
# 'contributor': '/s/contributor/omcs/bedume'}],
# 'start': {'@id': '/c/en/blow_dryer',
# '@type': 'Node',
# 'label': 'a blow dryer',
# 'language': 'en',
# 'term': '/c/en/blow_dryer'},
# 'surfaceText': 'You are likely to find [[a blow dryer]] in [[a beauty salon]]',
# 'weight': 1.0}
# ...
# ]
FAQ
Why did you create yet another library if original ConceptNet5 exists?
- Performance. Our benchmark (https://github.com/ldtoolkit/conceptnet-benchmark) has shown that ConceptNet Rocks is almost 5 times faster than ConceptNet5 for querying assertions by concepts.
- The original ConceptNet5 library requires PostgreSQL. PostgreSQL does not support the graph databases as a primary model, while ArangoDB is a multi-model database for graph.
- PostgreSQL generally requires either root permissions to install it using a package manager, or the compilation step. Not anyone have root permissions on their machine or have the compiler installed. ConceptNet Rocks library uses ArangoDB, which can be installed without root permissions using simple command.
Why is the library called ConceptNet Rocks?
- Under the hood ArangoDB uses key-value storage called RocksDB.
- Performance-wise, it does rock!
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file conceptnet-rocks-0.1.2.tar.gz
.
File metadata
- Download URL: conceptnet-rocks-0.1.2.tar.gz
- Upload date:
- Size: 18.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.0.10 CPython/3.7.9 Linux/5.8.7-1-default
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ac3ba6e7ffa21bef16bee27cad10ca4c766fb46a1bd5d0b65da8be2503ee7dc3 |
|
MD5 | 48f676217cc1b604c3be5135fa411f5c |
|
BLAKE2b-256 | dec9bd17cd82343a2e3097e603787f492fe6ca36fc6a0bb14fcd213f056d1413 |
File details
Details for the file conceptnet_rocks-0.1.2-py3-none-any.whl
.
File metadata
- Download URL: conceptnet_rocks-0.1.2-py3-none-any.whl
- Upload date:
- Size: 18.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.0.10 CPython/3.7.9 Linux/5.8.7-1-default
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6ec63678ab34d585c7e3557b84ac2b60a86f955544bf40e0ef2428fbcc85953d |
|
MD5 | aaabf48f6ba290efebed3d335b149ff8 |
|
BLAKE2b-256 | 7f4c756c7fbc6ea0ce719f288351245a816395e24b9c88a97dfb2079caa9f951 |