A library for Natural Language Processing style preprocessing for source code in different languages
Project description
sourCy
A library for NLP like preprocessing for source code in different programming languages. The architecture follows the one of spaCy.
Free software: GNU General Public License v3
Documentation: https://sourcy.readthedocs.io.
Features
Grammar based parsing
Multilanguage Support
Usage
To use sourCy in a project:
import sourcy
code = """
# This function computes the factor of the argument passed
def print_factors(x):
print("The factors of",x,"are:")
for i in range(1, x + 1):
if x % i == 0:
print(i)
num = 320
print_factors(num)
"""
# Creates a pipeline to process source code
scp = sourcy.load("python")
# Process the code and create a document with the tokens and the annotation
doc = scp(code)
for token in doc:
print(token.token, token.annotation, token.block)
Credits
This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.
History
0.1.0 (2020-08-11)
First release on PyPI.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
sourcy-0.2.2.tar.gz
(91.3 kB
view details)
File details
Details for the file sourcy-0.2.2.tar.gz
.
File metadata
- Download URL: sourcy-0.2.2.tar.gz
- Upload date:
- Size: 91.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.6.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.8.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3e19082ca5c4740cd52c189f3e310c18b48da37f36581815038949c808f5e704 |
|
MD5 | b561393b41d156f146c6b1b6db781da6 |
|
BLAKE2b-256 | 8161e63381b188e8e2a51fd05e79687075748cb96ab6f8b87db8abd415a06506 |