Inference and training for multiple languages of code2seq

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

pycode2seq

Training and inference with multiple languages of PyTorch's implementation of code2seq model.

Installation

python setup.py install

Inference

Minimal code example:

import sys
from pycode2seq import DefaultModelRunner

def main(argv):
    runner = DefaultModelRunner(
        save_path = "./tmp",
    )

    #List of embeddings for each method
    method_embeddings = runner.run_embeddings_on_file(argv[1], "kt") 

    #Code2seq predictions
    predictions = runner.run_on_file(argv[1], "kt")

    #Predicted method names
    names = [runner.prediction_to_text(prediction) for prediction in predictions]

if __name__ == "__main__":
    main(sys.argv)

Training

Download astminer and run:

./gradelw shadowJar

Mine projects for paths:

python training/mine_projects.py <data folder> <output folder> <path to astminer's cli.sh>

Combine mined paths:

python training/astminer_to_code2seq.py <data folder/holdout> <output folder> <holdout>

Build vocabulary with build_vocabulary.py from code2seq module

Combine vocabularies:

python training/combine_vocabularies.py

Expand weights:

python training/expand_weights.py

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.0.6

Aug 26, 2021

0.0.5

Aug 26, 2021

0.0.4

Jun 17, 2021

0.0.3

Jun 17, 2021

0.0.2

Jun 17, 2021

This version

0.0.1

Jun 17, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pycode2seq-0.0.1.tar.gz (162.9 kB view hashes)

Uploaded Jun 17, 2021 Source

Built Distribution

pycode2seq-0.0.1-py3-none-any.whl (173.7 kB view hashes)

Uploaded Jun 17, 2021 Python 3

Hashes for pycode2seq-0.0.1.tar.gz

Hashes for pycode2seq-0.0.1.tar.gz
Algorithm	Hash digest
SHA256	`526d29eb461057613c5380e51859cdb0f34adc063d9a52d96cfea319f634663a`
MD5	`173e55edbd9947d098a114f7bc6ec4fc`
BLAKE2b-256	`d2278df846cd810084a664700f107e64e0f69fdf70a24264901adb1d8c0278de`

Hashes for pycode2seq-0.0.1-py3-none-any.whl

Hashes for pycode2seq-0.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`cc2089836ffc0e99d44fe05e8d123b9b7cea2fcd33e696654d2738b7d807a252`
MD5	`bcae6f54fc7e405c001e0d6432c3510a`
BLAKE2b-256	`0c837e2af00462308bbfd775d6c180a926392250b0da0a409bb8b7a365942415`