Skip to main content

Tokenize code snippets.

Project description

Tokenize All Code

This is a clone of the original code that was found here. That repo has since disappeared, so I have cloned it here for archival and development purposes.

Tokenize blocks of code in Python. Used by manim-code-blocks to syntax highlight blocks of code.

Example Usage

from tokenize_all import Java

tokens = Java.tokenize(
    """
    public class Main {
        public static void main(String[] args) {
            System.out.println("Hello world!");
        }
    }
    """
)

for token in tokens: print(token)

Output:

>> Token[ type = keyword, value = public, start = 4, end = 10 ]
>> Token[ type = keyword, value = class, start = 11, end = 16 ]
>> Token[ type = class name, value = Main, start = 17, end = 21 ]
>> Token[ type = left brace, value = {, start = 22, end = 23 ]
>> Token[ type = keyword, value = public, start = 31, end = 37 ]
>> Token[ type = keyword, value = static, start = 38, end = 44 ]
>> Token[ type = keyword, value = void, start = 45, end = 49 ]
>> Token[ type = function, value = main, start = 50, end = 54 ]
>> Token[ type = left parentheses, value = (, start = 54, end = >> 55 ]
>> Token[ type = class name, value = String, start = 55, end = >> 61 ]
>> Token[ type = left bracket, value = [, start = 61, end = 62 ]
>> Token[ type = right bracket, value = ], start = 62, end = 63 ]
>> Token[ type = identifier, value = args, start = 64, end = 68 ]
>> Token[ type = right parentheses, value = ), start = 68, end = >> 69 ]
>> Token[ type = left brace, value = {, start = 70, end = 71 ]
>> Token[ type = class name, value = System, start = 83, end = >> 89 ]
>> Token[ type = dot, value = ., start = 89, end = 90 ]
>> Token[ type = identifier, value = out, start = 90, end = 93 ]
>> Token[ type = dot, value = ., start = 93, end = 94 ]
>> Token[ type = function, value = println, start = 94, end = >> 101 ]
>> Token[ type = left parentheses, value = (, start = 101, end = >> 102 ]
>> Token[ type = string, value = "Hello world!", start = 102, >> end = 116 ]
>> Token[ type = right parentheses, value = ), start = 116, end >> = 117 ]
>> Token[ type = semicolon, value = ;, start = 117, end = 118 ]
>> Token[ type = right brace, value = }, start = 126, end = 127 ]
>> Token[ type = right brace, value = }, start = 131, end = 132 ]

Building and Publishing

Requires Python 3.9 or later.

pip install twine build
python -m build
twine upload dist/*

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tokenize_all_code-1.0.18.tar.gz (7.8 kB view details)

Uploaded Source

Built Distribution

tokenize_all_code-1.0.18-py3-none-any.whl (7.5 kB view details)

Uploaded Python 3

File details

Details for the file tokenize_all_code-1.0.18.tar.gz.

File metadata

  • Download URL: tokenize_all_code-1.0.18.tar.gz
  • Upload date:
  • Size: 7.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.4

File hashes

Hashes for tokenize_all_code-1.0.18.tar.gz
Algorithm Hash digest
SHA256 1af1913a8e5cdb3d7d5b2e03e3e77947b86ae2396a9d08ddeb851edac8ef5f6d
MD5 e4ce479e45f2b3b2be4f3dfbc145fcce
BLAKE2b-256 fd36e1dea71eded238a6c52777432331ae80cde22789376878e35d2d9b0c8f51

See more details on using hashes here.

File details

Details for the file tokenize_all_code-1.0.18-py3-none-any.whl.

File metadata

File hashes

Hashes for tokenize_all_code-1.0.18-py3-none-any.whl
Algorithm Hash digest
SHA256 dcd05740d11db934c371a292f83f667cd5d5458006812f436a8592d3320ef1e3
MD5 96cc52b9c45ec4e64cd6fdf096910fab
BLAKE2b-256 0c06b104e99ce5e76a8060f87022542253e15e55cdafe592396d1551eec14d8e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page