Skip to main content

Alternative to the `jq` command that's a bit optimized for LLM eval datasets in jsonl format.

Project description

pprint_problems

Alternative to the jq command that's a bit optimized for LLM eval datasets in jsonl format.

Installation

pip install pprint_problems

Development

This is still a work in progress. If you have any suggestions or improvements, please feel free to open an issue or a pull request, or contact the author directly.

Usage

Here are some recommended ways to use this script:

1. Search for particular problems:
    pprint_problems problems.jsonl -r --search "keyword" -b

2. Load a local file:
    pprint_problems test_problems.jsonl --randomize -n 1 --parts code tests

3. Load a local file with "cat":
    cat problems.jsonl | grep "search_term" | pprint_problems -n 1 -p code

4. Load and randomize problems:
    pprint_problems -r -n 1 problems.jsonl

5. Use some arguments to only load a subset:
    pprint_problems my_problems.jsonl --n 3 --width 100 --line-numbers --randomize

6. Print out the structure:
    pprint_problems --structure test_data.jsonl

7. Print out the raw JSON:
    pprint_problems --n 1 --raw problems.jsonl

8. Manually filter problems with y/n on the keyboard:
    pprint_problems problems.jsonl --manual-filter -p code broken_diff

License

This project is licensed under the terms of the MIT license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pprint_problems-0.1.1.tar.gz (13.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pprint_problems-0.1.1-py3-none-any.whl (14.6 kB view details)

Uploaded Python 3

File details

Details for the file pprint_problems-0.1.1.tar.gz.

File metadata

  • Download URL: pprint_problems-0.1.1.tar.gz
  • Upload date:
  • Size: 13.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.12

File hashes

Hashes for pprint_problems-0.1.1.tar.gz
Algorithm Hash digest
SHA256 7d8c0095ccd9808d9e025bbae9ea4695e5da1592041318fadeeca32c52c487d6
MD5 077b25e6ca41e6a1351a3dc1bf92adac
BLAKE2b-256 3c6c8593ba0b294b3276a1c317c717e546980de09517e0a8542427ece5512aa6

See more details on using hashes here.

File details

Details for the file pprint_problems-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for pprint_problems-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 e488c7c64110b1d99596688c689eb0de52303c548ea89ff84ecacf05a0ef8c5b
MD5 b73ca65327e959e1a752b9559c0d4ef0
BLAKE2b-256 4d668405432048528a00cc009fe9b554300fedd05aca46cab55d8e0e44c55c20

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page