Skip to main content

Alternative to the `jq` command that's a bit optimized for LLM eval datasets in jsonl format.

Project description

pprint_problems

Alternative to the jq command that's a bit optimized for LLM eval datasets in jsonl format.

Installation

pip install pprint_problems

Development

This is still a work in progress. If you have any suggestions or improvements, please feel free to open an issue or a pull request, or contact the author directly.

Usage

Here are some recommended ways to use this script:

1. Search for particular problems:
    pprint_problems problems.jsonl -r --search "keyword" -b

2. Load a local file:
    pprint_problems test_problems.jsonl --randomize -n 1 --parts code tests

3. Load a local file with "cat":
    cat problems.jsonl | grep "search_term" | pprint_problems -n 1 -p code

4. Load and randomize problems:
    pprint_problems -r -n 1 problems.jsonl

5. Use some arguments to only load a subset:
    pprint_problems my_problems.jsonl --n 3 --width 100 --line-numbers --randomize

6. Print out the structure:
    pprint_problems --structure test_data.jsonl

7. Print out the raw JSON:
    pprint_problems --n 1 --raw problems.jsonl

8. Manually filter problems with y/n on the keyboard:
    pprint_problems problems.jsonl --manual-filter -p code broken_diff

License

This project is licensed under the terms of the MIT license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pprint_problems-0.1.0.tar.gz (11.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pprint_problems-0.1.0-py3-none-any.whl (12.7 kB view details)

Uploaded Python 3

File details

Details for the file pprint_problems-0.1.0.tar.gz.

File metadata

  • Download URL: pprint_problems-0.1.0.tar.gz
  • Upload date:
  • Size: 11.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.12

File hashes

Hashes for pprint_problems-0.1.0.tar.gz
Algorithm Hash digest
SHA256 cfcfb834634b8723a22f09e6ea0cf625f7faecfb4a2f52b30ddf603269c2de5f
MD5 ce5afa250aa86843050e74061f2143aa
BLAKE2b-256 7644cdfd65234115f1a8c0bf15c4235708cae55c4ff65a33d9388cce54c5d029

See more details on using hashes here.

File details

Details for the file pprint_problems-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for pprint_problems-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 34645279bb1cd912c2372ca0b5db44c06c300c658e63ecc2e442cf8862244116
MD5 7ead36ccf61f1346eb4d8ec349b2cead
BLAKE2b-256 8dc15984f003c391ee6e5db93b2b622c0083a6a1c6e131851c1bf4fe3969816e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page