Skip to main content

TULP: A command line tool, in the best essence of POSIX tooling, that will help you to **process**, **filter**, and **create** data in this new Artificial Intelligence world.

Project description

TULP: TULP Understands Language Promptly

A command line tool, in the best essence of POSIX tooling, that will help you to process, filter, and create data in this new Artificial Intelligence world, backed by chatGPT.

TULP allows you to harness the power of chatGPT by piping standard input content directly to chatGPT, getting the answer back on the shell.

tulp demo video

Installation:

pip install tulp

Usage:

TULP has 3 main operation modes:

  1. request: Process the user request:
tulp [A written request or question]
  1. stdin processing: Process or filter all the stdin input according to the user instructions, writing the processed output to stdout.
cat [MYFILE] | tulp [Processing instructions written in natural language]
  1. Code Interpretation: Add a -x to any of the previous operations, and Tulp will try to create, debug, and execute a program that gets the request done!.
cat examples/titanics.csv | tulp -x how many persons survived

In both cases, TULP will write to the standard output the answers and will write any other information to the standard error. You can safely pipe the output to your file or next piping command and will still get all the information and errors on stderr.

It is important to note that if your input is larger than 5000 characters, the input will be split into multiple chunks and processed by chatGPT in multiple requests. In this case, the result quality will really depend on the task (e.g., will work fine for translations or grammatical corrections, it will work terribly for summarizing). Anyway, tulp works great when the input is less than 5000 chars.

By default, tulp uses gpt-3.5-turbo, because it is cheaper and faster, but for complex tasks, it is always a good idea to force the gpt-4 model: tulp --model gpt-4 {a complex task}

Options:

usage: tulp [-h] [-x] [-w W] [--model {gpt-3.5-turbo,gpt-4}] [--max-chars MAX_CHARS] [-v] [-q] ...

TULP Understands Language Promptly: A command line tool, in the best essence of POSIX tooling, that will help you to **process**, **filter**, and **create** data in this new Artificial Intelligence world, backed by chatGPT.

positional arguments:
  request               User request, instructions written in natural language

optional arguments:
  -h, --help            show this help message and exit
  -x                    Allow tulp to create a program and execute it to fulfill the task (code interpret)
  -w W                  Write the output (or the created program for execution) to the file. If the file exists, a backup will be created before overwriting it.
  --model {gpt-3.5-turbo,gpt-4}
                        Select the LLM model to use, currently gpt-3.5-turbo or gpt-4
  --max-chars MAX_CHARS
                        Number of chars per message chunk per request
  -v                    Be verbose!
  -q                    Be quiet! Only print the answer and errors.

Configuration

The configuration file is located at ~/.tulp.conf.

The following are the parameters that can be configured:

  • LOG_LEVEL: The log level of Tulp. Valid options are DEBUG, INFO, WARNING, ERROR, and CRITICAL. The default value is INFO.
  • OPENAI_API_KEY: The API key for OpenAI. The default value is an empty string.
  • MAX_CHARS: The maximum number of characters processed in one chunk. The default value is 5000.
  • MODEL: The OpenAI model to be used by Tulp. The default value is gpt-3.5-turbo, but gpt-4 is also available.

All these settings could be overridden by an environment variable using the prefix TULP_ or by the different command line arguments described above. As environment variables, they will become: TULP_LOG_LEVEL, TULP_OPENAI_API_KEY, TULP_MAX_CHARS, or TULP_MODEL. Command line arguments will override environmental variables and the configuration file.

Here is an example configuration file with the default values:

[DEFAULT]
LOG_LEVEL = INFO
OPENAI_API_KEY = <<<YOUR API KEY >>>>
MAX_CHARS = 10000
MODEL = gpt-3.5-turbo

Examples:

The usage is endless, but anyway, here you have some ideas as inspiration:

Random

Create a plot directly from raw memory output printed by gdb:

Command:

cat <<EOF | tulp convert this to a python list of 2 element tuples |  tulp write a python function to scatter plot these points using matplotlib | python 
(gdb) p *polygon._points._M_ptr._M_impl._M_start@4
$21 = {{x = 0.441429973, y = -0.176619753}, {x = 0.476210177, y = -0.104575738}, {x = 0.674865067, y = -0.0814191923}, {x = 0.640084863, y = -0.199776307}}
EOF

Result:

matplotlib @rela

Typical Unix tooling replacement:

Sed

cat README.md | tulp replace all the occurrences of TULP for **TULP**

Awk

cat README.md | tulp print the second word of each line

grep, but advanced

cat tulp.py | tulp print the name of the functions and also the return line 

Grammatical and syntax corrections:

cat README.md | tulp fix all the typos, syntax and grammatical errors > README.fix.md

Or even better:

cat README.md | TULP_MAX_CHARS=10000 TULP_MODEL=gpt-4 tulp fix all the typos, syntax and grammatical errors > README.fix.md

Translations

cat README.md | tulp translate to Spanish > README.es.md

Data filtering from formatted input

csv

cat list.csv | tulp print only the second column
Count
3
1
2

csv

cat persons.json | tulp 'list the names and ages of each person in a csv table, using ; as separator'

Data creation and extraction from unstructured data (a story of oranges and friends):

fede@liebre:~/repos/tulp$ tulp write a poem that names 3 persons \(given each a name\) and list how they shared 10 oranges | tee examples/oranges_poem.txt
Roses are red,
Violets are blue,
Here's a poem,
About sharing oranges too.

There were three friends,
Whose names were Ann, Ben, and Sue,
They had 10 oranges,
And didn't know what to do.

Ann suggested they split them,
Equally, three each,
But Ben said that wasn't fair,
As Sue was too weak.

So they decided to give Sue,
An extra orange or two,
And split the rest evenly,
So everyone had a fair view.

And that's how Ann, Ben, and Sue,
Shared their 10 oranges,
With kindness and fairness,
And no one had any grudges.

fede@liebre:~/repos/tulp$ cat examples/oranges_poem.txt | python3 ./tulp.py write a list of persons and the number of oranges that they have as csv
Ann,3
Ben,3
Sue,4

Origin of the name

I used tulp.py to create "TULP". In some way, everything is recursive in "TULP", so it makes sense to use a recursive acronym.

Therefore, after several iterations with tulp.py, "TULP" and I decided that the best name would be "TULP", and this is how we decided what "TULP" stands for:

fede@liebre:~/repos/openai/tulp$ python3 ./tulp.py "TULP is a recursive acronym naming an opensource posix tool that processes stdin input according to natural language instructions, processing the input by instructing an artificial intelligence. Write some options of what TULP could stand for as recursive acronym"
TULP could stand for:
- TULP Understands Language Perfectly
- TULP Uses Language to Process
- TULP Understands Language Promptly
- TULP Utilizes Language for Processing
- TULP Unravels Language Precisely

Why?

I am a heavy user of Unix tooling (e.g: awk, jq, sed, grep, and so on), I have been using them since my early days and I used to think that I couldn't survive without them. But then, ChatGPT appeared, and I started to use more and more GPT for things that I used to use Unix tooling for. Somehow I feel the pain of cut & paste, and I was missing a way to do it faster and from within the terminal itself, so I came up with tulp.

Changelog

v1.0 | 2024-02-14

  • Changed to use gpt-4-0125-preview model by default
  • Updated to use openapi v1.0
  • Changes default max-chars to 40000

v07 | 2023-05-23

  • Adds Code Interpretation, -x option

v0.6 | 2023-05-11

  • Adds all the settings as command line arguments

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tulp-1.0.tar.gz (19.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tulp-1.0-py3-none-any.whl (18.8 kB view details)

Uploaded Python 3

File details

Details for the file tulp-1.0.tar.gz.

File metadata

  • Download URL: tulp-1.0.tar.gz
  • Upload date:
  • Size: 19.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.10.12

File hashes

Hashes for tulp-1.0.tar.gz
Algorithm Hash digest
SHA256 a371d6327d0996deeecd90d11cb2888cfad410e7ee1a1c5fe1d6821bc7eefa82
MD5 4160c12f3643a14f7216dafdc5ec57b9
BLAKE2b-256 50570a49d34589f994cc760aad67cb3a94b020aa75928be6309f59c0205345de

See more details on using hashes here.

File details

Details for the file tulp-1.0-py3-none-any.whl.

File metadata

  • Download URL: tulp-1.0-py3-none-any.whl
  • Upload date:
  • Size: 18.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.10.12

File hashes

Hashes for tulp-1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d9a4f2d4d86b1334d5431ae96e8d2dfc4a0de737bd86362f31b3407ee4c5d34a
MD5 c2e1bb95908d49941b2cc5853f3cbe54
BLAKE2b-256 7d862509e79e828533408588f469749572f9d1efb2c9873cd259fec0670eb985

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page