A CLI tool and library for managing and analyzing chat logs.

Project description

`ctk`: Conversation Tree Toolkit

ctk (Conversation Tree Toolkit) is a powerful command-line tool designed to manage, analyze, and engage with conversation logs exported from OpenAI's platforms (e.g., ChatGPT). Whether you're looking to filter conversations, perform advanced queries, merge multiple conversation libraries, or leverage Large Language Models (LLMs) for deeper insights, ctk provides a comprehensive suite of tools to streamline your workflow.

Features
Installation
Configuration
Usage
- Available Commands
  - list
  - search
  - jmespath
  - conversation
  - merge
  - export
  - dash
  - llm
Examples
Structure of conversations.json
Response Format for LLM Queries
Notes
Getting Help
Contributing
License

Features

List Conversations: Display a list of all conversations with selected fields.
Search with Regex: Filter conversations based on regex patterns applied to specific fields.
Advanced Queries with JMESPath: Perform complex queries for data retrieval.
Conversation Details: View detailed information about specific conversations.
Merge Libraries: Combine multiple conversation libraries using set operations.
Export Conversations: Export conversations in various formats like JSON, Markdown, or Hugo.
Interactive Dashboard: Launch a Streamlit-based dashboard for visual exploration.
LLM Integration: Engage with Large Language Models to perform tasks like summarization or analysis on your conversation data.

Installation

Prerequisites

Python 3.7+
pip (Python package installer)

Local Development Installation

Clone the Repository

git clone https://github.com/queelius/ctk.git
cd ctk

Create a Virtual Environment (Optional but Recommended)

Using venv:

python3 -m venv ctk-env
source ctk-env/bin/activate

Using conda:

 conda create --name ctk-env python=3.8
 conda activate ctk-env

Install Dependencies
```
pip install -r requirements.txt
```
Make the ctk Command Accessible

Ensure that the ctk script is executable and added to your PATH. You can achieve this by installing the package or setting up an alias.
```
chmod +x ctk/cli.py
ln -s $(pwd)/ctk/cli.py /usr/local/bin/ctk
```
Alternatively, you can install ctk as a package if a setup.py is provided.

End-User Installation Using `pypi`:

Install the Package
```
pip install ctk
```

Configuration

Before using the llm command, you need to configure the LLM settings.

Create Configuration File

Create a file named .ctkrc in your home directory:
```
touch ~/.ctkrc
```
Add LLM Configuration

Open .ctkrc with your preferred text editor and add the following:
```
[llm]
endpoint = https://api.openai.com/v1/engines/davinci/completions
api_key = YOUR_API_KEY
model = gpt-3.5-turbo
```
- endpoint: The API endpoint for the language model service.
- api_key: Your API key for authenticating with the language model service.
- model: The specific language model to use (e.g., gpt-3.5-turbo).
Note: Replace YOUR_API_KEY with your actual API key.

Usage

The ctk tool offers various subcommands to perform different operations on your conversation libraries. The general syntax is:

ctk <command> [options] <arguments>

Available Commands

1. `list`

Description:
Lists all conversations in the specified library directory.

Usage:

ctk list <libdir> [--indices <indices>] [--fields <fields>]

Arguments:

<libdir>: Path to the conversation library directory.

Options:

--indices: Specify the indices of conversations to list. If omitted, all conversations are listed.
--fields: Specify which fields to include in the output (default: title, update_time).

Example:

ctk list ./conversations --fields title update_time model

2. `search`

Description:
Runs a regex query on the conversations to filter results based on specified patterns.

Usage:

ctk search <libdir> <expression> --fields <fields>

Arguments:

<libdir>: Path to the conversation library directory.
<expression>: The regex pattern to search for.

Options:

--fields: One or more JMESPath expressions specifying the fields to apply the regex to (default: title).
--json: Output the results in JSON format.

Example:

ctk search ./conversations "C\+\+" --fields title --json

Note: To search for the literal string "C++", ensure you escape the plus signs as shown.

3. `jmespath`

Description:
Executes a JMESPath query on the conversations for advanced data retrieval.

Usage:

ctk jmespath <libdir> <query>

Arguments:

<libdir>: Path to the conversation library directory.
<query>: The JMESPath expression to execute.

Example:

ctk jmespath ./conversations "conversations[?status=='active']"

4. `conversation`

Description:
Prints detailed conversation information based on conversation indices or specific node IDs.

Usage:

ctk conversation <libdir> <indices> [--node <node_id>] [--json]

Arguments:

<libdir>: Path to the conversation library directory.
<indices>: One or more indices of conversations to display.

Options:

--node: Specify the node ID to indicate the terminal node of a conversation path.
--json: Output the conversation in JSON format instead of a formatted table.

Example:

ctk conversation ./conversations 0 1 2 --node node123 --json

5. `merge`

Description:
Merges multiple ctk libraries into a single library using specified operations.

Usage:

ctk merge <operation> <libdirs> -o <output_dir>

Arguments:

<operation>: Type of merge operation (union, intersection, difference).
<libdirs>: List of library directories to merge.

Options:

-o, --output: Specify the output library directory.

Example:

ctk merge union ./lib1 ./lib2 -o ./merged_lib

6. `export`

Description:
Exports conversations from the library in specified formats.

Usage:

ctk export <libdir> <indices> [--format <format>]

Arguments:

<libdir>: Path to the conversation library directory.
<indices>: One or more indices of conversations to export. If omitted, all conversations are exported.

Options:

--format: Output format (json, markdown, hugo). Default is json.

Example:

ctk export ./conversations 0 1 --format markdown

7. `dash`

Description:
Launches a Streamlit-based dashboard for interactive exploration of the conversation library.

Usage:

ctk dash <libdir>

Arguments:

<libdir>: Path to the conversation library directory.

Example:

ctk dash ./conversations

8. `llm`

Description:
Runs a language model query on the conversation library to perform tasks like summarization, analysis, or generating insights.

Usage:

ctk llm <libdir> <query> [--json]

Arguments:

<libdir>: Path to the conversation library directory.
<query>: The query or prompt to send to the language model.

Options:

--json: Output the results in JSON format.

Example:

ctk llm ./conversations "Provide a summary of conversation 0."

Note: Ensure that the .ctkrc configuration file is properly set up with your LLM API credentials.

Examples

Listing All Conversations:
```
ctk list ./conversations
```

Listing Specific Fields:

ctk list ./conversations --fields title update_time model

Filtering Conversations with Regex Search:

ctk search ./conversations "C\+\+" --fields title --json

Running a JMESPath Query:

ctk jmespath ./conversations "conversations[?status=='active']"

Merging Two Libraries with Union Operation:

ctk merge union ./lib1 ./lib2 -o ./merged_lib

Exporting Conversations to Markdown:

ctk export ./conversations 0 1 --format markdown

Launching the Dashboard:
```
ctk dash ./conversations
```

Running a Language Model Query:

ctk llm ./conversations "Provide a summary of conversation 0."

Structure of `conversations.json`

The ctk library stores conversation data in a JSON file named conversations.json located within your specified library directory (libdir). This file contains structured data representing ChatGPT chat sessions, organized as conversation trees.

Example `conversations.json`:

[
  {
    "id": "conversation_1",
    "title": "Project Discussion",
    "create_time": 1633072800,
    "update_time": 1633076400,
    "default_model_slug": "gpt-3.5-turbo",
    "safe_urls": ["https://example.com"],
    "mapping": {
      "node_1": {
        "text": "Hello, how can I assist you today?",
        "payload": {
          "message": {
            "content": {
              "content_type": "text",
              "parts": ["Hello, how can I assist you today?"]
            },
            "author": {
              "role": "assistant",
              "name": "ChatGPT"
            },
            "create_time": 1633072800
          }
        }
      },
      "node_2": {
        "text": "I need help with my project.",
        "payload": {
          "message": {
            "content": {
              "content_type": "text",
              "parts": ["I need help with my project."]
            },
            "author": {
              "role": "user",
              "name": "Alice"
            },
            "create_time": 1633072860
          }
        }
      }
    },
    "current_node": "node_2"
  }
]

This is a simplified example. The actual structure may vary based on your specific data.

Response Format for LLM Queries

When using the llm command to interact with a Large Language Model, the expected response format is JSON. This structured format ensures that the ctk tool can parse and execute the appropriate commands based on your query.

General Format:

{
  "command": "command_name",
  "args": ["<libdir>", "<args>"]
}

Examples

Example 1: Finding Starred Conversations

Query: "Find conversations that are starred."

Response:

{
  "command": "jmespath",
  "args": ["./conversations", "conversations[?starred]"]
}

Example 2: Listing Titles and URLs of Starred Conversations

Query: "Find conversations that are starred and only show me the title and URL."

Response:

{
  "command": "jmespath",
  "args": ["./conversations", "conversations[?starred].[title, url]"]
}

Notes

Library Directory (libdir): Ensure that the specified library directory exists and contains a valid conversations.json file before performing operations.
Indices: Conversation indices start at 0. Use the list command to view available indices before performing operations on specific conversations.
Regex Patterns: When using regex patterns, escape special characters as needed. For example, to search for "C++", use C\+\+.
Conflict Resolution in Merges: When merging libraries, duplicate conversation IDs can be handled using strategies like skip, overwrite-old, or error based on your requirements.
JSON Output: Utilize the --json flag in commands like list and conversation for machine-readable output, which is useful for further processing or integration with other tools.
Error Handling: The tool provides informative error messages. Ensure to read them carefully to troubleshoot issues related to missing files, incorrect indices, or invalid configurations.
Performance: For large libraries, some operations might take longer. Consider optimizing your queries and using efficient patterns to enhance performance.

Getting Help

For more information on using the ctk tool, access the help documentation for each command using the --help flag. For example:

ctk list --help

This command will display detailed information about the list command, including its usage, arguments, and options.

Contributing

Contributions are welcome! If you'd like to contribute to the ctk project, please follow these steps:

Fork the Repository

Click the "Fork" button at the top right of the repository page to create your own fork.

Clone Your Fork

git clone https://github.com/yourusername/ctk.git
cd ctk

Create a New Branch

git checkout -b feature/YourFeatureName

Make Your Changes

Implement your feature or bug fix.

Commit Your Changes

git commit -m "Add feature: YourFeatureName"

Push to Your Fork

git push origin feature/YourFeatureName

Create a Pull Request

Navigate to the original repository and click the "New Pull Request" button. Provide a clear description of your changes.

Please ensure that your contributions adhere to the project's coding standards and include appropriate tests where applicable.

License

This project is licensed under the MIT License.

Acknowledgements

Developed using Python and leveraging powerful libraries like argparse, jmespath, rich, requests, networkx, and pyvis.
Inspired by the need to efficiently manage and analyze conversation logs from AI platforms.

Contact

For questions, suggestions, or support, please open an issue on the GitHub repository or contact the maintainer at your.email@example.com.

Project details

Release history Release notifications | RSS feed

2.14.0

Apr 30, 2026

2.13.3

Apr 27, 2026

2.13.2

Apr 27, 2026

2.13.1

Apr 27, 2026

2.13.0

Apr 27, 2026

2.12.0

Apr 27, 2026

2.11.0

Apr 27, 2026

2.10.0

Apr 24, 2026

2.9.0

Apr 24, 2026

2.8.0

Mar 4, 2026

2.6.1

Feb 5, 2026

2.6.0

Jan 29, 2026

2.5.0

Dec 20, 2025

2.4.1

Dec 19, 2025

2.4.0

Dec 17, 2025

2.3.0

Dec 16, 2025

2.2.0

Nov 29, 2025

0.6.1

May 1, 2025

0.6.0

May 1, 2025

0.5.0

May 1, 2025

This version

0.4.0

Jan 25, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

conversation_tk-0.4.0.tar.gz (22.9 kB view details)

Uploaded Jan 25, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

conversation_tk-0.4.0-py3-none-any.whl (25.7 kB view details)

Uploaded Jan 25, 2025 Python 3

File details

Details for the file conversation_tk-0.4.0.tar.gz.

File metadata

Download URL: conversation_tk-0.4.0.tar.gz
Upload date: Jan 25, 2025
Size: 22.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.16

File hashes

Hashes for conversation_tk-0.4.0.tar.gz
Algorithm	Hash digest
SHA256	`8661a5139e99299be269a436fdc425c06a26a9c792a437b9c2c823985785dd63`
MD5	`ffe450d1f717a3eff3fa2271735bec4f`
BLAKE2b-256	`7c7705caf3d5f128d281adf89e779def19e8cadc785df085bbd14588f593e326`

See more details on using hashes here.

File details

Details for the file conversation_tk-0.4.0-py3-none-any.whl.

File metadata

Download URL: conversation_tk-0.4.0-py3-none-any.whl
Upload date: Jan 25, 2025
Size: 25.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.16

File hashes

Hashes for conversation_tk-0.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9f122d736e50956c92d963cce235021a39aacc665f07166035da2e16190474bb`
MD5	`b5f1505a9260cf22f298822e9230c8db`
BLAKE2b-256	`10abc66f11da0d0728761a8fa3d4a575c01506a2c5ca6ee0f5140f21ee76f194`

See more details on using hashes here.

conversation-tk 0.4.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

ctk: Conversation Tree Toolkit

Table of Contents

Features

Installation

Prerequisites

Local Development Installation

End-User Installation Using pypi:

Configuration

Usage

Available Commands

1. list

2. search

3. jmespath

4. conversation

5. merge

6. export

7. dash

8. llm

Examples

Structure of conversations.json

Example conversations.json:

Response Format for LLM Queries

General Format:

Examples

Notes

Getting Help

Contributing

License

Acknowledgements

Contact

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`ctk`: Conversation Tree Toolkit

End-User Installation Using `pypi`:

1. `list`

2. `search`

3. `jmespath`

4. `conversation`

5. `merge`

6. `export`

7. `dash`

8. `llm`

Structure of `conversations.json`

Example `conversations.json`: