kmds · PyPI

This is a package that helps data scientists and data analysts to capture notes while they work through data science tasks. The captured tasks can then be searched and analyzed.

These details have not been verified by PyPI

Project description

Knowledge Management for Data Science

What is the tool and why do you need it?

This tool is used for knowledge management in data science. As data scientists, incremental experimentation is a way of life. The problem is we have a lot of them and even small projects accumalate context, decisions and rationale over time. This is not a problem if we have both the need for experimentation (the design question or issue) and the results documented over time, but this tends to be done in an adhoc manner, so when its time to rebuild or revisit a particular question, we can't find the research and the results related to it. This is the need this tool fulfils.

Please see knowledge application development context for a description of a typical knowledge application development setting. Please see the video for a quick overview.

How is it related to process guidelines and vocabularies for machine learning?

Initiatives such as CRISP DM provide guidelines and processes for developing data science projects. Projects such as Open ML provide semantic vocabulary standardization for machine learning tasks. These are excellent tools. However, the guidelines they provide are task focussed. The gap between a conceptual idea and the final, or, even candidate data science tasks for a project is filled with many assumptions and experimental evaluations. The information in these assumptions and experimental evaluations is what this tool aims to capture. There is also an ordering to these assumptions and experimental evaluations. This is also what this tool aims to capture.

Who would use this tool?

This tool is for data scientists and data analysts.

How do you use this tool?

This version of the tool takes all the recent advances (as of early 2026) into consideration in how this tool is used. This is a python package. It is assumed that you have an API key to a provider. The basic usage scenario is as follows:

Install the python package in the environment where you intend to experiment and do your data science analysis.
Work through your analysis plan for your model development or experiment. The tool will not offer help with how your analysis or experiment will be done. It assumes you are the expert and you know how to do this. Of course, you can use Jupyternaut or a similar generative AI tool to generate your code for you.
As you work through your exploratory data analysis, data representation and modeling phases, log your findings to kmds
Run a report to fetch the details of your design rationale as needed. To communicate your findings to your team or management, simply export your knowledge base. Point a generative AI tool such as NotebookLM to the export and generate your report, video or other documentation artifact.

For a video excerpt of the design cosiderations, see this video

Examples of use

The repository contains two examples of use. One example is from analytics, the other is from machine learning. The notebooks for analytics are in the analytics example and the notebooks for machine learning are in the machine learning example. The analytics example evaluates the effectiveness of ticket resolution help desk. Using ticket resolution data for a particular quarter, Q2 2016, the example illustrates how effectiveness of the organization can be evaluated. The reader can explore the notebooks to see the details of the implementation and details of how findings in each phase of the model building cycle are logged. The findings from the resulting knowledge base can be exported to create materials to communicate the details of the project to team members and management, see this video and this infographic

The machine learning example illustrates how Principal Components Analysis can be used to summarize the sales activity in an online store for a particular quarter. The reader can view the notebooks under the machine learning example for details of the implementation. As with the analytics example, generative AI tools (Notebook LM in this case) can be used to communicate the findings and results from the knowledge base, see this infographic.

Licensing and Feature Questions

The tool is open source with an apache 2.0 license
If you are interested in the following features, please set up a meeting with me:
1. Help with a data analysis task for your use case
2. Developing an ontology based solution similar to the above for your specific use case.
3. Customizing this tool with other extensions, for example to integrate a feature store or meta-data management tool that you use in your data science tool stack.
If this problem resonates with you as a developer and you would like to contribute, submit an issue and if the feature makes sense we can discuss the possiblity of submitting a PR for it. Of course, you can fork this repository and use it per the licensing information. Thank you for your interest.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.3.3

Apr 20, 2026

0.3.2

Apr 20, 2026

0.3.1

Feb 2, 2026

This version

0.3.0

Feb 2, 2026

0.2.16

May 15, 2024

0.2.15

May 4, 2024

0.2.14

Mar 2, 2024

0.2.13

Mar 2, 2024

0.2.12

Feb 28, 2024

0.2.11

Feb 27, 2024

0.2.10

Feb 27, 2024

0.2.9

Feb 27, 2024

0.2.7

Feb 18, 2024

0.1.0

Feb 2, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kmds-0.3.0.tar.gz (5.7 MB view details)

Uploaded Feb 2, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

kmds-0.3.0-py3-none-any.whl (5.9 MB view details)

Uploaded Feb 2, 2026 Python 3

File details

Details for the file kmds-0.3.0.tar.gz.

File metadata

Download URL: kmds-0.3.0.tar.gz
Upload date: Feb 2, 2026
Size: 5.7 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.26 {"installer":{"name":"uv","version":"0.9.26","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for kmds-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`ce782703e779948e6ea69b59d90762e2c62e845b1784efed0cdc5b153be371d3`
MD5	`268a90298b466f8f74aceb157ba86f6d`
BLAKE2b-256	`a5899cf3846837021d6248b0657aa2dcd3f38974c8bc3b0e1568622432995316`

See more details on using hashes here.

File details

Details for the file kmds-0.3.0-py3-none-any.whl.

File metadata

Download URL: kmds-0.3.0-py3-none-any.whl
Upload date: Feb 2, 2026
Size: 5.9 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.26 {"installer":{"name":"uv","version":"0.9.26","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for kmds-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6ec968935835cf4b735d0653af83b9f1a2a9f89fe8340afc7586a8d784163055`
MD5	`4b9504bd404bc9143e1ac30c3f21da92`
BLAKE2b-256	`236995cc30cb40d75f25f19fe0a05e45171dea058142b67ce8adbd1dcc1fb41f`

See more details on using hashes here.

kmds 0.3.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Knowledge Management for Data Science

What is the tool and why do you need it?

How is it related to process guidelines and vocabularies for machine learning?

Who would use this tool?

How do you use this tool?

Examples of use

Licensing and Feature Questions

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes