Skip to main content

A comprehensive collection of data science, analysis, and engineering skills and scripts.

Project description

Claude Data Skills 🐍📊

A professional-grade collection of data science, analysis, and engineering skills and scripts for AI-assisted development.

PyPI Version License: MIT Python 3.9+

Overview

claude-data-skills is a comprehensive library designed to enhance AI-assisted data workflows. It provides a structured collection of "skills"—reusable, idiomatic patterns and scripts for everything from advanced standard library usage to complex machine learning pipelines and professional development workflows.

Key Features

  • 🚀 Professional Python Core: Unified expert guide for PEP-8, Pydantic, Pytest, and high-performance parallelism.
  • 📊 Data Analysis Pro: Consolidated power-user guide for NumPy, Pandas, and Polars. Unified strategy for scaling from KB to 100GB+.
  • 🕸️ Full Spectrum Graph Sieve (GraphRAG): Advanced agentic workflow for extracting relationship-aware domain knowledge from internal docs (.docx, .one, .msg, .pdf). Uses a 4-gate verifiable pipeline with local models (Ollama).
  • ⚡ Superpowers Workflow: Integrated skills for brainstorming, TDD, systematic debugging, and plan execution.
  • 🛡️ Data Safety First: Built-in guardrails to prevent accidental data loss or corruption during autonomous execution.
  • 📈 Visualization Pro: Expert guide for Plotly (interactive), Dash (dashboards), and Seaborn (static stats).
  • 🗄️ Database Pro: Unified access for SQL (Postgres), SQLAlchemy (ORM), Elasticsearch, and S3.
  • 📁 Document Processing Pro: Consolidated expert guide for PDF, Word (DOCX), Excel (XLSX), and PowerPoint (PPTX).
  • 🔬 Scientific Research Suite: Unified guide for the entire scientific lifecycle: brainstorming, writing (IMRAD), and peer review.
  • 📚 AI-Ready Dictionary Agent: Automated extraction of technical terms from PDF/PPTX into a structured knowledge base with fuzzy-search lookup tools.
  • 🔄 Legacy Migration Suite: Specialized patterns for migrating C#, MATLAB, and Python 2 code to modern Python ( 3.9+).

Installation

Install the package directly from PyPI:

pip install claude-data-skills

Quick Start

Post-Installation Setup

After installing the package, run the following command to copy the necessary skills files to your user's Claude home directory (~/.claude/skills):

setup-claude-skills

Using the CLI

The package includes several built-in commands. For example, to run the standard library demonstration:

stdlib-demo

Importing Skills

You can import advanced utility patterns directly into your own scripts:

from skills.python_dev.python_stdlib_pro.scripts.stdlib_demo import test_pathlib

# Run a verified pathlib pattern
test_pathlib()

Core Principles

  • Resource Aware: Every intensive task starts with hardware resource validation.
  • LLM Optimized: Scripts are dense, idiomatic, and contain strict guardrails for local/open-source LLMs.
  • Atomic Operations: Prevents file corruption by using temp-and-replace patterns for all writes.

Project Structure

skills/
├── core-workflow/          # Brainstorming, TDD, Debugging, Plans
├── data-analysis/          # Data Analysis Pro (NumPy, Pandas, Polars), Geopandas
├── data-sources/           # Database Pro (Postgres, SQLAlchemy, ES, S3)
├── machine-learning/       # ML-Classical, ML-Deep-Learning, PyMC
├── python-dev/             # Python Core Pro, Legacy Migration Suite, Logic Recovery
├── scientific-workflow/    # Scientific Research Suite
└── unstructured-data/      # Document Processing Pro, Binary Data Parsing

License

This project is licensed under the MIT License - see the LICENSE file for details.

Author

Created and maintained by Yoni Kremer.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

claude_data_skills-3.5.0.tar.gz (988.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

claude_data_skills-3.5.0-py3-none-any.whl (1.2 MB view details)

Uploaded Python 3

File details

Details for the file claude_data_skills-3.5.0.tar.gz.

File metadata

  • Download URL: claude_data_skills-3.5.0.tar.gz
  • Upload date:
  • Size: 988.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.11

File hashes

Hashes for claude_data_skills-3.5.0.tar.gz
Algorithm Hash digest
SHA256 5ea5ea63de4df69169e70f07da881b5b4ec02bbebae1826df95a5365c7df9ddf
MD5 defb197d0ae387ae3aab454afe1db5ab
BLAKE2b-256 29a47c8e85eff4cd624c4c7cc94e1a8560cf2a824a77e8c673f55117554dff34

See more details on using hashes here.

File details

Details for the file claude_data_skills-3.5.0-py3-none-any.whl.

File metadata

File hashes

Hashes for claude_data_skills-3.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 9dfa734130dfa89fbc817de38b1990a35d11568c7e66e63b86a2413e30467614
MD5 f213eff9ca0ed2b6f9357af2f1471367
BLAKE2b-256 c5a2d13e3a278327178ae0cb6f6eabbbf89cf71040af4ddbafedcb39cf49a7e9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page