Professional spreadsheet wrangling utilities for parsing, splitting, and expanding schedule data.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

drkhlick

These details have not been verified by PyPI

Project description

ScheduleTools

A Python library for parsing, splitting, and expanding schedule data from various formats.

Features

ScheduleParser: Parse tab-delimited schedule files with configurable date column names
ScheduleSplitter: Split schedule data by groups and apply filters
ScheduleExpander: Expand data to include required columns with mappings and defaults
CLI Interface: Command-line tools for batch processing
Flexible Configuration: JSON-based configuration with inheritance and validation

Installation

pip install scheduletools

Workflow Quick Start

from scheduletools import ScheduleParser, ScheduleSplitter, ScheduleExpander

# 1. Parse schedule data
parser = ScheduleParser("schedule.txt")
parsed_data = parser.parse()

# 2. Split by team
splitter = ScheduleSplitter(parsed_data, "Team")
team_schedules = splitter.split()

# 3. Expand with additional columns
expander = ScheduleExpander(team_schedules["E"], config.json)
expanded_data = expander.expand()

Complete Workflow Example

This example demonstrates the full transformation from wide blocked schedules to long schedules, then expansion and splitting.

Step 1: Parse Block Schedule

Start with a wide blocked schedule format:


Date	Time	Date	Time
	6 pm - 7:15 pm		6:00 pm - 7:00 pm	7:00 pm - 8:00 pm	8:15 pm - 9:15 pm
7/21/2025	E / F	7/22/2025	C / D	F	E
7/28/2025	E / F	7/29/2025	A / B	F	E

from scheduletools import ScheduleParser

# Parse with default "Date" column and reference date
parser = ScheduleParser("schedule.txt", reference_date="2025-07-21")
parsed_data = parser.parse()

Output - Long Format Schedule:

Index	Week	Day	Date	Start Time	Duration	Team
0	0	Monday	7/21/2025	6:00 PM	1:15	E
1	0	Monday	7/21/2025	6:00 PM	1:15	F
2	0	Tuesday	7/22/2025	6:00 PM	1:00	C
3	0	Tuesday	7/22/2025	6:00 PM	1:00	D
4	0	Tuesday	7/22/2025	7:00 PM	1:00	F
5	0	Tuesday	7/22/2025	8:15 PM	1:00	E
6	1	Monday	7/28/2025	6:00 PM	1:15	E
7	1	Monday	7/28/2025	6:00 PM	1:15	F
8	1	Tuesday	7/29/2025	6:00 PM	1:00	A
9	1	Tuesday	7/29/2025	6:00 PM	1:00	B
10	1	Tuesday	7/29/2025	7:00 PM	1:00	F
11	1	Tuesday	7/29/2025	8:15 PM	1:00	E

Step 2: Expand with Required Fields

from scheduletools import ScheduleExpander

# Configure expansion with required fields, defaults, and mappings
config = {
    "Required": [
        "Date",
        "Time", 
        "Duration",
        "Arrival Time",
        "Name",
        "Location Name",
        "Notes"
    ],
    "defaults": {
        "Name": "On-Ice Practice",
        "Location Name": "PISC",
        "Arrival Time": 15
    },
    "Mapping": {
        "Start Time": "Time",
        "Team": "Notes"
    }
}

expander = ScheduleExpander(parsed_data, config)
expanded_data = expander.expand()

Output - Expanded Schedule:

Date	Time	Duration	Arrival Time	Name	Location Name	Notes
7/21/2025	6:00 PM	1:15	15	On-Ice Practice	PISC	E
7/21/2025	6:00 PM	1:15	15	On-Ice Practice	PISC	F
7/22/2025	6:00 PM	1:00	15	On-Ice Practice	PISC	C
7/22/2025	6:00 PM	1:00	15	On-Ice Practice	PISC	D
7/22/2025	7:00 PM	1:00	15	On-Ice Practice	PISC	F
7/22/2025	8:15 PM	1:00	15	On-Ice Practice	PISC	E
7/28/2025	6:00 PM	1:15	15	On-Ice Practice	PISC	E
7/28/2025	6:00 PM	1:15	15	On-Ice Practice	PISC	F
7/29/2025	6:00 PM	1:00	15	On-Ice Practice	PISC	A
7/29/2025	6:00 PM	1:00	15	On-Ice Practice	PISC	B
7/29/2025	7:00 PM	1:00	15	On-Ice Practice	PISC	F
7/29/2025	8:15 PM	1:00	15	On-Ice Practice	PISC	E

Step 3: Split by Team

from scheduletools import ScheduleSplitter

# Split by the Notes column (which contains team names)
splitter = ScheduleSplitter(expanded_data, "Notes")
team_schedules = splitter.split()

# Show available team keys
print("Available teams:", list(team_schedules.keys()))

Output:

Available teams:'A', 'B', 'C', 'D', 'E', 'F'

Example - Team E Schedule: print(team_schedules['E'])

Date	Time	Duration	Arrival Time	Name	Location Name	Notes
7/21/2025	6:00 PM	1:15	15	On-Ice Practice	PISC	E
7/22/2025	8:15 PM	1:00	15	On-Ice Practice	PISC	E
7/28/2025	6:00 PM	1:15	15	On-Ice Practice	PISC	E
7/29/2025	8:15 PM	1:00	15	On-Ice Practice	PISC	E

ScheduleParser

Parse tab-delimited schedule files with flexible date column detection.

Input Format

ScheduleParser expects tab-delimited files with blocks starting at rows containing your specified date column name (default: "Date"):

Contents of schedule.txt:

1|Monday      → Tuesday     →                    
2|Date        → Time        → Date         → Time        →             →            
3|            → 6:00–7:15pm →              → 6:00–7:00pm → 7:00–8:00pm → 8:15–9:15pm
4|7/21/2025   → E / F       → 7/22/2025    → C / D       → F           → E
5|7/28/2025   → E / F       → 7/29/2025    → A / B       → F           → E

Note: → indicates an inserted tab.

Usage

from scheduletools import ScheduleParser

# Basic usage with default "Date" column name
parser = ScheduleParser("schedule.txt")
data = parser.parse()

# Custom date column name
parser = ScheduleParser("schedule.txt", date_column_name="Day")
data = parser.parse()

# With configuration file
parser = ScheduleParser("schedule.txt", config_path="config.json")
data = parser.parse()

# With config object
config = {"Format": {"Date": "%Y-%m-%d"}}
parser = ScheduleParser("schedule.txt", config=config)
data = parser.parse()

# With custom output column name
config = {"Output": {"value_column_name": "Player"}}
parser = ScheduleParser("schedule.txt", config=config)
data = parser.parse()

Configuration

1|{
2|  "Format": {
3|    "Date": "%m/%d/%Y",
4|    "Time": "%I:%M %p",
5|    "Duration": "H:MM"
6|  },
7|  "Block Detection": {
8|    "date_column_name": "Date"
9|  },
10|  "Missing Values": {
11|    "Omit": true,
12|    "Replacement": "TBD"
13|  },
14|  "Split": {
15|    "Skip": false,
16|    "Separator": ","
17|  },
18|  "Output": {
19|    "value_column_name": "Team"
20|  }
21|}

Configuration Sections

Format: Date, time, and duration format specifications
Block Detection: Date column name for identifying schedule blocks
Missing Values: How to handle empty or missing team entries
Split: Team entry splitting configuration (separator, skip options)
Output: Output column naming (e.g., "Team", "Player", "Group")

ScheduleSplitter

Split schedule data into multiple DataFrames based on grouping criteria. ScheduleSplitter creates separate DataFrames for each unique combination of values in the specified grouping columns, making it easy to work with subsets of your data.

Basic Usage

from scheduletools import ScheduleSplitter

# Split by single column
splitter = ScheduleSplitter(df, "Team")
team_schedules = splitter.split()

# Split by multiple columns
splitter = ScheduleSplitter(df, ["Team", "Week"])
schedules = splitter.split()

Advanced Usage

from scheduletools import ScheduleSplitter

# With filtering
splitter = ScheduleSplitter(
    df, 
    "Team", 
    include_values=["Team_A", "Team_B"],
    exclude_values=["Team_C"]
)
filtered_schedules = splitter.split()

ScheduleExpander

Expand schedule data to include required columns with mappings and defaults.

Usage

from scheduletools import ScheduleExpander

config = {
    "Required": ["Date", "Time", "Team", "Location", "Status"],
    "defaults": {
        "Location": "Main Arena",
        "Status": "Scheduled"
    },
    "Mapping": {
        "Start Time": "Time"
    }
}

expander = ScheduleExpander(data, config)
expanded_data = expander.expand()

CLI Usage

# Parse schedule
scheduletools parse schedule.txt -o output.csv

# Split data
scheduletools split data.csv --groupby Team -o split/

# Expand data
scheduletools expand data.csv config.json -o expanded.csv

Splitting Data

ScheduleSplitter provides powerful data splitting capabilities:

Dictionary Output: Returns a dictionary where keys are group identifiers and values are DataFrames
Filtering: Include or exclude specific values using include_values and exclude_values parameters
Multi-column Grouping: Split by multiple columns simultaneously for complex data organization

Changelog

0.4.0

Added multiple output column mapping support to ScheduleExpander
Enhanced ScheduleExpander with comprehensive validation for input/output columns
Improved error messages with detailed context for missing columns
Refactored expand() method into modular, focused functions
Updated Python compatibility to require Python 3.12+ (supports 3.12 and 3.13)
Updated development tools (Black, MyPy) to target Python 3.12

0.3.3

Added configurable output column name to ScheduleParser (default: "Team")
Updated README with comprehensive workflow examples using new team values (A-F)
Enhanced documentation with step-by-step transformation examples
Improved configuration options with new Output section
Maintained backward compatibility with default "Team" column name
Implemented dynamic versioning using setuptools-scm

0.3.2

Renamed CSVSplitter to ScheduleSplitter for better clarity
Updated documentation to reflect the new class name
Improved class descriptions to emphasize schedule data processing

0.3.0

Added configurable date column names (default: "Date")
Improved block detection and parsing logic
Added config object support for ScheduleParser
Removed meta pattern validation, now only validates date column
Combined block extraction and processing loops for better performance
Enhanced error handling and validation

0.2.0

Added configurable block start markers
Enhanced block detection strategies
Added config object support
Improved CLI integration
Added comprehensive test coverage

0.1.0

Initial release
Basic schedule parsing functionality
CSV splitting capabilities
Data expansion features

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

drkhlick

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.4.0

Sep 26, 2025

0.3.3

Jul 10, 2025

0.3.2

Jul 10, 2025

0.3.0

Jul 10, 2025

0.2.0

Jul 10, 2025

0.1.0

Jul 9, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scheduletools-0.4.0.tar.gz (79.7 kB view details)

Uploaded Sep 26, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

scheduletools-0.4.0-py3-none-any.whl (49.9 kB view details)

Uploaded Sep 26, 2025 Python 3

File details

Details for the file scheduletools-0.4.0.tar.gz.

File metadata

Download URL: scheduletools-0.4.0.tar.gz
Upload date: Sep 26, 2025
Size: 79.7 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for scheduletools-0.4.0.tar.gz
Algorithm	Hash digest
SHA256	`1620fc91d7f94a0b387c4228ce54e03db046d8674cc672842a8ca4632fe29754`
MD5	`3f45189587ac44f47ad62f04399b6999`
BLAKE2b-256	`002a03d4356437f9333b2c56a19314543706694b92e517e6547e1d55353fa24d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for scheduletools-0.4.0.tar.gz:

Publisher: publish.yml on Khlick/scheduletools

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: scheduletools-0.4.0.tar.gz
- Subject digest: 1620fc91d7f94a0b387c4228ce54e03db046d8674cc672842a8ca4632fe29754
- Sigstore transparency entry: 563917011
- Sigstore integration time: Sep 26, 2025
Source repository:
- Permalink: Khlick/scheduletools@05cad211add58258e98ee2feb175390e39582e40
- Branch / Tag: refs/tags/v0.4.0
- Owner: https://github.com/Khlick
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@05cad211add58258e98ee2feb175390e39582e40
- Trigger Event: push

File details

Details for the file scheduletools-0.4.0-py3-none-any.whl.

File metadata

Download URL: scheduletools-0.4.0-py3-none-any.whl
Upload date: Sep 26, 2025
Size: 49.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for scheduletools-0.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`60127efbe519b7be35286980514cce28f9be9ecbc72672da5a582e49e590121f`
MD5	`16e585edef2e6ef2e7522afdfca401ed`
BLAKE2b-256	`a6f0b0ca18f308570798a0abe58704484463cdeb1d71fcbb9ed47ccd30175c65`

See more details on using hashes here.

Provenance

The following attestation bundles were made for scheduletools-0.4.0-py3-none-any.whl:

Publisher: publish.yml on Khlick/scheduletools

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: scheduletools-0.4.0-py3-none-any.whl
- Subject digest: 60127efbe519b7be35286980514cce28f9be9ecbc72672da5a582e49e590121f
- Sigstore transparency entry: 563917016
- Sigstore integration time: Sep 26, 2025
Source repository:
- Permalink: Khlick/scheduletools@05cad211add58258e98ee2feb175390e39582e40
- Branch / Tag: refs/tags/v0.4.0
- Owner: https://github.com/Khlick
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@05cad211add58258e98ee2feb175390e39582e40
- Trigger Event: push

scheduletools 0.4.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

ScheduleTools

Features

Installation

Workflow Quick Start

Complete Workflow Example

Step 1: Parse Block Schedule

Step 2: Expand with Required Fields

Step 3: Split by Team

ScheduleParser

Input Format

Usage

Configuration

Configuration Sections

ScheduleSplitter

Basic Usage

Advanced Usage

ScheduleExpander

Usage

CLI Usage

Splitting Data

Changelog

0.4.0

0.3.3

0.3.2

0.3.0

0.2.0

0.1.0

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance