SVG-first slide generation toolkit with intelligent content planning and domain-specific layouts

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

icgma

These details have not been verified by PyPI

Project description

Slide Skill

Source material in. Polished, editable PowerPoint out.

An SVG-first slide generation toolkit for AI agents and the command line. Convert PDFs, Markdown, DOCX, or URLs into editable PPTX decks through a clean, inspectable SVG intermediate.

English README · 中文 README

Live preview — actual PPTX rendered through LibreOffice

Your browser does not support the video tag — download the MP4 or view the animated GIF.

_{Real screen recording of the generated .pptx: rendered via LibreOffice → PDF → 1920×1080 PNG frames → H.264 MP4 (16 s, 8 slides @ 2 s each, 348 KB). This is the actual file PowerPoint opens — no SVG approximation. Reproduce locally with slide-skill quickstart examples/sample.md --theme dark-tech: ~2 seconds, no API key, no LLM. Then open the .pptx in PowerPoint to edit every shape, text, and gradient natively.}

What it produces

Real example slides — five themes, six layouts. GitHub renders SVG inline, so what you see below is the actual artifact, not a screenshot.


_{Cover — dark-tech}	_{Bullet list — light-corporate}

_{Metric highlight — data-forward}	_{Two-column — warm-editorial}

_{Section divider — vibrant-startup}	_{Closing — dark-tech}

Source SVGs and per-file breakdown live in examples/. Drop them into any project's svg_output/ and run slide-skill export <project> to get a PPTX.

Why this exists

PowerPoint is the lingua franca of business and academia, but generating it from raw text has always been painful. Most tools either spit out a binary blob you can't inspect, or render slides as bitmaps that no one can edit.

Common pain	Typical tools	Slide Skill
Hard to inspect intermediate output	Binary `.pptx` only	Hand-readable SVG you can `cat` and diff
LLMs produce a wall of bullets	Fixed prompt → flat list	Layout-aware: cover, metric, two-column, quote…
Themes all look the same	One template	5 themes with full palette + type + layout specs
Visual QA is "open and look"	Manual	`check-svg`, `validate-pptx`, machine-readable `QA.md`
Gradients become flat colours in PPTX	Bitmap fallback	True `<a:gradFill>` rendered natively (v2.1)
AI authoring is one giant prompt	All-in-one	Strategist + Executor multi-role workflow

Quick start

pip install slide-skill

# Source → PPTX in one command
slide-skill quickstart your-notes.md --theme dark-tech

# Open exports/*.pptx in PowerPoint, Keynote, or Google Slides — fully editable.

Want more control? Use the multi-step pipeline:

slide-skill init my-deck --theme light-corporate
slide-skill source-to-md content.pdf -o projects/my-deck/sources/source.md
slide-skill spec projects/my-deck --source projects/my-deck/sources/source.md
slide-skill generate-guide projects/my-deck --source projects/my-deck/sources/source.md
# ...Executor (you or an LLM) writes svg_output/slide_NN.svg guided by design_guide.md
slide-skill check-svg projects/my-deck
slide-skill finalize-svg projects/my-deck
slide-skill export projects/my-deck

What's new in v3.0

🧠 Intelligent content planning — New content_planner module understands your source material and automatically picks the best layout per slide. Detects vocabulary lists, dialogues, metrics, process flows, and more from plain markdown.
🎓 Teaching domain — Specialized layouts for language education: vocab-card (1-4 large Chinese words with pinyin + translation), sentence-example (with annotations), dialogue (A/B conversation bubbles). Auto-sizing Chinese characters (shorter words get larger fonts).
📚 Course domain — Academic presentation layouts: learning-objectives, key-concept, case-study, discussion. Structured for classroom delivery.
🏆 Competition domain — Pitch deck layouts for student competitions (互联网+, 挑战杯): team-grid, metrics-dashboard, timeline, comparison-matrix. Maps to standard competition judging criteria.
🌐 Domain-aware configuration — Each domain has sensible defaults: teaching caps at 4 items/slide for clarity, competitions enforce time limits, courses balance density. Configure via ContentConfig(domain="teaching").
📊 460 passing tests — Comprehensive test coverage including 61 new tests for the v3.0 content planning layer.

What landed in v2.1 and still ships:

🎨 Native PowerPoint gradients — <linearGradient> and <radialGradient> in your SVG now render as true DrawingML <a:gradFill> in the exported PPTX (multi-stop, with correct angle math), not a flat mid-point colour. Open the .pptx in PowerPoint and the gradient is fully editable.
🎯 Polished auto-renderer — pure-Python templates now produce hero typography, gradient orbs, numbered bullet markers, and accent edges. The default quickstart flow looks presentable without any LLM in the loop.

What landed in v2.0 and still ships:

5 design themes — dark-tech · light-corporate · warm-editorial · data-forward · vibrant-startup
Multi-role workflow — Strategist plans, Executor authors SVG
Per-project design guide — design_guide.md generated from spec lock with full SVG examples
Permissive SVG QA — gradients, opacity, filters, transforms, classes, styles all allowed; only scripts, animations, and DOM event handlers are banned
AI SVG authoring prompt — slide-skill generate-guide writes a complete brief for the Executor role
170 passing tests — including end-to-end pipeline runs against two themes

Two ways to use it

Slide Skill ships two execution paths. Pick based on whether you have an LLM in the loop:

	Auto mode (default `quickstart`)	LLM Executor mode
Setup	`pip install slide-skill` — that's it	+ LLM API key (OpenAI / Claude / local)
Time to first PPTX	~2 seconds	~30–90 seconds
Visual ceiling	6 polished templates with gradients, decorative orbs, numbered bullets, hero typography	Hand-crafted per-slide SVG following `design_guide.md`
Layout selection	Heuristic (cover, bullet, metric, two-column, divider, closing)	LLM picks per slide
Deterministic	✅ same input → same output	⚠️ depends on model
Best for	Drafts, internal docs, recurring reports, CI pipelines	Pitch decks, conference talks, design-critical work
Edit afterwards	Yes — fully editable PPTX	Yes — fully editable PPTX

Auto mode samples (zero API key, real quickstart output):


_{Auto-rendered cover — dark-tech}	_{Auto-rendered bullet list — dark-tech}

_{Auto-rendered metrics — light-corporate}	_{Auto-rendered closing — dark-tech}

Per-slide stills (rasterised PNGs of the same PPTX, for offline viewing):

Compare with the Pipeline output section below (auto-mode end-to-end with QA reports), and the What it produces showcase above (LLM-Executor reference targets).

## 🌏 中文 / CJK support

All five themes now ship with a CJK font fallback chain (Microsoft YaHei → PingFang SC → Noto Sans SC → Source Han Sans SC). Chinese, Japanese, Korean input renders natively in PowerPoint, Keynote, Google Slides, and (with `noto-fonts-cjk` installed) LibreOffice.

A ready-to-run Chinese sample lives at [`examples/sample.zh-CN.md`](examples/sample.zh-CN.md):

```bash
slide-skill quickstart examples/sample.zh-CN.md --theme dark-tech
```

Real recording of the resulting `.pptx` (rendered through LibreOffice → PDF → MP4):

<div align="center">

<video src="https://github.com/icgma/slide-skill/raw/master/examples/auto-render/zh-CN/preview-zh.mp4" controls autoplay loop muted playsinline width="720" poster="examples/auto-render/zh-CN/slide_03.png">
  Your browser doesn't support inline video — <a href="examples/auto-render/zh-CN/preview-zh.mp4">download MP4</a> or view <a href="examples/auto-render/zh-CN/preview-zh.gif">animated GIF</a>.
</video>

<sub>9-slide Chinese deck rendered with the same dark-tech theme. Same 2-second pipeline, same zero-API-key promise.</sub>

</div>

<table>
<tr>
<td width="50%"><img src="examples/auto-render/zh-CN/slide_01.png" alt="封面 (Cover slide)" /></td>
<td width="50%"><img src="examples/auto-render/zh-CN/slide_03.png" alt="核心数据 (Metrics slide)" /></td>
</tr>
</table>

---

## 🤖 Use as an AI agent skill

`slide-skill` ships with a [`SKILL.md`](SKILL.md) at the repo root, written in the Anthropic Claude Code / Replit Agent skill format. Drop it into any agent that supports the skills convention and the agent will automatically reach for it whenever a user asks for slides, a deck, a presentation, or "做一份 PPT".

**Claude Code** (`~/.claude/skills/`):

```bash
git clone https://github.com/icgma/slide-skill.git ~/.claude/skills/slide-skill
```

**Replit Agent** (`.local/skills/` in your project):

```bash
git clone https://github.com/icgma/slide-skill.git .local/skills/slide-skill
```

**Cursor / other agents**: copy [`SKILL.md`](SKILL.md) into your rules/skills directory, install the package (`pip install -e tools/slide`), and the agent will know to call `slide-skill quickstart <input.md> --theme <theme>` for any slide-related request.

The skill file documents:
- When to activate (English + Chinese trigger phrases)
- The single command that does 90% of the work
- How to choose between the 5 themes
- Markdown authoring conventions (headings, bullets, **bold numbers** for metrics, `### A / ### B` for comparisons)
- The decision flow the agent should follow

---

Pipeline output (real end-to-end run)

The slides above are hand-crafted reference targets. Below is what the pipeline actually produced running slide-skill quickstart against examples/sample.md — an 8-slide "AI-Powered Analytics Platform" deck — once for each of two themes:

Theme	SVGs	PPTX	QA report	Visual sample
`dark-tech`	svg_output/	deck.pptx	QA.md · SVG-QA.md
`light-corporate`	svg_output/	deck.pptx	QA.md · SVG-QA.md

Both runs report status: automated-passed — PPTX Package ✓, SVG Gate ✓, Placeholder Scan ✓. See examples/RUN-LOG.md for exact commands, outputs, and a known follow-up (text overflow on long-paragraph slides).

The reference decks above showcase the design ceiling. The pipeline output here shows the current floor. Every release should narrow that gap.

How it works

┌──────────┐    ┌────────┐    ┌──────────┐    ┌──────────┐
│ Source   │ ─→ │ Spec   │ ─→ │ SVG      │ ─→ │ PPTX     │
│ md/pdf   │    │ JSON   │    │ output/  │    │ deck     │
│ docx/url │    │        │    │ slide_NN │    │ editable │
└──────────┘    └────────┘    └──────────┘    └──────────┘
                    ▲              ▲                ▲
              Strategist      Executor          Export +
              spec +          reads guide,      QA gates:
              generate-guide  writes SVGs       check-svg
                                                validate-pptx

The SVG intermediate is the hero. It is a hand-readable file you can:

Diff in code review
Tweak in any editor (Inkscape, browser, VSCode)
Run through automated QA before binary export
Few-shot into LLM prompts for layout consistency
Hand to a designer for one-off polish without round-tripping through PPTX

Design themes

Theme	Background	Accent	Best for
`dark-tech`	`#0F172A`	`#3B82F6`	Engineering, SaaS, research
`light-corporate`	`#FFFFFF`	`#1D4ED8`	Business, corporate
`warm-editorial`	`#FDF6EE`	`#EA580C`	Humanities, editorial
`data-forward`	`#F1F5F9`	`#0284C7`	Analytics, dashboards
`vibrant-startup`	`#FFFFFF`	`#7C3AED`	Startups, pitch decks

slide-skill themes    # list all themes
slide-skill formats   # list all canvas formats

Multi-role workflow

Strategist role

Prepares all planning artifacts before any SVG is written.

Normalize source: slide-skill source-to-md <file>
Create workspace: slide-skill init <name> --theme <theme>
Write design artifacts: slide-skill spec <project> --source <md> --theme <theme>
Write AI prompt: slide-skill generate-guide <project> --source <md>

Executor role

Writes SVG files guided by the planning artifacts.

Read design_guide.md — palette, typography, layout examples
Read svg_generation_prompt.md — per-slide content breakdown
Write svg_output/slide_NN.svg for each slide
Validate: slide-skill check-svg <project>

SVG authoring rules

Allowed: rect circle ellipse line text tspan image path polygon polyline g defs linearGradient radialGradient stop filter feGaussianBlur feOffset feFlood feComposite feMerge feMergeNode clipPath pattern use

Banned (hard error): script foreignObject iframe animate animateTransform set animateMotion

Banned attributes: onclick onload on* (any DOM event handler)

Fully permitted in v2.0+: opacity fill-opacity transform class style, and fill="url(#local-id)" gradient references — now rendered natively in PPTX.

Layout templates

Template	Use when
`cover`	Deck title / first slide
`section-divider`	Heading with no body text
`bullet-list`	3–7 bullet points
`two-column`	Side-by-side comparison
`metric-highlight`	2–4 large numbers / percentages
`quote`	A single strong quote
`closing`	Thank-you / CTA slide

Full SVG examples for each template are in the generated design_guide.md.

Chrome requirements (every slide)

<!-- Left accent stripe (required) -->
<g id="chrome-stripe">
  <rect x="0" y="0" width="6" height="720" fill="{accent}" />
</g>

<!-- Footer bar (required) -->
<g id="chrome-footer">
  <rect x="0" y="688" width="1280" height="32" fill="{surface}" />
  <text x="1184" y="708" ...>NN / TT</text>
</g>

Speaker notes

Write notes to <project>/notes/total.md:

## Slide 1
Opening remarks.

## Slide 2
Key insight: market opportunity is $50B.

Notes are embedded in the PPTX automatically during export.

Student competition toolkit

slide-skill competitions                    # list templates
slide-skill init <name> --competition internet-plus
slide-skill rehearse <project>              # timing analysis
slide-skill draft-notes <project>           # generate notes draft

Templates: internet-plus · challenge-cup · math-modeling · innovation-training · thesis-defense · course-presentation

TTS narration

# Edge TTS (default — free, offline-friendly)
slide-skill narrate <project> --engine edge-tts --voice zh-CN-XiaoxiaoNeural

# MiMo voice cloning
slide-skill narrate <project> --engine mimo --voice-clone sample.mp3

# MiMo voice design
slide-skill narrate <project> --engine mimo --voice-design "gentle female voice"

Development

# Run the test suite (460 tests)
pytest tests/ -v

# Lint / type-check helpers (if configured)
python -m slide_skill.cli --help

The skills/slide/ directory contains the agent skill documentation:

SKILL.md — main skill entry point for AI agents
guides/intake.md — source conversion and project setup
guides/svg-pipeline.md — design guide, SVG rules, finalization
guides/export.md — PPTX export and validation
guides/editing.md — template operations
guides/qa.md — QA loop and artifact expectations

License

MIT — see LICENSE.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

icgma

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

3.0.1

May 17, 2026

3.0.0

May 17, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

slide_skill-3.0.1.tar.gz (193.3 kB view details)

Uploaded May 17, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

slide_skill-3.0.1-py3-none-any.whl (163.5 kB view details)

Uploaded May 17, 2026 Python 3

File details

Details for the file slide_skill-3.0.1.tar.gz.

File metadata

Download URL: slide_skill-3.0.1.tar.gz
Upload date: May 17, 2026
Size: 193.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for slide_skill-3.0.1.tar.gz
Algorithm	Hash digest
SHA256	`6e034d4310cd7f4a1e591d769466a5761a10ff08cdb81453d841494113700f3d`
MD5	`2a48b374302b4db31e7b7c67b235d4f7`
BLAKE2b-256	`9461e0e924d962948de903f8f901376837f6287be5bbcef81dcfe14932a42f1a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for slide_skill-3.0.1.tar.gz:

Publisher: publish.yml on icgma/slide-skill

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: slide_skill-3.0.1.tar.gz
- Subject digest: 6e034d4310cd7f4a1e591d769466a5761a10ff08cdb81453d841494113700f3d
- Sigstore transparency entry: 1563590544
- Sigstore integration time: May 17, 2026
Source repository:
- Permalink: icgma/slide-skill@35a97d9b58089199d1e5e3c54bc23abccd676456
- Branch / Tag: refs/tags/v3.0.1
- Owner: https://github.com/icgma
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@35a97d9b58089199d1e5e3c54bc23abccd676456
- Trigger Event: push

File details

Details for the file slide_skill-3.0.1-py3-none-any.whl.

File metadata

Download URL: slide_skill-3.0.1-py3-none-any.whl
Upload date: May 17, 2026
Size: 163.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for slide_skill-3.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`869937b7fdb01e5f79efeae700e2f8992a9401f56e237f83c9f14af909108121`
MD5	`2976aed4cd0ff1484723a41d0547cff8`
BLAKE2b-256	`a598de4875af6b1a176f49eafe169d6b062fd127853789565ebf0ca2a381482d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for slide_skill-3.0.1-py3-none-any.whl:

Publisher: publish.yml on icgma/slide-skill

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: slide_skill-3.0.1-py3-none-any.whl
- Subject digest: 869937b7fdb01e5f79efeae700e2f8992a9401f56e237f83c9f14af909108121
- Sigstore transparency entry: 1563590618
- Sigstore integration time: May 17, 2026
Source repository:
- Permalink: icgma/slide-skill@35a97d9b58089199d1e5e3c54bc23abccd676456
- Branch / Tag: refs/tags/v3.0.1
- Owner: https://github.com/icgma
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@35a97d9b58089199d1e5e3c54bc23abccd676456
- Trigger Event: push

slide-skill 3.0.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Slide Skill

Source material in. Polished, editable PowerPoint out.

Live preview — actual PPTX rendered through LibreOffice

What it produces

Why this exists

Quick start

What's new in v3.0

Two ways to use it

Pipeline output (real end-to-end run)

How it works

Design themes

Multi-role workflow

Strategist role

Executor role

SVG authoring rules

Layout templates

Chrome requirements (every slide)

Speaker notes

Student competition toolkit

TTS narration

Development

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance