AI-friendly llms.txt & sitemap generator for websites. Crawl or import sitemaps, filter pages, and generate curated indexes for LLMs.
Project description
LLMS Sitemap Generator
AI-friendly llms.txt & sitemap generator / 面向 LLM 的站点地图生成器
Automatically collect, filter, and organize website URLs into curated indexes for LLMs and search engines.
自动收集、过滤和组织网站 URL,生成面向 LLM 和搜索引擎的策展式索引。
✨ Features / 功能特性
- 🤖 LLM Training / LLM 训练: Generate
llms.txtfiles for AI model training - 🔍 SEO Optimization / SEO 优化: Create comprehensive sitemaps
- 📊 Content Discovery / 内容发现: Automatically discover and catalog all pages
- 🌐 Multi-language / 多语言: Smart language filtering
- 🏢 Enterprise Sites / 企业网站: Handle complex B2B SaaS sites with multiple subdomains
📦 Installation / 安装
# Basic / 基础安装
pip install llms-sitemap-generator
# With GUI / 带 GUI 支持
pip install llms-sitemap-generator[gui]
🎯 Quick Start / 快速开始
1. Analyze site / 分析站点
llms-sitemap-generator analyze https://example.com
2. Generate / 生成
# Generate llms.txt / 生成 llms.txt
llms-sitemap-generator generate
# Dry-run / 预览模式
llms-sitemap-generator generate --dry-run --max-pages 100
3. GUI / 图形界面
llms-sitemap-generator gui
📝 Configuration / 配置示例
Minimal llmstxt.config.yml:
site:
base_url: "https://example.com"
default_language: "en"
sources:
- type: "sitemap"
url: "https://example.com/sitemap.xml"
filters:
include:
- pattern: "^/products"
group: "Products"
exclude:
- pattern: "blog"
max_urls: 1000
auto_group: true
output:
llms_txt: "llms.txt"
sitemap_xml: "sitemap.xml"
🛠️ Build Windows EXE / 构建 Windows 可执行文件
python build_exe.py
Output: dist/llms-sitemap-generator-gui.exe
📄 License
MIT License - see LICENSE
Made with ❤️ by Thordata
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file llms_sitemap_generator-0.2.0.tar.gz.
File metadata
- Download URL: llms_sitemap_generator-0.2.0.tar.gz
- Upload date:
- Size: 54.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
490eaa4630e54df22a4d09625184df89ae2fe6e9a5a7a3fba94c45e8f514afd0
|
|
| MD5 |
0e0ceda3a0cd3ae5773ada7ca85f9aaf
|
|
| BLAKE2b-256 |
b8ba40e36d1c993cc6914c1f7590d49858661c33b94a421f40776ee914d86d65
|
File details
Details for the file llms_sitemap_generator-0.2.0-py3-none-any.whl.
File metadata
- Download URL: llms_sitemap_generator-0.2.0-py3-none-any.whl
- Upload date:
- Size: 57.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
60740b26678024baeb5142fe122012a1a23e1270e0358753ced2162a1c213d0e
|
|
| MD5 |
b18653fcae70dc82291713f05bb8696e
|
|
| BLAKE2b-256 |
fc923034eb30ac2db60540f11f92e814f9bb03a22c7c40bad9fd80e34bff5275
|