Skip to main content

AI-friendly llms.txt & sitemap generator for websites. Crawl or import sitemaps, filter pages, and generate curated indexes for LLMs.

Project description

LLMS Sitemap Generator

License: MIT Python 3.9+

AI-friendly llms.txt & sitemap generator / 面向 LLM 的站点地图生成器

Automatically collect, filter, and organize website URLs into curated indexes for LLMs and search engines.

自动收集、过滤和组织网站 URL,生成面向 LLM 和搜索引擎的策展式索引。

✨ Features / 功能特性

  • 🤖 LLM Training / LLM 训练: Generate llms.txt files for AI model training
  • 🔍 SEO Optimization / SEO 优化: Create comprehensive sitemaps
  • 📊 Content Discovery / 内容发现: Automatically discover and catalog all pages
  • 🌐 Multi-language / 多语言: Smart language filtering
  • 🏢 Enterprise Sites / 企业网站: Handle complex B2B SaaS sites with multiple subdomains

📦 Installation / 安装

# Basic / 基础安装
pip install llms-sitemap-generator

# With GUI / 带 GUI 支持
pip install llms-sitemap-generator[gui]

🎯 Quick Start / 快速开始

1. Analyze site / 分析站点

llms-sitemap-generator analyze https://example.com

2. Generate / 生成

# Generate llms.txt / 生成 llms.txt
llms-sitemap-generator generate

# Dry-run / 预览模式
llms-sitemap-generator generate --dry-run --max-pages 100

3. GUI / 图形界面

llms-sitemap-generator gui

📝 Configuration / 配置示例

Minimal llmstxt.config.yml:

site:
  base_url: "https://example.com"
  default_language: "en"

sources:
  - type: "sitemap"
    url: "https://example.com/sitemap.xml"

filters:
  include:
    - pattern: "^/products"
      group: "Products"
  exclude:
    - pattern: "blog"
  max_urls: 1000
  auto_group: true

output:
  llms_txt: "llms.txt"
  sitemap_xml: "sitemap.xml"

🛠️ Build Windows EXE / 构建 Windows 可执行文件

python build_exe.py

Output: dist/llms-sitemap-generator-gui.exe

📄 License

MIT License - see LICENSE


Made with ❤️ by Thordata

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llms_sitemap_generator-0.2.0.tar.gz (54.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

llms_sitemap_generator-0.2.0-py3-none-any.whl (57.6 kB view details)

Uploaded Python 3

File details

Details for the file llms_sitemap_generator-0.2.0.tar.gz.

File metadata

  • Download URL: llms_sitemap_generator-0.2.0.tar.gz
  • Upload date:
  • Size: 54.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.9

File hashes

Hashes for llms_sitemap_generator-0.2.0.tar.gz
Algorithm Hash digest
SHA256 490eaa4630e54df22a4d09625184df89ae2fe6e9a5a7a3fba94c45e8f514afd0
MD5 0e0ceda3a0cd3ae5773ada7ca85f9aaf
BLAKE2b-256 b8ba40e36d1c993cc6914c1f7590d49858661c33b94a421f40776ee914d86d65

See more details on using hashes here.

File details

Details for the file llms_sitemap_generator-0.2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for llms_sitemap_generator-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 60740b26678024baeb5142fe122012a1a23e1270e0358753ced2162a1c213d0e
MD5 b18653fcae70dc82291713f05bb8696e
BLAKE2b-256 fc923034eb30ac2db60540f11f92e814f9bb03a22c7c40bad9fd80e34bff5275

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page