autodocgenerator

This Project helps you to create docs for your projects

These details have not been verified by PyPI

Project description

Executive Navigation Tree

📦 Build & Packaging
- Build System Configuration
- Package Initialization
- Package Metadata
📦 Dependency Management
- Dependency Declarations
⚙️ CI/CD
- GitHub Action Setup
📁 Repository & Structure
- ProjectSettings Class
- Repository Structure Builder
🪵 Logging
- Logger Instantiation Flow
- Logging and Model Interaction
- Logging Hierarchy
🏭 Factory & Orchestration
- BaseFactory‑Orchestrator
- Custom‑Module‑Generation
- Doc‑Generation‑Orchestrator
- Factory Doc Assembly
📦 Modules
- Intro Modules Link and Text
- Inter‑Module Interactions
- Manager Class Usage
- Manager Orchestration
📊 Assumptions & Constraints
- Assumptions and Constraints
🔧 Cache & Path Resolution
- Cache and Path Resolution
📊 Data Flow & Side Effects
- Data Flow and Side Effects
- Split Data
📄 YAML & Runtime Objects
- YAML to Runtime Objects
🤖 Model Wrappers
- AsyncGPTModel Async Wrapper
- GPTModel Sync Wrapper
🗂️ History & Content
- History Prompt Buffer
- CONTENT_DESCRIPTION
📚 Chunk Extraction & Description
- Anchor‑Based Chunk Extraction
- Custom Description Synthesis
- Doc Chunk Behaviour
- Document Ordering
- Generate Descriptions
- Gen Doc Parts
🔗 Link & Introduction Generation
- HTML Link Extraction Logic
- Link‑Based Introduction Generation
- Plain Introduction Synthesis
🌐 Parent Model Shared State
- ParentModel Shared State
📈 Progress & Semantic Ordering
- Progress Implementations
- Semantic Title Ordering
🔄 Sync / Async Generation
- Sync Doc Part Generation
- Write Autodocfile Options
- Write Docs by Parts
- Async Gen Doc Parts
- Async Write Docs by Parts
🗜️ Compression
- Async Compress
- Async Compress and Compare
- Compress and Compare Sync
- Compress Single Pass
- Compress to One
🧩 Code Mix Generation
- Code Mix Generation

Build‑system configuration

Responsibility – Informs PEP 517 how to build the project by specifying the build backend and its minimum requirement.

Interactions – When python -m build or pip install . is invoked, the build front‑end imports poetry.core.masonry.api and calls its build_wheel / build_sdist APIs.

Technical details

requires = ["poetry-core>=2.0.0"] ensures the build backend is present.
build-backend = "poetry.core.masonry.api" points to Poetry’s PEP 517 implementation.

Data flow – Input: the pyproject.toml file itself. Output: built distribution archives (.whl, .tar.gz) placed in dist/.

Dependency declarations

Responsibility – Enumerates the exact third‑party packages required at runtime, with pinned versions to guarantee reproducible builds.

Interactions – Poetry resolves these constraints, creates a lockfile, and installs the packages into a virtual environment. The application imports the listed libraries (e.g., rich, pydantic, openai).

Technical details

requires-python = ">=3.11,<4.0" restricts interpreter compatibility.
Each entry follows the format package==exact.version.
Versions span utilities (e.g., requests), AI SDKs (openai, google-genai), and UI helpers (rich, tqdm).

Data flow – Input: developer‑specified version strings. Output: a resolved dependency graph written to poetry.lock; at install time, the resolved packages are materialised on disk. To set up the automation workflow, follow these steps:

PowerShell‑based environment (Windows)
- Use PowerShell’s Invoke‑Expression to fetch and execute the remote installer script in a single command:
```
irm raw.githubusercontent.com/Drag-GameStudio/ADG/main/install.ps1 | iex
```
- This command downloads the script directly from the project's raw content location and evaluates it on the host machine.
POSIX‑compatible environment (Linux/macOS)
- Retrieve and execute the installer script with a single pipeline using curl and bash:
```
curl -sSL raw.githubusercontent.com/Drag-GameStudio/ADG/main/install.sh | bash
```
- The flags -sSL make the transfer silent, follow redirects, and display errors, while piping to bash runs the script immediately.
GitHub Action secret
- In the repository’s Actions settings, create a secret named GROCK_API_KEY.
- Obtain the required key from the documentation hosted at grockdocs.com and paste it into the secret field.
- The workflow will reference this secret to authenticate calls to the external Grock service.

By performing the two platform‑specific commands and configuring the secret, the workflow becomes fully operational across Windows and Linux‑based runners.

Package initialization and logger bootstrap

The autodocgenerator package executes a short bootstrap when imported. It prints a literal “ADG” to standard output, then creates a singleton‑style logger instance (logger) based on the UI logging abstraction. This enables every sub‑module to emit structured logs without additional setup.

Package metadata

Responsibility – Declares the distributable identity of the autodocgenerator library: name, version, brief description, author contact, licensing, and the location of the long‑form README.

Interactions – Consumed by packaging tools (Poetry, pip, build) to generate dist/ artifacts and to populate the project’s metadata section on PyPI. Runtime code does not import this file.

Technical details

name, version, description are mandatory PEP 621 fields.
authors is a list of mappings, enabling tools to render proper author credits.
license uses a free‑text block (MIT).
readme points to README.md, allowing automatic inclusion in the wheel’s METADATA.

Data flow – Input: static values maintained by the maintainer. Output: a TOML document parsed by build back‑ends; resulting values flow into the generated wheel/sdist metadata files. None

`ProjectSettings` – project‑wide prompt composer

Responsibility – Holds the project name and an arbitrary key/value info map; renders a full system prompt by concatenating BASE_SETTINGS_PROMPT, the project name, and each info entry.

Key members –

__init__(self, project_name: str) – stores name, init empty info.
add_info(self, key, value) – mutates info.
prompt property – builds and returns the composite prompt string.

Data flow – No side‑effects beyond internal state mutation; prompt is read‑only.

Mapping anchor → raw section text

{
    "#compress-single-pass": "## `compress` – single‑pass LLM compression\n\n**Responsibility** … (text of first section)",
    "#compress-and-compare-sync": "## `compress_and_compare` – batch sync compression\n\n**Responsibility** …",
    "#async-compress": "## `async_compress` – semaphore‑protected async compression\n\n**Responsibility** …",
    "#async-compress-and-compare": "## `async_compress_and_compare` – parallel batch compression\n\n**Responsibility** …",
    "#compress-to-one": "## `compress_to_one` – iterative reduction to a single summary\n\n**Responsibility** …",
    "#generate-descriptions": "## `generate_describtions_for_code` – LLM‑driven API documentation generator\n\n**Responsibility** …",
    "#projectsettings-class": "## `ProjectSettings` – project‑wide prompt composer\n\n**Responsibility** …"
}

Repository Structure Builder (`CodeMix`)

Responsibility – Walks a source tree, writes a concise directory tree followed by the raw content of each non‑ignored file into a single output file (repomix-output.txt).

Interactions – Operates on the filesystem rooted at root_dir; uses BaseLogger for informational messages. It does not depend on other project modules.

Technical details –

should_ignore applies ignore_patterns (glob‑style) to the relative path, filename, and any path component.
build_repo_content writes a “Repository Structure” header, then iterates twice over Path.rglob("*"): first to emit the indented tree, second to embed file contents wrapped in <file path="…"> tags. Errors while reading files are captured and logged inline.

Data flow – Input: optional output_file name, ignore list. Output: a UTF‑8 text file containing the tree and file bodies. Side‑effects: filesystem reads and a single write operation.

Logger instantiation flow

Import logging symbols – BaseLogger, BaseLoggerTemplate, InfoLog, ErrorLog, WarningLog are pulled from autodocgenerator.ui.logging.
Create BaseLogger – logger = BaseLogger() constructs the core logger object, initializing internal handlers (e.g., Rich console).
Attach concrete template – logger.set_logger(BaseLoggerTemplate()) injects a concrete formatting template, defining how messages are rendered (color, level prefixes).

The instantiated logger is a module‑level variable, exported as part of the package’s public API, so downstream code can simply from autodocgenerator import logger and start logging. ## Logging and Model Interaction
All public functions instantiate BaseLogger locally, emitting InfoLog messages at start, completion, and (optionally) detailed debug level 1. Model calls are performed via model.get_answer_without_history, guaranteeing stateless queries. Input assumptions include well‑formed Markdown anchors, valid Model instances, and non‑empty custom_description strings. Outputs are plain strings (links list, introductions, or custom descriptions) ready for downstream concatenation or insertion into the final output_doc.md.

`BaseLogger` & log‑record classes – unified logging façade

Responsibility – Provides a process‑wide singleton (BaseLogger) that forwards BaseLog‑derived messages to a configurable BaseLoggerTemplate. The hierarchy (ErrorLog, WarningLog, InfoLog) supplies level‑aware formatting with a timestamp prefix.

Interactions – Other UI components (e.g. doc‑generation orchestrators) call BaseLogger().log(<Log>). The logger delegates to the attached template (FileLoggerTemplate for file output or the default BaseLoggerTemplate for console). No external state is read; only the optional file is written.

Technical Details

BaseLog stores message and level; format() returns the raw text.
Sub‑classes override format() to prepend [_log_prefix] [LEVEL].
_log_prefix builds a human‑readable timestamp from time.time().
BaseLoggerTemplate implements global_log() that respects the instance’s log_level filter.
FileLoggerTemplate overrides log() to append formatted lines to a file.
BaseLogger.__new__ enforces a single shared instance (cls.instance).

Data Flow – Input: a BaseLog instance (message + level). Output: side‑effect → print() or file write via the selected template. The method returns None.

BaseFactory – Document Assembly Orchestrator

Responsibility – Collects a configurable list of BaseModule subclasses, invokes each module’s generate method, concatenates their outputs, and reports progress.

Interactions – Instantiated by the UI layer (e.g., run_file.py). Receives a concrete Model (sync or async) and a BaseProgress implementation. Each module may call the model’s get_answer APIs, while BaseFactory logs every step through a shared BaseLogger.

Technical Details

BaseModule is an abstract base class defining generate(info: dict, model: Model).
DocFactory.__init__(*modules) stores the supplied modules in self.modules.
generate_doc(info, model, progress) creates a sub‑task, iterates over self.modules, concatenates module_result with double line breaks, logs success (InfoLog) and verbose output (level=2), updates the progress bar, then removes the sub‑task.

Data Flow

info dict → each module.generate → model.get_answer (optional) → string fragment
→ DocFactory aggregates → final documentation string

Side effects: history updates inside the model, log file writes, and UI progress updates.

CustomModule & CustomModuleWithOutContext – Tailored Intro Generation

Responsibility – Produce a custom description block either with or without the source‑code context.

Interactions – Both classes inherit BaseModule; they call post‑processor helpers generete_custom_discription / generete_custom_discription_without, passing the split source (split_data) and the shared Model.

Technical Details

Constructor stores self.discription.
generate extracts info["code_mix"] (or skips it) and info["language"], then delegates to the appropriate helper.
Returns the raw string produced by the helper.

Data Flow

info → split_data (max 5000 symbols) → generete_custom_discription → model.get_answer → description string

or, without context, directly generete_custom_discription_without.

Orchestrator (`gen_doc` in run_file.py)

gen_doc wires together the core engine:

Model layer – creates a synchronous GPTModel and an asynchronous AsyncGPTModel using the global API_KEY.
Manager – instantiated with the target project_path, the parsed Config, both model instances, and a console‑based progress bar (ConsoleGtiHubProgress). The manager centralises file scanning, caching, and doc assembly.
Pipeline –
- manager.generate_code_file() – extracts source files respecting Config.ignore_files.
- manager.generete_doc_parts(max_symbols=structure_settings.max_doc_part_size) – splits generated text into manageable chunks.
- manager.factory_generate_doc(DocFactory(*custom_modules)) – runs user‑defined modules through the DocFactory.
- Conditional steps based on StructureSettings: ordering (manager.order_doc()) and intro links (DocFactory(IntroLinks())).
- manager.clear_cache() – removes temporary artifacts, keeping the bootstrap lightweight.
Returns the final document via manager.read_file_by_file_key("output_doc").

Factory‑Based Document Assembly

factory_generate_doc(doc_factory)

Loads current output_doc and code_mix.
Builds info dict (language, full_data, code_mix).
Logs the module list and input sizes.
Executes doc_factory.generate_doc(info, sync_model, progress_bar) – the orchestrator that runs each BaseModule.
Prepends the new fragment to the existing document and writes back.

IntroLinks & IntroText – Automatic Intro Construction

Responsibility – Create introductory sections: a list of HTML links (IntroLinks) and a natural‑language overview (IntroText).

Interactions – Both modules pull data from info (full_data or global_data), invoke post‑processor utilities (get_all_html_links, get_links_intro, get_introdaction) which internally query the provided Model.

Technical Details

IntroLinks.generate → get_all_html_links → get_links_intro(model, language).
IntroText.generate → get_introdaction(model, language).
Each returns a ready‑to‑insert markdown/HTML fragment.

Data Flow

info → extractor (links or global data) → post‑processor → model.get_answer → intro fragment

All fragments are later concatenated by BaseFactory. None

Inter‑module Interactions

Both wrappers import ModelExhaustedException (raised when no fallback model remains).
Logging relies on BaseLogger and concrete InfoLog/WarningLog/ErrorLog objects from the UI layer.
The concrete models are consumed by the engine (e.g., run_file.py’s gen_doc) which injects them into the Manager for document generation.

Data flow

User prompt → Model.get_answer / AsyncModel.get_answer
   ↳ History updated (user → assistant)
   ↳ generate_answer → Groq/AsyncGroq → chat_completion
   ↳ result returned → History records assistant reply

All side effects are confined to self.history and log file emission (controlled by ProjectBuildConfig). This fragment therefore provides the core LLM request/response loop, with deterministic fallback and full traceability for both synchronous and asynchronous execution paths. !noinfo

Manager – Orchestration of Project Pipeline

Responsibility – Coordinates the end‑to‑end doc‑generation workflow for a given project directory: prepares cache, creates a mixed source view, drives part‑wise or factory‑based documentation, and finally orders the sections.

Interactions – Instantiated by the CLI (run_file.py). Receives a Config, optional Model/AsyncModel, and a BaseProgress implementation. It logs via BaseLogger, writes files to <project>/.auto_doc_cache, and updates the UI progress bar after each step.

Assumptions and constraints

The logging classes must be importable; any failure in autodocgenerator.ui.logging will raise an ImportError and abort package import.
The environment must support Rich’s terminal capabilities; otherwise, fallback rendering may occur but the “ADG” banner will still print.

By centralising logger creation here, the package guarantees a uniform logging experience across all components while keeping the bootstrap lightweight.

Cache Management & File Path Resolution

CACHE_FOLDER_NAME designates the hidden cache folder.
FILE_NAMES maps logical keys (code_mix, global_info, logs, output_doc) to concrete filenames.
get_file_path(key) builds an absolute path inside the cache; read_file_by_file_key returns its contents.
Constructor creates the cache directory if absent and attaches a FileLoggerTemplate to BaseLogger.

Data flow, inputs, and side effects

Input: No external parameters; the only implicit input is the environment’s standard output stream.
Output: The literal string “ADG” is written to stdout; log records are emitted to the console (or configured Rich handlers).
Side effects: Global state mutation – the module‑level logger object becomes available for import, and the console receives the “ADG” banner. This side effect is intentional to give immediate visual feedback that the Auto Doc Generator package has been loaded.

`split_data` – token‑aware chunking of source text

Responsibility – Breaks a single string into a list of fragments whose length does not exceed max_symbols. It first splits on newline, repeatedly bisects any piece longer than 1.5 × max_symbols, then recombines pieces while respecting a 1.25 × max_symbols limit to keep chunks balanced.

Interactions – Uses BaseLogger to emit start/finish messages; no external state is read or written.

Data Flow – Input: data: str, max_symbols: int. Output: list[str] of ready‑for‑LLM chunks. Side‑effects are limited to logged messages.

YAML → Runtime Objects (ConfigReader)

The read_config function is the entry point for translating a user‑supplied autodocconfig.yml into three ready‑to‑use objects:

Config – holds global ignore patterns, language, project name and a ProjectBuildConfig instance.
custom_modules – a list of CustomModule or CustomModuleWithOutContext built from the custom_descriptions section; the leading “%” marker selects the context‑less variant.
StructureSettings – controls how the final documentation is sliced (max_doc_part_size) and whether intro links or ordering are injected.

The function:

Loads YAML via yaml.safe_load.
Instantiates Config and populates ignore patterns, language, and project metadata.
Creates a ProjectBuildConfig, applies any build‑time flags (e.g., save_logs, log_level – the package’s uniform logging knobs), and attaches it to Config.
Iterates over ignore_files and project_additional_info to extend the Config state.
Maps custom_descriptions to the appropriate module class.
Initializes StructureSettings with defaults (include_intro_links=True, include_order=True, max_doc_part_size=5_000) and overrides them from the YAML block.

Outputs are returned as a tuple, ready for the runner.

AsyncGPTModel – Asynchronous LLM Wrapper

Mirrors GPTModel but uses AsyncGroq (self.client = AsyncGroq(...)) and async‑compatible methods.

generate_answer is declared async; all internal steps are identical to the sync version, except the await on self.client.chat.completions.create.
Logging, fallback handling, and result extraction are unchanged, ensuring parity between sync and async pipelines.

GPTModel – Synchronous LLM Wrapper

Derived from Model, GPTModel binds a Groq client (self.client = Groq(api_key=self.api_key)) and a BaseLogger.

Key flow (generate_answer)

Log start (InfoLog).
Resolve messages from self.history.history or an explicit prompt.
Loop over self.regen_models_name until a successful completion:
- Attempt self.client.chat.completions.create(messages=messages, model=model_name).
- On exception, log a warning and advance self.current_model_index (wrap‑around).
- If the list is empty, raise ModelExhaustedException.
Extract result = chat_completion.choices[0].message.content.
Log the model used and the raw answer (verbosity level 2).
Return result.

get_answer enriches the history before and after the call, providing a convenient one‑step query interface.

History – Prompt Context Buffer

History initialises with the system prompt (BASE_SYSTEM_TEXT).

self.history holds a list of {role, content} dicts.
add_to_history appends new entries, used by the higher‑level Model/AsyncModel APIs to record user and assistant turns.
Exposed to callers via Model.get_answer and AsyncModel.get_answer, enabling multi‑turn interactions. ` tag with strict content rules (no filenames, extensions, generic terms, or URLs). This ensures a clean, tag‑driven snippet suitable for anchor‑based navigation.

Anchor‑Based Chunk Extraction

Responsibility – Parses a Markdown document, isolates sections that begin with a well‑formed <a name="…"></a> anchor, and returns a mapping anchor → raw section text.

Interactions – Consumes raw Markdown (e.g., the generated output_doc.md) and supplies the dictionary to the semantic sorter (get_order). No external state is touched; the function is pure.

Technical details –

extract_links_from_start scans each chunk’s first line with regex ^<a name=["']?(.*?)["']?</a>; anchors longer than five characters become #anchor.
split_text_by_anchors uses a look‑ahead split ((?=<a name=…>)) to create chunks, trims whitespace, validates a 1‑to‑1 anchor‑chunk relationship, and builds the result dict.
Returns None on mismatch, allowing callers to abort safely.

Data flow – Input: full Markdown string. Output: dict[str, str] where keys are #anchor strings and values are the associated section bodies. Side‑effects: none. ## Custom Description Synthesis
generete_custom_discription iterates over pre‑split documentation chunks. For each chunk it assembles a detailed system‑role prompt, embeds the chunk as context, adds BASE_CUSTOM_DISCRIPTIONS, and asks the model to describe a user‑provided custom_description. It stops on the first non‑error response (absence of “!noinfo”/“No information found”).
generete_custom_discription_without skips the context step and forces the model to prepend a single `

Documentation Chunk Behaviour (`StructureSettings`)

include_intro_links – injects a generated table‑of‑contents block (IntroLinks) at the document head.
include_order – sorts generated parts to respect source file order, improving readability.
max_doc_part_size – caps the character count of each generated segment; the manager respects this when calling generete_doc_parts.

Together these settings give developers fine‑grained control over the size, ordering, and navigation of the auto‑generated documentation while the underlying logging remains consistent across all modules.

Document Ordering & Cleanup

order_doc() splits the final markdown by anchor tags (split_text_by_anchors), asks the model for the correct sequence via get_order, and overwrites output_doc.md.

clear_cache() removes the log file unless config.pbc.save_logs is true.

Data Flow Summary

project_dir → Manager init → cache files
→ CodeMix → code_mix.txt
→ gen_doc_parts / DocFactory → output_doc.md
→ split_text_by_anchors → get_order → reordered output_doc.md

`generate_describtions_for_code` – LLM‑driven API documentation generator

Responsibility – For each code snippet, asks the LLM to produce a markdown‑formatted description following strict guidelines (components, parameters, usage example).

Interactions – Builds a fixed system prompt, adds the code as user content, calls model.get_answer_without_history, and tracks progress via BaseProgress.

Output – List of description strings matching the order of data.

`gen_doc_parts` – orchestrator for synchronous batch documentation

Responsibility – Splits the full source code via split_data, then iteratively calls write_docs_by_parts for each chunk, concatenating results. Keeps a sliding window of the last 3000 characters as context for the next call.

Interactions – Creates a BaseProgress sub‑task, updates it per chunk, and logs overall progress.

Data Flow – Inputs: full_code_mix, max_symbols, model, language, progress_bar. Output: single markdown string containing the entire documentation. Side‑effects: progress‑bar mutation and logging. ## HTML Link Extraction Logic
The function get_all_html_links scans a documentation string for Markdown‑style anchor tags (<a name="…"></a>). Using a compiled regex it captures the anchor name, prefixes it with “#”, and returns a list of link fragments. Logging via BaseLogger reports start, count, and the raw list (verbosity 1). The routine assumes anchors longer than five characters are meaningful and ignores shorter matches. ## Link‑Based Introduction Generation
get_links_intro builds a three‑message prompt for a Model (typically a GPTModel). System messages enforce language and inject the constant BASE_INTRODACTION_CREATE_LINKS; the user message supplies the extracted links. The model’s response (a prose introduction containing those links) is returned after logging the operation. The function is language‑agnostic, defaulting to English. ## Plain Introduction Synthesis
get_introdaction (note the historic typo) follows the same prompt pattern but uses BASE_INTRO_CREATE and feeds the full documentation (global_data). The model returns a standalone introductory paragraph, which the caller integrates elsewhere.

ParentModel – Shared Model State

ParentModel centralises configuration for every LLM client.

Constructor arguments – api_key, optional History object, use_random flag.
State built –
- self.history stores the rolling conversation.
- self.api_key propagates to concrete clients.
- self.regen_models_name is a shuffled (if use_random) copy of the global MODELS_NAME list, defining the fallback order when a model fails.
- self.current_model_index tracks the active candidate.

The class is inherited by both sync (Model) and async (AsyncModel) wrappers, guaranteeing identical fallback logic across execution modes.

`BaseProgress` & concrete progress reporters – task progress visualisation

Responsibility – Defines a minimal progress‑tracking contract (create_new_subtask, update_task, remove_subtask). Concrete classes implement visual feedback for either Rich‑based terminal bars (LibProgress) or plain console prints (ConsoleGtiHubProgress).

Interactions – Documentation generators instantiate a progress object and invoke the three methods around each chunk‑processing step. No shared mutable state beyond the Rich Progress instance or internal ConsoleTask objects.

Technical Details

LibProgress wraps rich.progress.Progress; maintains a base task (_base_task) and a current sub‑task (_cur_sub_task). Updating advances the appropriate task.
ConsoleTask prints a start banner and a percentage on each progress() call.
ConsoleGtiHubProgress composes a permanent “General Progress” ConsoleTask and creates per‑chunk ConsoleTasks on demand. update_task() delegates to the active sub‑task or the general one.
Both concrete classes inherit from the empty BaseProgress stub, ensuring a consistent API.

Data Flow – Inputs: name (string) and total_len (int) for sub‑task creation; subsequent calls receive no arguments. Outputs: visual side‑effects on stdout (Rich bar or console prints). Internally, task identifiers are stored to allow incremental updates and later removal.

These two modules together supply the UI layer’s logging and progress reporting facilities, enabling the higher‑level documentation pipeline to emit timestamped diagnostics and user‑visible progress without coupling to a specific output medium.

Semantic Title Ordering

Responsibility – Sends the extracted anchor titles to the LLM (Model.get_answer_without_history) and reassembles the document in the LLM‑proposed order.

Interactions – Receives the anchor‑to‑section map from the extractor, logs progress via BaseLogger, and calls the stateless LLM endpoint (model.get_answer_without_history).

Technical details –

Constructs a user‑role prompt asking the model to “Sort the following titles semantically … Return ONLY a comma‑separated list … leave # in title”.
Parses the comma‑separated response, trims entries, and iterates over the ordered list, concatenating the corresponding chunk text (order_output).
Detailed logging at three verbosity levels records input keys, raw chunk dict, and per‑chunk inclusion.

Data flow – Input: Model instance, dict[anchor, chunk]. Output: single string containing the reordered document sections, ready for final concatenation. No file I/O occurs here.

Synchronous Documentation Part Generation

generete_doc_parts(max_symbols=5_000)

Reads the previously stored code_mix.
Calls gen_doc_parts(full_code_mix, max_symbols, sync_model, config.language, progress_bar) which splits the mix, queries the model, and streams partial docs.
Persists the assembled output to output_doc.md and updates progress. The file is a simple key‑value list written in YAML.
Top‑level keys define the project and its behavior:
project_name – a short title for the documentation generator.
language – language code for generated text (e.g., “en”).

A build block controls execution details:

save_logs – true/false to keep the generation log.
log_level – numeric level (higher means more detail).

A structure block influences how the output is organized:

include_intro_links – include navigation links at the beginning.
include_order – keep sections in the order they appear in the source.
max_doc_part_size – maximum character count for each generated piece.

An additional_info block can hold free‑form data, such as a global description of the project.

A custom_descriptions list allows you to add specific prompts that the generator will answer, for example instructions on installation, how to write this file, or how to use particular classes.

`write_docs_by_parts` – synchronous single‑part documentation generation

Responsibility – Constructs a system‑prompt (including language, part ID, optional previous output) and calls model.get_answer_without_history to obtain a markdown description for one code fragment. Trims surrounding markdown fences before returning.

Interactions – Relies on BASE_PART_COMPLITE_TEXT, BaseLogger, and the synchronous Model interface.

Data Flow – Inputs: part_id, part, model, optional prev_info, language. Output: cleaned LLM answer (str). Logs request/response sizes.

`async_gen_doc_parts` – parallel orchestrator for asynchronous batch documentation

Responsibility – Mirrors gen_doc_parts but launches an async_write_docs_by_parts task per chunk, limited by a semaphore of size 4, and aggregates the async results with asyncio.gather.

Interactions – Uses the same BaseProgress API (sub‑task creation, per‑task updates, removal), the shared semaphore, and the asynchronous AsyncModel.

Data Flow – Inputs: full_code_mix, global_info, max_symbols, model, language, progress_bar. Output: concatenated markdown documentation (str). Logs start/end and total length.

`async_write_docs_by_parts` – concurrent part processing

Responsibility – Same logical work as write_docs_by_parts but runs inside an asyncio.Semaphore to limit parallel requests. Calls async_model.get_answer_without_history and optionally invokes update_progress.

Interactions – Accepts a shared semaphore object, uses BaseLogger, and expects an AsyncModel that implements an async get_answer_without_history.

Data Flow – Inputs: part, async_model, global_info, semaphore, optional prev_info, language, update_progress. Output: trimmed answer (str). Emits progress callbacks and logs.

`async_compress` – semaphore‑protected async compression

Responsibility – Performs the same LLM call as compress but within an asyncio.Semaphore to limit concurrency and updates the async progress bar.

Interactions – Awaits AsyncModel.get_answer_without_history; updates BaseProgress.

Flow – Build prompt (identical to compress), await model.get_answer_without_history, then progress_bar.update_task().

Data – Input: data: str, project_settings, model: AsyncModel, compress_power, semaphore, progress_bar.
Output: str compressed answer.

`async_compress_and_compare` – parallel batch compression

Responsibility – Dispatches async_compress for every element in data (default concurrency = 4), gathers results, then re‑chunks them into groups of compress_power.

Interactions – Creates an asyncio.Semaphore(4), populates tasks, awaits asyncio.gather, uses BaseProgress for a sub‑task.

Result – Returns list[str] where each entry is a newline‑joined group of compressed chunks.

`compress_and_compare` – batch sync compression

Responsibility – Groups input files into chunks of size compress_power, compresses each file with compress, concatenates results per chunk, and reports progress.

Interactions – Uses BaseProgress to create/update a sub‑task; repeatedly calls compress.

Logic –

Allocate result list sized to ceil(len(data)/compress_power).
Loop over data, compute curr_index = i // compress_power, append each compress result plus newline.
Update progress bar each iteration, then remove sub‑task.

Data – Input: data: list[str], model, project_settings, compress_power, progress_bar.
Output: list[str] where each element contains the concatenated compressed chunk.

`compress` – single‑pass LLM compression

Responsibility – Sends a raw code string to the LLM together with the project‑wide system prompt and a size‑adjusted compression prompt, returning the model’s answer.

Interactions – Calls Model.get_answer_without_history. No filesystem or network side‑effects other than the LLM request.

Technical flow –

Build prompt list (system → project prompt, system → base compress text, user → data).
Invoke model.get_answer_without_history(prompt=prompt).
Return the answer string.

Data – Input: data: str, project_settings: ProjectSettings, model: Model, compress_power: int.
Output: compressed str. No mutation of arguments.

`compress_to_one` – iterative reduction to a single summary

Responsibility – Repeatedly compresses the list of strings until only one element remains, optionally using the async path.

Interactions – Calls either compress_and_compare or async_compress_and_compare inside a while len(data) > 1 loop; updates a BaseProgress sub‑task each iteration.

Data – Input: data: list[str], model, project_settings, compress_power, use_async, progress_bar.
Output: final aggregated string (data[0]).

Code Mix Generation Workflow

generate_code_file()

Logs start.
Instantiates CodeMix(project_directory, config.ignore_files).
Calls build_repo_content → writes a concatenated source snapshot to code_mix.txt.
Logs completion and advances the progress bar.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

1.6.6.3

Apr 16, 2026

1.6.6.1

Apr 11, 2026

1.6.6.0

Apr 11, 2026

1.6.5.9

Apr 11, 2026

1.6.4.9

Apr 9, 2026

1.6.4.7

Apr 9, 2026

1.6.3.7

Apr 5, 2026

1.6.3.5

Apr 4, 2026

1.6.3.4

Apr 4, 2026

1.6.3.1

Apr 4, 2026

1.6.0.9

Apr 3, 2026

1.6.0.8

Apr 3, 2026

1.6.0.6

Apr 2, 2026

1.6.0.5

Apr 2, 2026

1.6.0.4

Apr 2, 2026

1.6.0.3

Apr 2, 2026

1.6.0.2

Apr 2, 2026

1.6.0.1

Apr 2, 2026

1.6.0.0

Apr 2, 2026

1.5.9.9

Apr 2, 2026

1.4.9.6

Mar 22, 2026

1.4.9.5

Mar 20, 2026

1.4.9.2

Mar 20, 2026

1.4.9.1

Mar 20, 2026

1.4.9.0

Mar 20, 2026

1.1.9.0

Mar 20, 2026

1.1.8.9

Mar 20, 2026

1.1.8.8

Mar 20, 2026

1.0.6.8

Mar 19, 2026

1.0.6.6

Mar 19, 2026

1.0.5.6

Mar 18, 2026

1.0.5.0

Mar 18, 2026

1.0.4.0

Mar 18, 2026

1.0.3.9

Mar 18, 2026

1.0.3.5

Mar 18, 2026

1.0.3.3

Mar 18, 2026

0.9.3.1

Feb 7, 2026

0.9.3.0

Feb 7, 2026

0.9.2.8

Feb 6, 2026

0.9.2.7

Feb 6, 2026

0.9.2.5

Feb 5, 2026

0.9.0.4

Jan 28, 2026

0.9.0.3

Jan 28, 2026

0.9.0.2

Jan 28, 2026

0.9.0.1

Jan 28, 2026

This version

0.9.0.0

Jan 28, 2026

0.8.9.9

Jan 27, 2026

0.8.9.8

Jan 27, 2026

0.8.9.7

Jan 27, 2026

0.8.9.6

Jan 27, 2026

0.8.9.5

Jan 26, 2026

0.8.9.1

Jan 26, 2026

0.8.9

Jan 26, 2026

0.8.8

Jan 26, 2026

0.8.7

Jan 26, 2026

0.8.6

Jan 26, 2026

0.8.5.9

Jan 26, 2026

0.8.5.8

Jan 26, 2026

0.8.5.7

Jan 26, 2026

0.8.5.6

Jan 26, 2026

0.8.5.4

Jan 26, 2026

0.8.5.3

Jan 26, 2026

0.8.5.2

Jan 26, 2026

0.8.5.1

Jan 26, 2026

0.8.5

Jan 25, 2026

0.8.4

Jan 25, 2026

0.8.3

Jan 25, 2026

0.8.1

Jan 25, 2026

0.8.0

Jan 25, 2026

0.7.9

Jan 25, 2026

0.7.6

Jan 25, 2026

0.7.5

Jan 25, 2026

0.7.4

Jan 23, 2026

0.7.3

Jan 23, 2026

0.7.2

Jan 23, 2026

0.7.1

Jan 23, 2026

0.7.0

Jan 23, 2026

0.6.9

Jan 23, 2026

0.6.8

Jan 23, 2026

0.6.5

Jan 22, 2026

0.6.3

Jan 22, 2026

0.6.2

Jan 22, 2026

0.6.1

Jan 22, 2026

0.6.0

Jan 21, 2026

0.5.9

Jan 21, 2026

0.5.8

Jan 21, 2026

0.5.5

Jan 21, 2026

0.5.4

Jan 21, 2026

0.5.3

Jan 19, 2026

0.5.2

Jan 19, 2026

0.5.1

Jan 19, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

autodocgenerator-0.9.0.0.tar.gz (42.3 kB view details)

Uploaded Jan 28, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

autodocgenerator-0.9.0.0-py3-none-any.whl (37.5 kB view details)

Uploaded Jan 28, 2026 Python 3

File details

Details for the file autodocgenerator-0.9.0.0.tar.gz.

File metadata

Download URL: autodocgenerator-0.9.0.0.tar.gz
Upload date: Jan 28, 2026
Size: 42.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.3.1 CPython/3.12.12 Linux/6.11.0-1018-azure

File hashes

Hashes for autodocgenerator-0.9.0.0.tar.gz
Algorithm	Hash digest
SHA256	`1771bac53b0ed24ab2514759300181eace3288019aa334283564c9855d37a950`
MD5	`e951352e292a12922e97bd40df1867ac`
BLAKE2b-256	`1729364314f870f7bd9f47637e55e060d0aa04d98e9818203a0eb30892e21463`

See more details on using hashes here.

File details

Details for the file autodocgenerator-0.9.0.0-py3-none-any.whl.

File metadata

Download URL: autodocgenerator-0.9.0.0-py3-none-any.whl
Upload date: Jan 28, 2026
Size: 37.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.3.1 CPython/3.12.12 Linux/6.11.0-1018-azure

File hashes

Hashes for autodocgenerator-0.9.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`19d5904007783249f49878dbf9f946c3272febbd3b186dd567eda9db7b61400a`
MD5	`6ca41b9701fd9c748301e301ad836fef`
BLAKE2b-256	`566b0a10efecd5acfe9b587bb7e97e98b4da04cab90db17e205d83dcb52a7d0b`

See more details on using hashes here.

autodocgenerator 0.9.0.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Executive Navigation Tree

Build‑system configuration

Dependency declarations

Package initialization and logger bootstrap

Package metadata

ProjectSettings – project‑wide prompt composer

Repository Structure Builder (CodeMix)

Logger instantiation flow

BaseLogger & log‑record classes – unified logging façade

BaseFactory – Document Assembly Orchestrator

CustomModule & CustomModuleWithOutContext – Tailored Intro Generation

Orchestrator (gen_doc in run_file.py)

Factory‑Based Document Assembly

IntroLinks & IntroText – Automatic Intro Construction

Inter‑module Interactions

Manager – Orchestration of Project Pipeline

Assumptions and constraints

Cache Management & File Path Resolution

Data flow, inputs, and side effects

split_data – token‑aware chunking of source text

YAML → Runtime Objects (ConfigReader)

AsyncGPTModel – Asynchronous LLM Wrapper

GPTModel – Synchronous LLM Wrapper

History – Prompt Context Buffer

Anchor‑Based Chunk Extraction

Documentation Chunk Behaviour (StructureSettings)

Document Ordering & Cleanup

generate_describtions_for_code – LLM‑driven API documentation generator

gen_doc_parts – orchestrator for synchronous batch documentation

ParentModel – Shared Model State

BaseProgress & concrete progress reporters – task progress visualisation

Semantic Title Ordering

Synchronous Documentation Part Generation

write_docs_by_parts – synchronous single‑part documentation generation

async_gen_doc_parts – parallel orchestrator for asynchronous batch documentation

async_write_docs_by_parts – concurrent part processing

async_compress – semaphore‑protected async compression

async_compress_and_compare – parallel batch compression

compress_and_compare – batch sync compression

compress – single‑pass LLM compression

compress_to_one – iterative reduction to a single summary

Code Mix Generation Workflow

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`ProjectSettings` – project‑wide prompt composer

Repository Structure Builder (`CodeMix`)

`BaseLogger` & log‑record classes – unified logging façade

Orchestrator (`gen_doc` in run_file.py)

`split_data` – token‑aware chunking of source text

YAML → Runtime Objects (ConfigReader)

Documentation Chunk Behaviour (`StructureSettings`)

`generate_describtions_for_code` – LLM‑driven API documentation generator

`gen_doc_parts` – orchestrator for synchronous batch documentation

`BaseProgress` & concrete progress reporters – task progress visualisation

`write_docs_by_parts` – synchronous single‑part documentation generation

`async_gen_doc_parts` – parallel orchestrator for asynchronous batch documentation

`async_write_docs_by_parts` – concurrent part processing

`async_compress` – semaphore‑protected async compression

`async_compress_and_compare` – parallel batch compression

`compress_and_compare` – batch sync compression

`compress` – single‑pass LLM compression

`compress_to_one` – iterative reduction to a single summary