Skip to main content

Audit trail generator for data processing scripts.

Project description

Annalist

https://img.shields.io/pypi/v/annalist.svg Documentation Status pre-commit.ci status

Audit trail generator for data processing scripts.

Feature Roadmap

This roadmap outlines the planned features and milestones for the development of our deterministic and reproducible process auditing system.

Milestone 1: Audit Logging Framework

  • Develop a custom audit logging framework or class.

  • Capture function names, input parameters, return values, data types, and timestamps.

  • Implement basic logging mechanisms for integration.

Milestone 2: Standardized Logging Format

  • Define a standardized logging format for comprehensive auditing.

  • Ensure consistency and machine-readability of the logging format.

Milestone 3: Serialization and Deserialization

  • Implement serialization and deserialization mechanisms.

  • Store and retrieve complex data structures and objects.

  • Test serialization for data integrity.

Milestone 4: Versioning and Dependency Tracking

  • Capture and log codebase version (Git commit hash) and dependencies.

  • Ensure accurate logging of version and dependency information.

Milestone 5: Integration Testing

  • Create integration tests using the audit logging framework.

  • Log information during the execution of key processes.

  • Begin development of process recreation capability.

Milestone 6: Reproduction Tool (Partial)

  • Develop a tool or script to read and reproduce processes from the audit trail.

  • Focus on recreating the environment and loading serialized data.

Milestone 7: Documentation (Partial)

  • Create initial documentation.

  • Explain how to use the audit logging framework and the audit trail format.

  • Document basic project functionalities.

Milestone 8: Error Handling

  • Implement robust error handling for auditing and reproduction code.

  • Gracefully handle potential issues.

  • Provide informative and actionable error messages.

Milestone 9: MVP Testing

  • Conduct testing of the MVP.

  • Reproduce processes from the audit trail and verify correctness.

  • Gather feedback from initial users within the organization.

Milestone 10: MVP Deployment

  • Deploy the MVP within the organization.

  • Make it available to relevant team members.

  • Encourage usage and collect user feedback.

Milestone 11: Feedback and Iteration

  • Gather feedback from MVP users.

  • Identify shortcomings, usability issues, or missing features.

  • Prioritize and plan improvements based on user feedback.

Milestone 12: Scaling and Extending

  • Explore scaling the solution to cover more processes.

  • Add additional features and capabilities to enhance usability.

Please note that milestones may overlap, and the order can be adjusted based on project-specific needs. We aim to remain flexible and responsive to feedback during development.

Credits

This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.

History

0.1.0 (2023-09-13)

  • First release on PyPI.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

data-annalist-0.1.0.tar.gz (11.4 kB view hashes)

Uploaded Source

Built Distribution

data_annalist-0.1.0-py2.py3-none-any.whl (4.9 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page