@cdklabs/genai-idp
Project description
GenAI IDP Core Package
This package is provided on an "as-is" basis, and may include bugs, errors, or other issues. All classes are under active development and subject to non-backward compatible changes or removal in any future version. These are not subject to the Semantic Versioning model. This means that while you may use them, you may need to update your source code when upgrading to a newer version of this package.
Overview
The GenAI IDP Core Package provides the foundational AWS CDK constructs for building intelligent document processing solutions. This package serves as the backbone for the GenAI Intelligent Document Processing (IDP) Accelerator, enabling organizations to transform unstructured documents into structured data at scale using AWS's latest AI/ML services.
For a comprehensive reference of all available constructs and their properties, please explore our API documentation. This detailed guide will help you understand the full capabilities of this package and how to leverage them in your projects.
Features
- Modular Architecture: Composable CDK constructs that can be combined to create complete document processing solutions
- Document Processing Infrastructure: Core components for document ingestion, tracking, and management
- Processing Environment API: GraphQL API for monitoring document processing status and results
- Web Application Support: Optional secure web interface for document tracking and management
- Extensible Design: Designed to work with multiple document processing patterns and AI/ML services
- Security-First Approach: Built-in support for encryption, IAM permissions, and secure data handling
- Observability: Integrated CloudWatch metrics, logs, and alarms for monitoring and troubleshooting
Getting Started
Installation
The package is available through npm for JavaScript/TypeScript projects and PyPI for Python projects.
JavaScript/TypeScript (npm)
# Using npm
npm install @cdklabs/genai-idp
# Using yarn
yarn add @cdklabs/genai-idp
Python (PyPI)
# Using pip
pip install cdklabs.genai-idp
# Using poetry
poetry add cdklabs.genai-idp
Basic Usage
Here's a simple example of how to use the core IDP constructs:
import * as cdk from 'aws-cdk-lib';
import { Construct } from 'constructs';
import * as s3 from 'aws-cdk-lib/aws-s3';
import * as kms from 'aws-cdk-lib/aws-kms';
import { ProcessingEnvironment } from '@cdklabs/genai-idp';
export class MyIdpStack extends cdk.Stack {
constructor(scope: Construct, id: string, props?: cdk.StackProps) {
super(scope, id, props);
// Create encryption key
const key = new kms.Key(this, 'IdpKey', {
enableKeyRotation: true,
});
// Create S3 buckets for input and output
const inputBucket = new s3.Bucket(this, 'InputBucket', {
encryption: s3.BucketEncryption.KMS,
encryptionKey: key,
eventBridgeEnabled: true,
});
const outputBucket = new s3.Bucket(this, 'OutputBucket', {
encryption: s3.BucketEncryption.KMS,
encryptionKey: key,
});
const workingBucket = new s3.Bucket(this, 'WorkingBucket', {
encryption: s3.BucketEncryption.KMS,
encryptionKey: key,
});
// Create processing environment
const environment = new ProcessingEnvironment(this, 'Environment', {
key,
inputBucket,
outputBucket,
workingBucket,
metricNamespace: 'MyIdpSolution',
});
// Attach document processors to the environment
// (processors are provided by separate packages)
}
}
API Reference
Key Components
- ProcessingEnvironment: Main construct that orchestrates the document processing workflow
- ProcessingEnvironmentApi: GraphQL API for monitoring document processing status
- IDocumentProcessor: Interface for document processing implementations
- LogLevel: Enum for controlling logging verbosity
For detailed API documentation, please refer to the TypeScript type definitions and JSDoc comments in the source code.
Contributing
We welcome contributions to the GenAI IDP Core Package! Please follow these steps to contribute:
- Fork the repository
- Create a feature branch (
git checkout -b feature/amazing-feature) - Commit your changes (
git commit -m 'Add some amazing feature') - Push to the branch (
git push origin feature/amazing-feature) - Open a Pull Request
Please ensure your code adheres to our coding standards and includes appropriate tests.
Related Projects
- @cdklabs/genai-idp-bda-processor: BdaProcessor implementation using Amazon Bedrock Data Automation
- @cdklabs/genai-idp-bedrock-llm-processor: BedrockLlmProcessor implementation for custom extraction using Amazon Bedrock models
- @cdklabs/genai-idp-sagemaker-udop-processor: SagemakerUdopProcessor implementation for specialized document processing using SageMaker endpoints
License
This project is licensed under the Apache License 2.0 - see the LICENSE file for details.
Copyright Amazon.com, Inc. or its affiliates. All Rights Reserved.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file cdklabs_genai_idp-0.0.2.tar.gz.
File metadata
- Download URL: cdklabs_genai_idp-0.0.2.tar.gz
- Upload date:
- Size: 1.9 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6a25ce940311d082d3fcade1a3234194d59d2ac137c2538b67babf6b9c62dcc9
|
|
| MD5 |
c86d8aab1b3a488f6ed0b505795040b6
|
|
| BLAKE2b-256 |
703a41d70a4de0f4c96341ac77117dce4134045e122902ee6f0deecc503bcac3
|
File details
Details for the file cdklabs_genai_idp-0.0.2-py3-none-any.whl.
File metadata
- Download URL: cdklabs_genai_idp-0.0.2-py3-none-any.whl
- Upload date:
- Size: 1.9 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0dc4c66ea2d696b89acdc15f587e2bdfe0b85bc817e8e0c17137cd1ffa47828e
|
|
| MD5 |
c2b505fe33da4d9bd9efb12cc0e846c0
|
|
| BLAKE2b-256 |
cf696935b8df086cae0cff80bfadacc3ed46807406cc1b46ffdb156d76e74892
|