S3Vectors provider for Stache AI
Project description
stache-ai-s3vectors
S3 Vectors provider for Stache AI - serverless vector database using Amazon S3 Vectors.
Installation
pip install stache-ai-s3vectors
Usage
Install the package and configure the provider in your settings:
from stache_ai.config import Settings
settings = Settings(
vectordb_provider="s3vectors",
s3vectors_bucket_name="my-vector-bucket", # Required
s3vectors_region="us-east-1", # Optional, defaults to AWS_REGION
)
The provider will be automatically discovered via entry points.
Environment Variables
| Variable | Description | Default |
|---|---|---|
VECTORDB_PROVIDER |
Set to s3vectors |
Required |
S3VECTORS_BUCKET_NAME |
S3 Vectors bucket name | Required |
S3VECTORS_REGION |
AWS region for S3 Vectors | us-east-1 |
AWS_REGION |
Fallback region | us-east-1 |
IAM Permissions
The S3 Vectors provider requires specific IAM permissions. Note that S3 Vectors uses its own service namespace (s3vectors:), not the standard S3 namespace.
Minimum Required Permissions
{
"Version": "2012-10-17",
"Statement": [
{
"Effect": "Allow",
"Action": [
"s3vectors:GetVectorBucket"
],
"Resource": "arn:aws:s3vectors:REGION:ACCOUNT:vector-bucket/BUCKET_NAME"
},
{
"Effect": "Allow",
"Action": [
"s3vectors:CreateIndex",
"s3vectors:GetIndex",
"s3vectors:ListIndexes",
"s3vectors:PutVectors",
"s3vectors:QueryVectors",
"s3vectors:ListVectors",
"s3vectors:DeleteVectors",
"s3vectors:GetVectors"
],
"Resource": "arn:aws:s3vectors:REGION:ACCOUNT:vector-bucket/BUCKET_NAME/index/*"
}
]
}
SAM Template Example
Resources:
S3VectorsBucket:
Type: AWS::S3Vectors::VectorBucket
Properties:
VectorBucketName: !Sub '${AWS::StackName}-vectors'
StacheFunction:
Type: AWS::Serverless::Function
Properties:
Environment:
Variables:
VECTORDB_PROVIDER: s3vectors
S3VECTORS_BUCKET_NAME: !GetAtt S3VectorsBucket.VectorBucketName
Policies:
- Version: '2012-10-17'
Statement:
- Effect: Allow
Action:
- s3vectors:GetVectorBucket
Resource:
- !GetAtt S3VectorsBucket.VectorBucketArn
- Effect: Allow
Action:
- s3vectors:CreateIndex
- s3vectors:GetIndex
- s3vectors:ListIndexes
- s3vectors:PutVectors
- s3vectors:QueryVectors
- s3vectors:ListVectors
- s3vectors:DeleteVectors
- s3vectors:GetVectors
Resource:
- !Sub '${S3VectorsBucket.VectorBucketArn}/index/*'
Terraform Example
data "aws_iam_policy_document" "s3vectors" {
statement {
effect = "Allow"
actions = [
"s3vectors:GetVectorBucket"
]
resources = [
"arn:aws:s3vectors:${var.region}:${data.aws_caller_identity.current.account_id}:vector-bucket/${var.bucket_name}"
]
}
statement {
effect = "Allow"
actions = [
"s3vectors:CreateIndex",
"s3vectors:GetIndex",
"s3vectors:ListIndexes",
"s3vectors:PutVectors",
"s3vectors:QueryVectors",
"s3vectors:ListVectors",
"s3vectors:DeleteVectors",
"s3vectors:GetVectors"
]
resources = [
"arn:aws:s3vectors:${var.region}:${data.aws_caller_identity.current.account_id}:vector-bucket/${var.bucket_name}/index/*"
]
}
}
Important Notes
Bucket Name vs ARN
The provider uses the bucket name (not the ARN) for API calls, but IAM policies require the full ARN format:
- Environment variable:
S3VECTORS_BUCKET_NAME=my-vectors-bucket(just the name) - IAM Resource:
arn:aws:s3vectors:us-east-1:123456789012:vector-bucket/my-vectors-bucket(full ARN)
S3 Vectors bucket names are globally unique across AWS, so include your account ID or a unique prefix:
# SAM template example
VectorBucketName: !Sub '${AWS::StackName}-vectors-${AWS::AccountId}'
Metadata Limits
S3 Vectors has metadata size limits:
- Filterable metadata: 2KB limit (used in query filters)
- Non-filterable metadata: Part of 40KB total (returned but can't filter)
When creating indexes, specify large fields like text as non-filterable:
aws s3vectors create-index \
--vector-bucket-name my-bucket \
--index-name my-index \
--dimension 1024 \
--distance-metric cosine \
--metadata-configuration 'nonFilterableMetadataKeys=["text"]'
list_vectors Limitation
The list_vectors API does NOT support metadata filtering - only query_vectors supports filtering. For operations that need to filter without a query vector, the provider lists all vectors and filters client-side.
Requirements
- Python >= 3.10
- stache-ai >= 0.1.0
- boto3 >= 1.34.0
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file stache_ai_s3vectors-0.1.2.tar.gz.
File metadata
- Download URL: stache_ai_s3vectors-0.1.2.tar.gz
- Upload date:
- Size: 22.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e87fc11154db3bc7d1d6c8de0d5dca0c17a876e8524a14eaf0e83bfd8ea38301
|
|
| MD5 |
d7efb8d2cb56ba2eacc4ded5d59ba0b6
|
|
| BLAKE2b-256 |
e73cea912d6ced0c83808396a170ca9f279a53d389dbba8e567c90bd62996e0b
|
File details
Details for the file stache_ai_s3vectors-0.1.2-py3-none-any.whl.
File metadata
- Download URL: stache_ai_s3vectors-0.1.2-py3-none-any.whl
- Upload date:
- Size: 11.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4053f30bc75881a7ff84e840694034e8727edf73e85114734225a7b267372aed
|
|
| MD5 |
c4b071c329e16595e3933260d9f66d00
|
|
| BLAKE2b-256 |
c38bfad869f8b6c82ed0513aad594ba4dc1abc27fb99b60a3ac852cbf0b77b66
|