Skip to main content

一个支持阿里云OSS和MinIO的对象存储抽象包

Project description

FiuAI S3

一个支持阿里云OSS和MinIO的对象存储抽象包,提供了统一的接口来操作不同的对象存储服务。

特性

  • 支持阿里云OSS和MinIO存储服务
  • 统一的接口设计
  • 工厂模式实现,易于扩展
  • 完整的类型提示
  • 详细的日志记录
  • 异常处理机制

安装

pip install fiuai-s3

快速开始

初始化存储

# file: utils/s3.py
from fiuai_s3 import ObjectStorageFactory
from config.app_config import get_settings

ObjectStorageFactory.initialize(
        # 初始化对象存储
        provider=get_settings().object_storage_config.provider,
        bucket_name=get_settings().object_storage_config.bucket_name,
        endpoint=get_settings().object_storage_config.endpoint,
        access_key=get_settings().object_storage_config.access_key,
        secret_key=get_settings().object_storage_config.secret_key,
        temp_dir=get_settings().object_storage_config.s3_temp_dir,
        use_https=get_settings().object_storage_config.s3_use_https
    
)
S3_Client = ObjectStorageFactory.get_instance()

使用存储实例

# file: app.py
from utils.s3 import S3_Client

# 上传文件
S3_Client.upload_file("test.txt", b"Hello World")

# 下载文件
data = S3_Client.download_file("test.txt")

# 删除文件
S3_Client.delete_file("test.txt")

# 列出文件
files = S3_Client.list_files(prefix="test/")

单据文件管理(推荐业务用法)

支持业务身份(auth_tenant_id, auth_company_id, doc_id)在实例初始化时注入(可为空),各操作方法参数可选,若不传则使用实例属性,若传则覆盖。

初始化带业务身份

from fiuai_s3 import ObjectStorageFactory
from fiuai_s3.object_storage import StorageConfig

config = StorageConfig(
    provider="minio",
    bucket_name="dev",
    endpoint="http://127.0.0.1:19000",
    access_key="devdevdev",
    secret_key="devdevdev"
)
# 业务身份可选
S3_Client = ObjectStorageFactory.create_storage(
    config,
    auth_tenant_id="t1",
    auth_company_id="c1",
    doc_id="d1"
)

上传单据文件

# 方法参数可选,优先级高于实例属性
S3_Client.upload_doc_file(
    filename="发票.pdf",
    data=b"...文件内容...",
    tags={"业务类型": "发票", "年份": "2024"}
    # auth_tenant_id、auth_company_id、doc_id 可不传,使用实例属性
)

为了规范文件, 文档类型的文件上传请按规范进行打标

class DocFileType(Enum):
    """
    文档文件类型, 避免某些场景不规范的文件名
    """
    PDF = "pdf"
    OFD = "ofd"
    XML = "xml"
    DOCTYPE = "doctype"
    TABLE = "table"
    TEXT = "text"
    IMAGE = "image"
    AUDIO = "audio"
    ARCHIVE = "archive"
    OTHER = "other"
class DocSourceFrom(Enum):
    """
    文档来源, 避免某些场景同类文件混淆,比如有2个xml文件,一个用户上传,一个ai生成
    """
    USER = "user"
    AI = "ai"
    INTEGRATION = "integration"
    ETAX = "etax"
    BANK = "bank"
    OTHER = "other"

上传时标注

c = ObjectStorageFactory.get_instance()
    c.upload_doc_file(
        filename="test.txt",
        data=b"test",
        doc_source_from=DocSourceFrom.USER,
        doc_file_type=DocFileType.XML,
        tags={"name": "hilton", "age": "10"},
        auth_tenant_id="t1",
        auth_company_id="c1",
        doc_id="id111",
    )

下载单据文件

data = S3_Client.download_doc_file(
    filename="发票.pdf"
    # auth_tenant_id、auth_company_id、doc_id 可不传,使用实例属性
)

列出单据下所有文件

files = S3_Client.list_doc_files()
  • 存储路径自动为:bucketname/auth_tenant_id/auth_company_id/doc_id/filename
  • tags会自动打到对象存储(Alicloud用OSS Tagging,Minio用metadata)
  • 支持实例初始化时设置默认auth_tenant_id、auth_company_id、doc_id,调用时可覆盖
  • 缺失必要参数会抛出异常

配置参数

参数 类型 必填 默认值 说明
provider str - 存储提供商,支持 "alicloud" 或 "minio"
bucket_name str - 存储桶名称
endpoint str - 存储服务端点
access_key str - 访问密钥
secret_key str - 密钥
temp_dir str "temp/" 临时目录
use_https bool False 是否使用HTTPS

开发

安装开发依赖

uv pip install .

运行测试

python -m pytest tests/

许可证

MIT License

作者

贡献

欢迎提交 Issue 和 Pull Request!

发布

uv publish --username token --password your-pypi-token-here

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fiuai_s3-0.3.6.tar.gz (14.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

fiuai_s3-0.3.6-py3-none-any.whl (19.7 kB view details)

Uploaded Python 3

File details

Details for the file fiuai_s3-0.3.6.tar.gz.

File metadata

  • Download URL: fiuai_s3-0.3.6.tar.gz
  • Upload date:
  • Size: 14.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.7.8

File hashes

Hashes for fiuai_s3-0.3.6.tar.gz
Algorithm Hash digest
SHA256 15340b8cc922831564ec6b3b46bffebaed42898740face5c97a19cbc69ff43be
MD5 f9ca4a7e5a02102691c114cb5c543ce8
BLAKE2b-256 4d72b011e3165bd024688b72ecfc16cd7452092a87e2f2ee764cf41132faba0a

See more details on using hashes here.

File details

Details for the file fiuai_s3-0.3.6-py3-none-any.whl.

File metadata

  • Download URL: fiuai_s3-0.3.6-py3-none-any.whl
  • Upload date:
  • Size: 19.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.7.8

File hashes

Hashes for fiuai_s3-0.3.6-py3-none-any.whl
Algorithm Hash digest
SHA256 6826313360bb193a6aa79cbab66198dec1a5943166365473b531b4dcd2a054e0
MD5 8a325ccf8642b05c0d4538380e7e7fd1
BLAKE2b-256 5b571618ea14be89edef0ba2ebf45c0d58b8e0a15606b7fd7eaf6e5d4119218c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page