Skip to main content

A unified file handling library for working with files from local filesystem, S3, URLs, and streams.

Project description


About SmooAI

SmooAI is an AI-powered platform for helping businesses multiply their customer, employee, and developer experience.

Learn more on smoo.ai

SmooAI Packages

Check out other SmooAI packages at smoo.ai/open-source

About smooai-file (Python)

File operations that don't lie — magic-byte MIME detection catches spoofed extensions, size + content validation is built in, and local / URL / S3 / bytes / stream sources all speak the same typed async API. Stream-first so a large upload doesn't blow your memory.

PyPI Version PyPI Downloads PyPI Last Update

GitHub License GitHub Actions Workflow Status GitHub Repo stars

Python Package

This is the Python port of @smooai/file, mirroring the feature set of the TypeScript version with idiomatic async/await Python. The package provides the same unified File class with automatic MIME type detection, rich metadata, and full S3 integration.

Install

pip install smooai-file

or with uv:

uv add smooai-file

Multi-Language Support

smooai-file is available as native implementations in TypeScript, Python, Rust, and Go — each built with idiomatic patterns for its ecosystem.

Language Package Install
TypeScript @smooai/file pnpm add @smooai/file
Python smooai-file pip install smooai-file
Rust smooai-file cargo add smooai-file
Go github.com/SmooAI/file/go/file go get github.com/SmooAI/file/go/file

Key Features

Stream-First Design

  • Async-native with asyncio throughout
  • Memory-efficient processing via aiofiles
  • Supports both async iterators and sync file-like objects
  • Lazy byte handling wherever possible

Multiple File Sources

  • Local Filesystem — async read/write with aiofiles, stat metadata
  • URLs — automatic download via httpx, header metadata extraction
  • S3 Objects — direct AWS S3 integration (download and upload) via boto3, ETag and Content-Type extraction
  • Bytes — in-memory buffers with full metadata support
  • Streams — async iterators and sync file-like objects

Intelligent File Type Detection

Automatic MIME type and extension detection using a priority cascade:

  1. Magic-byte inspection of file contents
  2. HTTP response headers (Content-Type, Content-Disposition)
  3. S3 object metadata
  4. File extension fallback

Rich Metadata

  • File name and extension
  • MIME type
  • File size
  • Last modified and created timestamps
  • SHA-256 (and other algorithm) checksums
  • URL and filesystem path
  • Source type (FILE, URL, S3, BYTES, STREAM)

Examples

Basic Usage

import asyncio
from smooai_file import File

async def main():
    # Create from a local path
    file = await File.from_file("/path/to/document.pdf")

    # Read contents
    content = await file.read()          # bytes
    text = await file.read_text()        # str (UTF-8)

    # Access metadata
    print(file.name)           # "document.pdf"
    print(file.mime_type)      # "application/pdf"
    print(file.size)           # 102400
    print(file.extension)      # "pdf"
    print(file.path)           # "/path/to/document.pdf"
    print(file.last_modified)  # datetime(...)

asyncio.run(main())

(back to examples)

URL Download

import asyncio
from smooai_file import File

async def main():
    # Fetch from a URL (uses httpx under the hood)
    file = await File.from_url("https://example.com/report.pdf")

    # MIME type detected from Content-Type header and magic bytes
    print(file.mime_type)   # "application/pdf"
    print(file.size)        # populated from Content-Length header

    # Save to disk
    original, saved = await file.save("/tmp/report.pdf")
    print(saved.path)       # "/tmp/report.pdf"

asyncio.run(main())

(back to examples)

S3 Integration

import asyncio
from smooai_file import File

async def main():
    # Download from S3
    file = await File.from_s3("my-bucket", "reports/report.pdf")

    # Upload to S3 (sets ContentType, ContentLength, ContentDisposition)
    await file.upload_to_s3("my-bucket", "archive/report.pdf")

    # Save to S3 and get a new S3-backed File instance
    original, s3_file = await file.save_to_s3("my-bucket", "archive/report.pdf")

    # Move to S3 (deletes local source if applicable)
    s3_file = await file.move_to_s3("my-bucket", "archive/report.pdf")

    # Generate a pre-signed URL (expires in 1 hour)
    signed_url = await s3_file.get_signed_url(expires_in=3600)
    print(signed_url)

asyncio.run(main())

(back to examples)

Stream Handling

import asyncio
from smooai_file import File

async def my_async_generator():
    yield b"hello "
    yield b"world"

async def main():
    # From an async iterator
    file = await File.from_stream(my_async_generator())
    text = await file.read_text()
    print(text)   # "hello world"

    # From a sync file-like object
    with open("/path/to/file.bin", "rb") as f:
        file = await File.from_stream(f)
    print(file.mime_type)

asyncio.run(main())

(back to examples)

File Operations

import asyncio
from smooai_file import File

async def main():
    file = await File.from_file("/tmp/notes.txt")

    # Append and prepend (local files only)
    await file.append("new line\n")
    await file.prepend("# Header\n")

    # Truncate to 1 KB
    await file.truncate(1024)

    # Compute checksum
    digest = await file.checksum("sha256")
    print(digest)   # 64-char hex string

    # Filesystem checks
    print(await file.exists())       # True
    print(await file.is_readable())  # True
    print(await file.is_writable())  # True

    # Move to a new location (deletes source)
    moved = await file.move("/tmp/archive/notes.txt")

    # Delete
    await moved.delete()

asyncio.run(main())

(back to examples)

Built With

  • Python 3.11+ with full type hints
  • aiofiles — async filesystem I/O
  • httpx — async HTTP client for URL downloads
  • boto3 — AWS SDK for S3 integration
  • puremagic — pure-Python magic-byte MIME detection (no libmagic system dep)

Related Packages

Development

uv sync
uv run poe install-dev
uv run pytest
uv run poe lint
uv run poe lint:fix   # optional fixer
uv run poe format
uv run poe typecheck
uv run poe build

Set UV_PUBLISH_TOKEN before running uv run poe publish to upload to PyPI.

(back to top)

Contact

Brent Rager

Smoo Github: https://github.com/SmooAI

(back to top)

License

MIT © SmooAI

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

smooai_file-2.2.9.tar.gz (67.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

smooai_file-2.2.9-py3-none-any.whl (20.8 kB view details)

Uploaded Python 3

File details

Details for the file smooai_file-2.2.9.tar.gz.

File metadata

  • Download URL: smooai_file-2.2.9.tar.gz
  • Upload date:
  • Size: 67.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.13 {"installer":{"name":"uv","version":"0.11.13","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for smooai_file-2.2.9.tar.gz
Algorithm Hash digest
SHA256 0085d3e2bb0ad6975493ac01be5583b65f53d3bb4d2ddef72bb7a96642fc48b3
MD5 e10f58ee656e00c61f9b494ddae6d8fa
BLAKE2b-256 cccf1b8f03d9e3b4caa92340b933a7d4efded65916d40a9f8735270dcd5926bb

See more details on using hashes here.

File details

Details for the file smooai_file-2.2.9-py3-none-any.whl.

File metadata

  • Download URL: smooai_file-2.2.9-py3-none-any.whl
  • Upload date:
  • Size: 20.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.13 {"installer":{"name":"uv","version":"0.11.13","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for smooai_file-2.2.9-py3-none-any.whl
Algorithm Hash digest
SHA256 cf5fadc9377c5a38783f62132d41e20096b645f9b177cfb17b61d71cd87e237b
MD5 f595dfa9d2f87da0915acf68c196e6fe
BLAKE2b-256 99e12f662d364304a36b62460eafd9aac36c5643ec141c492aa5d44d6f945d3f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page