A unified file handling library for working with files from local filesystem, S3, URLs, and streams.
Project description
About SmooAI
SmooAI is an AI-powered platform for helping businesses multiply their customer, employee, and developer experience.
Learn more on smoo.ai
SmooAI Packages
Check out other SmooAI packages at smoo.ai/open-source
About smooai-file (Python)
File operations that don't lie — magic-byte MIME detection catches spoofed extensions, size + content validation is built in, and local / URL / S3 / bytes / stream sources all speak the same typed async API. Stream-first so a large upload doesn't blow your memory.
Python Package
This is the Python port of @smooai/file, mirroring the feature set of the TypeScript version with idiomatic async/await Python. The package provides the same unified File class with automatic MIME type detection, rich metadata, and full S3 integration.
Install
pip install smooai-file
or with uv:
uv add smooai-file
Multi-Language Support
smooai-file is available as native implementations in TypeScript, Python, Rust, and Go — each built with idiomatic patterns for its ecosystem.
| Language | Package | Install |
|---|---|---|
| TypeScript | @smooai/file |
pnpm add @smooai/file |
| Python | smooai-file |
pip install smooai-file |
| Rust | smooai-file |
cargo add smooai-file |
| Go | github.com/SmooAI/file/go/file |
go get github.com/SmooAI/file/go/file |
Key Features
Stream-First Design
- Async-native with
asynciothroughout - Memory-efficient processing via
aiofiles - Supports both async iterators and sync file-like objects
- Lazy byte handling wherever possible
Multiple File Sources
- Local Filesystem — async read/write with
aiofiles, stat metadata - URLs — automatic download via
httpx, header metadata extraction - S3 Objects — direct AWS S3 integration (download and upload) via
boto3, ETag and Content-Type extraction - Bytes — in-memory buffers with full metadata support
- Streams — async iterators and sync file-like objects
Intelligent File Type Detection
Automatic MIME type and extension detection using a priority cascade:
- Magic-byte inspection of file contents
- HTTP response headers (
Content-Type,Content-Disposition) - S3 object metadata
- File extension fallback
Rich Metadata
- File name and extension
- MIME type
- File size
- Last modified and created timestamps
- SHA-256 (and other algorithm) checksums
- URL and filesystem path
- Source type (
FILE,URL,S3,BYTES,STREAM)
Examples
Basic Usage
import asyncio
from smooai_file import File
async def main():
# Create from a local path
file = await File.from_file("/path/to/document.pdf")
# Read contents
content = await file.read() # bytes
text = await file.read_text() # str (UTF-8)
# Access metadata
print(file.name) # "document.pdf"
print(file.mime_type) # "application/pdf"
print(file.size) # 102400
print(file.extension) # "pdf"
print(file.path) # "/path/to/document.pdf"
print(file.last_modified) # datetime(...)
asyncio.run(main())
URL Download
import asyncio
from smooai_file import File
async def main():
# Fetch from a URL (uses httpx under the hood)
file = await File.from_url("https://example.com/report.pdf")
# MIME type detected from Content-Type header and magic bytes
print(file.mime_type) # "application/pdf"
print(file.size) # populated from Content-Length header
# Save to disk
original, saved = await file.save("/tmp/report.pdf")
print(saved.path) # "/tmp/report.pdf"
asyncio.run(main())
S3 Integration
import asyncio
from smooai_file import File
async def main():
# Download from S3
file = await File.from_s3("my-bucket", "reports/report.pdf")
# Upload to S3 (sets ContentType, ContentLength, ContentDisposition)
await file.upload_to_s3("my-bucket", "archive/report.pdf")
# Save to S3 and get a new S3-backed File instance
original, s3_file = await file.save_to_s3("my-bucket", "archive/report.pdf")
# Move to S3 (deletes local source if applicable)
s3_file = await file.move_to_s3("my-bucket", "archive/report.pdf")
# Generate a pre-signed URL (expires in 1 hour)
signed_url = await s3_file.get_signed_url(expires_in=3600)
print(signed_url)
asyncio.run(main())
Stream Handling
import asyncio
from smooai_file import File
async def my_async_generator():
yield b"hello "
yield b"world"
async def main():
# From an async iterator
file = await File.from_stream(my_async_generator())
text = await file.read_text()
print(text) # "hello world"
# From a sync file-like object
with open("/path/to/file.bin", "rb") as f:
file = await File.from_stream(f)
print(file.mime_type)
asyncio.run(main())
File Operations
import asyncio
from smooai_file import File
async def main():
file = await File.from_file("/tmp/notes.txt")
# Append and prepend (local files only)
await file.append("new line\n")
await file.prepend("# Header\n")
# Truncate to 1 KB
await file.truncate(1024)
# Compute checksum
digest = await file.checksum("sha256")
print(digest) # 64-char hex string
# Filesystem checks
print(await file.exists()) # True
print(await file.is_readable()) # True
print(await file.is_writable()) # True
# Move to a new location (deletes source)
moved = await file.move("/tmp/archive/notes.txt")
# Delete
await moved.delete()
asyncio.run(main())
Built With
- Python 3.11+ with full type hints
- aiofiles — async filesystem I/O
- httpx — async HTTP client for URL downloads
- boto3 — AWS SDK for S3 integration
- python-magic — magic-byte MIME detection
Related Packages
- @smooai/file — TypeScript/Node.js version
- smooai-file (Rust) — Rust version
github.com/SmooAI/file/go/file— Go version
Development
uv sync
uv run poe install-dev
uv run pytest
uv run poe lint
uv run poe lint:fix # optional fixer
uv run poe format
uv run poe typecheck
uv run poe build
Set UV_PUBLISH_TOKEN before running uv run poe publish to upload to PyPI.
Contact
Brent Rager
Smoo Github: https://github.com/SmooAI
License
MIT © SmooAI
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file smooai_file-2.2.4.tar.gz.
File metadata
- Download URL: smooai_file-2.2.4.tar.gz
- Upload date:
- Size: 60.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.11.7 {"installer":{"name":"uv","version":"0.11.7","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9cd3afa9ab2e3bdab8d861bad92f05c0386f1c18fd2c6c8f06f15c22b84d3a5f
|
|
| MD5 |
1e4f3b8bd33c215f162064c68ccd4ad6
|
|
| BLAKE2b-256 |
63ad2f973aeae38e792c184f9af570f50b3078dfdcdfe7d319263e66f6b5bf85
|
File details
Details for the file smooai_file-2.2.4-py3-none-any.whl.
File metadata
- Download URL: smooai_file-2.2.4-py3-none-any.whl
- Upload date:
- Size: 16.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.11.7 {"installer":{"name":"uv","version":"0.11.7","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e296e8f69988ac5f7fc927eced7648ea5966f5173c9f15d4936a90d7c5559b82
|
|
| MD5 |
97a37944e7ae06db1cb7ff0bc1d6d482
|
|
| BLAKE2b-256 |
6a89092e776bc35bf7918bc0d3dfd0a987b7f53d12a5bf47df253581f36e0a8b
|