Visualize s3 data
Project description
English | 简体中文
OSS browser based on s3
Vis3 is a visualization tool for large language models and machine learning data, supporting cloud storage platforms with S3 protocol (AWS, Aliyun OSS, Tencent Cloud) and various data formats (json, jsonl.gz, warc.gz, md, mobi, epub, etc.). It offers interactive visualization through JSON, HTML, Markdown, and image views for efficient data analysis.
Features
- Supports JSON, JSONL, WARC, and more, automatically recognizing data structures for clear, visual insights.
- One-click field previews with seamless switching between Html, Markdown, and image views for intuitive operation.
- Integrates with S3-compatible cloud storage (Aliyun OSS, AWS, Tencent Cloud) and local file parsing for easy data access.
https://github.com/user-attachments/assets/aa8ee5e8-c6d3-4b20-ae9d-2ceeb2eb2c41
Getting Started
# python >= 3.9.2
pip install vis3
Or create a Python environment using conda:
Install miniconda
# 1. Create Python 3.11 environment using conda
conda create -n vis3 python=3.11
# 2. Activate environment
conda activate vis3
# 3. Install vis3
pip install vis3
# 4. Launch (no authentication)
vis3 --open
Upgrade to the latest version
pip install vis3 -U
Variables
ENABLE_AUTH
Enable authentication.
ENABLE_AUTH=1 vis3
BASE_DATA_DIR
Specify database (SQLite) directory.
BASE_DATA_DIR=your/database/path vis3
BASE_URL
Specity base url to the api call.
BASE_URL=/a/b/c
Local Development
conda create -n vis3-dev python=3.11
# Activate virtual environment
conda activate vis3-dev
# Install poetry
# https://python-poetry.org/docs/#installing-with-the-official-installer
# Install Python dependencies
poetry install
# Install frontend dependencies (install pnpm: https://pnpm.io/installation)
cd web && pnpm install
# Build frontend assets (in web directory)
pnpm build
# Start vis3
uvicorn vis3.main:app --reload
React Component 
We provide a React component via npm for customizing your data preview ui.
npm i @vis3/kit
Community
Welcome to join the Opendatalab official WeChat group!
Related Projects
- LabelU Image / Video / Audio annotation tool
- LabelU-kit Web frontend annotation kit (LabelU is developed based on this kit)
- LabelLLM Open-source LLM dialogue annotation platform
- Miner U One-stop high-quality data extraction tool
License
This project is licensed under the Apache 2.0 license.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file vis3-1.1.6.tar.gz.
File metadata
- Download URL: vis3-1.1.6.tar.gz
- Upload date:
- Size: 1.9 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.0.1 CPython/3.11.12 Linux/6.11.0-1015-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0fdab0b3ac8e2bb63f162a35de03fcd3e12f253dcaea7296040ee217330cecbd
|
|
| MD5 |
75054e72047274cf5f392d049f6c29c3
|
|
| BLAKE2b-256 |
df8067404880192487ded3c2ca4d6c1bf9e30505d1ffa93c256a477dc0d51756
|
File details
Details for the file vis3-1.1.6-py3-none-any.whl.
File metadata
- Download URL: vis3-1.1.6-py3-none-any.whl
- Upload date:
- Size: 1.9 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.0.1 CPython/3.11.12 Linux/6.11.0-1015-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
78d25e1666b357ac0ad2cdc8b7196aa01002c937e6fd949e62f5ac11e198234d
|
|
| MD5 |
4d770d545a07da63627fa618a0eacb21
|
|
| BLAKE2b-256 |
edbbc40da2187c902420d3e333f7c33494a66ea46edb0207fd2deb528e983bf9
|