Skip to main content

Visualize s3 data

Project description

English | 简体中文

OSS browser based on s3

A visualization tool designed for large language models and machine learning data. It supports cloud storage platforms with S3 protocol (AWS, Alibaba Cloud) and handles various data formats (json, jsonl.gz, warc.gz, md, etc.). Interactive visualization through JSON, Html, Markdown, and image views for efficient data analysis.

Features

  • Supports multiple formats such as JSON, JSONL, WARC, intelligently recognizes data structures and visually presents key information, making data clear at a glance.

  • One-click preview of any field, supports free switching between multiple view modes such as web pages, Markdown, images, simple and intuitive operation.

  • Seamlessly connects with mainstream cloud storage platforms (Alibaba Cloud, AWS, Tencent Cloud, and other cloud storage platforms that support the S3 protocol), supports local file parsing, making data access easy.

https://github.com/user-attachments/assets/aa8ee5e8-c6d3-4b20-ae9d-2ceeb2eb2c41

Getting Started

# python >= 3.9.2
pip install vis3

Or create a Python environment using conda:

Install miniconda

# 1. Create Python 3.11 environment using conda
conda create -n vis3 python=3.11

# 2. Activate environment
conda activate vis3

# 3. Install vis3
pip install vis3

# 4. Launch
vis3

# ----------------

# Default launch doesn't require authentication, or set ENABLE_AUTH variable to enable authentication.
ENABLE_AUTH=true vis3

# Specify database (sqlite) directory
BASE_DATA_DIR=your/database/path vis3

Local Development

conda create -n vis3-dev python=3.11

# Activate virtual environment
conda activate vis3-dev

# Install poetry
# https://python-poetry.org/docs/#installing-with-the-official-installer

# Install Python dependencies
poetry install

# Install frontend dependencies (install pnpm: https://pnpm.io/installation)
cd web && pnpm install

# Build frontend assets (in web directory)
pnpm build

# Start vis3
uvicorn vis3.main:app --reload

React Component

We provide a standalone React component via npm for customizing your data preview ui.

Community

Welcome to join the Opendatalab official WeChat group!

Related Projects

  • LabelU Image / Video / Audio annotation tool
  • LabelU-kit Web frontend annotation kit (LabelU is developed based on this kit)
  • LabelLLM Open-source LLM dialogue annotation platform
  • Miner U One-stop high-quality data extraction tool

License

This project is licensed under the Apache 2.0 license.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vis3-1.0.4.tar.gz (1.9 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vis3-1.0.4-py3-none-any.whl (1.9 MB view details)

Uploaded Python 3

File details

Details for the file vis3-1.0.4.tar.gz.

File metadata

  • Download URL: vis3-1.0.4.tar.gz
  • Upload date:
  • Size: 1.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.0.1 CPython/3.11.12 Linux/6.11.0-1015-azure

File hashes

Hashes for vis3-1.0.4.tar.gz
Algorithm Hash digest
SHA256 899acdaaf62d659935c34a63eef278535e52f9b2e96b3ff9f07583258e446c78
MD5 a9b67a96301f96225ba89a9a665eb4d8
BLAKE2b-256 a55631915b252d3819a9c5598b9b467fa7e20d80d684cdaf280f21ada399b3e8

See more details on using hashes here.

File details

Details for the file vis3-1.0.4-py3-none-any.whl.

File metadata

  • Download URL: vis3-1.0.4-py3-none-any.whl
  • Upload date:
  • Size: 1.9 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.0.1 CPython/3.11.12 Linux/6.11.0-1015-azure

File hashes

Hashes for vis3-1.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 b7a58465a7f838d831981398437a6894d7458e86d87fbc7177a98422755f021a
MD5 d44b3d8c4770d8afba9fa9229164d6ab
BLAKE2b-256 6268265654242e16c83e797e9bd3f8577a1b6c5a619c77656d0648971f2b52b8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page