A universal dataset analysis tool with interactive visualization
Project description
📊 Super DataScope
通用数据集分析工具,支持 Parquet、SQLite、CSV 等格式。super 系列开源项目之一。
快速开始
安装
pip install super-datascope
使用
# 启动 Web UI
super-datascope
# 分析指定文件
super-datascope data.parquet
# 分析目录下所有数据文件(深度遍历)
super-datascope ./datasets/
# 分析 SQLite 数据库
super-datascope database.db
功能特性
- 📤 文件上传分析(开箱即用)
- 💻 命令行快速启动
- 📊 交互式数据可视化(8 种图表类型)
- 🔍 DuckDB SQL 查询
- 📋 JSON 数据深度分析(路径提取、通配符、正则)
- 💾 会话保存与恢复
- ☁️ MinIO 远程存储支持(可选)
数据接入方式
| 方式 | 描述 | 复杂度 |
|---|---|---|
| 📤 文件上传 | Web UI 直接上传 parquet/csv/sqlite | 最简单 |
| 💻 命令行参数 | super-datascope ./data/ |
简单 |
| ☁️ ObjStorage | 配置 MinIO 等对象存储浏览远程文件 | 高级 |
高级配置
MinIO 远程存储
安装 MinIO 支持:
pip install super-datascope[minio]
创建配置文件 ~/.super-datascope/config.yaml:
storage:
my_minio:
type: minio
connection:
endpoint: localhost:9000
access_key: minioadmin
secret_key: minioadmin
secure: false
bucket_name: my-datasets
bucket_root_path: data/
或通过环境变量配置:
export DATASCOPE_STORAGE_TYPE=minio
export DATASCOPE_STORAGE_ENDPOINT=localhost:9000
export DATASCOPE_STORAGE_ACCESS_KEY=minioadmin
export DATASCOPE_STORAGE_SECRET_KEY=minioadmin
export DATASCOPE_STORAGE_BUCKET=my-datasets
技术栈
License
MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file super_datascope-0.1.1.tar.gz.
File metadata
- Download URL: super_datascope-0.1.1.tar.gz
- Upload date:
- Size: 60.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6e5656926a429da4556843dbd5448fae75439c1e7a30bd6fd573f6a3c8379f87
|
|
| MD5 |
6da6afad6d066b079692b9db5bd7e9a8
|
|
| BLAKE2b-256 |
bc8f9ee5a510ccced5dc4ecf3790dfca695b4963ae3a29f8d2fd4cd7e0c3a8d3
|
Provenance
The following attestation bundles were made for super_datascope-0.1.1.tar.gz:
Publisher:
publish-super-datascope.yml on huanghw1989/super_dev_tools
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
super_datascope-0.1.1.tar.gz -
Subject digest:
6e5656926a429da4556843dbd5448fae75439c1e7a30bd6fd573f6a3c8379f87 - Sigstore transparency entry: 924128968
- Sigstore integration time:
-
Permalink:
huanghw1989/super_dev_tools@3848a7b69e077818dbd36155d7b440054486ea04 -
Branch / Tag:
refs/tags/super-datascope-v0.1.1 - Owner: https://github.com/huanghw1989
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish-super-datascope.yml@3848a7b69e077818dbd36155d7b440054486ea04 -
Trigger Event:
push
-
Statement type:
File details
Details for the file super_datascope-0.1.1-py3-none-any.whl.
File metadata
- Download URL: super_datascope-0.1.1-py3-none-any.whl
- Upload date:
- Size: 74.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
422947d1a0e3f3e5d248b09a1ed7d4d621ec670ae9aa8f577042755e74656095
|
|
| MD5 |
019b358b03c154bdc7893e1ab4268de1
|
|
| BLAKE2b-256 |
375e3b637bc4b8487beea13ae699661c4b6ab0083cd0cb499d23c3b2825f08e1
|
Provenance
The following attestation bundles were made for super_datascope-0.1.1-py3-none-any.whl:
Publisher:
publish-super-datascope.yml on huanghw1989/super_dev_tools
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
super_datascope-0.1.1-py3-none-any.whl -
Subject digest:
422947d1a0e3f3e5d248b09a1ed7d4d621ec670ae9aa8f577042755e74656095 - Sigstore transparency entry: 924128969
- Sigstore integration time:
-
Permalink:
huanghw1989/super_dev_tools@3848a7b69e077818dbd36155d7b440054486ea04 -
Branch / Tag:
refs/tags/super-datascope-v0.1.1 - Owner: https://github.com/huanghw1989
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish-super-datascope.yml@3848a7b69e077818dbd36155d7b440054486ea04 -
Trigger Event:
push
-
Statement type: