A python package to map your own csv files data using Atlas from NOMIC
Reason this release was yanked:
requirements dependencies
Project description
This is a vesy simple way to map your text data using Altas from NOMIC using the lib click
.
You have to create an account to get API_KEY NOMIC.
Atlas enables you to:
Store, update and organize multi-million point datasets of unstructured text, images and embeddings.
Visually interact with your datasets from a web browser.
Run semantic search and vector operations over your datasets.
Use Atlas to:
- Visualize, interact, collaborate and share large datasets of text and embeddings.
- Collaboratively clean, tag and label your datasets
- Build high-availability apps powered by semantic search
- Understand and debug the latent space of your AI model trains
How to use
Installation
To install the necessary dependencies, run the following command:
python -m venv mymapenv
source mymapenv/bin/activate
pip install --upgrade pip
pip install text2mapviewer
Supported Transformer Models from Hugging Face
This project supports a variety of transformer models, including models from the Hugging Face Model Hub and sentence-transformers. Below are some examples: - Hugging Face Model: 'prajjwal1/bert-mini' - Hugging Face Model: 'Sahajtomar/french_semantic' (french version for semantic search embedding) - Sentence-Transformers Model: 'sentence-transformers/all-MiniLM-L6-v2' etc...
Please ensure that the model you choose is compatible with the project requirements and adjust the --transformer_model_name
option accordingly.
To map your text/csv files
pip install -r requirements.txt
python main.py --transformer-model-name MODEL_NAME --cache_dir CACHE_DIR --batch-size BATCH_SIZE --file-path FILE_PATH
Remarque for the CACHE_DIR : you can setup it like ==>
export TRANSFORMERS_CACHE=/path_to_your/transformers_cache
Give a fidback.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for text2mapviewer-0.2.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3174c3e2b7ba87d040f8817d94791de8cf76798440c487a34f253092cf9b9a19 |
|
MD5 | d65a4db82f47fec3efce17c2078f62d0 |
|
BLAKE2b-256 | b1176f053cd0c35e6e72c145bad185e10f380fc61c1f8104a65795f1d60d0089 |