Skip to main content

A python package to map your own csv files data using Atlas from NOMIC

Reason this release was yanked:

update code and readma

Project description

This is a vesy simple way to map your text data using Altas from NOMIC using the lib click.

You have to create an account to get API_KEY NOMIC.

Atlas enables you to:

Store, update and organize multi-million point datasets of unstructured text, images and embeddings.

Visually interact with your datasets from a web browser.

Run semantic search and vector operations over your datasets.

Use Atlas to:

- Visualize, interact, collaborate and share large datasets of text and embeddings.

- Collaboratively clean, tag and label your datasets

- Build high-availability apps powered by semantic search

- Understand and debug the latent space of your AI model trains

How to use

Installation

To install the necessary dependencies, run the following command:

python -m venv mymapenv 
source mymapenv/bin/activate
pip install --upgrade pip 
pip install text2mapviewer

Supported Transformer Models from Hugging Face

This project supports a variety of transformer models, including models from the Hugging Face Model Hub and sentence-transformers. Below are some examples: - Hugging Face Model: 'prajjwal1/bert-mini' - Hugging Face Model: 'Sahajtomar/french_semantic' (french version for semantic search embedding) - Sentence-Transformers Model: 'sentence-transformers/all-MiniLM-L6-v2' etc...

Please ensure that the model you choose is compatible with the project requirements and adjust the --transformer_model_name option accordingly.

To map your text/csv files

pip install -r requirements.txt
python main.py --transformer-model-name MODEL_NAME --cache_dir CACHE_DIR --batch-size BATCH_SIZE --file-path FILE_PATH

Remarque for the CACHE_DIR : you can setup it like ==>

export TRANSFORMERS_CACHE=/path_to_your/transformers_cache

Give a fidback.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

text2mapviewer-0.2.1.tar.gz (6.8 kB view hashes)

Uploaded Source

Built Distribution

text2mapviewer-0.2.1-py3-none-any.whl (5.7 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page