Skip to main content

A python package to map your own csv files data using Atlas from NOMIC

Project description

This is a vesy simple way to map your text data using Altas from NOMIC using the lib click.

You have to create an account to get API_KEY NOMIC.

<< Atlas enables you to:

  • Store, update and organize multi-million point datasets of unstructured text, images and embeddings.

  • Visually interact with your datasets from a web browser.

  • Run semantic search and vector operations over your datasets. Use Atlas to:

    • Visualize, interact, collaborate and share large datasets of text and embeddings.
    • Collaboratively clean, tag and label your datasets
    • Build high-availability apps powered by semantic search
    • Understand and debug the latent space of your AI model trains >>

How to use

Installation

To install the necessary dependencies, run the following command:

python -m venv mymapenv 
source mymapenv/bin/activate
pip install --upgrade pip 
pip install text2mapviewer

Login NOMIC server

Login/create your Nomic account:

nomic login

If you have already your account :

nomic login [YOUR_API_TOKEN_NOMIC_HERE] 

Examples :

from NOMIC and with lib text2mapviewer

from text2mapviewer.examples.map_embedding import project

# Use the projet from the lib text2mapviewer 
print(project) 

With the lib click after clone this ripo

python scr/text2mapviewer/examples/map_embedding_click.py --num_embeddings 10000 --embedding_dim 256

The Animation Ouput

Supported Transformer Models from Hugging Face

This project supports a variety of transformer models, including models from the Hugging Face Model Hub and sentence-transformers. Below are some examples: - Hugging Face Model: 'prajjwal1/bert-mini' - Hugging Face Model: 'Sahajtomar/french_semantic' (french version for semantic search embedding) - Sentence-Transformers Model: 'sentence-transformers/all-MiniLM-L6-v2' etc...

Please ensure that the model you choose is compatible with the project requirements and adjust the --transformer_model_name option accordingly.

To map your text/csv files

pip install -r requirements.txt
python main.py --transformer-model-name MODEL_NAME --cache_dir CACHE_DIR --batch-size BATCH_SIZE --file-path FILE_PATH

NOTE: for the CACHE_DIR : you can setup it like ==>

export TRANSFORMERS_CACHE=/path_to_your/transformers_cache

Give a fidback.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

text2mapviewer-0.2.2.tar.gz (9.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

text2mapviewer-0.2.2-py3-none-any.whl (8.2 kB view details)

Uploaded Python 3

File details

Details for the file text2mapviewer-0.2.2.tar.gz.

File metadata

  • Download URL: text2mapviewer-0.2.2.tar.gz
  • Upload date:
  • Size: 9.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.16

File hashes

Hashes for text2mapviewer-0.2.2.tar.gz
Algorithm Hash digest
SHA256 b3a4b671b947fba3193439ef775f748bace395f384adbcc088e8786e57fa87f1
MD5 75ecf4098480868d7f19e3e4ffe3060d
BLAKE2b-256 a4b95899e4cc4db302f6f2f37d78694094753cfa212e136d86f18389403026c3

See more details on using hashes here.

File details

Details for the file text2mapviewer-0.2.2-py3-none-any.whl.

File metadata

  • Download URL: text2mapviewer-0.2.2-py3-none-any.whl
  • Upload date:
  • Size: 8.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.16

File hashes

Hashes for text2mapviewer-0.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 07239f079beeaf95d9e6697a98322319fa02b075db105a9f2f02a2595ae66f1d
MD5 ff4a4d82a9310f1937cd5db233bd1d4b
BLAKE2b-256 c13ade12f0db582b3c75ae7815e67c170170b74339e8bcc4eefe2a0656014264

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page