A Doc2Vec based Embedding Model.
Project description
Swarmauri Embedding Doc2vec
A Gensim-powered Doc2Vec implementation for document
embeddings in the Swarmauri ecosystem. The component registers as
Doc2VecEmbedding and returns vectors as swarmauri_standard.vectors.Vector
instances.
Installation
Install the package with your preferred Python packaging tool:
pip install swarmauri_embedding_doc2vec
poetry add swarmauri_embedding_doc2vec
uv pip install swarmauri_embedding_doc2vec
Usage
from swarmauri_embedding_doc2vec import Doc2VecEmbedding
documents = [
"This is the first document.",
"Here is another document.",
"And a third one.",
]
# Initialize the embedder. Adjust parameters to match your dataset size.
embedder = Doc2VecEmbedding(vector_size=300, window=10, min_count=1, workers=1)
# Fit and transform documents into Vector objects.
vectors = embedder.fit_transform(documents)
# Access the raw embedding values via the Vector.value attribute.
first_vector = vectors[0].value
# Transform new documents (the result is also a Vector).
new_vector = embedder.transform(["This is a new document."])[0]
# Save and load the underlying Doc2Vec model.
model_path = "doc2vec.model"
embedder.save_model(model_path)
embedder.load_model(model_path)
Want to help?
If you want to contribute to swarmauri-sdk, read up on our guidelines for contributing that will help you get started.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file swarmauri_embedding_doc2vec-0.9.0.dev33.tar.gz.
File metadata
- Download URL: swarmauri_embedding_doc2vec-0.9.0.dev33.tar.gz
- Upload date:
- Size: 7.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.10.3 {"installer":{"name":"uv","version":"0.10.3","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5e28a264b6a3070112c4d363e30c1f3a95d875fffbbe3bb2295ba117f41d3250
|
|
| MD5 |
42cb336ff2e2f6fac6dc4d600c994bba
|
|
| BLAKE2b-256 |
38757682a37c18311968b000f9dd5e46bc528039a1002933898df8837e5ed223
|
File details
Details for the file swarmauri_embedding_doc2vec-0.9.0.dev33-py3-none-any.whl.
File metadata
- Download URL: swarmauri_embedding_doc2vec-0.9.0.dev33-py3-none-any.whl
- Upload date:
- Size: 8.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.10.3 {"installer":{"name":"uv","version":"0.10.3","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f169d222301f924607f9dbdee6b08383dbb08faf16d3732fe277da113c321610
|
|
| MD5 |
7c3461ab7b1c1ed818fbb0def4298125
|
|
| BLAKE2b-256 |
c50d6754940b99d695327fd21d77f0cbfda024cf3c9584e3ab1683cdb814153c
|