A semantic search package for hotel data
Project description
Traversaal
Traversaal is a Python package that provides a simple semantic search functionality for hotel data. It leverages large language models such as BERT to encode hotel descriptions and reviews, allowing users to perform semantic search queries and retrieve relevant results based on the provided search query.
Features
- Efficient semantic search for hotel data based on descriptions and reviews.
- Utilizes state-of-the-art language models to encode and compare text embeddings.
- Returns relevant search results with corresponding scores for ranking.
- Supports GPU acceleration for faster encoding and search performance.
Installation
You can install Traversaal using pip:
pip install traversaal
import pandas as pd
import traversaal
model_name = 'bert-base-uncased'
search = traversaal.SemanticSearch(model_name)
df = pd.read_csv('hotels.csv')
encoded_data = search.encode_data(df)
query = 'great location and service'
relevant_results = search.search(encoded_data, query)
print("\nRelevant Results:")
print(relevant_results.head())
For more detailed usage examples and API documentation, please refer to the GitHub repository.
Contributing
Contributions to Traversaal are welcome! If you encounter any issues, have suggestions, or would like to contribute enhancements or new features, please feel free to submit a pull request on the GitHub repository.
License
Traversaal is licensed under the MIT License. See the LICENSE file for more details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
File details
Details for the file traversaal-0.57-py3-none-any.whl
.
File metadata
- Download URL: traversaal-0.57-py3-none-any.whl
- Upload date:
- Size: 2.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.16
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5d804f3efaf420c64a4405636429f798c896bba9c5ca57d676bfe7d30d8ff679 |
|
MD5 | 1fa077b3e762bc734bc408d1b289b776 |
|
BLAKE2b-256 | 551813069d461d9dd988c09e291e1a2d976edd4d0c76fadc2f79393c9441b235 |