AI-powered search engine
Project description
txtai: AI-powered search engine
txtai builds an AI-powered index over sections of text. txtai supports building text indices to perform similarity searches and create extractive question-answering based systems.
NeuML uses txtai and/or the concepts behind it to power all of our Natural Language Processing (NLP) applications. Example applications:
- cord19q - COVID-19 literature analysis
- paperai - AI-powered literature discovery and review engine for medical/scientific papers
- neuspo - a fact-driven, real-time sports event and news site
- codequestion - Ask coding questions directly from the terminal
txtai is built on the following stack:
- sentence-transformers
- transformers
- faiss
- Python 3.6+
Installation
The easiest way to install is via pip and PyPI
pip install txtai
You can also install txtai directly from GitHub. Using a Python Virtual Environment is recommended.
pip install git+https://github.com/neuml/txtai
Python 3.6+ is supported
Notes for Windows
This project has dependencies that require compiling native code. Linux enviroments usually work without an issue. Windows requires the following extra steps.
-
Install C++ Build Tools - https://visualstudio.microsoft.com/visual-cpp-build-tools/
-
If PyTorch errors are encountered, run the following command before installing paperai. See pytorch.org for more information.
pip install torch===1.6.0 torchvision===0.7.0 -f https://download.pytorch.org/whl/torch_stable.html
Examples
The examples directory has a series of examples and notebooks giving an overview of txtai. See the list of notebooks below.
Notebooks
Notebook | Description | |
---|---|---|
Introducing txtai | Overview of the functionality provided by txtai | |
Extractive QA with txtai | Extractive question-answering with txtai | |
Build an Embeddings index from a data source | Embeddings index from a data source backed by word embeddings | |
Extractive QA with Elasticsearch | Extractive question-answering with Elasticsearch |
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file txtai-1.0.0.tar.gz
.
File metadata
- Download URL: txtai-1.0.0.tar.gz
- Upload date:
- Size: 13.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/42.0.2 requests-toolbelt/0.9.1 tqdm/4.48.0 CPython/3.7.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 |
29ae9cdecfd1973c57e4f6adaef62cf72c9d7d5d79c6528825a3a51adcb74adc
|
|
MD5 |
767354df5c5ffa139fd613072d085d15
|
|
BLAKE2b-256 |
929b4ca8c0324ecfc93e518691a1a9d3fc44022b7f42f0f3c0c63697742cf860
|
File details
Details for the file txtai-1.0.0-py3-none-any.whl
.
File metadata
- Download URL: txtai-1.0.0-py3-none-any.whl
- Upload date:
- Size: 17.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/42.0.2 requests-toolbelt/0.9.1 tqdm/4.48.0 CPython/3.7.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 |
6ebae5c06f2dca3b9336b5c3d2dfae762290d3c23261875789c3d769df80f545
|
|
MD5 |
f4b8894ba10b4d066c453f3a28644b0d
|
|
BLAKE2b-256 |
c712eab6619bacb0b9b0b6d738d945035192428c58a7d47303d7c018a072f7c9
|