Experimaestro common module for IR experiments
Project description
Information Retrieval for experimaestro
Information Retrieval module for experimaestro
The full documentation can be read at IR@experimaestro.
You can find experiments built on top of XPMIR on the xpmir github workspace.
Finally, you can find the roadmap.
Install
Base experimaestro-IR can be installed with pip install xpmir.
Functionalities can be added by installing optional dependencies:
pip install xpmir[neural]to install neural-IR packages (torch, etc.)pip install xpmir[anserini]to install Anserini related packages
For the development version, you can:
- If you just want the development version: install with
pip install git+https://github.com/experimaestro/experimaestro-ir.git - If you want to edit the code: clone and then do a
pip install -e .within the directory
What's inside?
- Collection management (using datamaestro)
- Interface for the IR datasets library
- Splitting IR datasets
- Shuffling training triplets
- Representation
- Word Embeddings
- HuggingFace transformers
- Indices
- dense: FAISS interface
- sparse: xpmir-rust library
- Standard Indexing and Retrieval
- Anserini
- Learning to Rank
- Pointwise
- Pairwise
- Distillation
- Neural IR
- Cross-Encoder
- Splade
- DRMM
- ColBERT
- Paper reproduction:
- MonoBERT (Passage Re-ranking with BERT. Rodrigo Nogueira and Kyunghyun Cho. 2019)
- (alpha) DuoBERT (Multi-Stage Document Ranking with BERT. Rodrigo Nogueira, Wei Yang, Kyunghyun Cho, Jimmy Lin. 2019)
- (beta) Splade v2 (SPLADE v2: Sparse Lexical and Expansion Model for Information Retrieval, Thibault Formal, Carlos Lassance, Benjamin Piwowarski, and Stéphane Clinchant. SIGIR 2021)
- (planned) ANCE
- Pre-trained models
- HuggingFace integration (direct, through the Sentence Transformers library)
Thanks
Some parts of the code have been adapted from OpenNIR
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file experimaestro_ir-1.3.0.tar.gz.
File metadata
- Download URL: experimaestro_ir-1.3.0.tar.gz
- Upload date:
- Size: 162.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7fa8084a9c953f818ed7b515b0660e92b5333b9ce5f721c9a2d43578eaedc9d9
|
|
| MD5 |
5469a668e6f58d81f3c4df250a0951ea
|
|
| BLAKE2b-256 |
b0307b4aaa090fad9ba891fe17b4f825be1a4c9022d4956eac2501a86cceeee9
|
File details
Details for the file experimaestro_ir-1.3.0-py3-none-any.whl.
File metadata
- Download URL: experimaestro_ir-1.3.0-py3-none-any.whl
- Upload date:
- Size: 192.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3dde1d777b502b16dfcdaec69da621c53d9dc9f2882d54e5d19131f1dd5eccfd
|
|
| MD5 |
45b434dfe227e48b4256c70dea2637fe
|
|
| BLAKE2b-256 |
66c1b12849682ac30ba8a16fd5a66d3a6e330f517e39c25385818aebb3610cb0
|