Experimaestro common module for IR experiments
Project description
Information Retrieval for experimaestro
Information Retrieval module for experimaestro
The full documentation can be read at IR@experimaestro.
Install
Base experimaestro-IR can be installed with pip install xpmir
.
Functionalities can be added by installing optional dependencies:
pip install xpmir[neural]
to install neural-IR packages (torch, etc.)pip install xpmir[anserini]
to install Anserini related packages
What's inside?
- Collection management (using datamaestro)
- Interface for the IR datasets library
- Splitting IR datasets
- Shuffling training triplets
- Representation
- Word Embeddings
- HuggingFace transformers
- Indices
- dense: FAISS interface
- sparse: xpmir-rust library
- Standard Indexing and Retrieval
- Anserini
- Learning to Rank
- Pointwise
- Pairwise
- Distillation
- (planned) Pipelines (e.g. ANCE)
- Neural IR
- Cross-Encoder
- Splade
- DRMM
- ColBERT
- Paper reproduction:
- MonoBERT (Passage Re-ranking with BERT. Rodrigo Nogueira and Kyunghyun Cho. 2019)
- (planned) DuoBERT (Multi-Stage Document Ranking with BERT. Rodrigo Nogueira, Wei Yang, Kyunghyun Cho, Jimmy Lin. 2019)
- (planned) Splade v2 (SPLADE v2: Sparse Lexical and Expansion Model for Information Retrieval, Thibault Formal, Carlos Lassance, Benjamin Piwowarski, and Stéphane Clinchant. SIGIR 2021)
- Pre-trained models
- HuggingFace integration (direct, through the Sentence Transformers library)
Examples
Thanks
Some parts of the code have been adapted from OpenNIR
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
experimaestro-ir-0.3.0.zip
(209.7 kB
view hashes)
Built Distribution
Close
Hashes for experimaestro_ir-0.3.0-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3d8f7c09da5dfad8840f92d397df2a9639fcaa151ee41b48d5b57b938328115e |
|
MD5 | 304fa106293e3930758fd86bdab052d3 |
|
BLAKE2b-256 | fab13a328a1d7f918b28235a246b2068f4827aed41977fe14db6c9b8f1915b1b |