Skip to main content

Intuitive interface to many IR axioms.

Project description

DOI PyPi CI Code coverage Python Issues Commit activity Downloads License

↕️ ir_axioms

Intuitive axiomatic retrieval experimentation.

ir_axioms is a Python framework for experimenting with axioms in information retrieval in a declarative way. It includes reference implementations of many commonly used retrieval axioms and is well integrated with the PyTerrier framework and the Pyserini toolkit. Re-rank your search results today with ir_axioms and understand your retrieval systems better by analyzing axiomatic preferences!

Presentation video on YouTube Poster
Presentation video Poster

Usage

The ir_axioms framework is easy to use. Below, we've prepared some notebooks showcasing the main features. If you have questions or need assistance, please contatct us.

Example Notebooks

We include several example notebooks to demonstrate re-ranking and preference evaluation in PyTerrier using ir_axioms. You can find all examples in the examples/ directory.

Backends

You can experiment with ir_axioms in PyTerrier and Pyserini. However, we recommend PyTerrier as not all features are implemented for the Pyserini backend.

PyTerrier (Terrier index)

To use ir_axioms with a Terrier index, please use our PyTerrier transformers (modules):

Transformer Class Type Description
AggregatedPreferences 𝑅 → 𝑅𝑓 Aggregate axiom preferences for each document
EstimatorKwikSortReranker 𝑅 → 𝑅′ Train estimator for ORACLE, use it to re-rank with KwikSort.
KwikSortReranker 𝑅 → 𝑅′ Re-rank using axiom preferences aggregated by KwikSort.
PreferenceMatrix 𝑅 → (𝑅×𝑅)𝑓 Compute an axiom’s preference matrix.

You can also directly instantiate a index context object from a Terrier index if you want to build custom axiomatic modules:

from ir_axioms.backend.pyterrier import TerrierIndexContext
context = TerrierIndexContext("/path/to/index/dir")
axiom.preference(context, query, doc1, doc2)

Pyserini (Anserini index)

We don't have modules for Pyserini to re-rank or analyze results out of the box. However, you can still comute axiom preferences to integrate retrieval axioms into your search pipeline:

from ir_axioms.backend.pyserini import AnseriniIndexContext
context = AnseriniIndexContext("/path/to/index/dir")
axiom.preference(context, query, doc1, doc2)

Citation

If you use this package or its components in your research, please cite the following paper describing the ir_axioms framework and its use-cases:

Alexander Bondarenko, Maik Fröbe, Jan Heinrich Reimer, Benno Stein, Michael Völske, and Matthias Hagen. Axiomatic Retrieval Experimentation with ir_axioms. In 45th International ACM Conference on Research and Development in Information Retrieval (SIGIR 2022), July 2022. ACM.

You can use the following BibTeX entry for citation:

@InProceedings{bondarenko:2022d,
  author =    {Alexander Bondarenko and
               Maik Fr{\"o}be and
               {Jan Heinrich} Reimer and
               Benno Stein and
               Michael V{\"o}lske and
               Matthias Hagen},
  booktitle = {45th International ACM Conference on Research and Development
               in Information Retrieval (SIGIR 2022)},
  month =     jul,
  publisher = {ACM},
  site =      {Madrid, Spain},
  title =     {{Axiomatic Retrieval Experimentation with ir_axioms}},
  year =      2022
}

Development

To build ir_axioms and contribute to its development you need to install the build, and setuptools and wheel packages:

pip install build setuptools wheel

(On most systems, these packages are already pre-installed.)

Installation

Install dependencies for developing the ir_axioms package:

pip install -e .

If you want to develop the Pyserini backend, install dependencies like this:

pip install -e .[pyserini]

If you want to develop the PyTerrier backend, install dependencies like this:

pip install -e .[pyterrier]

Testing

Install test dependencies:

pip install -e .[test]

Verify your changes against our test suite to verify.

flake8 ir_axioms tests
pylint -E ir_axioms tests.unit --ignore-paths=^ir_axioms.backend
pytest ir_axioms/ tests/unit/ --ignore=ir_axioms/backend/

Please also add tests for the axioms or integrations you've added.

Testing backend integrations

Install test dependencies (replace <BACKEND> with either pyserini or pyterrier):

pip install -e .[<BACKEND>]

Verify your changes against our test suite to verify.

pylint -E ir_axioms.backend.<BACKEND> tests.integration.<BACKEND>
pytest tests/integration/<BACKEND>/

Build wheel

A wheel for this package can be built by running:

python -m build

Support

If you hit any problems using ir_axioms or reproducing our experiments, please write us an email or file an issue:

We're happy to help!

License

This repository is released under the MIT license. If you use ir_axioms in your research, we'd be glad if you'd cite us.

Abstract

Axiomatic approaches to information retrieval have played a key role in determining basic constraints that characterize good retrieval models. Beyond their importance in retrieval theory, axioms have been operationalized to improve an initial ranking, to “guide” retrieval, or to explain some model’s rankings. However, recent open-source retrieval frameworks like PyTerrier and Pyserini, which made it easy to experiment with sparse and dense retrieval models, have not included any retrieval axiom support so far. To fill this gap, we propose ir_axioms, an open-source Python framework that integrates retrieval axioms with common retrieval frameworks. We include reference implementations for 25 retrieval axioms, as well as components for preference aggregation, re-ranking, and evaluation. New axioms can easily be defined by implementing an abstract data type or by intuitively combining existing axioms with Python operators or regression. Integration with PyTerrier and ir_datasets makes standard retrieval models, corpora, topics, and relevance judgments—including those used at TREC—immediately accessible for axiomatic experimentation. Our experiments on the TREC Deep Learning tracks showcase some potential research questions that ir_axioms can help to address.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ir_axioms-0.2.7.tar.gz (295.7 kB view details)

Uploaded Source

Built Distribution

ir_axioms-0.2.7-py3-none-any.whl (50.2 kB view details)

Uploaded Python 3

File details

Details for the file ir_axioms-0.2.7.tar.gz.

File metadata

  • Download URL: ir_axioms-0.2.7.tar.gz
  • Upload date:
  • Size: 295.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/4.0.2 CPython/3.11.4

File hashes

Hashes for ir_axioms-0.2.7.tar.gz
Algorithm Hash digest
SHA256 19fb23ee4898cfd29c692624f3fbd48caa73212b4e8fbb91579fe951d7c63f67
MD5 90f70107c8f52d28bdd7b76f9bd3b82d
BLAKE2b-256 8804aebf2394db19806997c3962e41a73405703349a13cc129833dc9780c2cf1

See more details on using hashes here.

File details

Details for the file ir_axioms-0.2.7-py3-none-any.whl.

File metadata

  • Download URL: ir_axioms-0.2.7-py3-none-any.whl
  • Upload date:
  • Size: 50.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/4.0.2 CPython/3.11.4

File hashes

Hashes for ir_axioms-0.2.7-py3-none-any.whl
Algorithm Hash digest
SHA256 d799da720d1d685748c3702d01ad4746cb78a576fb00a8ceec3408a4f7f8eb99
MD5 9088f4365aba4cae5ed09e4f864d095d
BLAKE2b-256 697779e48b3f0c8278bfad683c6508fcc3767dd37fa7b059ec099be229c8aca0

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page