A Python package for working with biblical references in ordinary human text.

These details have not been verified by PyPI

Project links

Development Status
- 4 - Beta
Intended Audience
- Developers
Operating System
- OS Independent
Programming Language
- Python :: 3
Topic
- Religion
- Text Processing :: Indexing

Project description

RefSpy Logo

Version: 0.11.4 Beta Status: BETA

Refspy is a Python package for working with biblical references in human text.

README

eukras/refspy on Github | Report Issue | Contact Author
refspy on PyPI → pip install refspy.
refspy on ReadTheDocs → See docs/ dir.
CHANGES.md | TODO.md

Online Demonstration

Bible Rocket is an online demonstration of most of the features of the refspy package. You can use to make indexes from any text.

It will also graph the distribution of references found.

Multi-language Demonstration

The script demo.py will generate file like demo/en_US.html for all languages, which are used to make the following screenshots.

`en_US` (@eukras)	`fr_FR` (@a2ohm)

Features

Find biblical references matching human writing styles (spaces, commas, prefixes, abbreviations)
Match international and European syntaxes (Matt 5:3,7-9 vs Matt 5,3.7-9).
Use contextual book references to interpret partial references (e.g. "in Romans see 5:4").
Sequentially replace matched references.
Compile indexes of all matched references, or all verse ranges covered; list the most referenced chapters (hotspots) in a list of references.
Format references as names, abbreviated names, and URL parameters.
Generate HTML links of any required format.
Construct and manipulate verses, ranges, and references.
Compare and sort verses, ranges, and references.
Collate references by library and book for iteration.
Test if references or ranges contain, overlap, or adjoin one another; merge overlapping references and join adjacent ones.
Store verses as UNSIGNED INT(12) for database indexing.
Generally follow established conventions, so SBL style in English.

Not implemented:

Old style references: Rev. 2 and 3, Matt. 28. 20; Acts 1. 8, 2 Cor. 9. 1-12 (1930s). Matt. xxv. 46. Dan. xii. 2. 2 Thes. i. 7-10. Ephes. iv. 18 (1790s).

HOWTO

The Reference Manager

Initialising refspy with corpus and language names will return a reference manager. This provides a single convenient interface for the whole library. By default, refspy provides a Protestant canon in English.

from refspy import refspy
__ = refspy()

Or, to create specific canons:

from refspy.languages.english import ENGLISH
from refspy.languages.french import FRENCH
from refspy.libraries.en_US import DC, DC_ORTHODOX, NT, OT
from refspy.manager import Manager

# Protestant
__ = refspy('protestant', 'en_US')
__ = Manager(libraries=[OT, NT], language=ENGLISH)

# Catholic
__ = refspy('catholic', 'fr_FR')
__ = Manager(libraries=[OT, DC, NT], language=FRENCH)

# Orthodox
__ = refspy('orthodox', 'en_US')
__ = Manager(libraries=[OT, DC, DC_ORTHODOX, NT], language=ENGLISH)

Additionally, a third argument can specify international or European syntax for references, that is, whether to write Matt 5:3,7-9 or Matt 5,3.7-9. If not argument is given, the default syntax for the specified language will be used.

The file refspy/setup.py shows valid names for libraries and languages. There's only English initially. The en_US libraries conform to the SBL Style Guide for book names and abbreviations. Other libraries can be defined and added locally following the structure in refspy.libraries.en_US. If they follow established academic usage where possible, please contribute them to the project.

Creating references

Shortcut functions can create simple references using any book name, abbreviation, or alias in the libraries list. Firstly, we can create references from strings:

ref = __.r('Rom 2:6,9,1,2')

We can construct references more programmatically with __.bcv():

assert __.name(__.bcv('Rom')) == 'Romans'
assert __.name(__.bcv('Rom', 2)) == 'Romans 2'
assert __.name(__.bcv('Rom', 2, 2)) == 'Romans 2:2'
assert __.name(__.bcv('Rom', 2, 2, 3)) == 'Romans 2:2-3'

Or __.bcr() to specify book, chapter and verse ranges:

assert __.name(__.bcr('Rom', 2, [(2, 3), 7])) == "Romans 2:2-3,7"

Options

Both refspy() and refspy.matcher.Matcher() take the following optional argument, which modify this usage. To obtain more fine-grained control of aliasing, make your own copy of refspy.libraries.en_US and use it with Manager(libraries=[MY_LIB]).

`include_two_letter_aliases=True` (default: `True`)

Allow two-letter book aliases to be matched: e.g. Ge for Genesis or 1 Ti for First Timothy. Note short names given as abbrevs (like Ps) will not be filtered like aliases are. Aliases (e.g. Is, Am) that are also common words will be only be matched as part of references (So Am 3:1 but not Am); the list of ambiguous_aliases for a language is supplied in the language file, e.g. refspy/languages/english.py.

from refspy import refspy
__ = refspy(include_two_letter_aliases=True)
match, ref = __.first_reference('2Ti 1')
assert ref == __.bcv('2 Tim', 1)

Formatting references

ref = __.r('Rom 2:3-4,7')

assert __.name(ref) == 'Romans 2:3–4, 7'
assert __.book(ref) == 'Romans'
assert __.numbers(ref) == '2:3–4, 7'
assert __.abbrev_name(ref) == 'Rom 2:3–4, 7'
assert __.abbrev_book(ref) == 'Rom'
assert __.abbrev_numbers(ref) == '2:3–4, 7'

Some utility functions can be used to turn these into usable URL parameters if needed:

from refspy.utils import url_param, url_escape

ref = __.r('2 Cor 3:4-5')
assert url_param(__.abbrev_name(ref)) == '2+cor+3.4-5'
assert url_escape(__.abbrev_name(ref)) == '2%20Cor%203%3A4-5'

In general, though, templating is a better way to make links.

Templating references

bible_gateway = (
    '<a href="https://www.biblegateway.com/passage/'
  + '?search={LINK}&version=NRSVA">'
  + '{NAME}'
  + '</a>'
)
ref = __.r('2 Cor 3:4-5')
link = __.template(reference, bible_gateway)

assert link.find('2%20Cor%203%3A4-5') > 0
assert link.find('2 Corinthians 3:4–5') > 0

The full list of template fields is:

Field	Output
`{LINK}`	`1%20Cor%202%3A3-4`
`{NAME}`	`1 Corinthians 2:3–4`
`{BOOK}`	`1 Corinthians`
`{NUMBERS}`	`2:3–4`
`{ASCII_NUMBERS}`	`2:3-4`
`{ABBREV_NAME}`	`1 Cor 2:3–4`
`{ABBREV_BOOK}`	`1 Cor`
`{ABBREV_NUMBERS}`	`2:3–4`
`{ESC_NAME}`	`1%20Corinthians%202%3A3-4`
`{ESC_BOOK}`	`1%20Corinthians`
`{ESC_NUMBERS}`	`2%3A3-4`
`{ESC_ABBREV_NAME}`	`1%20Cor%202%3A3-4`
`{ESC_ABBREV_BOOK}`	`1%20Cor`
`{ESC_ABBREV_NUMBERS}`	`2%3A3-4`
`{PARAM_NAME}`	`1+cor+2.3-4`
`{PARAM_BOOK}`	`1+cor`
`{PARAM_NUMBERS}`	`2.3-4`

The {LINK} field is like {ESC_ABBREV_NAME}, but will use English-style verse formatting with any language, which suits linking to sites like Bible Gateway.

Templates can be passed as optional arguments to other rendering functions, say to generate links within indexes.

__.make_index(references, template=bible_gateway)

Comparing references

A reference can be a set of any valid verses and verse ranges spread across multiple books or even libraries.

rom_2 = __.r('Rom 2')
rom_4 = __.r('Rom 4')
rom_4a = __.bc('rom', 4)

assert rom_2 < rom_4
assert not rom_2 >= rom_4
assert rom_4 == rom_4a

Because references can be compared using the < operator, they can also be sorted without any special functions, and used in min() and max().

assert __.sort_references([rom_4, rom_2]) == [rom_2, rom_4]
assert sorted([rom_4, rom_2]) == [rom_2, rom_4]  # <-- Same
assert min([rom_4, rom_2]) == rom_2

Contains, Overlaps, Adjoins

We will commonly want to know if one reference contains(), or overlaps() another. The adjoins() function works out adjacency for chapters and verses, but note it is limited by not knowing the lengths of chapters.

gen1 = __.r('Gen 1')
gen2 = __.r('Gen 2')
gen1_22_23 = __.r('Gen 1:22-23')
gen1_24_28 = __.r('Gen 1:24-28')

assert gen1.contains(gen1_22_23)
assert gen1_22_23.overlaps(gen1)
assert gen1_22_23.adjoins(gen1_24_28)
assert gen1.adjoins(gen2)
assert not gen1.overlaps(gen2)

Sort, Merge, and Combine

References can be simplified by merging overlapping ranges and combining those that are adjacent.

assert __.merge_references([gen1_22_23, gen1]) == gen1
assert __.combine_references([gen1, gen2]) == __.r('Gen 1-2')
assert __.combine_references([gen1_22_23, gen1_24_28]) == __.r('Gen 1:22-28')

Under the hood, these methods just join the range lists together and merge or combine them into a new reference. (Note the * operator to unpack lists into arguments for reference().)

from refspy.range import merge_ranges, combine_ranges

assert reference(*merge_ranges(ranges)) == __.merge_references([reference(*ranges)])
assert reference(*combine_ranges(ranges)) == __.combine_references([reference(*ranges)])

Manipulating references

Among other transformations, references can be turned into their (first) book objects or references to just their books or chapters.

ref1 = __.ref('Rom 2:3-4,7')

assert __.get_book(ref1).chapters == 16
assert __.name(ref1.book_reference()) == 'Romans'
assert __.name(ref1.chapter_reference()) == 'Romans 2'

Navigating references

assert __.next_chapter(rom_2) == rom_3
assert __.prev_chapter(rom_2) == rom_1
assert __.prev_chapter(rom_1) == acts_28
assert __.prev_chapter(matt_1) is None

To create chapter references:

from refspy.libraries.en_US import NT

nt_chapter_refs_ = [
  __.bcv(book.name, ch)
  for ch in range(1, book.chapters)
  for book in NT.books
]

Matching references in text

To find references in text and print HTML links for them:

url = 'https://www.biblegateway.com/passage/?search=%s&version=NRSVA"

text = "Rom 1; 1 Cor 8:3,4; Rev 22:3-4"

strs, refs = __.find_references(text)
for match_str, ref in zip(strs, refs):
   print(f"{match_str} -> {url % __.param(ref)}")

Replacing references in text

To produce the demo image above, we use the sequential_replace function from refspy/utils:

from refspy.utils import sequential_replace

matches = __.find_references(text, include_books=True)
strs, tags = [], []
for match_str, ref in matches:
    strs.append(match_str)
    if ref.is_book():
        tags.append(f'<span class="yellow">{match_str}</span>')
    else:
        tags.append(
            f'<span class="green">{match_str}</span><sup>{__.abbrev(ref)}</sup>'
        )
html = sequential_replace(text, strs, tags)}

Collating and Indexing

To produce the index for the demo image above:

matches = __.find_references(text)

index = []
for library, book_collation in __.collate(
    sorted([ref for _, ref in matches])
):
    for book, reference_list in book_collation:
        new_reference = __.merge(reference_list)
        index.append(__.abbrev(new_reference))

html_list = "; ".join(index)

Internals

github.com/AlDanial/cloc v 2.04  T=0.04 s (1285.4 files/s, 171816.9 lines/s)
-------------------------------------------------------------------------------
Language                     files          blank        comment           code
-------------------------------------------------------------------------------
Python                          44            933           1122           4396
Markdown                         6            177              0            519
TOML                             1              5              0             35
Text                             3              1              0             30
-------------------------------------------------------------------------------
SUM:                            54           1116           1122           4980
-------------------------------------------------------------------------------

Glossary

Book, Format, Index, Language, Library, Number, Range, Reference, and Verse are Pydantic types that will raise ValueErrors if initialised with bad data, say if a verse has Numbers outside the range 0..999, or if a Range has a start verse that is greater than its end verse. These appear in the models and types folders.

Book. A book has id, name, abbrev, aliases, and chapters. No verse counts.
Format. The Format objects define what properties and characters to use when formatting references for various purposes.
Index. An integer which results from expanding a verse by powers of 1000; verse(1, 7, 16, 1) becomes the integer 1007016001. Provided for database indexing if required.
Language. A language has verse_markers (e.g. v. and .vv.), ambiguous_aliases (e.g. Is and Am, which are also words), and number prefixes (e.g. Second and II for 2).
Library. A library has id, name, abbrev, and a list of Books. See e.g. libraries/en_US.py. Library IDs are spaced out in a roughly historical order: OT is 200, NT is 400.
Number. An integer 1..999. We assume verses/chapters/books/libraries are limited to this size. This may need modifying to accommodate, say, zero verses in the Septuagint.
Range. A pair of (start, end) verses; 1 Cor 16:1-2 becomes range(verse(400, 7, 16, 1), verse(400, 7, 16, 2)).
Reference. A list of ranges; 1 Cor 16:1-2,6 becomes reference([range(verse(400, 7, 16, 1), verse(400, 7, 16, 2)), range(verse(400, 7, 16, 6), verse(400, 7, 16, 6))]). They do not automatically sort or simplify the ranges.
Syntax. The correct characters for matching and formatting references; international or European.
Verse. A quadruple of (library, book, chapter, verse) numbers; 1 Cor 16:1 becomes verse(400, 7, 16, 1)

Data Structures

Libraries and Books

Libraries and books are Pydantic BaseModels, which apply validation checks to the data whenever created or modified. These are defined by in locale files, such as refspy/languages/en_US.py:

OT = Library(
    id=200,
    name="Old Testament",
    abbrev="OT",
    books=[
        Book(
            id=1,
            name="Genesis",
            abbrev="Gen",
            aliases=['Ge'],
            chapters=50,
        ),
      ...
  ]
)

Any major class has its own file, so Library is defined in refspy/library.py and so on.

Books and libraries have names, abbrevs, and aliases. If URL params are needed, they are generated from these. So, the name '1 Corinthians' has the abbrev '1 Cor', and generates the param '1+cor'. The languages/english.py file says that numeric prefixes must also match I and First (etc). The params are lowercase with no spaces.

Verses

A verse contains library, book, chapter, and verse numbers. The library, book, chapter, and verse numbers are all in 1-3 digits, in the range 1-999. It is assumed that there will not be 1000 or more verse in a chapter, chapters in a book, books in a library, or total libraries.

Refspy does not know or care how many verses there are in a chapter. It is expected that this will be determined from a database of texts by any client application, especially since not all verse numbers actually exist in texts (e.g. due to copying errors in the Vulgate at the time verse numbers were assigned). However, knowing the number of chapters per book allows the previous and next chapter to be determined, say, for navigating a library.

Verse(library=1, book=2, chapter=3, verse=4)
verse(1, 2, 3, 4)
verse(1, 2, 3, 1004)  # <-- ValueError

Index Numbers

Verses convert to an index value, verse(1, 2, 3, 4).index() == 1002003004 (UNSIGNED INT(12)), which allows efficient indexing in databases:

sql_clause = " OR ".join([
    f"({column_name} BETWEEN {range.start.index()} AND {range.end.index()})"
    for range in reference.ranges
])

Verses are read back with the class method refspy.verse.Verse.from_index().

Ranges

A range contains start and end verses.

A whole chapter is referenced as range(verse(1, 1, 1, 1), verse(1, 1, 1, 999)).

A whole book is referenced as range(verse(1, 1, 1, 1), verse(1, 1, 999, 999)).

So, a chapter or book contains every range and verse within that chapter or book, however long the chapter or book are.

Verses and ranges convert to tuples e.g. ((1, 2, 3, 4), (1, 2, 3, 5)) which can be sorted and compared.

Ranges can be tested for containment, overlap, or adjacency. Note that this does not take account of which verse numbers actually exist in any given text.

# Make ranges...
gen1 = range(verse(1, 1, 1, 1), verse(1, 1, 1, 999))
gen1_22_23 = range(verse(1, 1, 1, 22), verse(1, 1, 1, 23))
gen1_24_28 = range(verse(1, 1, 1, 24), verse(1, 1, 1, 28))
gen1 = range(verse(1, 1, 1, 1), verse(1, 1, 1, 999))
gen = range(verse(1, 1, 1, 1), verse(1, 1, 999, 999))
exod = range(verse(1, 2, 1, 1), verse(1, 2, 999, 999))

assert gen.is_book()
assert gen1.is_chapter()

assert gen1.contains(gen1_22_23)
assert gen1_22_23.overlaps(gen1)
assert gen1_22_23.adjoins(gen1_24_28)
assert gen1.adjoins(gen2)
assert not gen1.overlaps(gen2)
assert gen.adjoins(exod)
assert not gen.overlaps(exod)

Comparison operators can also be used, as well as sorting:

assert gen1 < gen2
assert not gen1 == gen2

gen_1_and_2 = range(verse(1, 1, 1, 1), verse(1, 1, 2, 999))

assert gen1 + gen2 == gen_1_and_2
assert sorted([gen2, gen1]) == [gen1, gen2]
assert min([gen2, gen1]) == gen1

References

References are lists of verse ranges. These are entirely numerical objects.

References, ranges, and verses have shorter constructor functions for programming convenience. Note reference() does not require list brackets.

Range(start=verse_1, end=verse_2)
range(verse_1, verse_2)

Reference(ranges=[range_1, range_2])
reference(range_1, range_2)

ref_1 = reference(
  range(verse(1, 1, 1, 1), verse(1, 1, 1, 3))
)

The reference module contains standalone functions for numeric reference construction that parallel the reference manager's __.bcv() method.

book = __.books[NT.id, 1]  # <-- Matthew is NT book ID 1
assert book_reference(NT.id, 1) == __.bcv(book.id)
assert chapter_reference(NT.id, 1, 2) == __.bcv(book.id, 2)
assert verse_reference(NT.id, 1, 2, 3) == __.bcv(book.id, 2, 3)
assert verse_reference(NT.id, 1, 2, 3, 4) == __.bcv(book.id, 2, 3, 4)

The same comparison operations that work on ranges also work on references. So references can be sorted(), min(), or max(). This becomes less intuitive the more complex their list of ranges becomes.

Project details

These details have not been verified by PyPI

Project links

Development Status
- 4 - Beta
Intended Audience
- Developers
Operating System
- OS Independent
Programming Language
- Python :: 3
Topic
- Religion
- Text Processing :: Indexing

Release history Release notifications | RSS feed

0.11.9

Apr 29, 2026

0.11.7

Apr 24, 2026

0.11.6

Apr 23, 2026

0.11.5

Mar 14, 2026

This version

0.11.4

Nov 18, 2025

0.11.3

Nov 18, 2025

0.10.4

Sep 19, 2025

0.10.3

Jun 25, 2025

0.10.2

Jun 25, 2025

0.10.1

Mar 2, 2025

0.9.18

Nov 20, 2024

0.9.17

Nov 19, 2024

0.9.16

Nov 19, 2024

0.9.15

Nov 17, 2024

0.9.14

Nov 17, 2024

0.9.13

Jul 20, 2024

0.9.12

Jul 19, 2024

0.9.11

Jul 19, 2024

0.9.10

Jul 19, 2024

0.9.9

Jul 19, 2024

0.9.8

Jul 16, 2024

0.9.7

Jul 10, 2024

0.9.6

Jul 6, 2024

0.9.5

Jul 6, 2024

0.9.4

Jun 29, 2024

0.9.3

Jun 29, 2024

0.9.2

Jun 29, 2024

0.9.1

Jun 28, 2024

0.9.0

Jun 27, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

refspy-0.11.4.tar.gz (55.5 kB view details)

Uploaded Nov 18, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

refspy-0.11.4-py3-none-any.whl (46.5 kB view details)

Uploaded Nov 18, 2025 Python 3

File details

Details for the file refspy-0.11.4.tar.gz.

File metadata

Download URL: refspy-0.11.4.tar.gz
Upload date: Nov 18, 2025
Size: 55.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for refspy-0.11.4.tar.gz
Algorithm	Hash digest
SHA256	`7aa0494a7b54b1daa6249eae8d1bd08fd5795f80752a11370d5b6bf8d1c91fb1`
MD5	`e211747d82920936d909bba5750d42c6`
BLAKE2b-256	`54955e849d4b1ae4550ff7ceb4444ed4fcf7b21f09fec4605e7116a0ac0d0831`

See more details on using hashes here.

File details

Details for the file refspy-0.11.4-py3-none-any.whl.

File metadata

Download URL: refspy-0.11.4-py3-none-any.whl
Upload date: Nov 18, 2025
Size: 46.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for refspy-0.11.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`248c8dedabe8cb1bad3a8412b9ec4e261660af90932d55549e15072426ece973`
MD5	`b8a157a33e2a5c701264d4c5e2c6f811`
BLAKE2b-256	`60481fc33532715f8c37e90094ce90230e2d381152c3dd6dd7b53d84bbb281b2`

See more details on using hashes here.

refspy 0.11.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

README

Online Demonstration

Multi-language Demonstration

Features

HOWTO

The Reference Manager

Creating references

Options

include_two_letter_aliases=True (default: True)

Formatting references

Templating references

Comparing references

Contains, Overlaps, Adjoins

Sort, Merge, and Combine

Manipulating references

Navigating references

Matching references in text

Replacing references in text

Collating and Indexing

Internals

Glossary

Data Structures

Libraries and Books

Verses

Index Numbers

Ranges

References

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`include_two_letter_aliases=True` (default: `True`)