Retrieves, processes and downloads Bible passages from Bible Gateway
Project description
Meaningless is a Python library used to retrieve, process and download Bible passages from Bible Gateway.
Features include:
- Passage retrieval from the Bible Gateway site or from a local YAML/JSON/XML/CSV file.
- Different output formats for different purposes:
- Multi-line strings for printing Bible passages.
- Python list of strings (or in-memory data structure) for passing Bible passages to other Python logic.
- YAML/JSON/XML/CSV files for persistent storage of Bible passages or as input for other applications and scripts.
- Handling of edge case passages, such as those with tabular data and omitted passages in certain translations.
- Flags to enable particular content modifications, such as ignoring passage numbers.
- Filtering on Bible passages from a local file based on a given text input or regular expression.
Now accepting feature requests! If you want to see a certain feature included, please create an issue describing all the necessary details.
Installation
pip install meaningless
API Documentation
Documentation is generated using Sphinx.
Online
The API documentation is hosted through GitHub Pages at the following link: https://daniel-tran.github.io/meaningless/
Offline
You can view the API documentation as static HTML documents from docs\index.html
. After cloning this repo, you can load the HTML files in a web browser, which allows you to navigate to other sections.
Supported translations
English
- ASV
- AKJV
- BRG
- EHV
- ESV
- ESVUK
- GNV
- GW
- ISV
- JUB
- KJV
- KJ21
- LEB
- MEV
- NASB
- NASB1995
- NET
- NIV
- NIVUK
- NKJV
- NLT
- NLV
- NMB (New Testament only)
- NOG
- NRSV
- NRSVUE
- WEB
- YLT
Español
- RVA
Usage
Web Extractor
The Web Extractor is used to obtain passage information directly from Bible Gateway.
from meaningless import WebExtractor
if __name__ == '__main__':
bible = WebExtractor()
passage = bible.get_passage('Ecclesiastes', 1, 2)
print(passage)
Output:
² “Meaningless! Meaningless!”
says the Teacher.
“Utterly meaningless!
Everything is meaningless.”
YAML Downloader
The YAML Downloader is which formats passages obtained from the Bible Gateway website (using the Web Extractor) into a YAML structure and writes it out to a file:
from meaningless import YAMLDownloader
if __name__ == '__main__':
downloader = YAMLDownloader()
downloader.download_passage('Ecclesiastes', 1, 2)
Output:
Running the above code would produce a file called Ecclesiastes.yaml
in the current working directory with the approximate contents:
Ecclesiastes:
1:
2: "² “Meaningless! Meaningless!”\n says the Teacher.\n“Utterly meaningless!\n\
\ Everything is meaningless.”"
Info:
Language: English
Translation: NIV
Timestamp: '0000-00-00T00:00:00.000000+00:00'
Meaningless: 0.0.0
YAML Extractor
The YAML Extractor uses the generated files from the YAML Downloader to find passages. This is faster than the Web Extractor, since it is not retrieving information from the Internet and is also unaffected by bandwidth limitations.
from meaningless import YAMLExtractor
if __name__ == '__main__':
bible = YAMLExtractor()
passage = bible.get_passage('Ecclesiastes', 1, 2)
print(passage)
Output:
Assuming the YAML downloader has already generated a YAML file in the current directory called Ecclesiastes.yaml
which contains the book of Ecclesiastes in YAML format:
² “Meaningless! Meaningless!”
says the Teacher.
“Utterly meaningless!
Everything is meaningless.”
YAML File Interface
The YAML File Interface is a set of helper methods used to read and write YAML files. This can be useful if you need to do some customised processing on a downloaded YAML file.
from meaningless import YAMLDownloader, yaml_file_interface
if __name__ == '__main__':
downloader = YAMLDownloader()
downloader.download_passage('Ecclesiastes', 1, 2)
bible = yaml_file_interface.read('./Ecclesiastes.yaml')
bible['Info']['Customised?'] = True
yaml_file_interface.write('./Ecclesiastes.yaml', bible)
Output:
Running the above code would produce a file called Ecclesiastes.yaml
in the current working directory with the approximate contents:
Ecclesiastes:
1:
2: "² “Meaningless! Meaningless!”\n says the Teacher.\n“Utterly meaningless!\n\
\ Everything is meaningless.”"
Info:
Language: English
Translation: NIV
Timestamp: '0000-00-00T00:00:00.000000+00:00'
Meaningless: 0.0.0
Customised?: true
JSON Downloader
The JSON Downloader is effectively the same as the YAML Downloader except the resulting file is in JSON format and has a different file extension.
from meaningless import JSONDownloader
if __name__ == '__main__':
downloader = JSONDownloader()
downloader.download_passage('Ecclesiastes', 1, 2)
Output:
Running the above code would produce a file called Ecclesiastes.json
in the current working directory with the approximate contents:
{
"Ecclesiastes": {
"1": {
"2": "² “Meaningless! Meaningless!”\n says the Teacher.\n“Utterly meaningless!\n Everything is meaningless.”"
}
},
"Info": {
"Language": "English",
"Meaningless": "0.0.0",
"Timestamp": "0000-00-00T00:00:00.000000+00:00",
"Translation": "NIV"
}
}
JSON Extractor
Much like the YAML Extractor, the JSON Extractor uses the generated files from the JSON Downloader to find passages.
from meaningless import JSONExtractor
if __name__ == '__main__':
bible = JSONExtractor()
passage = bible.get_passage('Ecclesiastes', 1, 2)
print(passage)
Output:
Assuming the JSON downloader has already generated a JSON file in the current directory called Ecclesiastes.json
which contains the book of Ecclesiastes in JSON format:
² “Meaningless! Meaningless!”
says the Teacher.
“Utterly meaningless!
Everything is meaningless.”
JSON File Interface
The JSON File Interface is a set of helper methods used to read and write JSON files. Similar to the YAML File Interface, it can be used to do customised processing on a JSON file or its contents.
from meaningless import JSONDownloader, json_file_interface
if __name__ == '__main__':
downloader = JSONDownloader()
downloader.download_passage('Ecclesiastes', 1, 2)
bible = json_file_interface.read('./Ecclesiastes.json')
bible['Info']['Customised?'] = True
json_file_interface.write('./Ecclesiastes.json', bible)
Output:
Running the above code would produce a file called Ecclesiastes.json
in the current working directory with the approximate contents:
{
"Ecclesiastes": {
"1": {
"2": "² “Meaningless! Meaningless!”\n says the Teacher.\n“Utterly meaningless!\n Everything is meaningless.”"
}
},
"Info": {
"Customised?": true,
"Language": "English",
"Meaningless": "0.0.0",
"Timestamp": "0000-00-00T00:00:00.000000+00:00",
"Translation": "NIV"
}
}
XML Downloader
The XML Downloader is effectively the same as the YAML Downloader except the resulting file is in a specific XML format and has a different file extension.
from meaningless import XMLDownloader
if __name__ == '__main__':
downloader = XMLDownloader()
downloader.download_passage('Ecclesiastes', 1, 2)
Output:
Running the above code would produce a file called Ecclesiastes.xml
in the current working directory with the approximate contents:
<?xml version="1.0" encoding="utf-8"?>
<root>
<info>
<language>English</language>
<translation>NIV</translation>
<timestamp>0000-00-00T00:00:00.000000+00:00</timestamp>
<meaningless>0.0.0</meaningless>
</info>
<book name="Ecclesiastes" tag="_Ecclesiastes">
<chapter number="1" tag="_1">
<passage number="2" tag="_2">² “Meaningless! Meaningless!”
says the Teacher.
“Utterly meaningless!
Everything is meaningless.”</passage>
</chapter>
</book>
</root>
XML Extractor
Much like the YAML Extractor, the XML Extractor uses the generated files from the XML Downloader to find passages.
from meaningless import XMLExtractor
if __name__ == '__main__':
bible = XMLExtractor()
passage = bible.get_passage('Ecclesiastes', 1, 2)
print(passage)
Output:
Assuming the XML downloader has already generated an XML file in the current directory called Ecclesiastes.xml
which contains the book of Ecclesiastes in XML format:
² “Meaningless! Meaningless!”
says the Teacher.
“Utterly meaningless!
Everything is meaningless.”
XML File Interface
The XML File Interface is a set of helper methods used to read and write XML files. Unlike the other file interfaces, this is more geared towards the specific document format used by the XML Downloader and Extractor, so you may observe some strange behaviour if you try using this for general purpose XML file interactions.
from meaningless import XMLDownloader, xml_file_interface
if __name__ == '__main__':
downloader = XMLDownloader()
downloader.download_passage('Ecclesiastes', 1, 2)
bible = xml_file_interface.read('./Ecclesiastes.xml')
bible['Info']['Customised'] = True
xml_file_interface.write('./Ecclesiastes.xml', bible)
Output:
Running the above code would produce a file called Ecclesiastes.xml
in the current working directory with the approximate contents:
<?xml version="1.0" encoding="utf-8"?>
<root>
<info>
<language>English</language>
<translation>NIV</translation>
<timestamp>0000-00-00T00:00:00.000000+00:00</timestamp>
<meaningless>0.0.0</meaningless>
<customised>true</customised>
</info>
<book name="Ecclesiastes" tag="_Ecclesiastes">
<chapter number="1" tag="_1">
<passage number="2" tag="_2">² “Meaningless! Meaningless!”
says the Teacher.
“Utterly meaningless!
Everything is meaningless.”</passage>
</chapter>
</book>
</root>
Note that you are allowed to write badly formed XML documents using this file interface, but they will cause runtime errors in your code upon trying to read and process them.
CSV Downloader
The CSV Downloader is effectively the same as the YAML Downloader except the resulting file is in CSV format and has a different file extension.
from meaningless import CSVDownloader
if __name__ == '__main__':
downloader = CSVDownloader()
downloader.download_passage('Ecclesiastes', 1, 2)
Output:
Running the above code would produce a file called Ecclesiastes.csv
in the current working directory with the approximate contents:
Book,Chapter,Passage,Text,Language,Translation,Timestamp,Meaningless
Ecclesiastes,1,2,"² “Meaningless! Meaningless!”
says the Teacher.
“Utterly meaningless!
Everything is meaningless.”",English,NIV,0000-00-00T00:00:00.000000+00:00,0.0.0
CSV Extractor
Much like the YAML Extractor, the CSV Extractor uses the generated files from the CSV Downloader to find passages.
from meaningless import CSVExtractor
if __name__ == '__main__':
bible = CSVExtractor()
passage = bible.get_passage('Ecclesiastes', 1, 2)
print(passage)
Output:
Assuming the CSV downloader has already generated a CSV file in the current directory called Ecclesiastes.csv
which contains the book of Ecclesiastes in CSV format:
² “Meaningless! Meaningless!”
says the Teacher.
“Utterly meaningless!
Everything is meaningless.”
CSV File Interface
The CSV File Interface is a set of helper methods used to read and write CSV files. This is geared towards the CSV document format used by the CSV Downloader and Extractor and cannot be used to add custom attributes to the output file when writing CSV data.
from meaningless import CSVDownloader, csv_file_interface
if __name__ == '__main__':
downloader = CSVDownloader()
downloader.download_passage('Ecclesiastes', 1, 2)
bible = csv_file_interface.read('./Ecclesiastes.csv')
bible['Info']['Language'] = 'English (EN)'
csv_file_interface.write('./Ecclesiastes.csv', bible)
Output:
Running the above code would produce a file called Ecclesiastes.csv
in the current working directory with the approximate contents:
Book,Chapter,Passage,Text,Language,Translation,Timestamp,Meaningless
Ecclesiastes,1,2,"² “Meaningless! Meaningless!”
says the Teacher.
“Utterly meaningless!
Everything is meaningless.”",English (EN),NIV,0000-00-00T00:00:00.000000+00:00,0.0.0
Text searching within files
All file-based extractors support passage filtering by search text or by regular expression.
The section below is a simple example that prints passages containing "meaningless" from Ecclesiastes 1:
from meaningless import YAMLDownloader, YAMLExtractor
if __name__ == '__main__':
downloader = YAMLDownloader()
downloader.download_chapter('Ecclesiastes', 1)
bible = YAMLExtractor()
print(bible.find_text_in_chapter('meaningless', 'Ecclesiastes', 1))
Output:
² “Meaningless! Meaningless!”
says the Teacher.
“Utterly meaningless!
Everything is meaningless.”
¹⁴ I have seen all the things that are done under the sun; all of them are meaningless, a chasing after the wind.
The section below is another simple example that prints passages containing "meaningless" or "wisdom" from Ecclesiastes 1:
from meaningless import YAMLDownloader, YAMLExtractor
if __name__ == '__main__':
downloader = YAMLDownloader()
downloader.download_chapter('Ecclesiastes', 1)
bible = YAMLExtractor()
print(bible.find_text_in_chapter('meaningless|wisdom', 'Ecclesiastes', 1, is_regex=True))
Output:
² “Meaningless! Meaningless!”
says the Teacher.
“Utterly meaningless!
Everything is meaningless.”
¹³ I applied my mind to study and to explore by wisdom all that is done under the heavens. What a heavy burden God has laid on mankind!
¹⁴ I have seen all the things that are done under the sun; all of them are meaningless, a chasing after the wind.
¹⁶ I said to myself, “Look, I have increased in wisdom more than anyone who has ruled over Jerusalem before me; I have experienced much of wisdom and knowledge.”
¹⁷ Then I applied myself to the understanding of wisdom, and also of madness and folly, but I learned that this, too, is a chasing after the wind.
¹⁸ For with much wisdom comes much sorrow;
the more knowledge, the more grief.
Q&A
How to report potential bugs and other feedback?
To report bugs and other problems, create an issue in this repo that details:
- A brief description of the problem encountered
- Steps to recreate the problem (or a code sample that demonstrates the problem)
- Expected result
- The version of this library being used
If you have any questions, complaints, compliments or even ideas to improve this library, you can also leave them as a GitHub issue with the appropriate label. Or you can also send an email to dantran.au@gmail.com, although a response will likely take longer than replying to a GitHub issue.
Should I manually edit the downloaded file?
This is NOT recommended under normal circumstances, as it may cause problems with the library API when using the modified file.
If multiple translations are supported, why aren't there more unit tests for these?
The Base Extractor and Downloader all use the same overall structure to represent passage contents for all translations.
For the Web Extractor, the page structure of the Bible Gateway site is mostly the same across different translations, so as long as the translation-specific differences are handled correctly, the same set of unit tests should suffice.
Will this library ever drop support for certain translations?
Short answer: It depends.
The primary case for dropping support for a certain Bible translation is if there is an observed problem in the extracted Bible contents which is considerably difficult to address due to how the Biblical contents is structured (in terms of HTML) on the Bible Gateway site. With that in mind, reinstating support for Bible translations can be reconsidered when such problems subside.
How are omitted passages determined for each translation?
Without having to go through every single passage and check if it is omitted, a set of common omitted passages are found here:
- https://en.wikipedia.org/wiki/List_of_New_Testament_verses_not_included_in_modern_English_translations
- http://textus-receptus.com/wiki/List_of_Omitted_Bible_Verses#List_of_Bible_verses_totally_omitted
These passages are checked on the Bible Gateway site, and then added to the Base Downloader's internal list of omitted passages for the relevant translation.
If you notice any problems such as unhandled omitted passages or incorrect tagging of an omitted passage in the Base Downloader, please create an issue to report it.
Does this library provide support for the Apocrypha books?
At the moment, you can use the Web Extractor's search()
, search_multiple()
and other related functions to obtain passages from the Apocrypha books.
There is currently no official support for the Apocrypha books in the other downloaders and extractors, at least until the issues identified here become easier to work around.
Contributors
- daniel-tran (Creator & current maintainer)
To make a contribution to this library, refer to CONTRIBUTING.md
.
Disclaimer
This library is not endorsed by nor affiliated with Bible Gateway or any of its partners.
Change Log
1.1.0
- Fixed an issue where
get_passage_range
in the Web Extractor would incorrectly cap the upper limit of passages to 100 - Drop library support for Python 3.8
- Python version support now extends to the 4 earliest versions (subject to its EOL date) which are at least receiving security fixes
1.0.0
- Removed legacy_xml_file_interface module and the
use_legacy_mode
flag in the XML Downloader and Extractor - Added experimental Apocrypha Web Extractor for the NRSVUE translation
- Added translation support for: NMB
0.7.0
- Refined xml_file_interface to use a more standard XML document structure with easier integration with XSLT
- Added
use_legacy_mode
flag to XML Downloader and Extractor to continue using the original behaviour and assist with transitioning to the updated XML file interface - Added legacy_xml_file_interface module for backward compatibility with XML files using the previous (deprecated) document structure
- Added
- Added download timestamp and library version information to output files
- Drop library support for Python 3.7
0.6.1
- Fixed an issue where the JSON file interface was writing Unicode characters incorrectly to output files
0.6.0
- Added text searching functionality into Base Extractor
- Added a dedicated area for experimental functionality and helper scripts
- With enough refinement, some of these might become new features in future releases
- Added translation support for: NRSVUE
- Removed translation support for: CJB
0.5.0
- Added CSV Extractor and CSV Downloader
- Added csv_file_interface module to assist with CSV file access
- Added translation support for: NIVUK
- Drop library support for Python 3.6
0.4.0
- Documentation is now publicly available through GitHub Pages. See https://daniel-tran.github.io/meaningless/
- This was previously an undocumented change as part of the 0.3.0 release.
- Added XML Extractor and XML Downloader
- Added xml_file_interface module to assist with XML file access
- Added translation support for: RVA
0.3.0
- Added initial continuous integration workflow with GitHub Actions
- Added translation support for: BRG, CJB, EHV, ESVUK, GNV, GW, ISV, JUB, NASB1995, NOG
- Drop library support for Python 3.5
0.2.0
- Added Base Extractor, Base Downloader, JSON Extractor and JSON Downloader
- Base Extractor contains shared logic for both the YAML and JSON Extractors
- Base Downloader contains shared logic for both the YAML and JSON Downloaders
- Added json_file_interface module to assist with generic JSON file access
- Added translation support for: ASV, AKJV, KJ21, LEB, MEV, NET, NLV, YLT
0.1.0
- Initial release!
- Added Web Extractor, YAML Extractor and YAML Downloader
- Added yaml_file_interface module to assist with generic YAML file access
- Added translation support for: ESV, KJV, NASB, NIV, NKJV, NLT, NRSV, WEB
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file meaningless-1.1.0.tar.gz
.
File metadata
- Download URL: meaningless-1.1.0.tar.gz
- Upload date:
- Size: 38.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.12.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 675bcb7c68853eb6a300fa5f77eb1c4f45ef35d5dcf952988bcfa40f147ff78e |
|
MD5 | e547e0e94a5ad24c2cb6d3bd7f303a17 |
|
BLAKE2b-256 | abfb876eeb84f25118c42e2bd47ac1f9d93792c9d5b7d8a05ff19cedf3ec1cf4 |
File details
Details for the file meaningless-1.1.0-py3-none-any.whl
.
File metadata
- Download URL: meaningless-1.1.0-py3-none-any.whl
- Upload date:
- Size: 42.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.12.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 02c6f6350a4bfebaa839986e7dd9c25c1ee431487c9b16c01a747ddb04304508 |
|
MD5 | 1234280d9ab259158938eb7748a7119c |
|
BLAKE2b-256 | cbe2c8b1ba91a6adabd0c13dc434bf716ee622534500562ae54fb73cccf8de1d |