Scrape norwegian Bibles on Youversion website and generate a Logos compatible format
Project description
I'm trying to make the code:
-
Scrape the Bible text off bible.com for any defined Bible translation. The URL for the Bible text is: https://www.bible.com/no/bible/"+TRANSLATION_NUMBER+"/"+BOOK+"."+CHAPTER+".nb" The translation number defines which translation is used. E.g. the Norwegian 1988 translation is version number 102. The Bible book is a three-letter abbreviation for the book. The abbreviations used on the site are defined in the array "booklist" (not to be confused with bbooklist, which are the abbreviations used on Logos)
-
Separate out just the text - remove all HTML tags, and add Logos program tags. The information on Logos formatting is found on https://wiki.logos.com/Personal_Books What is needed here is basically the "field on/off:bible" and Bible verse link. For instance, in John 1:1, you might get:
{{field-off:bible}} KAPITTEL 1 [[@BibleNO2011:jn 1:1]] 1 {{field-on:bible}}I begynnelsen var Ordet, og Ordet var hos Gud, og Ordet var Gud. {{field-off:bible}}
For the Bible verse link in Logos, the syntax is: [[@Bible "TRANSLATION" : "BOOK" "CHAPTER":"VERSE"]] The Bible book abbreviations in Logos are different than the ones used on bible.com, and are found in the list "bbooklist".
I'm attaching my awesome code.
Thanks - this will make studying the Bible much easier both for me, and for many other pastors and Bible students in Norway.
Versions specific problems
102
Stars, stars everywhere 1 chr 29 : 22, 1 chr 22 : 9, 1 chr 21 : 1
2216
Not split on the digits, be a bit smarter, all the numbers in Ezekiel disappear https://www.bible.com/bible/2216/EZK.29.BGO
29
The quotes are in a different container div.class=qt e.g. https://www.bible.com/bible/29/ACT.8.N11BM John 8 The passage about the adulterous woman, basically fix the numbering https://www.bible.com/bible/29/JHN.8.N11BM
115
Numbering issues https://www.bible.com/bible/115/REV.21.ELB verse 18, quotation issues 1 peter 2 : 6 https://www.bible.com/bible/115/1PE.2.ELB
30
Numbering issues, John 8, Micha 1 : 2+ quotations...
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file bibelen-0.1.4.tar.gz
.
File metadata
- Download URL: bibelen-0.1.4.tar.gz
- Upload date:
- Size: 17.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.1.13 CPython/3.10.7 Linux/5.19.7-200.fc36.x86_64
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a3b064d06de4a59707a370a8ef45b888e9dc552ae8beccb2fde2c43db80df7b2 |
|
MD5 | eb063662f9bf790b9cde5b72c8e475ff |
|
BLAKE2b-256 | 34275676dba4e9da5557e898576059dfe8c9b2c7b1a546c43051e7317e997aaf |
File details
Details for the file bibelen-0.1.4-py3-none-any.whl
.
File metadata
- Download URL: bibelen-0.1.4-py3-none-any.whl
- Upload date:
- Size: 17.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.1.13 CPython/3.10.7 Linux/5.19.7-200.fc36.x86_64
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e0baa04ed222b268ae83ff2b982480e60fc40eb59a6f2eda22497364bf100472 |
|
MD5 | 8c33596073559f2717e1ba111f41220c |
|
BLAKE2b-256 | 0371722de773f6376364ca498cdc9e12485241a978c313cfb6f2f2cab0229333 |