A tool to efficiently check if a Bitcoin Address has ever been used before
Project description
used_addr_check (Python)
A tool to efficiently check if a Bitcoin Address has ever been used before
Description
Based on loyce.club's "all bitcoin addresses ever used" list, this library and CLI tool can search the list very, very fast and efficiently.
Features
- Lookup either a single address, or a long list of potential addresses.
- CLI and library options
- Generates a binary search tree index from the text file, and then uses that to search for the address.
Getting Started
This project depends on the large address list being downloaded and extracted.
# Download the zipped text file of used addresses, where each line is a used Bitcoin address.
wget http://alladdresses.loyce.club/all_Bitcoin_addresses_ever_used_sorted.txt.gz
# Extract the file
gunzip -d ./all_Bitcoin_addresses_ever_used_sorted.txt.gz --stdout | pv > all_Bitcoin_addresses_ever_used_sorted.txt
Optionally, if you intend to use the scan_file
subcommand, install ripgrep
from optimal performance:
# first, install cargo/rust
# then, run:
cargo install ripgrep
Usage - CLI
pip install used_addr_check
# download and extract the required file:
wget http://alladdresses.loyce.club/all_Bitcoin_addresses_ever_used_sorted.txt.gz
gunzip -d ./all_Bitcoin_addresses_ever_used_sorted.txt.gz --stdout | pv > addr_list.txt
# generate the index file (optional):
used_addr_check index -f ./addr_list.txt
# the index file is now at: ./addr_list.index.parquet
# search a couple of addresses:
used_addr_check search -f ./addr_list.txt -s moW9o415jNfgyuzytEMZD84Kovri5DJ64e -s mncqTEYTidNdbqGZnXTd1JFYRrruuh5StV
# search for a long list of addresses (extracted by regex):
used_addr_check scan_file -f ./addr_list.txt -n file_with_addresses_to_lookup.txt
Usage - Library
This example will generate the index file, if required.
from used_addr_check import search_multiple_in_file
needles = [
'moW9o415jNfgyuzytEMZD84Kovri5DJ64e',
'mncqTEYTidNdbqGZnXTd1JFYRrruuh5StV'
]
haystack_file_path = './addr_list.txt'
addresses_found_list: List[str] = search_multiple_in_file(
haystack_file_path=haystack_file_path,
needles=needled,
)
print(f"{addresses_found_list=}")
Performance Notes
- With the default indexing size of one index entry per 1000 addresses in the "haystack" file, the index is a 140MB Parquet file.
- On a 2023 mid-range laptop with an SSD:
- Indexing takes 4 minutes.
- Addresses can be searched at 20 query addresses ("needles") per sec.
There are certainly opportunities for further improvement, but this performance is adequate.
Contributing
Please Star this repo if it's helpful.
Please open GitHub Issues and Pull Requests with features/bugs/fixes.
Future Features
- Optionally disable
loguru
logging in subfunctions - Convert "found address" result to an iterator.
- Test cases.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file used_addr_check-0.1.6.tar.gz
.
File metadata
- Download URL: used_addr_check-0.1.6.tar.gz
- Upload date:
- Size: 15.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4b5f42c079cf355c6d5e8a5431a8399f71d1d3be9b7acef4c8bc64baf3e44b0a |
|
MD5 | d91510766e50bfeb0aa6ff5fe2a2a355 |
|
BLAKE2b-256 | 64c720361eeedcae2f55c3198d19251b0994a5e73b824ad2fed67927bd23cb23 |
File details
Details for the file used_addr_check-0.1.6-py3-none-any.whl
.
File metadata
- Download URL: used_addr_check-0.1.6-py3-none-any.whl
- Upload date:
- Size: 12.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1c1350540ada3b10fec3df59aff97493f308ac12c20b06ea671dd5bf131758d2 |
|
MD5 | 7d4f021fbf0df4f5213c3a41088e99ba |
|
BLAKE2b-256 | f2ef1a5a01520aa1c7b02d1aae9237b6fbfdcbc48d716a45bda8195374d43280 |