Simple scraper for kleinanzeigen.de searches with notifications for new ads.
Project description
ek-scraper
Simple scraper for kleinanzeigen.de searches with notifications for new ads.
Installation
Install this package from PyPi in a separate virtual environment using pipx
.
pipx install ek-scraper
Usage
For the full usage check the
ek-scraper --help
command
Create a configuration file using
ek-scraper create-config <path/to/config.json>
The example configuration file will look like this:
{
"filter": {
"exclude_topads": true,
"exclude_patterns": []
},
"notifications": {
"pushover": {
"token": "<your-app-api-token>",
"user": "<your-user-api-token>",
"device": []
},
"ntfy.sh": {
"topic": "<your-private-topic>",
"priority": 3
},
},
"searches": [
{
"name": "Wohnungen in Hamburg Altona",
"url": "https://www.kleinanzeigen.de/s-wohnung-mieten/altona/c203l9497",
"recursive": true
}
]
}
See Configuration for details on all configuration options.
- Configure one or more searches in the
searches
section of the configuration, see Searches for more details - Configure notifications in the
notifications
section of the configuration, see Notifications for details on notification configuration - (Optional) Configure filters in the
filter
section of the configuration, see Filter for more details
Run the following command to initialize the data store without sending any notifications:
ek-scraper run --no-notifications path/to/config.json
Afterwards, run
ek-scraper run path/to/config.json
to receive notifications according to your notifications
configuration.
Development
Follow the steps below to set up a development environment for this project.
-
Clone this repository
git clone git@github.com:jonasehrlich/ek-scraper.git
-
Change directory into the repository
cd ek-scraper
-
Create a virtual environment using poetry
poetry install
-
(Optional) Install pre-commit environment
$ pre-commit [INFO] Installing environment for https://github.com/pre-commit/pre-commit-hooks. [INFO] Once installed this environment will be reused. [INFO] This may take a few minutes... [INFO] Installing environment for https://github.com/psf/black. [INFO] Once installed this environment will be reused. [INFO] This may take a few minutes... Check Yaml...........................................(no files to check)Skipped Fix End of Files.....................................(no files to check)Skipped Trim Trailing Whitespace.............................(no files to check)Skipped black................................................(no files to check)Skipped
Configuration
Searches
Searches can be configured in the searches
array of the configuration file.
Each of the searches can be configured with the following parameters.
Name | Description |
---|---|
name |
Name of the search, use a descriptive one (required) |
url |
URL of the first page of your search (required) |
recursive |
Whether to follow all pages of the search result (optional, defaults to true) |
Filter
Filters can be configured in the filter
section of the configuration file to exclude specific ads
from your scrape results on the client side. The following settings can be configured.
Name | Description |
---|---|
exclude_topads |
Whether to exclude top ads from the results (optional, defaults to true) |
exclude_patterns |
Case-insensitive regular expression patterns used to exclude ads (optional) |
Notifications
Notifications can be configured in the notifications
section of the configuration file.
Push notifications using Pushover
ek-scraper
supports push notifications to your devices using Pushover.
For further information on the service check their terms and conditions.
The implementation for Pushover notifications will send a single notification per search, if new ads are discovered.
To configure Pushover for notifications from the scraper, first register at the service and create
an application (e.g. ek-scraper
). To use the service in ek-scraper
, add the pushover
object
to the notifications
object in your configuration file and fill the API tokens. Selection of the
devices which will receive the notifications, is supported using the device
array.
Name | Description |
---|---|
token |
API token of the Pushover app (required) |
user |
API token of the Pushover user (required) |
device |
List of device names to send the notifications to (optional, defaults to all devices) |
Push notifications using ntfy.sh
ek-scraper
supports push notifications to your devices using ntfy.sh.
For further information on the service check their terms and conditions.
The implementation for ntfy.sh notifications will send a single notification per search, if new ads are discovered.
To configure ntfy.sh for notifications from the scraper, define a topic and subscribe to it in the mobile app.
Note that topic names are public, so it's wise to choose something that cannot be guessed easily. This can be done by including a UUID, e.g. by running the following command in your shell:
echo "ek-scraper-$(uuidgen)"
To use the service in ek-scraper
, add the ntfy.sh
object to the notifications
object in your
configuration file and add the topic you previously subscribed to.
Name | Description |
---|---|
topic |
Topic to publish the notifications to |
priority |
Priority to send the notifications with (optional, defaults to 3) |
Running ek-scraper
regularly
It should be avoided to run the tool too often to avoid getting your IP address blocked by kleinanzeigen.de
In order to run ek-scraper
regularly on a Unix-like system, configure it as a cronjob.
To configure a cronjob, run
crontab -e
Edit the crontab table to run the command you want to run. A handy tool to check schedule configurations for cronjobs is crontab.guru.
For more information on configuring cronjobs use your favorite search engine.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file ek_scraper-0.2.3.tar.gz
.
File metadata
- Download URL: ek_scraper-0.2.3.tar.gz
- Upload date:
- Size: 14.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.5.1 CPython/3.10.12 Linux/6.2.0-1019-azure
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d9687197b5b4c16d598ebadaa45ddfa4a37eb5be4786ab66da9736edfa93a612 |
|
MD5 | d57acc455f18185bbc6047d1550c0c13 |
|
BLAKE2b-256 | 4040c8ae55dbdff709fed12d4cbae673e79bf9f40d20aefb60a79534b2c622ab |
File details
Details for the file ek_scraper-0.2.3-py3-none-any.whl
.
File metadata
- Download URL: ek_scraper-0.2.3-py3-none-any.whl
- Upload date:
- Size: 16.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.5.1 CPython/3.10.12 Linux/6.2.0-1019-azure
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d9c9aa27a87969e3f6cc6b83b325f0e9a7345f25f708318f955ef3bc490af8f5 |
|
MD5 | dc5f61e96518188b5c1a5c1bde7538f6 |
|
BLAKE2b-256 | eb005d1bfea6c5f040cbd9b80ae576bbb9a4b9b146a9b6c978b5250088ba0c4b |