Skip to main content

A simple crawler obtaining all available files from the campus management platform Studip and moodle for University Trier

Project description

Filecrawl

A simple crawler obtaining all available files from the campus management platforms Studip and moodle for University Trier

Disclaimer

This version works only for the University Trier and Studip 4. Use the Studip RESTAPI if possible as Crawlers are forbidden by many universities.

There are other open Source Clients supporting the API, e.g. studip-fuse or STUD.IP-FileSync.

Installation

Windows

  1. Download Python 3.4+
  2. Download filecrawl and unzip it.
  3. open cmd
  4. cd /path/to/filecrawl/
  5. python -m pip install -r requirements.txt

MacOS and Linux

  1. Download Python 3.4+
  2. Download filecrawl and unzip it.
  3. open terminal
  4. cd /path/to/filecrawl/
  5. pip3 install -r requirements.txt
    • When you use a Linux Distro which isn't supported by the keyrings backend (e.g. Raspbian) you also need to run pip3 install keyrings.alt

Usage

Windows

Simply run the script with python filecrawl.py when you are in the filecrawl folder. It will guide you through the setup when you run the script for the first time.

MacOS and Linux

Simply run the script with python3 filecrawl.py when you are in the filecrawl folder. It will guide you through the setup when you run the script for the first time.

  • You may get an Error when using help in the Setup on macOS stating that FIFinderSyncExtensionHost is implemented in two paths - just ignore this. This also leads to a Finder window which is stuck as long as the script is running.

Crontabs

When you want to run filecrawl as a crontab, you have to have root privileges or run everything with sudo. Also, you may have to change the path where the config will be saved in filecrawl/L233, config_handling/L89 and config_handling/L100 as the root user has no /User/ folder where the config will be stored.

Credits & Licence

Under MIT LICENCE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

filecrawl-1.0.0.tar.gz (9.3 kB view details)

Uploaded Source

Built Distribution

filecrawl-1.0.0-py2.py3-none-any.whl (10.7 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file filecrawl-1.0.0.tar.gz.

File metadata

  • Download URL: filecrawl-1.0.0.tar.gz
  • Upload date:
  • Size: 9.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.3

File hashes

Hashes for filecrawl-1.0.0.tar.gz
Algorithm Hash digest
SHA256 841ace038611a973e6bb05a57639d0cce8c2551e147fe5d265932610ea0a39e8
MD5 000e0dcd7f19c5757203f81f302cbd36
BLAKE2b-256 b785e9d04e3feffa3b56ba8333546b523f080b8c47491ff414169ac9fbd3f319

See more details on using hashes here.

File details

Details for the file filecrawl-1.0.0-py2.py3-none-any.whl.

File metadata

  • Download URL: filecrawl-1.0.0-py2.py3-none-any.whl
  • Upload date:
  • Size: 10.7 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.3

File hashes

Hashes for filecrawl-1.0.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 9f765d6b5ca07bcf38516f8c2c24fbf6ced27d3d9ceb19162bc7e7dc1f13c68f
MD5 b95150872e2647d0e81e275c5f5eb448
BLAKE2b-256 d810fc25cc5e3e65d71c0c388972fcef66165b10e10b3d6e713a1c1341ea848a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page