Skip to main content

hu_wiwi_grades is a Python library for searching, viewing and scraping published students grading of the Faculty of Economics and Business Administration of the Humboldt University of Berlin

Project description

Table of Content

hu_wiwi_grades

hu_wiwi_grades is a Python library for searching, viewing and scraping published students' grading of the Faculty of Economics and Business Administration of the Humboldt University of Berlin.

Please note: The functionality maybe interrupted in case any changes in the publication occur or in case the website is not available.

Background

This library was primarily created for testing/training purposes, such as extracting information from PDF files, writing and publishing of code. It nevertheless aims to offer a use-case. Current and historical grading information may be of interest for (prospective) students, examiners or potentially even employers.

Installation

Available as soon as the package has been added to The Python Package Index (PyPI) (not supported yet!):

Use the package manager pip to install hu_wiwi_grades

pip install hu_wiwi_grades

or through the author's Github repository

pip install git+https://github.com/NDelventhal/hu_wiwi_grades

Requirements

The following libraries are required:

  • tabula
  • pandas
  • numpy
  • requests
  • beautifulsoup4

These libraries can be installed via the package manager pip.

pip install tabula numpy pandas requests bs4

Usage

import hu_wiwi_grades as hu

hu.list_sources() 
# scrapes URL sources that list grading overviews and returns a dictionary containing the semesters as keys and the URLs as values. 

df = hu.scrape_overview(exam = "Economics") 
# Scrapes the latest grading overview and returns the overview or a subset based on the entered exam specification.
# In this example solely "Finance" exams are returned. The exam arguments defaults to "" (no filtering).  

df = hu.scrape_all_overviews(exam = "Valuation") 
# Same as above, but instead of solely the latest overview all historical overviews are pulled. Typically, a few semesters are available.

df = hu.get_grading(exam="", only_current_semester = True) 
# Scrapes the grades from the URLs listed in the overview pages of either only the latest semester (only_current_semester = True) or all (only_current_semester = False). 
# An exam filter may be specified as in the examples above or not.
# Returns a dataframe listing the number of participants, the examiner and all grades as variables. 

df2 = hu.prepare_for_analysis(df) 
# Prepares the dataframe output of get_grading() for further analysis, such as visualisations, descriptive statistics or regression analysis.

Further usuage examples are listed here

License

This project is licensed under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hu_wiwi_grades-0.1.0.tar.gz (5.7 kB view details)

Uploaded Source

File details

Details for the file hu_wiwi_grades-0.1.0.tar.gz.

File metadata

  • Download URL: hu_wiwi_grades-0.1.0.tar.gz
  • Upload date:
  • Size: 5.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.24.0 setuptools/47.1.0 requests-toolbelt/0.9.1 tqdm/4.51.0 CPython/3.8.5

File hashes

Hashes for hu_wiwi_grades-0.1.0.tar.gz
Algorithm Hash digest
SHA256 af97a8780a59c4b37d09445c9ae8226830a6cb70b83e707ddff7cab818da10df
MD5 26ee6b89aa05c88b1de6e1d1892cfd81
BLAKE2b-256 b4eeba098a94afee98d358576e66e46b4a4e1a7743b877ee2e386908e5d8ebd1

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page