Skip to main content

Langara Course Information Aggregator

Project description

LangaraCourseInfo

This project collects course and transfer information from Langara College, Vancouver, into an SQLite database.

Once built, the database weighs around 15 MB for all data (~250 MB with source HTML/PDFs), which should be lightweight enough for most uses.

The transfer agreement scraper currently takes an excruciating amount of time - approximately an hour - this will be improved with multithreading in the future.

Collected Data

  • Course Information: course description & other attributes
  • Course Offerings: dating from 1999 - present.
  • Transfer Information: only active transfer agreements are collected.

Table Definitions

  • TransferInformation(subject, course_code, source, destination, credit, effective_start, effective_end)

  • CourseInfo(subject, course_code, credits, title, description, lecture_hours, seminar_hours, lab_hours, AR, SC, HUM, LSC, SCI, SOC, UT)

  • Sections(year, term, seats, waitlist, subject, course_code, crn, credits, additional_fees, repeat_limit)

  • Schedules(year, term, crn, type, days,, time, start_date, end_date, room, instructor)

  • SemesterHTML(year, term, sectionHTML, catalogueHTML, attributeHTML)

  • TransferPDF(subject, pdf)

Stack

  • SQLite
  • Selenium
  • Beautifulsoup

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

LangaraCourseInfo-1.0.3.tar.gz (7.5 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

LangaraCourseInfo-1.0.3-py3-none-any.whl (5.8 kB view details)

Uploaded Python 3

File details

Details for the file LangaraCourseInfo-1.0.3.tar.gz.

File metadata

  • Download URL: LangaraCourseInfo-1.0.3.tar.gz
  • Upload date:
  • Size: 7.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.4

File hashes

Hashes for LangaraCourseInfo-1.0.3.tar.gz
Algorithm Hash digest
SHA256 c4382df38fe97dd1651dd76edc26768b00e0c45a98d1f2a4fc5877c4cbf0c09b
MD5 323c43013a5525d317b0218f08740c28
BLAKE2b-256 f0a8c60e151152981956ce57f7dfa3eca5350521298eb7b1df7fc5d4e0829873

See more details on using hashes here.

File details

Details for the file LangaraCourseInfo-1.0.3-py3-none-any.whl.

File metadata

File hashes

Hashes for LangaraCourseInfo-1.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 0fab361729b2e8937ed06481a03dda7ddf82b9708c14ea76099a9c5ed25fab30
MD5 f81bc6e22299ec799270a942ae037341
BLAKE2b-256 4b0c931cff9046624aeb68bda4b6c028350061f9bd1d0953649b4f02f94f53e5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page