Skip to main content

Langara Course Information Aggregator

Project description

LangaraCourseInfo

This project collects course and transfer information from Langara College, Vancouver, into an SQLite database.

Once built, the database weighs around 15 MB for all data (~250 MB with source HTML/PDFs), which should be lightweight enough for most uses.

The transfer agreement scraper currently takes an excruciating amount of time - approximately an hour - this will be improved with multithreading in the future.

Collected Data

  • Course Information: course description & other attributes
  • Course Offerings: dating from 1999 - present.
  • Transfer Information: only active transfer agreements are collected.

Table Definitions

  • TransferInformation(subject, course_code, source, destination, credit, effective_start, effective_end)

  • CourseInfo(subject, course_code, credits, title, description, lecture_hours, seminar_hours, lab_hours, AR, SC, HUM, LSC, SCI, SOC, UT)

  • Sections(year, term, seats, waitlist, subject, course_code, crn, credits, additional_fees, repeat_limit)

  • Schedules(year, term, crn, type, days,, time, start_date, end_date, room, instructor)

  • SemesterHTML(year, term, sectionHTML, catalogueHTML, attributeHTML)

  • TransferPDF(subject, pdf)

Stack

  • SQLite
  • Selenium
  • Beautifulsoup

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

LangaraCourseInfo-1.0.4.tar.gz (7.5 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

LangaraCourseInfo-1.0.4-py3-none-any.whl (24.3 kB view details)

Uploaded Python 3

File details

Details for the file LangaraCourseInfo-1.0.4.tar.gz.

File metadata

  • Download URL: LangaraCourseInfo-1.0.4.tar.gz
  • Upload date:
  • Size: 7.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.4

File hashes

Hashes for LangaraCourseInfo-1.0.4.tar.gz
Algorithm Hash digest
SHA256 53c980fe31ceb3d5e5cc5a779eb5a30257ca4051bcb1d76b3b9b0556ec0544d4
MD5 b4178cfa72c287e74ef71a9d4569248e
BLAKE2b-256 0f20ef365e4dec03162cb2fe2284fc96cb7c5d5f37486261aabacefc015f64fd

See more details on using hashes here.

File details

Details for the file LangaraCourseInfo-1.0.4-py3-none-any.whl.

File metadata

File hashes

Hashes for LangaraCourseInfo-1.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 148e18fe4bd0131ead17d9694a0494abfbed36f4a8e817aebf1c54cb540b3e97
MD5 a426011c26ea46e20708aa3d24e26c20
BLAKE2b-256 56af4581a1b7bed474f78637043958bb5dd6933d65e4c0e8b657f5a258f5f4c8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page