Skip to main content

This Python script utilizes Selenium to scrape data from Coursera, providing detailed information about various courses such as link, title, rating, recent views, students enrolled, time requirement, skills, learner count, difficulty level, duration, and sub-course details.

Project description

Coursera Scraper

This repository contains a Python script that utilizes Selenium to scrape data from Coursera.

Scraped Fields

The script scrapes the following fields for each course:

  1. Link: Link of the course
  2. Title: The title of the course.
  3. Institute: The institution offering the course (if available).
  4. Rating: The course rating.
  5. Recent Views: The number of recent views for the course.
  6. Students Enrolled: The number of students enrolled in the course.
  7. Time Requirement: The approximate time required to complete the course.
  8. Skills: A list of skills covered in the course.
  9. Learner Count: The total number of learners who have taken the course.
  10. Difficulty Level: The difficulty level of the course.
  11. Duration: The duration of the course.
  12. Sub-course: Additional information about the course or its sub-courses.

The scraped data is stored in a dictionary format for each course.

Prerequisites

You can install the required Python packages by running the following command:

pip install coursera-scraper

from scraper.main import scraper

for course_detail in scraper(keyword='python'):
    # do some processing on course detail

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

Please make sure to update tests as appropriate.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

coursera-scraper-0.1.3.tar.gz (4.2 kB view hashes)

Uploaded Source

Built Distribution

coursera_scraper-0.1.3-py3-none-any.whl (4.8 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page