A coding challenge webscraper for leetcode, and other websites!

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

leetscraper · · ·

leetscraper is a coding challenge webscraper for leetcode, and other websites!
It was created as a way to gather coding problems to solve without having to sign up to a website or submit code to a problem checker.

Usage

Install package and dependencies

pip install leetscraper tqdm urllib3 beautifulsoup4 selenium webdriver-manager

Examples

Import the module and Instantiate the class. The class has some kwargs options to control the behaviour of the scraper. However, all the default values will start to scrape all problems from leetcode.com to the cwd.

The most basic usage looks like this:

from leetscraper import Leetscraper

if __name__ == "__main__":
    Leetscraper()

The avaliable kwargs to control the behaviour of the scraper are:

"""
website_name: "leetcode.com", "projecteuler.net", "codechef.com" ("leetcode.com" is set if ignored)
scraped_path: "path/to/save/scraped_problems" (Current working directory is set if ignored)
scrape_limit: Integer of how many problems to scrape at a time (-1 is set if ignored, which is no limit)
auto_scrape: "True", "False" (True is set if ignored)
"""

Example of how to automatically scrape the first 50 problems from projecteuler.net to a directory called SOLVE-ME:

from leetscraper import Leetscraper

if __name__ == "__main__":
    Leetscraper(website_name="projecteuler.net", scraped_path="~/SOLVE-ME", scrape_limit=50)

Example of how to scrape all problems from all supported websites:

from leetscraper import Leetscraper

if __name__ == "__main__":
    websites = ["leetcode.com", "projecteuler.net", "codechef.com"]

    for site in websites:
        Leetscraper(website_name=site)

You can pass through different arguments for different websites to control exactly how the scraper behaves. You can also disable scraping problems at time of instantiation by using the kwarg auto_scrape=False. This allows you to call the class functions in different order, or one at a time. This will change how the scraper works, as its designed to look in a directory for already scraped problems to avoid duplicates. I would encourage you to look at the function docstrings if you wish to use this scraper outside of its intended automated use.

Contributing

If you would like to contribute, adding support for a new coding challenge website, or fixing current bugs is always appreciated! I would encourage you to see CONTRIBUTING.md for further details. If you would like to report bugs or suggest websites to support, please add a card to Issues. Thank you to all contributors to this project!

Code of Conduct

Contributing to this project means you are willing to follow the same conduct that others are held to! Please see Code of Conduct for further details.

Licence

This project uses the GPL-2.0 License, As generally speaking, I want you to be able to do whatever you want with this project, But still have the ability to add your changes to this codebase should you make improvements or extend support. For further details on what this licence allows, please see LICENSE.md

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

2.4.3

Dec 1, 2022

2.4.2

Nov 1, 2022

2.4.1

Oct 1, 2022

2.4.0

Sep 1, 2022

2.3.0

Jul 1, 2022

2.2.1

Jun 1, 2022

2.2.0

May 22, 2022

2.1.2

May 17, 2022

2.1.1

May 17, 2022

2.1.0

May 16, 2022

2.0.2

May 6, 2022

2.0.1

May 6, 2022

2.0.0

May 6, 2022

1.5.0

Apr 29, 2022

1.4.2

Mar 6, 2022

1.4.1

Feb 20, 2022

1.4.0

Feb 14, 2022

1.3.0

Feb 13, 2022

1.2.0

Feb 11, 2022

1.1.2

Feb 9, 2022

This version

1.1.1

Feb 7, 2022

1.1.0

Feb 6, 2022

1.0.3

Jan 30, 2022

1.0.2

Jan 30, 2022

1.0.1

Jan 30, 2022

1.0.0

Jan 30, 2022

0.0.0

Jan 30, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

leetscraper-1.1.1.tar.gz (14.0 kB view hashes)

Uploaded Feb 7, 2022 Source

Built Distribution

leetscraper-1.1.1-py3-none-any.whl (13.4 kB view hashes)

Uploaded Feb 7, 2022 Python 3

Hashes for leetscraper-1.1.1.tar.gz

Hashes for leetscraper-1.1.1.tar.gz
Algorithm	Hash digest
SHA256	`0db391d0bf262f32318c3b2598f19c0eb492b5363b2ec0a36e88ab4d9c415d73`
MD5	`75fbf3ccc390b68af912fd34d2d8e09e`
BLAKE2b-256	`56586d4eaa4a5f4edbd82b2af83b53b013806b127f390a2f36a615db8960da5a`

Hashes for leetscraper-1.1.1-py3-none-any.whl

Hashes for leetscraper-1.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f3625b1154e793ddbef438a357fe1e3afe33a3447371eb4ead39a10b1486fc5e`
MD5	`fb005c2b163aeeee7ec784f3ae8f1a19`
BLAKE2b-256	`d12d9327b44dae0cbe158a84918b96d351a2a4dd3e77db9f5fa4887571e76e89`