Skip to main content

A Python library that enables downloading articles from DergiPark (https://dergipark.org.tr) related to a topic or a search phrase.

Project description

A python library for searching and downloading articles from DergiPark which is a website where people can browse articles based on numerous filters .

  1. First, install a driver from the driver links listed here: https://pypi.org/project/selenium/

  2. To download articles, you should run searchAndDownload method. There are a few parameters to consider:

    1. topic: (default None) This is the topic you want to search for. It is limited with the topics listed in DergiPark. To access these topics and their corresponding links, check out 'topicLinkDict.pickle' file. It may be in a different path if specified.
    2. searchPhrase: (default None) This is the phrase you want to search for. There is no limitation.
    3. maxArticle: (default None) DergiPark lists at most 9600 articles. If maxArticle is not specified, the code will download at most 9600 articles. You can change this parameter if you want to download less than 9600 articles. You must not give a number greater than 9600.
    4. headless: (default True) If you want to see the browser while downloading, set it to False.
    5. language: (default türkçe) Specify the article language. If the authors didn't specify the article language, this article won't be downloaded. You should give one language at a time.
    6. path: (default None) Specify the where to download the articles. If not specified, creates an articles folder.
    7. progress: (default True) Whether to see a progress bar while downloading. Set it to False if you don't want to see it.
    8. console_logging: (default False) Whether you want to see an output in the console for each article downloaded.
    9. driver_path: (default None) Specify the driver path.
  3. If you want to renew the topic dictionary, please call refreshTopicPickle method with these parameters:

    1. dictPath: (default topicLinkDict.pickle) Specify the pickle you want to write the dictionary.
    2. driverPath: (default None) Specify the driver path.

All tests were conducted using chromedriver.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

spyderDergiPark-0.0.9.tar.gz (5.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

spyderDergiPark-0.0.9-py3-none-any.whl (6.2 kB view details)

Uploaded Python 3

File details

Details for the file spyderDergiPark-0.0.9.tar.gz.

File metadata

  • Download URL: spyderDergiPark-0.0.9.tar.gz
  • Upload date:
  • Size: 5.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.10.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.10.1

File hashes

Hashes for spyderDergiPark-0.0.9.tar.gz
Algorithm Hash digest
SHA256 51ab86f49cc5cbc27130b1756095ce3004925611635848c08cd1b5885324433b
MD5 f76026b1c4dd130a720b7ffea32e3f4d
BLAKE2b-256 c3fbea3137f4fad3d902d163d40996752a15b60561b2206896f010c4d38fc040

See more details on using hashes here.

File details

Details for the file spyderDergiPark-0.0.9-py3-none-any.whl.

File metadata

  • Download URL: spyderDergiPark-0.0.9-py3-none-any.whl
  • Upload date:
  • Size: 6.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.10.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.10.1

File hashes

Hashes for spyderDergiPark-0.0.9-py3-none-any.whl
Algorithm Hash digest
SHA256 bc0430755b8895077edaa1ab8d50e297c679556d1aef478ef891014caba3528c
MD5 685795396c51d64a93cbb219164bacb5
BLAKE2b-256 5e7cae25a337fb562a1b62296895dd23d4afcd3a45063a5f131c43c88bdd0f39

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page