Skip to main content

OSCAR Corpus Download Tool

Project description

OSCAR Corpus Downloader

Simple tool to download the OSCAR corpus.

1. Installation

Installation can be done using pypi:

$ pip install oscar-corpus-downloader

2. Usage

Submit an OSCAR access request following the procedure described on the project page.

Once you have received your credentials, you can use the command line interface to download an OSCAR corpus part.

$ export OSCAR_USERNAME=username
$ export OSCAR_PASSWORD=password
$ oscar download --help
Usage: oscar download [OPTIONS]

Options:
  -u, --url TEXT         OSCAR corpus url  [required]
  -o, --output-dir TEXT  Output directory  [required]
  --resume               Resume download
  --help                 Show this message and exit.

$ oscar download \
  --url https://oscar-prive.huma-num.fr/2301/fr_meta \
  -o ./oscar-fr

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

oscar_corpus_downloader-0.1.0.tar.gz (5.3 kB view details)

Uploaded Source

Built Distribution

oscar_corpus_downloader-0.1.0-py3-none-any.whl (6.2 kB view details)

Uploaded Python 3

File details

Details for the file oscar_corpus_downloader-0.1.0.tar.gz.

File metadata

  • Download URL: oscar_corpus_downloader-0.1.0.tar.gz
  • Upload date:
  • Size: 5.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.0 CPython/3.11.4 Linux/6.2.0-35-generic

File hashes

Hashes for oscar_corpus_downloader-0.1.0.tar.gz
Algorithm Hash digest
SHA256 34beddcbd4b43cd52a31aab28ef02b1afea39e2c4bbd63436d15f3997851049e
MD5 591d719ba6d291800d7a967fb7ef2157
BLAKE2b-256 84843908f271ab6a6fc949ac79a1b6f52fa8ded9160f36a237d11c562740e95b

See more details on using hashes here.

File details

Details for the file oscar_corpus_downloader-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for oscar_corpus_downloader-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 2e5b4169839cbd09b84cc7a57568957a338be3856714701c6ca35cd6263a8e75
MD5 0f917de2429b2d8d717d403bda67da3b
BLAKE2b-256 590166fc217f3fba3fbf343264e7fdb64bb80ec4bcd806549306a247dfca4ffa

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page