Skip to main content

A package that uses natural language processing to convert PDF calendars to JSONs and to convert Google Calendar events to Excel files

Project description

This Python package is great for taking Arthur Murray Vernon’s Google Calendar events and arrange them in a calendar structure in an Excel file. That can then be copy-and-pasted into Microsoft Office Publisher to create a printable PDF calendar.

For those who want to go from the printable PDF calendars to a digital one, you’re in luck! I use machine learning to parse through printable PDF calendars and create JSONs out of them, where each event has a title, dance_style and time (if applicable), ripe for creating Google Calendar events from them.

While this project is geared towards use at Arthur Murray Dance Studios, feel free to take a look at the source code and modify it for your own calendar’s needs.

Have fun!

Setup from source code (GitHub)

  1. Clone the repo. git clone https://github.com/vincentchov/amvernon-cal.git
  2. Install Python 3.x with pip.
  3. Install Java 8.
  4. Create and activate a virtual environment.
  5. Install the corpora python -m textblob.download_corpora.
  6. Install dependencies: pip install -r requirements.txt.
  7. Profit!

Setup from from PyPI (Pip)

  1. Follow steps 2-5 from above.
  2. Install amvernoncal from PyPI. pip install amvernoncal

How to go from Google Calendar to an Excel file

  1. Activate the Google Calendar API for your account and obtain your client_secret.json file.
  2. Activate your virtual environment.
  3. Import the module that will use your client secret: from amvernoncal.gcal_to_xlsx import gcal_events_to_xlsx.
  4. Give the gcal_events_to_xlsx() function a month and year to search, and the name of the Google Calendar you’re converting from, making sure to surround each of the two arguments by quotes. Example: gcal_events_to_xlsx('September 2017', 'Classes')
  5. That will then create 3 folders: JSONs, PDFs, and Output. Your Excel file will be in the Output folder.

Alternatively, you can invoke gcal_events_to_xlsx() directly in the Terminal using amvernon_gcal_to_xlsx, which comes with a help screen, thanks to Docopt.

How to go from a printable PDF calendar to a JSON

  1. Follow steps 1 and from above.
  2. Import the function that will parse your calendar: from amvernoncal.pdfproc.pdf_to_json import parse_calendar
  3. Give the parse_calendar() function a path to your calendar, named based on the month and year, as well as tell it if you want to save to a JSON file or just return the JSON. Example: parse_calendar('september_2017.pdf', to_file=True)

Project details


Release history Release notifications

This version
History Node

0.0.2.dev4

History Node

0.0.2.dev3

History Node

0.0.2.dev2

History Node

0.0.2.dev1

History Node

0.0.1.dev86

History Node

0.0.1.dev84

History Node

0.0.1.dev83

History Node

0.0.1.dev82

History Node

0.0.1.dev81

History Node

0.0.1.dev80

History Node

0.0.1.dev79

History Node

0.0.1.dev78

History Node

0.0.1.dev77

History Node

0.0.1.dev76

History Node

0.0.1.dev75

History Node

0.0.1.dev74

History Node

0.0.1.dev73

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
amvernoncal-0.0.2.dev4.tar.gz (10.9 MB) Copy SHA256 hash SHA256 Source None May 24, 2018

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging CloudAMQP CloudAMQP RabbitMQ AWS AWS Cloud computing Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page