Skip to main content

Python package for Automating Videos building dataset for ML/DL Workflow using Tkinter GUI

Project description

Automate-Images + Videos-download

Introduction :

Data collection is one of the important steps in Data analysis , Machine learning and Deep learning. It is the process of gathering information of our interest, in our case it's the images of any type we want to gather. Without data, we can't model our problem and do analysis of it.

Objective :

👉 Automate the collection and building of images using Google API's for search and videos using Youtube API's.
👉 Option to send mails to anyone containing collected datasets as attachment.

Motivation :

This is a part of project - IV (Task-II) made for UCS633 - Data analytics and visualization at TIET.
@Author : Sourav Kumar
@Roll no. : 101883068

Getting credentials for Google API and Youtube API:

For Google image API For Youtube API

Getting Started Locally :

  • After getting the credentials , we can now run the script for automation.
  • Firstly download all the files on your system using cmd (terminal) or git bash :
    git clone https://github.com/souravs17031999/AUTOMATE-DATA-PIPELINE

With Tkinter GUI :

For IMAGES :

  • Move to the directory where script is downloaded.
  • Now run the script in following way :
    python data_image_automation.py
    and arguments can be passed from interactive GUI.

For VIDEOS :

  • Move to the directory where script is downloaded.
  • Now run the script in following way :
    python data_video_automation.py
    and arguments can be passed from interactive GUI.
    Note : Two important args :

Client secrets file path: Give absolute path.
Chrome driver path : Give absolute path with .exe extension.

Output :

Some samples of images : output
output
output
output
output

Limitations and Scope for future work:

There are few limitations :

  • Currently Google and Youtube allows 100 requests in one API call and so the script is able to download in total 100API's in one go but this behaviour can be changed from the script by calling it multiple times after tweaking some parameters which controls the after and before id's of returned images and videos and can be checked in their API documentation.
  • [For Images] Considering big file sizes for sending as mails, i have resized , gray-scaled and then zipped the files and upload as attachment.
    This behaviour can be changed from the script by disabling some of the functions passing a optional parameter.
  • [For videos] Considering big file sizes for sending as mails, i have compressed the file sizes and reduced the duration of videos.
    This behaviour can be easily manipulated in the script itself.
  • Also, mail can be only sent for GMAIL users as of now, this can also be changed by manipulating the Ports of all email servers and controlling their behaviours in the script.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Built Distribution

File details

Details for the file videos-automate-python-souravdlboy-1.0.tar.gz.

File metadata

  • Download URL: videos-automate-python-souravdlboy-1.0.tar.gz
  • Upload date:
  • Size: 2.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.7.0b5

File hashes

Hashes for videos-automate-python-souravdlboy-1.0.tar.gz
Algorithm Hash digest
SHA256 940c95e153f5a87e869fca643520ca86b07994ab0e38bd0e83b7aff9a6aca3e5
MD5 9ce9ebbcca737287a49e01ce7e41c4cc
BLAKE2b-256 ae07e7294917dd87c3e21288cdb8b89449339ff25adafbef28e242a855b8d997

See more details on using hashes here.

File details

Details for the file videos_automate_python_souravdlboy-1.0-py3-none-any.whl.

File metadata

  • Download URL: videos_automate_python_souravdlboy-1.0-py3-none-any.whl
  • Upload date:
  • Size: 3.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.7.0b5

File hashes

Hashes for videos_automate_python_souravdlboy-1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 48c68931c1c535acd3c150a1e658c56a57aae36c2224844d278e6d1c5c87cc37
MD5 00aff8f4f0fb7fd36c2a8ac60957201c
BLAKE2b-256 9bf05acf0ca473943abce5ec28f3847c873e26861cd84d6a41cf1f0c77cd13ca

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page