Skip to main content

A package that can grab all data of Covid-19 cases in India

Project description

# covdata.covidindia Module

Description

This covidindia Module has the power to bring all the data related to Covid-19 outbreak in India in a tabular form. This module will bring the whole set of raw data in a date-wise and processed format from the COVID-19 Indian Dataset maintained by University of Kalyani, which will be easier to deal with.

Once installed, we can get an up-to-date information about the Covid-19 affected people, till the day the data is available at the source. We can visualize the data of our choice for any given range of dates. It is capable of giving information about the total number of confirmed, recovered and deceased patients, between any two dates, of any city, district or related to any particular gender (male/female). We can also visualize the whole up-to-date data without using any aforesaid filters using our module.

How it works?

It gathers information from our own maintained database. Database data are collected from various websites, news channels and covid19india.org crowdsourcing initiative. From the collected data this module will help user to represent in user friendly format with all types of filters in one place. No need to search any website. Just use this module for India based Covid-19 projects.

Covidindia Server

This server is mainly made to give the data related to covid-19 in India. Thsi server mainly focused on showing the data in interactive way. Link: Covidindia Server

Installation

 pip install covdata

What's new!!

  • District wise data for each state with daily basis is available now.
  • Number of subject tested data for every state with daily basis is now availabe.
  • All types of graphs can be separately generated as confirmed,deceased or recovered. No more subplots will be generated.
  • With all graphs data new extra data can be pushed to generate multiple line graphs to track the any relevance information.This extra data must be a dictionary or list of dictionary where for each dictionary contains a list of data points with a suitable key name.This key name will the legend name of the garph.

Note

There might be some issues to start the server for most probably Mac users.

 Files can't be made due to some error.

if above line is seen in the terminal please let me know by opening an issue in github.If this happens server will start normally but download function will not work.

Up the server

 covdata cov-server   #to start the server with default port
 covdata cov-server -b :8000      #to bind the port at 8000

Usage

 #import module
 from covdata.covidindia import initializer,Demographic_overview,Data,visualizer

 #Initialize the module to collect the data
 Init=initializer()

 #shows the collected data in Dataframe format of Total Confimed daily count data.(if daily is false cumulative data will be shown)
 Init.show_data(of='Total Confirmed',daily=True)

 obj1=Data(init)       #defining the object for Data class
 cumData=obj1.get_cum_dataset_between_date('30/1/2020','5/4/2020',by='total recovered')   #gives the cumulative count between 30/1/2020 and 5/4/2020

 obj1.rank(10,'Total Confirmed',kind='top',cummulative=True,date='4/3/2020')   #gives the top 10 total confirmed cases for date 4/3/2020

 obj2=Demographic_overview(init)     # object for demographic_overview class

 obj2.demography('Salt Lake','30/1/2020-10/4/2020')    #Gives Male and Female counts of confirmed cases for salt lake city in west bengal between 30/1/2020 and 10/4/2020

 obj3=visualizer(init)       #initiazing visualizer class

 obj3.whole(daily=True)       #shows line plot for daily counts of confirmed cases,recovered cases and death cases against date.

 obj3.whole(typeof='confirmed',daily=False,xlabel='Dates',ylabel='Daily Counts')   #cumulative count plot of india of confirmed case .

 #cumulative recovery plot of Maharashtra between 3/6/2020 and 10/6/2020
 obj3.graph_by_date(startDate='3/6/2020', endDate='10/6/2020', typeof='recovered',title='recovery data', daily=False,state='mh')

Examples

Examples are in github repository project folder.

Terminal Usage

 #get help
 covdata -h

 #get dataset of user choice
 covdata -f Total_Confirmed --options show_data

 #save the above data as csv
 covdata -f Total_Confirmed --options show_data --save \where\to\save\filename.csv

 #get demographic overview by place and date
 covdata -p wb -d all --options demography   #please use state code.here 'wb' means 'West bengal'

 #get above data for fast n rows for simplicity
 covdata -p wb -d all --options demography -H 10      #head(10)

 #get data by state
 covdata -s mp --options state_dataset       #for madhyapradesh

 #get daily count data for any state
 covdata -s mp --options state_dataset -D y   #daily basis count for madhyapradesh.

 #get daily count for all states of total confirmed/total recovered/total death
 covdata -f Total_Confirmed --options state_dataset        #Total Confirmed cases for daily basis for all states.

 #get data by date
 covdata -d 10/4/2020 --options data_by_date

 #get count by date
 covdata -d 11/4/2020 -f Total_Recovered --options count_by_date

 #get top n values in India
 covdata -r top -n 10 -f Total_Confirmed --options rank    #top 10 values of total confirmed cases as cumulative

 #get top 10 values of Total Recovered on the basis of daily counts for any date
 covdata -r top -n 10 -f Total_Recovered -D y --options rank -d 15/3/2020

 #get bottom 10 values of Total Recovered cases as cumulative counts for any date
 covdata -r bottom -n 10 -f Total_Recovered -D n --options rank -d 15/3/2020

 #get top 10 values of Total Recovered cases as cumulative counts for any date
 covdata -r top -n 10 -f Total_Recovered -D n --options rank -d 15/3/2020

 #get top 10 values of Total Recovered cases as cumulative counts for all dates if date is not mentioned
 covdata -r top -n 10 -f Total_Recovered -D n --options rank

 #get cumulative data between two given dates
 covdata -d 15/3/2020-30/3/2020 -f Total_Recovered --options cumulative_between_date     #Total recovered cumulative data between 15/3/2020 and 30/3/2020

 #get daily count between two dates
 covdata -d 15/3/2020-30/3/2020 -f Total_Confirmed --options count_between_date   #Daily count data between 15/3/2020 and 30/3/2020

 #graph of whole data
 covdata -g whole -D y -f Total_Confirmed --options graph    #plot the whole data with dailly counts of
                                                               confirmed data
 (if -f flag is not mentioned combibed graph will be plotted)
 #graph whole data of any state
 covdata -g whole -D y -s wb -f Total_Recovered --options graph    #plots the daily graph of west bengal
                                                                    of recovery data

 #graph of 1st n data of any selected state ,if state not mentioned all together state count will be shown
 covdata -g head -r 15 -D y -s wb --options graph   #plots 1st 15 days daily counts for west bengal
 covdata -g head -r 15 -D n -s wb --options graph   #plots 1st 15 days cumulative counts for west bengal

 #get a daily or cumulative graph between two dates for any states
 covdata -g graph_by_date -d 25/3/2020-4/4/2020 -D y -s mh --options graph

 #if state is not mentioned graph is shown for all states together
 covdata -g graph_by_date -d 25/3/2020-4/4/2020 -D y --options graph

 #latitude vs confirmed cases
 covdata -g latitude --options graph

 #similarly -g tail will work same and please use state code.Terminal feature demonstrates some methods only that are written in --options flag(type 'covdata -h' to see options). -H -T and --save flag can be used for every method that returns a dataframe not a graph.

Documentation

Help on module covidindia:

NAME covidindia - Created on Tue Mar 31 15:24:51 2020

DESCRIPTION @author: Dripta Senapati

CLASSES builtins.object Demographic_overview initializer Data visualizer

 class Data(initializer)
 |  Data(init)
 |
 |  # Data class can apply various filters on collected datasets(Confirmed,Recovered,Deceased)
 |  # based on User's choice
 |
 |  Method resolution order:
 |      Data
 |      initializer
 |      builtins.object
 |
 |  Methods defined here:
 |
 |  __init__(self, init)
 |      This is a class that will scrap the data from source upto the previous day of
 |      day of using that package.
 |
 |      Returns
 |      -------
 |      None.
 |
 |  get_count_between_date(self, startDate, endDate, by)
 |      Gives daily count data for all states between two dates
 |
 |      Parameters
 |      ----------
 |      startDate : character
 |          date format dd/mm/yyyy
 |      endDate : character
 |          date format dd/mm/yyyy.
 |      by : character
 |          'Confirmed' or 'Recovered' or 'Death'.
 |
 |      Raises
 |      ------
 |      Exception
 |          Startdate must be less than enddate.If not it raise anexception.
 |          it also raise error for wrong input in by parameter.
 |
 |      Returns
 |      -------
 |      df : DataFrame
 |          DataFrame consisting daily counts for between two given dates
 |          for given by parameter.
 |
 |  get_count_by_date(self, by, date=None)
 |      Gives the daily count of a given date or all dates by 'confirmed' or 'recovered'
 |      or 'death'
 |
 |      Parameters
 |      ----------
 |      by : character
 |          'Confirmed' or 'Recovered' or 'Death'.
 |      date : character, optional
 |          if date(dd/mm/yyyy) is given count will be shown for that date. The default is None.
 |
 |      Raises
 |      ------
 |      Exception
 |          If by argument is not within above mentioned and if year is not 2020 it will
 |          raise an exception.
 |
 |      Returns
 |      -------
 |      df : dataframe
 |          dataframe consisting of counts of given date or all dates for all states .
 |
 |  get_cum_dataset_between_date(self, startDate, endDate, by)
 |      This method of Data class will give cumulative counts between two given dates
 |      for all states
 |
 |      Parameters
 |      ----------
 |      startDate : character
 |          date format dd/mm/yyyy.
 |      endDate : character
 |          date format dd/mm/yyyy.
 |      by : character
 |          'Total Confirmed' or 'Total Recovered' or 'Total Death'.
 |
 |      Raises
 |      ------
 |      Exception
 |          startdate should be less than endDate---if not it will raise exception.
 |
 |      Returns
 |      -------
 |      df : Dataframe
 |          returns a dataframe of cumulative counts between two dates for all states by
 |          'Total Confirmed' or 'Total Recovered' or 'Total Death'.
 |
 |  get_dataset_by_date(self, date)
 |      This method of Data Class will allow user to get cumulative count of all states of
 |      total confirmed,total death and total recovered
 |      India for a particular given date
 |
 |      Parameters
 |      ----------
 |      date : character
 |          Should be in dd/mm/yyyy format.
 |
 |      Returns
 |      -------
 |      df : Dataframe
 |          Dataframe consisting all cumulative values of total confirmed,total recovered,total
 |          death for a given date.
 |
 |  get_dataset_state(self, state='Whole', daily=False)
 |      this method of Data class will allow user to get cumulative counts of a particular
 |      state.
 |
 |      Parameters
 |      ----------
 |      state : character, optional
 |          name of state in India. The default is 'Whole'.
 |          state code is also applicable.If not given data for all states will be shown.
 |      daily : bool, optional
 |          If True Daily Count data will be shown. The default is False.
 |
 |      Raises
 |      ------
 |      Exception
 |          If state/state code is wrong it will raise exception.
 |
 |      Returns
 |      -------
 |      df : DataFrame
 |          if state is whole and daily is Flase then it Returns a dataframe consisting all states having
 |          cumulative count of totalconfirmed,total recovered,total death till the previous day of the day of using this package.
 |          Otherwise for daily is True it will return dictionary of dataframes having confirmed,recovered and death cases for all states.
 |          if state is mentioned and daily is true then it will return a dataframe consisting all dates showing daily confirmed,daily recovered and
 |          daily death.
 |
 |          if state is mentioned and daily is False then a dataframe will be returned consisting of name of districts of that states along with only
 |          confirmed data.
 |
 |  rank(self, num, by, kind='top', cumulative=False, date=None)
 |      Gives top n or bottom n values as cumulative or daily basis for a date or
 |      combining whole dates filtered with by parameter.
 |
 |      Parameters
 |      ----------
 |      num : integer
 |          number of rows user want to see.
 |          e.g num=10 -> top/bottom 10 data will be shown
 |      by : character
 |          'Total Confirmed' or 'Total Recovered' or 'Total Death'.
 |      kind : character, optional
 |          'top' or 'bottom' by which data will be filtered. The default is 'top'.
 |      cummulative : bool, optional
 |         if True it will show cumulative counts. The default is False.
 |      date : character, optional
 |          (must be in dd/mm/yyyy format)if date is given then method will return cumulative or daily count
 |          for that date. The default is None.
 |          if None it will return all cumulative/daily counts
 |
 |      Raises
 |      ------
 |      Exception
 |          if date is None and cumulative is false then it is not possible to show
 |          data for top n or botom n rows..
 |
 |      Returns
 |      -------
 |      dataframe/dictionary
 |          if date is not given and cummulative is False then it prompt to input state(state code or name)
 |          if state is set to 'all' it will return dictionary consisting all states having top/bottom counts
 |          otherwise a dataframe for a given state.
 |          whenever date is mentioned a dataframe will be returned consisting top/bottom cumulative count or
 |          daily count for that date.
 |
 |  ----------------------------------------------------------------------
 |  Methods inherited from initializer:
 |
 |  show_data(self, of)
 |      This method only assembles the collected data
 |
 |      Returns
 |      -------
 |      list
 |          returns collected data as 3 dataframe format.
 |
 |  ----------------------------------------------------------------------
 |  Data descriptors inherited from initializer:
 |
 |  __dict__
 |      dictionary for instance variables (if defined)
 |
 |  __weakref__
 |      list of weak references to the object (if defined)

class Demographic_overview(initializer)
 |  Demographic_overview(init, silent=False)
 |
 |  Method resolution order:
 |      Demographic_overview
 |      initializer
 |      builtins.object
 |
 |  Methods defined here:
 |
 |  __init__(self, init, silent=False)
 |      This Demographic_overview class has the power of filtering in terms of state,
 |      district,city,date or range of date
 |
 |      Returns
 |      -------
 |      None.
 |
 |  demography(self, place='all', date='all')
 |      This method can show the male and female count of confirmed cases for a state or a district or
 |      a city for a given date of range of date.
 |
 |      Parameters
 |      ----------
 |      place : character, optional
 |          place can be state or district or city.this method automatically recognize the place
 |          and gives user data for that place. The default is 'all'.If place is mentioned data for all places
 |          will be shown.
 |      date : character, optional
 |          by which date or range of date the data will be filtered.Date must be dd/mm/yyyy format.
 |          The default is 'all'.if date is not mentioned data for all dates will be shown
 |
 |      Raises
 |      ------
 |      Exception
 |          If no data is found for a date or a place it will raise exceptions.
 |
 |      Returns
 |      -------
 |      Dataframe
 |          dataframe consists of male female counts of confirmed cases for given date and place.
 |
 |  ----------------------------------------------------------------------
 |  Methods inherited from initializer:
 |
 |  show_data(self, of)
 |      This method only assembles the collected data
 |
 |      Returns
 |      -------
 |      list
 |          returns collected data as 3 dataframe format.
 |
 |  ----------------------------------------------------------------------
 |  Data descriptors inherited from initializer:
 |
 |  __dict__
 |      dictionary for instance variables (if defined)
 |
 |  __weakref__
 |      list of weak references to the object (if defined)

class initializer(builtins.object)
 |  initializer(silent=False)
 |
 |  Methods defined here:
 |
 |  __init__(self, silent=False)
 |      This is a class that will scrap the data from source upto the previous day of
 |      day of using that package.
 |
 |      Returns
 |      -------
 |      None.
 |
 |  show_data(self, of)
 |      This method only assembles the collected data
 |
 |      Returns
 |      -------
 |      list
 |          returns collected data as 3 dataframe format.
 |
 |  ----------------------------------------------------------------------
 |  Data descriptors defined here:
 |
 |  __dict__
 |      dictionary for instance variables (if defined)
 |
 |  __weakref__
 |      list of weak references to the object (if defined)

class visualizer(initializer)
 |  visualizer(init)
 |
 |  Method resolution order:
 |      visualizer
 |      initializer
 |      builtins.object
 |
 |  Methods defined here:
 |
 |  __init__(self, init)
 |      Gather all information to perform the visualization
 |
 |      Parameters
 |      ----------
 |      init : TYPE
 |          None.
 |
 |      Returns
 |      -------
 |      None.
 |
 |  graph_by_date(self, startDate, endDate, state=None, title=None, daily=False)
 |      Gives the visualization of cumulative data or daily data between two given
 |      dates for a given state or as whole india.
 |
 |      Parameters
 |      ----------
 |      title : Character, optional
 |          Sets the title of the subplots. The default is None.
 |      startDate : character
 |          dd/mm/yyyy format(e.g 02/04/2020).
 |      endDate : character
 |          dd/mm/yyyy format(e.g 02/04/2020).
 |      state : character, optional
 |          for which state graph will be plotted.If None graph will be plotted for whole india. The default is None.
 |          states input also takes state codes.
 |      daily : bool, optional
 |          if True graph will plotted daily basis. The default is False.
 |
 |      Raises
 |      ------
 |      Exception
 |          startdate must be less than enddate and if states are given wrong it will raise exception.
 |
 |      Returns
 |      -------
 |      3 subplots consisting date vs total confirmed,date vs total recovered,date vs total deceased for a state or whole india
 |      between two dates.
 |      may be daily or cumulative based on daily parameter passed.
 |
 |  head(self, num, title=None, daily=False)
 |      Gives graphical visualization of first n(=num) dayes based on daily or cumulative data
 |
 |      Parameters
 |      ----------
 |      title : Character, optional
 |          Sets the title of the subplots. The default is None.
 |      num : integer
 |          setd the number of dates from start user wants to see.
 |      daily : bool, optional
 |          if true graph is plotted based on daily data. The default is False.
 |
 |      Returns
 |      -------
 |      3 subplots consisting date vs total confirmed,date vs total recovered,date vs total deceased.
 |      may be daily or cumulative based on daily parameter passed.
 |
 |  plot_by_latitude(self, title=None)
 |      Gives the visualization of counts with respect to state latitudes
 |
 |      Parameters
 |      ----------
 |      title : Character, optional
 |          Sets the title of the subplots. The default is None.
 |
 |      Returns
 |      -------
 |      3 subplots consisting latitude vs latitude confirmed,latitudes vs total recovered,latitudes vs total deceased.
 |
 |  tail(self, num, title=None, daily=False)
 |      Gives graphical visualization of latest n(=num) dayes based on daily or cumulative data
 |
 |      Parameters
 |      ----------
 |      title : Character, optional
 |          Sets the title of the subplots. The default is None.
 |      num : integer
 |          setd the number of latest dates user wants to see.
 |      daily : bool, optional
 |          if true graph is plotted based on daily data. The default is False.
 |
 |      Returns
 |      -------
 |      3 subplots consisting date vs total confirmed,date vs total recovered,date vs total deceased.
 |      may be daily or cumulative based on daily parameter passed.
 |
 |  whole(self, title=None, daily=False)
 |      Generate 3 subplots of whole collected data
 |
 |      Parameters
 |      ----------
 |      title : Character, optional
 |          Sets the title of the subplots. The default is None.
 |      daily : bool, optional
 |          if True garph will be plotted on daily counts otherwise on cumulative counts. The default is False.
 |
 |      Returns
 |      -------
 |      3 subplots consisting date vs total confirmed,date vs total recovered,date vs total deceased.
 |      may be daily or cumulative based on daily parameter passed.
 |
 |  ----------------------------------------------------------------------
 |  Methods inherited from initializer:
 |
 |  show_data(self, of)
 |      This method only assembles the collected data
 |
 |      Returns
 |      -------
 |      list
 |          returns collected data as 3 dataframe format.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

covdata-1.2.2.tar.gz (54.0 kB view hashes)

Uploaded Source

Built Distribution

covdata-1.2.2-py3-none-any.whl (78.7 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page