Generate automated data dictionary from files and databases
Project description
DataDictionary
AutoDataDictionary generates data dictionary from source files and database tables including:
- Column Name
- Sample Value
- Source File Name
- Non Null Values Count
- Unique Values Count
- Data Type
- Unique Values List
How to use
Generate data dictionary from db
import autodatadictionary as ad
ad.to_dictionary_from_db(
sql_alchemy_connection_string='postgresql://username:password@domain:5432/db',
schema='schema')
Generate data dictionary from csv files
import autodatadictionary as ad
ad.to_dictionary_from_file(['/path/data1.csv', '/path/data2.csv', '/path/dataN.csv'], sep=',')
Generate data dictionary from dataframe
import autodatadictionary as ad
titanic = pd.read_csv('https://raw.githubusercontent.com/datasciencedojo/datasets/master/titanic.csv')
ad.to_dictionary_from_dataframe([titanic])
ad.to_dictionary_from_dataframe(dataframes=[titanic], source_names=['titanic_dataframe'], source=True)
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for autodatadictionary-0.0.6-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b69e940fd7e9873df5aed47ba11f7a75cd209e4681155cf1800b7a7bd30d356b |
|
MD5 | 224762752777207e1058bd693f7f40e4 |
|
BLAKE2b-256 | 57f5a897f87020a8cc7a8b2b03954fcde58c16171f93fd52718fd285436517d4 |