Generate automated data dictionary from files and databases
Project description
DataDictionary
AutoDataDictionary generates data dictionary from source files and database tables including:
- Column Name
- Sample Value
- Source File Name
- Non Null Values Count
- Unique Values Count
- Data Type
- Unique Values List
How to use
Generate data dictionary from db
import autodatadictionary as ad
ad.to_dictionary_from_db(
sql_alchemy_connection_string='postgresql://username:password@domain:5432/db',
schema='schema')
Generate data dictionary from csv files
import autodatadictionary as ad
ad.to_dictionary_from_file(['/path/data1.csv', '/path/data2.csv', '/path/dataN.csv'], sep=',')
Generate data dictionary from dataframe
import autodatadictionary as ad
titanic = pd.read_csv('https://raw.githubusercontent.com/datasciencedojo/datasets/master/titanic.csv')
ad.to_dictionary_from_dataframe([titanic])
ad.to_dictionary_from_dataframe(dataframes=[titanic], source_names=['titanic_dataframe'], source=True)
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for autodatadictionary-0.0.7-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 529ae9a7f7649d8ba051c5b237a5d4d51559dd71c4c7ab3f61d1f66814435d5b |
|
MD5 | e630ce06db07e1b8cd7cecd5dfb5b311 |
|
BLAKE2b-256 | 9feb647b812c6f8c1d36d0f3fc5ccc569c72fd4043668189c19c267343d09547 |