Python Library of General Data Science Solutions
Project description
BloomTechLib
BloomTech Labs Python Library of General Data Science Solutions
BloomTechLib Developer Guidelines
- No PEP8 violations.
- No global state.
- Must be backwards compatible to 3.6.x
- Must be forward compatible up to the latest version of Python 3.9.x
- Should avoid dependencies outside the standard library.
- Every feature will be documented in detail.
- Code examples will be included for each feature.
Analysis
CSV Similarity Score
Compares two csv files and returns a score between 0.0 and 1.0 to indicate how similar the data is.
Assumptions
- The data files have the same header, delimiter and number of rows.
- Each row of data should be a unique observation, each column representing a single aspect.
- CSV is a convenient format, but a database adapter could be useful in the future.
- Data will be primitive strings or numbers and not more complex types.
DataBase Ops
DataModelMongo Class
find(dict) -> dict
insert(dict)
find_many(dict, int) -> Iterator[dict]
insert_many(dict)
get_df() -> DataFrame
DataModelSQL Class
db_action(str)
db_query(str) -> list
HTML to DataFrame
html_to_df(str, int) -> DataFrame
DevOps API
- WIP
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
BloomTechLib-0.0.1.tar.gz
(4.2 kB
view hashes)
Built Distribution
Close
Hashes for BloomTechLib-0.0.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1e9e7aa6cd4725ad2db113af8090a456d3212fc3f93f2bed59aae506b6e58a65 |
|
MD5 | ff1f5f04825436e667ede4fbde000233 |
|
BLAKE2b-256 | 80e25922caf7455a211b0a602ffd91ca5a0ed2fd71f3ed23f999dc1718c0d594 |