The ingestion game CLI
Project description
The Ingestion Game
The Ingestion Game consists of:
- A data file with a varying number of entries containing some data. It is shaped as a CSV where each row has
an arbitrary number of
key=value
pairs separated by comma.key1=value1,key2=value2 key3=value3 key4=value4,key5=value5,key6=value6
- Not all
key
s are always important! Each time we play the game, the keys to process can be different. The rest are just considered noise. We want to ingest all the important keys. - All important keys are always going to be present in the data rows of the input file.
- Each
key
supports a specific data type. We should only process rows where the values match the expected datatype of each key. - Each row consists of an Entity of a given type. The type will be informed as
type=T
in the data.type
is always considered as an important key and we can assume it is astr
. - Entities have hierarchies defined in the games' rules. For example, given the rule
A -> B
, we cannot process any Entity of typeB
until we have processed all Entities of typeA
.
Your task is to win the Ingestion Game with the following rules:
- The important keys are
id:int
,name:str
,food:str
andtype:str
. - Entities follow the hierarchy
A -> B -> C
.
The solution must be coded in Python and you can use any public libraries. The solution must be an CLI that outputs
the processed data to stdout
in CSV format following the important keys, e.g.,:
id,name,food,type
1,levy,veggies,A
2,lima,pizza,A
3,john,fish,B
We are going to provide a data file with the keys and hierarchies mentioned above, but your CLI should accept any
important keys and hierarchies as inputs. You can see an example on input data in the provided input.txt
.
🤓 We value in the solution
- Good software design
- Proper documentation
- Compliance to Python standards and modern usages (e.g.: PEP8)
- Proper use of data structures
- Ergonomy of the command line interface
- Setup/Launch instructions if required
Solution
Clone the project from GitHub:
git clone git@github.com:collate-hiring/python-test-txemac.git
Set up environment and install requirements:
python3 -m venv env
pip install -r src/requirements.txt
Run
You can see the help:
python src/main.py --help
For to run, you need to add 3 parameters to the call:
- IMPORTANT_KEYS: [required] separate by commas without spaces
example:
id:int,name:str,food:str,type:str
- HIERARCHY: [required] separate by commas without spaces
example:
A,B,C
- PATH: [required] path of the input file
example:
/files/input.txt
Example:
python main.py id:int,name:str,type:str A,B,C /file/input.txt
Tests
Install requirements:
pip install -r tests/requirements.txt
pytest -vvv
Flake8
flake8
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for ingestion_game-0.1.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2bc0ff36ebe724a98270f83f3046cf089efd42c6210097562ebe13429fb019a4 |
|
MD5 | 97ac2d7a0aa78a4172b8fe43e1ea00f2 |
|
BLAKE2b-256 | fbf17b5c30a719c34989bf5bb59d2cac2da4217c083cfdb0941634945b1fbfc6 |