Generic Etl Job template that can be imported
Project description
aind-data-transformation
Usage
Please import this package and extend the abstract base class to define a new transformation job
from aind_data_transformation.core import (
BasicJobSettings,
GenericEtl,
JobResponse,
)
# An example JobSettings
class NewTransformJobSettings(BasicJobSettings):
# Add extra fields needed, for example, a random seed
random_seed: Optional[int] = 0
# An example EtlJob
class NewTransformJob(GenericEtl[NewTransformJobSettings]):
# This method needs to be defined
def run_job(self) -> JobResponse:
"""
Main public method to run the transformation job
Returns
-------
JobResponse
Information about the job that can be used for metadata downstream.
"""
job_start_time = datetime.now()
# Do something here
job_end_time = datetime.now()
return JobResponse(
status_code=200,
message=f"Job finished in: {job_end_time-job_start_time}",
data=None,
)
Contributing
The development dependencies can be installed with
pip install -e .[dev]
Adding a new transformation job
Any new job needs a settings class that inherits the BasicJobSettings class. This requires the fields input_source and output_directory and makes it so that the env vars have the TRANSFORMATION_JOB prefix.
Any new job needs to inherit the GenericEtl class. This requires that the main public method to execute is called run_job
and returns a JobResponse.
Linters and testing
There are several libraries used to run linters, check documentation, and run tests.
- Please test your changes using the coverage library, which will run the tests and log a coverage report:
coverage run -m unittest discover && coverage report
- Use interrogate to check that modules, methods, etc. have been documented thoroughly:
interrogate .
- Use flake8 to check that code is up to standards (no unused imports, etc.):
flake8 .
- Use black to automatically format the code into PEP standards:
black .
- Use isort to automatically sort import statements:
isort .
Pull requests
For internal members, please create a branch. For external members, please fork the repository and open a pull request from the fork. We'll primarily use Angular style for commit messages. Roughly, they should follow the pattern:
<type>(<scope>): <short summary>
where scope (optional) describes the packages affected by the code changes and type (mandatory) is one of:
- build: Changes that affect build tools or external dependencies (example scopes: pyproject.toml, setup.py)
- ci: Changes to our CI configuration files and scripts (examples: .github/workflows/ci.yml)
- docs: Documentation only changes
- feat: A new feature
- fix: A bugfix
- perf: A code change that improves performance
- refactor: A code change that neither fixes a bug nor adds a feature
- test: Adding missing tests or correcting existing tests
Semantic Release
The table below, from semantic release, shows which commit message gets you which release type when semantic-release
runs (using the default configuration):
Commit message | Release type |
---|---|
fix(pencil): stop graphite breaking when too much pressure applied |
|
feat(pencil): add 'graphiteWidth' option |
|
perf(pencil): remove graphiteWidth option BREAKING CHANGE: The graphiteWidth option has been removed. The default graphite width of 10mm is always used for performance reasons. |
(Note that the BREAKING CHANGE: token must be in the footer of the commit) |
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file aind_data_transformation-0.1.0.tar.gz
.
File metadata
- Download URL: aind_data_transformation-0.1.0.tar.gz
- Upload date:
- Size: 35.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 91f2a8ff92f409412856bafbf4cceadb9cba5fc7dc780e62ad05adcbe765533c |
|
MD5 | de44313b98f3a5d2d6e9c038d31c35ed |
|
BLAKE2b-256 | f355c3f4e9103a155d7905f3df11cc0d040ef9ec35713c6210a882646641a954 |
File details
Details for the file aind_data_transformation-0.1.0-py3-none-any.whl
.
File metadata
- Download URL: aind_data_transformation-0.1.0-py3-none-any.whl
- Upload date:
- Size: 5.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 39b1de5f5bcf93a97d0abb72f7c2422804cb65a33bbd50f2a479d24e2c08a3eb |
|
MD5 | 7edfb1e97eee8122b1dca08402bed5ec |
|
BLAKE2b-256 | b5ac1448943f125f37a8152a96f7eb21695b63312b20a010e3b5bff2643a9b99 |