Skip to main content

SimCATS-Datasets is a Python package that simplifies the creation and loading of SimCATS datasets.

Project description

SimCATS logo

SimCATS-Datasets

SimCATS-Datasets is a Python package that simplifies the creation and loading of SimCATS datasets. Please have a look at this repository regarding SimCATS itself.

Installation

The framework supports Python versions 3.7 - 3.11 and installs via pip:

pip install simcats-datasets

Alternatively, the SimCATS-Datasets package can be installed by cloning the GitHub repository, navigating to the folder containing the setup.py file, and executing

pip install .

For installation in development/editable mode, use the option -e.

Documentation

The official documentation is hosted on ReadtheDocs but can also be built locally. To do this, first install the packages sphinx, sphinx-rtd-theme, sphinx-autoapi, myst-nb , and jupytext with

pip install sphinx sphinx-rtd-theme sphinx-autoapi myst-nb jupytext

and then, in the docs folder, execute the following command:

.\make html

To view the generated HTML documentation, open the file docs\build\html\index.html.

Loading Datasets

Datasets created with SimCATS-Datasets are stored in HDF5 files. These datasets can be loaded using the function load_dataset from simcats_datasets.loading.

The return value of the function is a named tuple. The fields can be accessed by their name or index. As with normal tuples, it is also possible to unpack the returned fields directly into separate variables. The available fields depend on which data was specified to be loaded. Please look at the docstring for further information.

Additionally, SimCATS-Datasets offers a pytorch dataset (see torch.utils.data.Dataset) implementation called SimcatsDataset. It allows the direct use of SimCATS datasets for machine learning purposes with Torch and can be imported from simcats_datasets.loading.pytorch.

Creating Datasets

To create a simulated dataset, import create_simulated_dataset from simcats_datasets.generation. This function allows the creation of simulated CSDs with ground truth very easily. It is also possible to add further CSDs to already existing datasets. The function will detect the existing dataset automatically. For the function's usage, please have a look at its docstring.

:warning: WARNING
The functionalities for creating and extending simulated datasets using SimCATS expect that the SimCATS simulation uses the IdealCSDInterface implementation called IdealCSDGeometric. Other implementations might cause problems because the expected information for creating labeled lines etc. might be unavailable.

Alternatively, to using create_simulated_dataset and directly simulating a dataset with SimCATS, it is also possible to create a SimCATS-Dataset compatible dataset with existing data (for example, experimentally measured data or data simulated with other frameworks). This can be done using create_dataset from simcats_datasets.generation.

Citations

@article{hader2024simcats,
  author={Hader, Fabian and Fleitmann, Sarah and Vogelbruch, Jan and Geck, Lotte and Waasen, Stefan van},
  journal={IEEE Transactions on Quantum Engineering}, 
  title={Simulation of Charge Stability Diagrams for Automated Tuning Solutions (SimCATS)}, 
  year={2024},
  volume={5},
  pages={1-14},
  doi={10.1109/TQE.2024.3445967}
}

License, CLA, and Copyright

CC BY-NC-SA 4.0

This work is licensed under a GNU General Public License 3.

GPLv3

Contributions must follow the Contributor License Agreement. For more information, see the CONTRIBUTING.md file at the top of the GitHub repository.

Copyright © 2024 Forschungszentrum Jülich GmbH - Central Institute of Engineering, Electronics and Analytics (ZEA) - Electronic Systems (ZEA-2)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

simcats_datasets-2.5.0.tar.gz (69.1 kB view details)

Uploaded Source

Built Distribution

simcats_datasets-2.5.0-py3-none-any.whl (60.5 kB view details)

Uploaded Python 3

File details

Details for the file simcats_datasets-2.5.0.tar.gz.

File metadata

  • Download URL: simcats_datasets-2.5.0.tar.gz
  • Upload date:
  • Size: 69.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.11

File hashes

Hashes for simcats_datasets-2.5.0.tar.gz
Algorithm Hash digest
SHA256 982611d0f36bfb52beb63e3ee4b78befb5984bbb11c740bd1264ea4eb3fd5a7d
MD5 3abcea5317e7706b1b3b9c3f54daf141
BLAKE2b-256 33d5b545fadd39d5f6d94edfbd6483192b9cbe4685c2ec0a34b61ac953b50bac

See more details on using hashes here.

File details

Details for the file simcats_datasets-2.5.0-py3-none-any.whl.

File metadata

File hashes

Hashes for simcats_datasets-2.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 60ce5b725bba7c145cac27b887b574bebcd7bd1d0e53e10a6e725f6d8d13c877
MD5 af310c8c480b5820039b8921bfffbbd1
BLAKE2b-256 7fe736bb321a113991a9d5b2a72a8509bba4154e00990291635a0df2227eb57b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page