Skip to main content

A Python package providing sampling methods via matrix-based distance measures to mitigate autocorrelation

Project description

SAMBA - Sampling Algorithms with Matrix-Based Weight Allocation

SAMBA is a Python package providing sampling methods via matrix-based distance measures to mitigate autocorrelation.

Installation and Usage

The package can be installed via pip:

$ pip install samba_sampler

Detailed information on the usage of the package can be found in the documentation. For a quick start, the following example shows how to use the package:

import samba_sampler as samba
sampler = samba.LanguageSampler() # Default parameters
print(sampler.sample(5))

Changelog

Version 0.3.2 (2023-07-18)

  • Added taxa filtering, applying to the Glottolog data

Version 0.3.1 (2023-07-17)

  • Added first version of the walking distance matrix derived from Guzman Naranjo & Jäger (2023).

Version 0.3 (2023-07-13)

  • Initial release, following on the arcaverborum project.

Community Guidelines

While the author can be contacted directly for support, it is recommended that third parties use GitHub standard features, such as issues and pull requests, to contribute, report problems, or seek support.

Contributing guidelines, including a code of conduct, can be found in the CONTRIBUTING.md file.

Author, Citation, and Acknowledgements

The library is developed by Tiago Tresoldi (tiago@tresoldi.org).

The library is developed in the context of the Cultural Evolution of Texts project, with funding from the Riksbankens Jubileumsfond (grant agreement ID: MXM19-1087:1).

If you use samba_sampler, please cite it as:

Tresoldi, Tiago (2023). SAMBA (Sampling Algorithms with Matrix-Based Weight Allocation): a Python package providing sampling methods via matrix-based distance measures to mitigate autocorrelation. Version 0.3. Uppsala: Uppsala University.

In BibTeX:

@misc{Tresoldi2023samba,
  author = {Tresoldi, Tiago},
  title = {SAMBA (Sampling Algorithms with Matrix-Based Weight Allocation): a Python package providing sampling methods via matrix-based distance measures to mitigate autocorrelation. Version 0.3.},
  howpublished = {\url{https://github.com/tresoldi/samba_sampler}},
  address = {Uppsala},
  published = {Upssala University},
  year = {2023}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

samba_sampler-0.3.2.tar.gz (122.0 MB view details)

Uploaded Source

Built Distribution

samba_sampler-0.3.2-py3-none-any.whl (122.0 MB view details)

Uploaded Python 3

File details

Details for the file samba_sampler-0.3.2.tar.gz.

File metadata

  • Download URL: samba_sampler-0.3.2.tar.gz
  • Upload date:
  • Size: 122.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.10

File hashes

Hashes for samba_sampler-0.3.2.tar.gz
Algorithm Hash digest
SHA256 cc9a85621a23a26ad0e28b71d584b9786954d74c7aa7bb95a10fa110880a04ee
MD5 b7e5d3bc543ca4ca508a8f13316d392b
BLAKE2b-256 ba3d791eca03676520bbac442d9ba48179718c36fc24ff21e4255a68c9ffe44b

See more details on using hashes here.

File details

Details for the file samba_sampler-0.3.2-py3-none-any.whl.

File metadata

File hashes

Hashes for samba_sampler-0.3.2-py3-none-any.whl
Algorithm Hash digest
SHA256 cf580e181f1d3f9a8892a2bdd7995be7ba95de8d3631756f27a41a47b6f929e0
MD5 bbdfbf398021d8a242ed940b715f275b
BLAKE2b-256 e4a91e4d7e2328b3672031483098b946beeb163f90c191b50d596eae10a5696a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page