Skip to main content

ELFEN - Efficient Linguistic Feature Extraction for Natural Language Datasets

Project description

ELFEN - Efficient Linguistic Feature Extraction for Natural Language Datasets

This python package provides efficient linguistic feature extraction for text datasets (i.e. datasets with N text instances, in a tabular structure).

For further information, check the GitHub repository and the documentation

Usage of third-party resources usable in this package

The extraction of psycholinguistic, emotion/lexicon and semantic features relies on third-party resources such as lexicons. Please refer to the original author's licenses and conditions for usage, and cite them if you use the resources through this package in your analyses.

For an overview which features use which resource, and how to export all third-party resource references in a bibtex string, consult the documentation.

Acknowledgements

While all feature extraction functions in this package are written from scratch, the choice of features in the readability and lexical richness feature areas (partially) follows the readability and lexicalrichness python packages.

We use the wn python package to extract Open Multilingual Wordnet synsets.

Citation

If you use this package in your work, for now, please cite

@misc{maurer-2025-elfen,
  author = {Maurer, Maximilian},
  title = {ELFEN - Efficient Linguistic Feature Extraction for Natural Language Datasets},
  year = {2025},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/mmmaurer/elfen}},
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

elfen-1.0.3.tar.gz (45.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

elfen-1.0.3-py3-none-any.whl (51.2 kB view details)

Uploaded Python 3

File details

Details for the file elfen-1.0.3.tar.gz.

File metadata

  • Download URL: elfen-1.0.3.tar.gz
  • Upload date:
  • Size: 45.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.15

File hashes

Hashes for elfen-1.0.3.tar.gz
Algorithm Hash digest
SHA256 1921877ceddc004c137cf1a083709a3d10cce44cc2f02103b008d702a1f15f1d
MD5 0e8a3fb32c1e16f2334d63ad9775c27d
BLAKE2b-256 9f04f57c6bc9966c37e8196c85938908d8cbaceadeb3a9b529d4abbd546de35d

See more details on using hashes here.

File details

Details for the file elfen-1.0.3-py3-none-any.whl.

File metadata

  • Download URL: elfen-1.0.3-py3-none-any.whl
  • Upload date:
  • Size: 51.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.15

File hashes

Hashes for elfen-1.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 58c9a2c78b5f62576e38027b54db5864f0c7314e8df5e0dc03612850d11d6891
MD5 c47c52398d8b99ec5bd3c0338178c2a6
BLAKE2b-256 6a65a93e2a4988c3918cbd3287e8e3b23864e38310e6d5bd13d101aed934a382

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page