Skip to main content

Using Markov chain to detect gibberish in text.

Project description

gibberishpy

gibberishpy is a Python-based application designed to analyze and identify gibberish in a given string. The application leverages the principles of Markov Chains, a mathematical system that undergoes transitions from one state to other on a state space, to calculate both additive and multiplicative probabilities. gibberishpy allow users to build their own model with custom txt file.

Usage Examples

Build Model

from gibberishpy.scanner import GibberishScanner


if __name__ == "__main__":
    scanner = GibberishScanner()
    scanner.build_model(corpus_path="path/to/corpus.txt", n_gram_size=2)
    scanner.save_model("transition_matrix_2d.tm", encoding="utf-8")

Scan Gibberish

from gibberishpy.scanner import GibberishScanner


if __name__ == "__main__":
    scanner = GibberishScanner()
    scanner.load_model(path="transition_matrix_2d.tm")
    additive_cum_proba, multiplicative_cum_proba = scanner.scan("ldfjgnkdfjnd")
    print(additive_cum_proba)
    print(multiplicative_cum_proba)

    # 0.00022810218978102192
    # 0.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gibberishpy-1.0.3.tar.gz (4.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

gibberishpy-1.0.3-py3-none-any.whl (5.5 kB view details)

Uploaded Python 3

File details

Details for the file gibberishpy-1.0.3.tar.gz.

File metadata

  • Download URL: gibberishpy-1.0.3.tar.gz
  • Upload date:
  • Size: 4.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.8.2 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.7

File hashes

Hashes for gibberishpy-1.0.3.tar.gz
Algorithm Hash digest
SHA256 3fd07b6b61f16b1c5330b90756eb71d02077be761f180ffcd026f3ba9831e400
MD5 f032315c94fa370c7b38de767bfb724e
BLAKE2b-256 ff4fc559378837090a8c3d8614e3895fe4009aa932171c02c962884c183a8f45

See more details on using hashes here.

File details

Details for the file gibberishpy-1.0.3-py3-none-any.whl.

File metadata

  • Download URL: gibberishpy-1.0.3-py3-none-any.whl
  • Upload date:
  • Size: 5.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.8.2 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.7

File hashes

Hashes for gibberishpy-1.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 ef4312fea849c8fd53696ed14cd122a5d8a2aca4ec8a44c299380c6bee99aacf
MD5 dc9b98cae48b4dee6f5f0d3d9e85f87f
BLAKE2b-256 e20d273eaba40394a353c8c23ffbe3d44c8b44d94f30eff87260738a610ccf7f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page