Skip to main content

Analyze multiple files for similarity and/or uniqueness.

Project description

runner

Analyze multiple files for similarity and/or uniqueness. Finding similarities of works duplicated from one or more part of different works to create a seemingly unique one can be difficult because of different strategies being used, but this can be done better with a software like runner.

Requirements

Usage

Install the latest abero package, upcoming versions might introduce unannounced changes, so a virtual environment is a must have before installation.

pip install -U abero

To integrate abero into your Python codes, check the code snippet below:

import abero

abero.analyze(directory, extension="txt", threshold=80, template=None, skipnames=0, group=0, unzip=0, reset=0)

CLI Usage

# usage: runner [-h] -d directory [-e extension] [-c control] [-t threshold] [-u unzip] [-s skipnames] [-g group] [-r reset]

py runner.py -d "<path_to_files>" -e "txt" -c "<path_to_control_file>" -t 1 -u 1 -s 1 -r 1
  1. -d <path> - Full path of the dirctory containing the files to analyze.
  2. -e <txt> - List of allowed file extensions to analyze.
  3. -c <*.txt> - Full path of the control file.
  4. -t <80> - Threshold level for uniqueness, treats similarity below threshold as unique (1-100; default = 0)
  5. -u <0> - Unzip/extract ZIP files (0-1; default = 0)
  6. -s <0> - Skip files with common names (0-1; default = 0)
  7. -g <1> - Only compare if files contains the same identifier (0-1; default = 1)

Example: student1*_set1*.py >> student2*_set1*.py

  1. -r <0> - Reset analytics before execution (0-1; default = 0)

Control File

Control file contains words or phrases, checked line-by-line, that are deem allowed to be contained in all files to analyzed; therefore, if found on the test files, it will not be flagged as duplicate work.

Features

  • Unzip feature
  • File comparison
  • Threshold levels
  • Skip / group compare
  • Diff tool, content viewer

Did you know?

The repository name abero was inspired from the words aberrant and runner (Latin), which may mean deviating or being absent.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

abero-1.0.6.tar.gz (125.4 kB view details)

Uploaded Source

Built Distribution

abero-1.0.6-py3-none-any.whl (8.3 kB view details)

Uploaded Python 3

File details

Details for the file abero-1.0.6.tar.gz.

File metadata

  • Download URL: abero-1.0.6.tar.gz
  • Upload date:
  • Size: 125.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.7.3 pkginfo/1.7.0 requests/2.24.0 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.9.2

File hashes

Hashes for abero-1.0.6.tar.gz
Algorithm Hash digest
SHA256 d0c06e64e4660a86d367c02856a70c9f6a6b9689d5d5bebc394deee490297af2
MD5 181ee2a3e38e31085096f55ff64a1c2e
BLAKE2b-256 65fd227662c4e09bcb68ef635e2c2c47a827fc4433f49651b5de7c705fad7a34

See more details on using hashes here.

File details

Details for the file abero-1.0.6-py3-none-any.whl.

File metadata

  • Download URL: abero-1.0.6-py3-none-any.whl
  • Upload date:
  • Size: 8.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.7.3 pkginfo/1.7.0 requests/2.24.0 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.9.2

File hashes

Hashes for abero-1.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 80c9c28dbeadb7833beb5b6c7188e961af983ad22724c8d58c0545904cfa7097
MD5 1598d285715c83a33fce5ba5e6f24f3d
BLAKE2b-256 15c51068f2cb66ced04f2f7e5bb332188f80cb98162c8b98cdd252942c1ef70a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page