Skip to main content

Source Code Analyzing Maching is an pplication for the analysis of similarities between separate files. Currently with Python, Java, and '*.txt' file checking capabilities.

Project description

SourceAnalyzer

GitHub Repo Link

  • A locally run application that demonstrates different matching algorithms
  • Current release compares files as a one to one connection
  • Outputs given percentage of similarity and highlighted visualization of the matching sections of input documents
  • Supports, raw text files and python files currently, with C++ and java planned in the future releases

Manual pdf link:

https://www.dropbox.com/s/tdtd9n7aubkxf9u/Codalyzers%20Project.pdf?dl=0

Getting Started

STEP 1

it is recommended to create and navigate to virtual environemnt using python to run script

python3.8 -m venv <dir> 
source <dir>/bin/activate

STEP 2

install source_analyzer python package from Python Package Index. Note: must be running python version 3.8 or greater

pip install scam
OR

Download the latest built compressed file release from source_analyzer-X.X.X.tar.gz

then install downloaded file

pip3 install /<path_to_file>/scam-0.0.2.tar.gz

STEP 3

(make sure xlaunch is running) run script

scam

Known Errors/Issues

Errors- Multiple of the same substring found in file B will return only the first instance of that substring.

Issues- Python files featuring a heavy amount of print statements may cause skewed data.

Test Files

Test files can be found in:

/lib/python3.8/site-packages/source/test_file there are python files and .txt files, make sure to change the file filter down below to be able to see either of them.

Project Group: Codalyzers

  • Djoni Austin | @dcaust1n
  • Jared Dawson | @lukinator1
  • Shane Eising | @seising99
  • Julian Marott | @jmmoratta

References:

https://theory.stanford.edu/~aiken/publications/papers/sigmod03.pdf

Project details


Release history Release notifications | RSS feed

This version

0.2.7

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scam-0.2.7.tar.gz (34.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

scam-0.2.7-py2-none-any.whl (291.8 kB view details)

Uploaded Python 2

File details

Details for the file scam-0.2.7.tar.gz.

File metadata

  • Download URL: scam-0.2.7.tar.gz
  • Upload date:
  • Size: 34.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.15.0 pkginfo/1.6.1 requests/2.25.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/2.7.17

File hashes

Hashes for scam-0.2.7.tar.gz
Algorithm Hash digest
SHA256 6548933cdf9b92b0caffc8127958ccd4d3427ac935e9cd8367c4556e2ef9b102
MD5 ee74eb84af93e2415b6b78ff49b6939f
BLAKE2b-256 38b107e55a33819ed072f653c8a0c1ceb84a360738bffb05e25a0da30d8023ba

See more details on using hashes here.

File details

Details for the file scam-0.2.7-py2-none-any.whl.

File metadata

  • Download URL: scam-0.2.7-py2-none-any.whl
  • Upload date:
  • Size: 291.8 kB
  • Tags: Python 2
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.15.0 pkginfo/1.6.1 requests/2.25.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/2.7.17

File hashes

Hashes for scam-0.2.7-py2-none-any.whl
Algorithm Hash digest
SHA256 2664d324c1ce026241a71bb66ef48fd42ce3e3af85eef43d04a53f9efa3237c7
MD5 396f7505a9d68767c24ebd85e3e20e13
BLAKE2b-256 6fa16385985e9efc31ce4504617cfa9cdd6bdeea101bd56fe4906a34e2338c6c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page