Source Code Analyzing Maching is an pplication for the analysis of similarities between separate files. Currently with Python, Java, and '*.txt' file checking capabilities.
Project description
SourceAnalyzer
- A locally run application that demonstrates different matching algorithms
- Current release compares files as a one to one connection
- Outputs given percentage of similarity and highlighted visualization of the matching sections of input documents
- Supports, raw text files and python files currently, with C++ and java planned in the future releases
Manual pdf link:
https://www.dropbox.com/s/tdtd9n7aubkxf9u/Codalyzers%20Project.pdf?dl=0
Getting Started
STEP 1
it is recommended to create and navigate to virtual environemnt using python to run script
python3.8 -m venv <dir>
source <dir>/bin/activate
STEP 2
install source_analyzer python package from Python Package Index. Note: must be running python version 3.8 or greater
pip install scam
OR
Download the latest built compressed file release from source_analyzer-X.X.X.tar.gz
then install downloaded file
pip3 install /<path_to_file>/scam-0.0.2.tar.gz
STEP 3
(make sure xlaunch is running) run script
scam
Known Errors/Issues
Errors- Multiple of the same substring found in file B will return only the first instance of that substring.
Issues- Python files featuring a heavy amount of print statements may cause skewed data.
Test Files
Test files can be found in:
/lib/python3.8/site-packages/source/test_file there are python files and .txt files, make sure to change the file filter down below to be able to see either of them.Project Group: Codalyzers
- Djoni Austin | @dcaust1n
- Jared Dawson | @lukinator1
- Shane Eising | @seising99
- Julian Marott | @jmmoratta
References:
https://theory.stanford.edu/~aiken/publications/papers/sigmod03.pdf
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file scam-0.2.7.tar.gz.
File metadata
- Download URL: scam-0.2.7.tar.gz
- Upload date:
- Size: 34.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.15.0 pkginfo/1.6.1 requests/2.25.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/2.7.17
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6548933cdf9b92b0caffc8127958ccd4d3427ac935e9cd8367c4556e2ef9b102
|
|
| MD5 |
ee74eb84af93e2415b6b78ff49b6939f
|
|
| BLAKE2b-256 |
38b107e55a33819ed072f653c8a0c1ceb84a360738bffb05e25a0da30d8023ba
|
File details
Details for the file scam-0.2.7-py2-none-any.whl.
File metadata
- Download URL: scam-0.2.7-py2-none-any.whl
- Upload date:
- Size: 291.8 kB
- Tags: Python 2
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.15.0 pkginfo/1.6.1 requests/2.25.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/2.7.17
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2664d324c1ce026241a71bb66ef48fd42ce3e3af85eef43d04a53f9efa3237c7
|
|
| MD5 |
396f7505a9d68767c24ebd85e3e20e13
|
|
| BLAKE2b-256 |
6fa16385985e9efc31ce4504617cfa9cdd6bdeea101bd56fe4906a34e2338c6c
|