Automatic assignment grading for instructor use in programming courses

These details have not been verified by PyPI

Project links

Source Code

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 5 - Production/Stable
Environment
- Console
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Natural Language
- English
Operating System
- OS Independent
Programming Language

Project description

This utility aims to provide a simple, yet secure and highly configurable way to autograde programming assignments.

I consider it to be finished. From now on, I will only be adding extra grading languages if necessary and fixing bugs if any are reported. Autograder has been tested on a real university class with hundreds of students and has shown to be error-less (in terms of grades), almost completely safe from cheating, and fast.

Features
Installation
Quickstart
Supported Programming Languages
Usage
Advanced Usage
Writing testcases
Helper functions
Command line help
Implementation details
Anti Cheating
Adding Programming Languages

Features

Most features are demonstrated in examples/ directory
Easy to grade (simply running autograder on a directory with assignments and testcases)
Easy-to-write testcases
Testcase grade can be based on student's output in stdout
A per-testcase grade can be any number out of 100 points
Support for grading C, C++, Java, and Python code
A file with testcase results can be generated for each student (done by default)
You can customize the total points for the assignment, timeout for the running time of student's program, file names to be considered for grading, and formatters for checking student output
Anti-Cheating capabilities that make it nearly impossible for students to break the grader and choose their results (precompilation of testcases, verification of who exited the program, and removal of testcase source files before testing). You can read more on this in implementation details section below
You can pass arguments to language compilers during testcase (or submission) precompilation and compilation using config.ini
You can grade submissions in multiple programming languages at once, as long as there are testcases written in each language
Most of these features are described in detail in autograder/default_config.ini, implementation details section below, and command line help section below

Installation

Currently, Linux-only and Python >= 3.6. OS X has not been tested. Windows, Python < 3.6 are not supported at all.
Run pip3 install assignment-autograder
If you want to update to a newer version, run pip3 install --upgrade --no-cache-dir assignment-autograder

Quickstart

Run autograder path/to/directory/you'd/like/to/grade --guide. The guide will create all of the necessary configurations and directories for grading and will explain how to grade.

Supported Programming Languages

Java (only through javac and java alias)
C (only through gcc)
C++ (only through g++)
CPython (3.6-3.10)

Usage

Create tests directory in the same directory as student submissions. It has to follow the same structure as one of the examples. (can be automatically created using instructions from quickstart section)
Write testcases as described below. You can use examples/ as reference.
Create input and output text files in their respective directories for each testcase. If a test does not require input and/or output, the respective text file is also not required.
run autograder path/to/submissions/dir from command line. If you are in the same directory as submissions, you can simply run autograder.

Advanced Usage

If you create config.ini in tests, you can customize grader's behavior. Use autograder --guide if you want all optional directories and configurations set up for you. If you remove some configuration fields from config.ini, grader will use the respective fields from default config.
To check output, you can specify output formatters in a file output_formatters.py in the directory with your testcase folder. They will format student's output to allow you to give credit to students even if their output is not exactly the same as expected. To see how to write this file, refer to examples or to default_formatters.py.

Writing testcases

Write a main that follows the same structure as one of the examples in your programming language. The main should usually call student's code, check its result, and call one of the helper functions (when working with output, you don't check the result, and simply allow autograder to handle grading by calling CHECK_OUTPUT())
Assume that student's code is available in your namespace. Examples demonstrate exactly how to call students' functions.
Assume that helper functions CHECK_OUTPUT(), RESULT(int r), PASS(), FAIL() are predefined and use them to return student scores to the grader
Each helper function terminates the execution of the program and returns its respective exit code that signifies student's score for the testcase
Each testcase is graded out of 100% and allows you to specify grades down to a single percent, which means that you can fully control how much partial credit is given

Helper functions

CHECK_OUTPUT() indicates that we do not check student's return values for the testcase and that we only care about their output (stdout) that will be checked by the autograder automatically using student's stdout and the output files with the same name stem as the testcase. (beware: printing anything within your testcase will break this functionality)
RESULT(int r) returns student's score r back to the grader (0 - 100)
PASS() returns the score of 100% back to the grader and is equivalent to RESULT(100)
FAIL() returns the score of 0% back to the grader and is equivalent to RESULT(0)

Command line help

usage: autograder [-h] [-v] [-p [min_score]] [--no_output] [-s [<name> [<name> ...]]] [-g] [submission_path]

positional arguments:
  submission_path       Path to directory that contains student submissions

optional arguments:
  -h, --help            show this help message and exit
  -v, --version         show the version number
  -p [min_score], --print [min_score]
                        Use after already graded to print assignments with score >= min_score
  --no_output           Do not output any code to the console
  -s [<name> [<name> ...]], --submissions [<name> [<name> ...]]
                        Only grade submissions with specified file names (without full path)
  -g, --guide           Guide you through setting up a grading environment

Implementation details

I used exit codes to specify student grades. Currently, I choose an integer offset, add it to the student grade when returning from testcase, and subtract it when grading. This allows us to specify student scores using exit codes; CHECK_OUTPUT has its own exit code. The exit code range I use is carefully picked so that it does not use any exit codes occupied by the system. Even though this method seems prone to cheating at first, we mitigate that by the methods described in the anti cheating section.
At the point of writing this readme, output checking is a PASS or FAIL process (i.e. no partial credit possible). The reason is that allowing for 'partial similarity' of outputs is too error-prone and could yield too many points for students that did not actually complete the task properly. If you want to increase the chances of students' output matching, you should use formatters described in advanced usage section.
If you don't prototype student functions you want to test in your C/C++ testcases, you will run into undefined behavior because of how c handles linking.

Anti Cheating

One of the main weaknesses of automatic grading is how prone it is to cheating. Autograder tries to solve this problem with methods described in this section. Currently, it is impossible to cheat autograder in Python, C, C++ (as far as I've read and tested).

It is very unlikely that the student will be able to cheat autograder when using Java because it would require him to read and understand the source code of the grader and make private methods public in the testcase file (technically still possible but not easy at all), or get the validating string from the testcase class (I have made sure that this is impossible but I doubt any solution can be bullet-proof if the student has a lot of time and great Java experience). If the student is able to do all of these steps, he/she can easily pass most of bachelor-level courses where autograder can be applied without attempting to cheat. However, the possibility of cheating in Java is still nonzero which is why I am planning to try to implement protections against making private methods public. The description of anti-cheating features can be found below.

To restrict the student from exiting the process himself and returning an exit code with the grade of his/her choice, I validate test output using a pseudorandom key called validation string. Autograder gives the string to the testcase as an environment variable which is erased right after the testcase saves it, and then it is automatically printed on the last line of stdout before the testcase exits. The autograder, then, pops it from stdout and verifies that it is the same string it sent. If it is not, the student will get the respective error message and a 0 on the testcase.
To prevent students from simply importing the string from the testcase file, test helper files (described above) all have some way of disallowing imports. For C/C++, it is the static identifier, for Java, it is the private method modifiers, for python it is throwing an error if name != "main". I assume that similar precautions can be implemented in almost any language added into autograder.
Simply parsing validating string from the testcase file is impossible because it is saved at runtime.
As an additional (and maybe unnecessary) security measure, autograder precompiles testcases without linking for all languages except for java, thus decreasing the possibility that the student will simply parse the testcase file and figure out the correct return values if the security measure above doesn't work.

Adding Programming Languages

If you want to add a new language for grading, you have to:
1. create a new module with subclass of TestCase in autograder/testcases/
2. add it into ALLOWED_LANGUAGES dictionary in autograder/testcases/__init__.py
3. write a respective test helper module in autograder/testcases/test_helpers directory.
Use the other testcase subclasses and test helpers as reference
This point is optional but if you want full anti-cheating capabilities for your new language, you will need to consider three things:
- Does your language support getting and unsetting environment variables? It is required to save validating string in your code without leaking it to students.
- Does your language support private-to-file functions/classes/methods/variables? It is required to prevent the student from simply importing helper functions and validating string.
- Does your language support precompilation (conversion to bytecode without linking)? It is not as important as other points but could speed up grading and hide testcase code from students.

Project details

These details have not been verified by PyPI

Project links

Source Code

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 5 - Production/Stable
Environment
- Console
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Natural Language
- English
Operating System
- OS Independent
Programming Language

Release history Release notifications | RSS feed

3.7.6

Nov 12, 2022

3.7.5

Sep 11, 2022

3.7.4

Sep 11, 2022

3.7.3

Sep 11, 2022

3.7.2

Sep 11, 2022

3.7.0

Sep 5, 2022

3.6.9

Sep 5, 2022

3.6.8

Apr 18, 2022

3.6.7

Apr 18, 2022

3.6.6

Apr 18, 2022

3.6.5

Apr 18, 2022

3.6.4

Apr 17, 2022

3.6.3

Apr 16, 2022

3.6.2

Apr 16, 2022

3.6.1

Apr 16, 2022

3.6.0

Apr 13, 2022

3.5.1

Feb 21, 2022

3.5.0

Feb 20, 2022

3.4.0

Feb 19, 2022

3.3.9

Nov 30, 2021

3.3.8

Nov 21, 2021

3.3.7

Nov 20, 2021

3.3.6

Nov 20, 2021

3.3.5

Nov 17, 2021

3.3.4

Nov 15, 2021

3.3.3

Nov 15, 2021

3.3.2

Nov 14, 2021

3.3.1

Nov 14, 2021

3.3.0

Nov 14, 2021

3.2.4

Nov 10, 2021

3.2.3

Nov 10, 2021

3.2.2

Nov 9, 2021

3.2.1

Nov 9, 2021

3.2.0

Nov 9, 2021

3.1.0

Nov 6, 2021

3.0.1

Nov 5, 2021

3.0.0

Nov 5, 2021

3.0.0a4 pre-release

Nov 2, 2021

3.0.0a3 pre-release

Nov 2, 2021

3.0.0a2 pre-release

Nov 2, 2021

3.0.0a1 pre-release

Nov 2, 2021

3.0.0a0 pre-release

Oct 31, 2021

2.19.0

Oct 24, 2021

2.18.2

Oct 17, 2021

2.18.1

Oct 9, 2021

2.17.0

Sep 28, 2021

2.15.3

Sep 11, 2021

2.15.2

Sep 11, 2021

2.15.1

Sep 11, 2021

2.15.0

Sep 11, 2021

2.14.3

Jan 8, 2021

2.14.2

Jan 8, 2021

2.14.1

Jan 8, 2021

2.14.0

Jan 8, 2021

2.13.0

Dec 25, 2020

This version

2.12.0

Dec 24, 2020

2.11.1

Dec 24, 2020

2.11.0

Dec 24, 2020

2.10.2

Dec 23, 2020

2.10.1

Dec 23, 2020

2.10.0

Dec 23, 2020

2.9.1

Dec 21, 2020

2.7.3

Jun 25, 2020

2.7.2

Jun 9, 2020

2.7.1

Jun 9, 2020

2.7.0

Jun 9, 2020

2.6.0

May 11, 2020

2.5.8

May 11, 2020

2.5.7

May 11, 2020

2.5.6

May 11, 2020

2.5.5

May 11, 2020

2.5.4

May 11, 2020

2.5.3

May 11, 2020

2.5.2

May 11, 2020

2.5.1

May 11, 2020

2.5.0

May 11, 2020

2.4.1

Apr 23, 2020

2.4.0

Apr 23, 2020

2.3.1

Apr 17, 2020

2.3.0

Apr 16, 2020

2.2.1

Apr 16, 2020

2.2.0

Apr 16, 2020

2.0.0

Apr 16, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

assignment-autograder-2.12.0.tar.gz (873.0 kB view hashes)

Uploaded Dec 24, 2020 Source

Built Distribution

assignment_autograder-2.12.0-py3-none-any.whl (890.5 kB view hashes)

Uploaded Dec 24, 2020 Python 3

Hashes for assignment-autograder-2.12.0.tar.gz

Hashes for assignment-autograder-2.12.0.tar.gz
Algorithm	Hash digest
SHA256	`f9a2f5eacbe5e313eda0b9848729bfe28b178e5d943b4edc74556b6cd6652ef8`
MD5	`ad26b24336cef813c5d140bc2c44b2fd`
BLAKE2b-256	`07d107164871ef85dbf5415f5b79b4382e634e55d5588e3a28a82acf191375b1`

Hashes for assignment_autograder-2.12.0-py3-none-any.whl

Hashes for assignment_autograder-2.12.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`399be1ba81b2128313da5354f5838cfa5d88f5c6856f0f0e761ccab326497024`
MD5	`ad0843e4a6f7d355474f0e01331dec1a`
BLAKE2b-256	`0541ac310ffc2fa3cc8ed3b70dac31e4976808ffc4fca3089ff353f56b617364`