Skip to main content

A benchmark based on swe-bench that evaluates the conceptual reasoning capabilities of LLMs in the context of software engineering tasks.

Project description

The author of this package has not provided a project description

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

code_conductor_bench-0.1.5.dev2.tar.gz (1.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

code_conductor_bench-0.1.5.dev2-py3-none-any.whl (3.3 kB view details)

Uploaded Python 3

File details

Details for the file code_conductor_bench-0.1.5.dev2.tar.gz.

File metadata

File hashes

Hashes for code_conductor_bench-0.1.5.dev2.tar.gz
Algorithm Hash digest
SHA256 ec116051c5d24140536b3fdc5be0883b71dc412b831b691ff4d52cd78794c5d5
MD5 c9aec034f1946f2a68cd31878c5d5b69
BLAKE2b-256 66cf60dfec3f4b87efba7db555a22dbd86f7b374e30b4586b587333b72121fb3

See more details on using hashes here.

File details

Details for the file code_conductor_bench-0.1.5.dev2-py3-none-any.whl.

File metadata

File hashes

Hashes for code_conductor_bench-0.1.5.dev2-py3-none-any.whl
Algorithm Hash digest
SHA256 b51a258142b05f030169ce5ca041f69b5f9f18c5543ba8b13f8f96e654041ce6
MD5 09afe1868099351850006a36b7390189
BLAKE2b-256 69066253192ccaa2a904984e54cda5903867eb961f08ed1816a26fd1c8ece8b1

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page