A python wrapper over NESTFUL data
Project description
NESTFUL
This is the official repository for NESTFUL.
- Paper Title: NESTFUL: A Benchmark for Evaluating LLMs on Nested Sequences of API Calls
- Link: https://arxiv.org/abs/2409.03797v2
- HuggingFace Data Link: https://huggingface.co/datasets/ibm-research/nestful
Data
We have shared the latest NESTFUL evaluation set under data_v2 dir.
nestful_data.jsonl: It has 1861 evaluation data for nested sequencing.executable_functions: Contains the implementation of all the functions in the benchmark.
The data_v1 directory includes the data for the previous version of the paper - link.
executable: contains data and spec with necessary information to execute them through RapidAPI.non-executable: contains the nested sequencing data from SGD and GLAIVE that are hand-picked by human annotators from data synthetically generated using an LLM.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
nestful_wrapper-0.0.3.tar.gz
(94.6 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file nestful_wrapper-0.0.3.tar.gz.
File metadata
- Download URL: nestful_wrapper-0.0.3.tar.gz
- Upload date:
- Size: 94.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.21
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9d5e4b03e2e568487b91b418698d8189dd8e3d0b281fbfb113b6faee63b4e62e
|
|
| MD5 |
a5a49811fc828eee1fa964613f3dd146
|
|
| BLAKE2b-256 |
8e9bb0c7ae48fb002f68a6a79b1de16c39d85c3a5f4663a4323e648874f07b63
|
File details
Details for the file nestful_wrapper-0.0.3-py3-none-any.whl.
File metadata
- Download URL: nestful_wrapper-0.0.3-py3-none-any.whl
- Upload date:
- Size: 118.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.21
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
395040b24b8f8d304af562ca1591862a485223c7074a68aea3cc59e0d1053554
|
|
| MD5 |
f2213a9be5d02c52de33c77448b91cd0
|
|
| BLAKE2b-256 |
d83aa06c40f549dc33bcfca5d2b1ff870a341442c38153a9b53f8a243dea82a1
|