BaseBenchmark
- name (str): Name of the benchmark.
- data_dir (str): Path to the data directory.
- save_to (str): Path to save the results.
- processes (int): Number of processes to use for parallel processing. :(default: :obj:
1)
init
- name (str): Name of the benchmark.
- data_dir (str): Path to the data directory.
- save_to (str): Path to save the results.
- processes (int): Number of processes to use for parallel processing. :(default: :obj:
1)
download
load
- force_download (bool): Whether to force download the data.
train
valid
test
run
- agent (ChatAgent): The chat agent.
- on (str): The data split to run the benchmark on.
- randomize (bool): Whether to randomize the data.
- subset (int): The subset of the data to run the benchmark on.