Industry standard for evaluating large models across thousands of dimensions.

Library

evaluationbenchmarkframework

Notes

Date approximate.