Get your Maihem API key and install the SDK before
you start.
Integrate with your codebase
1
Add target agent (if you haven't already)
2
Add a decorator to each step of your workflow
This is an example of a basic RAG workflow. Add a decorator to each step of the workflow as shown below.See a full list of supported evaluators and metrics and their required input and output maps.
Example agent workflow in Python
Upload data and run test
4
Format your test data
Make sure the data you want to upload is in the following format:
3
Upload dataset
dataset_a
will now be available to use in your tests.4
Create test
5
Run the test
A
test run
will generate:- Simulated conversations between your target agent and Maihem
- Evaluations of the conversations
- A list of detected failures
6