-
Notifications
You must be signed in to change notification settings - Fork 3.4k
feat: add logic to track eval runs #3019
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
@@ -56,6 +58,8 @@ class ReliabilityEval: | |||
file_path_to_save_results: Optional[str] = None | |||
# Enable debug logs | |||
debug_mode: bool = getenv("AGNO_DEBUG", "false").lower() == "true" | |||
# Log the results to the Agno platform | |||
monitoring: bool = getenv("AGNO_MONITOR", "").lower() == "true" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We should also be able to set this on initialize, same as agent/team
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
And that parameter should default to True I think.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- Yes, they can be set on initialization already! we don't overwrite
__init__
for this classes but they're decorated w@dataclass
- Good one, agree, will make it default to True
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, one last thing
agno/api/evals.py
andagno/eval/utils.py
: logic to call the API and track evaluations runstrack_results
flag and logic to track the runs when needed