You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Apex: An advanced autonomous coding agent for VS Code featuring total autonomy modes, recursive chain-of-thought reasoning, council-of-critics self-critique, persistent memory, dynamic personas, and extensive tool use capabilities.
Comprehensive benchmark of 44 open source language models across creative writing, logic puzzles, counterfactual reasoning, and programming tasks. Tested on Apple M4 Max with detailed performance analysis.
# Open Source Language Model BenchmarkThis repository evaluates 43 open source language models across tasks like creative writing and programming. 🚀 It offers insights into model performance, showing that speed does not always equal accuracy. 🐱💻## OverviewThis benchmark evaluates where we currently stand with open source language models, exa