j-94

j-94

7 followers · 25 following

Achievements

Lists (12)

Sort

Starred repositories

i-am-shodan / USBArmyKnife

USB Army Knife – the ultimate close access tool for penetration testers and red teamers.

C++ 1,735 167 Updated Jul 14, 2025

NomenAK / SuperClaude

A configuration framework that enhances Claude Code with specialized commands, cognitive personas, and development methodologies.

Python 7,904 695 Updated Jul 14, 2025

HazyResearch / cartridges

Storing long contexts in tiny caches with self-study

Python 87 5 Updated Jul 15, 2025

webtui / webtui

MDX 1,948 37 Updated Jul 14, 2025

snap-stanford / POPPER

Automated Hypothesis Testing with Agentic Sequential Falsifications

Python 210 21 Updated May 14, 2025

zou-group / textgrad

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Python 2,742 231 Updated Jul 9, 2025

Paper2Poster / Paper2Poster

Open-source Multi-agent Poster Generation from Papers

Python 2,317 139 Updated Jun 17, 2025

octotools / octotools

OctoTools: An agentic framework with extensible tools for complex reasoning

Python 1,225 161 Updated Jul 3, 2025

aws / amazon-q-developer-cli-autocomplete

Rust 28 29 Updated Jul 15, 2025

CharlesQ9 / Alita

665 37 Updated Jun 6, 2025

smtg-ai / claude-squad

Manage multiple AI terminal agents like Claude Code, Aider, Codex, OpenCode, and Amp.

Go 2,909 196 Updated Jul 13, 2025

jennyzzt / dgm

Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents

Python 1,516 321 Updated Jun 12, 2025

openai / preparedness

Releases from OpenAI Preparedness

Python 793 79 Updated May 30, 2025

openai / openai-python

The official Python library for the OpenAI API

Python 27,365 4,063 Updated Jul 14, 2025

openai / evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 16,556 2,755 Updated Dec 18, 2024

openai / simple-evals

Python 3,828 384 Updated Jul 9, 2025

morph-labs / SWELancer-Benchmark

Forked from openai/SWELancer-Benchmark

Set up SWE-Lancer 50X faster on Morph Cloud

Python 7 Updated Apr 3, 2025

openai / SWELancer-Benchmark

This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?"

Python 1,435 136 Updated May 16, 2025

frdel / agent-zero

Agent Zero AI framework

Python 10,983 2,125 Updated Jul 14, 2025

web-arena-x / webarena

Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"

Python 1,054 165 Updated Feb 7, 2025

Terry-Xu-666 / NodeRAG

The official repository of NodeRAG

Python 318 42 Updated Mar 19, 2025

ryokun6 / ryos

ryOS, made with Cursor

TypeScript 412 63 Updated Jul 14, 2025

Ziems / arbor

A framework for optimizing DSPy programs with RL

Python 89 9 Updated Jul 14, 2025

aaronwangy / Data-Science-Cheatsheet

A helpful 5-page machine learning cheatsheet to assist with exam reviews, interview prep, and anything in-between.

TeX 5,225 738 Updated Mar 15, 2023

SALT-NLP / collaborative-gym

Framework and toolkits for building and evaluating collaborative agents that can work together with humans.

Python 88 9 Updated Apr 8, 2025

coder / agentapi

HTTP API for Claude Code, Goose, Aider, and Codex

Go 648 49 Updated Jul 7, 2025

coder / anyclaude

Claude Code with any LLM

TypeScript 46 7 Updated May 27, 2025

toolhouse-community / mcp-server-toolhouse

Python 13 10 Updated Mar 5, 2025

UdaraJay / concept-explorer

Dive endlessly deeper into a single concept using AI

Python 84 12 Updated Apr 12, 2025

jaw9c / awesome-remote-mcp-servers

Remote MCP Servers

568 52 Updated Jul 14, 2025

j-94

Lists (12)

agent

agents

AI4SWE

DATA

DSPY

Home visual

INFRA

layout

LMDEV

nextstack

SEARCH

UX

Starred repositories

prompt-engineering