8000 sharpboy2008 (Simon Ye) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View sharpboy2008's full-sized avatar

Block or report sharpboy2008

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Code repository for O'Reilly book

Jupyter Notebook 3,019 1,862 Updated Jan 7, 2025

Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch

Python 679 111 Updated Jun 23, 2025

LexEval: A Comprehensive Benchmark for Evaluating Large Language Models in Legal Domain

Python 70 8 Updated Oct 30, 2024

This repository contains the code and implementation details of the CascadeTabNet paper "CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents"

Python 1,537 431 Updated Aug 27, 2021

Distributed crawling infrastructure running on top of severless computation, cloud storage (such as S3) and sophisticated queues.

TypeScript 430 37 Updated Dec 30, 2022

A Python module to scrape several search engines (like Google, Yandex, Bing, Duckduckgo, ...). Including asynchronous networking support.

HTML 2,703 746 Updated Jul 3, 2021

Javascript scraping module based on puppeteer for many different search engines...

HTML 561 127 Updated Dec 30, 2022

The ultimate LLM/AI application development framework in Golang.

Go 4,607 369 Updated Jun 25, 2025

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

Python 3,254 236 Updated Jun 23, 2025

Statistics of Common Crawl monthly archives mined from URL index files

Python 183 11 Updated May 28, 2025

Automatic extraction of relevant features from time series:

Jupyter Notebook 8,829 1,242 Updated Feb 16, 2025

A research prototype of a human-centered web agent

Python 5,756 577 Updated Jun 25, 2025

Memory for AI Agents; Announcing OpenMemory MCP - local and secure memory management.

Python 35,301 3,570 Updated Jun 25, 2025

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 24,530 3,298 Updated Jun 25, 2025

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Python 2,684 224 Updated Apr 1, 2025

🧠 Curated collection of system prompts for top AI tools. Perfect for AI agent builders and prompt engineers. Incuding: ChatGPT, Claude, Perplexity, Manus, Claude-Code, Loveable, v0, Grok, same new,…

TypeScript 2,653 428 Updated Jun 25, 2025

Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app

MDX 1,842 154 Updated Jun 23, 2025

Examples of using E2B

TypeScript 1,007 183 Updated May 17, 2025

Surf is a computer use AI agent powered by OpenAI that interacts with a E2B's virtual desktop environment through natural language instructions

TypeScript 446 73 Updated May 23, 2025

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

TypeScript 14,278 1,709 Updated Jun 19, 2025

AI computer use powered by open source LLMs and E2B Desktop Sandbox

Python 1,301 178 Updated Jun 5, 2025

Official Firecrawl MCP Server - Adds powerful web scraping to Cursor, Claude and any other LLM clients.

JavaScript 3,543 337 Updated Jun 4, 2025

A Collection of Cheatsheets, Books, Questions, and Portfolio For DS/ML Interview Prep

Jupyter Notebook 4,116 1,125 Updated Aug 31, 2024

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

1,506 99 Updated Jun 23, 2025

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

Python 4,889 677 Updated Jun 23, 2025

Implementation of my RAG system that won all categories in Enterprise RAG Challenge 2

Python 1,533 338 Updated May 12, 2025

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

Jupyter Notebook 16,471 2,339 Updated Dec 26, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 14,918 2,953 Updated Jun 25, 2025
Next
0