More
<
8000
/summary>
Stars
🔥Highlighting the top ML papers every week.
Robust and fast topic models with sentence-transformers.
Curated resources for discovering, reading, and working with arXiv papers
A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"
A lightweight LMM-based Document Parsing Model
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.
Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.
Designing Multi-Agent Systems with Zero Supervision
Perform transformations on your data with natural language using LLMs
Research papers and blogs to transition to AI Engineering
A Python toolkit for chain-of-thought prompting 🐍
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
An agentic company research tool powered by LangGraph and Tavily that conducts deep diligence on companies using a multi-agent framework. It leverages Google's Gemini 2.0 Flash and OpenAI's GPT-4.1…
FlowGram is a node-based flow building engine that helps developers quickly create workflows in either fixed layout or free connection layout modes
explore token trajectory trees on instruct and base models
Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The inference framework can be sglang, or it can be adapted/modified t…
Tool for generating high quality Synthetic datasets
Local Deep Research achieves ~95% on SimpleQA benchmark (tested with GPT-4.1-mini) and includes benchmark tools to test on your own data. Searches 10+ sources - arXiv, PubMed, GitHub, web, and your…
Fine-tune LLMs for free with 100+ Notebooks on Google Colab, Kaggle, and more.
Self-contained worked examples of Apache Lucene features and functionality
FULL v0, Cursor, Manus, Same.dev, Lovable, Devin, Replit Agent, Windsurf Agent, VSCode Agent, Dia Browser, Trae AI & Cluely (And other Open Sourced) System Prompts, Tools & AI Models.
SuperSonic is the next-generation AI+BI platform that unifies Chat BI (powered by LLM) and Headless BI (powered by semantic layer) paradigms.