-
inferencemachines
Starred repositories
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…
This repository provides programs to build Retrieval Augmented Generation (RAG) code for Generative AI with LlamaIndex, Deep Lake, and Pinecone leveraging the power of OpenAI and Hugging Face model…
List of references and online resources related to data science, machine learning and deep learning.
FastAPI Backend for a Conversational Agent using Cohere, (Azure) OpenAI, Langchain & Langgraph and Qdrant as VectorDB
A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Composable building blocks to build Llama Apps
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
📺 Discover the latest machine learning / AI courses on YouTube.
Desktop app for prototyping and debugging LangGraph applications locally.
Create an open source toy dataset for finetuning LLMs with reasoning abilities
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
A GPT-based autonomous multi-agent AI in Next.js that research & recommends Instagram Viral Posts reflecting your personality.
TAG-Bench: A benchmark for table-augmented generation (TAG)
Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
The easiest way to deploy agents, MCP servers, models, RAG, pipelines and more. No MLOps. No YAML.
Speech To Speech: an effort for an open-sourced and modular GPT4-o
Open-Sora: Democratizing Efficient Video Production for All
A composable and fully extensible C++ execution engine library for data management systems.
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
A efficient and effective few-shot NL2SQL method on GPT-4.
The first open-source Artificial Narrow Intelligence generalist agentic framework Computer-Using-Agent that fully operates graphical-user-interfaces (GUIs) by using only natural language. Uses Visu…