-
Carnegie Mellon University
- Pittsburgh, PA
Lists (1)
Sort Name ascending (A-Z)
Stars
Utilities intended for use with Llama models.
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
Migrate to PostgreSQL in a single command!
Open-source resources on agents for computer use.
TAG-Bench: A benchmark for table-augmented generation (TAG)
A suite of tools to develop RAG, semantic search, and other AI applications more easily with PostgreSQL
LOTUS: A semantic query engine for fast and easy LLM-powered data processing
CypherBench: Towards Precise Retrieval over Full-scale Modern Knowledge Graphs in the LLM Era
[ICLR 2025 Oral] Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
A generative world for general-purpose robotics & embodied AI learning.
🦫 BEAVER: An Enterprise Benchmark for Text-to-SQL
A curated list of GraphRAG, Knowledge Graph, and other graphy GenAI resources
collection of text2cypher datasets, evaluations, and finetuning instructions
A topic-centric list of HQ open datasets.
Calibrated Seq2Seq Models for Efficient and Generalizable Ultra-fine Entity Typing (EMNLP 2023)
🤖 A Python library for learning and evaluating knowledge graph embeddings
A system for agentic LLM-powered data processing and ETL
[WWWJ 2024] LLMs for Knowledge Graph Construction and Reasoning: Recent Capabilities and Future Opportunities
[ACL 2024] IEPile: A Large-Scale Information Extraction Corpus
Code and Dataset for the Bhola et al. (2020) Retrieving Skills from Job Descriptions: A Language Model Based Extreme Multi-label Classification Framework
Official implementation of QATCH: Benchmarking SQL-centric tasks with Table Representation Learning Models on Your Data
[NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personali…
Agentic components of the Llama Stack APIs
[NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?