This repository processes the Contract Understanding Atticus Dataset (CUAD) using LangChain to extract structured data from legal contracts and build a comprehensive knowledge graph. The system includes a LangGraph-powered agent for intelligent querying and analysis of contract information.
The Contract Understanding Atticus Dataset (CUAD) consists of 500 contracts with annotations for 41 legal clauses. This dataset provides a rich source of legal text for information extraction and analysis.
- Contract Processing: Extract structured data from 500 CUAD contracts using LangChain
- Knowledge Graph Construction: Build a comprehensive graph database of contract relationships and clauses
- Intelligent Query System: Utilize LangGraph agents to query and analyze the contract database
- Clone the repository
- Copy the environment file:
copy .env.example to .env
- Start the application:
docker-compose up