8000 GitHub - ash-hun/BERGEN-UP: E2E Evaluation Pipeline for ONLY RAG. Benchmark to BERGEN from NAVER Labs (a.k.a. BERGEN UP✨)
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

E2E Evaluation Pipeline for ONLY RAG. Benchmark to BERGEN from NAVER Labs (a.k.a. BERGEN UP✨)

License

Notifications You must be signed in to change notification settings

ash-hun/BERGEN-UP

Repository files navigation

BERGEN-UP

New version of BERGEN (a.k.a BERGEN UP✨)

BERGEN (BEnchmarking Retrieval-augmented GENeration) is a library designed to benchmark RAG systems with a focus on question-answering (QA) by NAVER Labs. It addresses the challenge of inconsistent benchmarking in comparing approaches and understanding the impact of each component in a RAG pipeline. Unlike BERGEN, BERGEN-UP is an end-to-end evaluation pipeline that enhanced focuses on the diversity of RAG pipelines and the functionality of each modules.

🍒 Key Feature

  • E2E Evaluation Pipeline for RAG
    • Chunking
      • token level
        • recall
        • precision
        • iou
    • Pre-Retrieval
    • Retrieval
    • Post-Retrieval
    • Generation
  • Extra Module for RAG
    • Generate Synthetic Dataset
      • QA (= Question Answering)

🥑 How to run pipeline?

1. Write your evaluation in conf/config.yaml
2. Run only below script
$ uv run pipeline.py label='__experiments_name__'

🍊 Core points Each Module

Chunking Module
  • 핵심 기능

  • 사용법

    • conf/config.yamlchunking 섹션에 아래 내용을 참고하여 작성한다.
    chunking:
        strategies: 
            - question_set_path: "${hydra:runtime.cwd}/data/chunking/question_set/questions_df_chatlogs.csv"
            - corpora_id_paths:
                chatlogs: "${hydra:runtime.cwd}/data/chunking/corpora/chatlogs.md"
            - Semantic Chunking:
                mode: openai
                embedding_model: "text-embedding-3-large"
                custom_url: "custom_embedding_function_api_address"
            - Recursive Token Chunking:
                chunk_size: 800
                chunk_overlap: 400
            - Fixed Token Chunking:
                chunk_size: 800
                chunk_overlap: 400

About

E2E Evaluation Pipeline for ONLY RAG. Benchmark to BERGEN from NAVER Labs (a.k.a. BERGEN UP✨)

Topics

Resources

License

Stars

Watchers

Forks

Languages

0