BERGEN-UP

New version of BERGEN (a.k.a BERGEN UP✨)

BERGEN (BEnchmarking Retrieval-augmented GENeration) is a library designed to benchmark RAG systems with a focus on question-answering (QA) by NAVER Labs. It addresses the challenge of inconsistent benchmarking in comparing approaches and understanding the impact of each component in a RAG pipeline. Unlike BERGEN, BERGEN-UP is an end-to-end evaluation pipeline that enhanced focuses on the diversity of RAG pipelines and the functionality of each modules.

🍒 Key Feature

E2E Evaluation Pipeline for RAG
- Chunking
  - token level
    - recall
    - precision
    - iou
- Pre-Retrieval
- Retrieval
- Post-Retrieval
- Generation
Extra Module for RAG
- Generate Synthetic Dataset
  - QA (= Question Answering)

🥑 How to run pipeline?

1. Write your evaluation in `conf/config.yaml`

2. Run only below script

$ uv run pipeline.py label='__experiments_name__'

🍊 Core points Each Module

Chunking Module

핵심 기능
- Token Level 평가
  - Metric : (https://research.trychroma.com/evaluating-chunking)
    - iou
    - precision
    - recall

사용법

conf/config.yaml의 chunking 섹션에 아래 내용을 참고하여 작성한다.

chunking:
    strategies: 
        - question_set_path: "${hydra:runtime.cwd}/data/chunking/question_set/questions_df_chatlogs.csv"
        - corpora_id_paths:
            chatlogs: "${hydra:runtime.cwd}/data/chunking/corpora/chatlogs.md"
        - Semantic Chunking:
            mode: openai
            embedding_model: "text-embedding-3-large"
            custom_url: "custom_embedding_function_api_address"
        - Recursive Token Chunking:
            chunk_size: 800
            chunk_overlap: 400
        - Fixed Token Chunking:
            chunk_size: 800
            chunk_overlap: 400

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
conf		conf
data/chunking		data/chunking
modules		modules
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
config.py		config.py
pipeline.py		pipeline.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BERGEN-UP

🍒 Key Feature

🥑 How to run pipeline?

1. Write your evaluation in `conf/config.yaml`

2. Run only below script

🍊 Core points Each Module

About

Languages

License

ash-hun/BERGEN-UP

Folders and files

Latest commit

History

Repository files navigation

BERGEN-UP

🍒 Key Feature

🥑 How to run pipeline?

1. Write your evaluation in conf/config.yaml

2. Run only below script

🍊 Core points Each Module

About

Topics

Resources

License

Stars

Watchers

Forks

Languages

1. Write your evaluation in `conf/config.yaml`