Aligning Language Models with Observational Data

Official repository of the paper "Aligning Language Models with Observational Data: Opportunities and Risks from a Causal Perspective". The project explores methods to fine-tune large language models (LLMs) using observational data, tackling challenges like spurious correlations and confounding. We propose DECONFOUNDLM, a novel approach to mitigate these issues by correcting for known confounders in the reward signals.

Code to be released soon.

You can find the project webpage at deconfoundlm.github.io.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Aligning Language Models with Observational Data

Code to be released soon.

About

Uh oh!

Releases

Packages

erfanloghmani/DeconfoundLM

Folders and files

Latest commit

History

Repository files navigation

Aligning Language Models with Observational Data

Code to be released soon.

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages