The Compressor-Retriever Architecture for Language Model OS

Yuan Yang, Siheng Xiong, Ehsan Shareghi and Faramarz Fekri

🚧 This repo is under heavy development

Overview

LLMs nowadays can process multimodal data, long documents, use tools, and browse web. Can we integrate all these and make a language model OS? Where the LLM acts as a CPU that processes data stored in a context window (RAM).

We argue the the key challenge towards LM OS is managing the life-long context and ensuring statefulness across sessions. To address this, we introduce compressor-retriever, a model-agnostic architecture designed for life-long context management. Our approach exclusively uses the base model's forward function to compress and retrieve context, ensuring end-to-end differentiability. Preliminary experiments demonstrate the effectiveness of this architecture in in-context learning tasks, marking a step towards the development of a fully stateful LLM OS.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
imgs		imgs
model		model
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
experiment.py		experiment.py
generate.py		generate.py
run_sft.sh		run_sft.sh
sft.py		sft.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

The Compressor-Retriever Architecture for Language Model OS

🚧 This repo is under heavy development

Overview

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

gblackout/LM-OS

Folders and files

Latest commit

History

Repository files navigation

The Compressor-Retriever Architecture for Language Model OS

🚧 This repo is under heavy development

Overview

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages