Sesame-explorations

This is the companion codebase to my short explorations around the Sesame model.

Current status

Currently preparing the training dataset from a (4 hours) NotebookLM dataset: https://huggingface.co/datasets/thomwolf/notebooklm-sample

Current processing scripts are in this folder: ./scripts

[✅] Extract and convert audio
[✅] Diarize dataset
[ ] Audio and text tokenization
[ ] Finetuning CSM

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
scripts		scripts
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md