Stars
[ACL 2024] This is the code repo for our ACL’24 paper "Cleaner Pretraining Corpus Curation with Neural Web Scraping".
Code and data to evaluate LLMs on the ENEM, the main standardized Brazilian university admission exams.
The Elastic stack (ELK) powered by Docker and Compose.