-
Yandex
-
11:31
(UTC +04:00)
Stars
A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training
🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton
Tile primitives for speedy kernels
Machine Learning Engineering Open Book
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
Use the OpenAI Batch tool to make async batch requests to the OpenAI API.
[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627
An interactive exploration of Transformer programming.
Gin provides a lightweight configuration framework for Python
Simple code for generating a color-coded latex table from raw data
Run evaluation on LLMs using human-eval benchmark
A simple cost estimator for batch text generation with OpenAI LLMs
Generate textbook-quality synthetic LLM pretraining data
A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering
LLM Workshop by Sourab Mangrulkar
Large Language Model Text Generation Inference
Home of CodeT5: Open Code LLMs for Code Understanding and Generation
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
Finetuning large language models for GDScript generation.
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
A repository of links with advice related to grad school applications, research, phd etc
📘 dict subclass with keylist/keypath support, built-in I/O operations (base64, csv, html, ini, json, pickle, plist, query-string, toml, xls, xml, yaml), s3 support and many utilities.
Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2022)