8000 syzymon (Szymon Tworkowski) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View syzymon's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@kakainet @Vatican-X-Formers

Block or report syzymon

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The simplest implementation of recent Sparse Attention patterns for efficient LLM inference.

Jupyter Notebook 62 4 Updated Jan 25, 2025

Grok open release

Python 50,279 8,354 Updated Aug 30, 2024

[ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training

Python 21 5 Updated Aug 18, 2024

All things prompt engineering

Python 5,606 313 Updated Jun 4, 2024

Your pair programming wingman. Supports OpenAI, Anthropic, or any LLM on your local inference server.

TypeScript 70 11 Updated Jun 26, 2024

A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。

669 36 Updated Apr 7, 2024

Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" [ICLR 2024]

Jupyter Notebook 371 49 Updated Aug 25, 2024

ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].

Python 1,069 75 Updated Feb 22, 2024

GAOKAO-Bench is an evaluation framework that utilizes GAOKAO questions as a dataset to evaluate large language models.

Python 650 44 Updated Jan 7, 2025

C++ implementation of Qwen-LM

C++ 587 52 Updated Dec 6, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 18,323 1,502 Updated Apr 29, 2025

Inference code for CodeLlama models

Python 16,300 1,914 Updated Aug 12, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 41,336 6,854 Updated Dec 9, 2024

A Python Search Engine for Humans 🥸

Python 219 26 Updated Apr 22, 2024

Premise Selection Data in Isabelle

Python 10 Updated Mar 13, 2023

Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)

Python 79 3 Updated Aug 30, 2023

LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.

Python 1,455 86 Updated Nov 7, 2023

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Jupyter Notebook 2,726 141 Updated Aug 4, 2024

A framework for the evaluation of autoregressive code generation language models.

Python 943 244 Updated Oct 31, 2024

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 3,991 309 Updated Apr 16, 2025
Python 31 3 Updated Oct 14, 2023

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 47,703 7,506 Updated May 20, 2025

Long-context pretrained encoder-decoder models

Python 94 13 Updated Oct 28, 2022

Simple Cloudflare bypass for ChatGPT

Go 1,335 329 Updated Jul 9, 2023

jax-triton contains integrations between JAX and OpenAI Triton

Python 390 46 Updated May 2, 2025

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

7,489 398 Updated Jul 16, 2023

A curated list of awesome papers related to pre-trained models for information retrieval (a.k.a., pretraining for IR).

666 49 Updated Jan 7, 2024
Python 1,470 110 Updated May 12, 2023
Next
0