syzymon

🎯

Focusing

Szymon Tworkowski syzymon

🎯

Focusing

PhD student @ University of Warsaw, interested in large language models.

262 followers · 38 following

University of Warsaw
Warsaw, Poland
https://syzymon.github.io
@s_tworkowski
https://scholar.google.com/citations?user=1V8AeXYAAAAJ&hl=en

Achievements

Organizations

Stars

PiotrNawrot / nano-sparse-attention

The simplest implementation of recent Sparse Attention patterns for efficient LLM inference.

Jupyter Notebook 62 4 Updated Jan 25, 2025

xai-org / grok-1

Grok open release

Python 50,279 8,354 Updated Aug 30, 2024

yuzhaouoe / pretraining-data-packing

[ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training

Python 21 5 Updated Aug 18, 2024

microsoft / promptbase

All things prompt engineering

Python 5,606 313 Updated Jun 4, 2024

nvms / wingman

Your pair programming wingman. Supports OpenAI, Anthropic, or any LLM on your local inference server.

TypeScript 70 11 Updated Jun 26, 2024

jianzhnie / awesome-instruction-datasets

A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。

669 36 Updated Apr 7, 2024

TIGER-AI-Lab / MAmmoTH

Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" [ICLR 2024]

Jupyter Notebook 371 49 Updated Aug 25, 2024

microsoft / ToRA

ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].

Python 1,069 75 Updated Feb 22, 2024

OpenLMLab / GAOKAO-Bench

GAOKAO-Bench is an evaluation framework that utilizes GAOKAO questions as a dataset to evaluate large language models.

Python 650 44 Updated Jan 7, 2025

QwenLM / qwen.cpp

C++ implementation of Qwen-LM

C++ 587 52 Updated Dec 6, 2024

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 18,323 1,502 Updated Apr 29, 2025

meta-llama / codellama

Inference code for CodeLlama models

Python 16,300 1,914 Updated Aug 12, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 41,336 6,854 Updated Dec 9, 2024

AmenRa / retriv

A Python Search Engine for Humans 🥸

Python 219 26 Updated Apr 22, 2024

Simontwice / MagnusData

Premise Selection Data in Isabelle

Python 10 Updated Mar 13, 2023

JeanKaddour / NoTrainNoGain

Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)

Python 79 3 Updated Aug 30, 2023

CStanKonrad / long_llama

LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.

Python 1,455 86 Updated Nov 7, 2023