8000 t1end4t (Le Tien Dat) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View t1end4t's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Phenikaa University
  • Vietnam

Block or report t1end4t

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A collection of tricks and tools to speed up transformer models

TeX 159 9 Updated Apr 2, 2025

A repo lists papers related to LLM based agent

Python 1,623 90 Updated May 9, 2025

Must-read Papers on LLM Agents.

2,359 139 Updated May 8, 2025
Haskell 399 100 Updated Jan 25, 2024

Collection of various algorithms in mathematics, machine learning, computer science and physics implemented in C++ for educational purposes.

C++ 31,751 7,365 Updated Nov 24, 2024

All Algorithms implemented in Rust

Rust 23,884 2,383 Updated Apr 10, 2025

All Algorithms implemented in Python

Python 200,334 46,724 Updated May 10, 2025

A curated list of awesome Haskell frameworks, libraries and software.

442 23 Updated May 1, 2025

A curated list of awesome C++ (or C) frameworks, libraries, resources, and shiny things. Inspired by awesome-... stuff.

63,533 7,962 Updated May 4, 2025

An opinionated list of awesome Python frameworks, libraries, software and resources.

Python 242,966 25,654 Updated Aug 11, 2024

A curated list of Rust code and resources.

Rust 50,248 2,897 Updated May 10, 2025

🦀 A curated list of Rust tools, libraries, and frameworks for working with LLMs, GPT, AI

396 21 Updated Mar 6, 2024

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 13,579 912 Updated May 7, 2025

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,314 567 Updated Oct 28, 2024

Transformer related optimization, including BERT, GPT

C++ 6,149 903 Updated Mar 27, 2024

Fast inference from large lauguage models via speculative decoding

Python 723 68 Updated Aug 22, 2024

The score code of FastBERT (ACL2020)

Python 605 90 Updated Oct 29, 2021

Landmark Attention: Random-Access Infinite Context Length for Transformers

Python 423 36 Updated Dec 20, 2023

Compress your input to ChatGPT or other LLMs, to let them process 2x more content and save 40% memory and GPU time.

Python 375 19 Updated Feb 12, 2024

[KDD'22] Learned Token Pruning for Transformers

Python 97 18 Updated Feb 27, 2023

Trax — Deep Learning with Clear Code and Speed

Python 8,204 825 Updated Apr 10, 2025

SpotServe: Serving Generative Large Language Models on Preemptible Instances

118 10 Updated Feb 22, 2024

Ongoing research training transformer models at scale

Python 12,312 2,755 Updated May 10, 2025

Disaggregated serving system for Large Language Models (LLMs).

Jupyter Notebook 581 61 Updated Apr 6, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 38,274 4,360 Updated May 10, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,210 2,620 Updated Mar 4, 2025

FlashMLA: Efficient MLA decoding kernels

Cuda 11,527 831 Updated Apr 29, 2025

A plugin for Jupyter Notebook to run CUDA C/C++ code

Jupyter Notebook 227 93 Updated Sep 13, 2024
Next
0