LLMNexus
Popular repositories Loading
-
Compact-Language-Models-via-Pruning-and-Knowledge-Distillation
Compact-Language-Models-via-Pruning-and-Knowledge-Distillation PublicForked from alperiox/Compact-Language-Models-via-Pruning-and-Knowledge-Distillation
Unofficial implementation of https://arxiv.org/pdf/2407.14679
Python 1
-
llmperf
llmperf PublicForked from ray-project/llmperf
LLMPerf is a library for validating and benchmarking LLMs
Python 1
-
TensorRT-LLM
TensorRT-LLM PublicForked from NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
C++ 1
-
-
-
build-nanogpt
build-nanogpt PublicForked from karpathy/build-nanogpt
Video+code lecture on building nanoGPT from scratch
Python
Repositories
- LLMs-from-scratch Public Forked from rasbt/LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
LLMNexus/LLMs-from-scratch’s past year of commit activity - GOT-OCR2.0 Public Forked from Ucas-HaoranWei/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
LLMNexus/GOT-OCR2.0’s past year of commit activity - llm-awq Public Forked from mit-han-lab/llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
LLMNexus/llm-awq’s past year of commit activity - Awesome-LLM-Strawberry Public Forked from hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
LLMNexus/Awesome-LLM-Strawberry’s past year of commit activity - awesome-llm-apps Public Forked from Shubhamsaboo/awesome-llm-apps
Collection of awesome LLM apps with RAG using OpenAI, Anthropic, Gemini and opensource models.
LLMNexus/awesome-llm-apps’s past year of commit activity - nano-sparse-attention Public Forked from PiotrNawrot/nano-sparse-attention
The simplest implementation of recent Sparse Attention patterns for efficient LLM inference.
LLMNexus/nano-sparse-attention’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…