8000 Orion-zhen (Orion) · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Orion-zhen's full-sized avatar
💥
CUDA Out Of Memory
💥
CUDA Out Of Memory

Block or report Orion-zhen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. abliteration abliteration Public

    Make abliterated models with transformers, easy and fast

    Python 77 13

  2. turboderp-org/exllamav2 turboderp-org/exllamav2 Public

    A fast inference library for running LLMs locally on modern consumer-class GPUs

    Python 4.2k 317

  3. theroyallab/tabbyAPI theroyallab/tabbyAPI Public

    The official API server for Exllama. OAI compatible, lightweight, and fast.

    Python 995 112

  4. hiyouga/LLaMA-Factory hiyouga/LLaMA-Factory Public

    Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

    Python 53.5k 6.6k

  5. SJTU-IPADS/PowerInfer SJTU-IPADS/PowerInfer Public

    High-speed Large Language Model Serving for Local Deployment

    C++ 8.2k 434

  6. CrazyBoyM/llama3-Chinese-chat CrazyBoyM/llama3-Chinese-chat Public

    Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。

    Python 4.2k 338

0