8000 GitHub - jatali/KuiperLLama_yes: 动手实现大模型推理框架
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

jatali/KuiperLLama_yes

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

自制大模型推理框架

🙋🙋🙋 自制大模型推理框架,火热进行中,请加下方微信了解

带你从零写一个支持LLama推理,支持Cuda加速的大模型框架

课程目录

第三方依赖

  1. google glog https://github.com/google/glog
  2. google gtest https://github.com/google/googletest
  3. sentencepiece https://github.com/google/sentencepiece
  4. armadillo + openblas https://arma.sourceforge.net/download.html

openblas作为armadillo的后端数学库,加速矩阵乘法等操作,也可以选用Intel-MKL

模型下载地址

  1. llama2 https://pan.baidu.com/s/1PF5KqvIvNFR8yDIY1HmTYA?pwd=ma8rhttps://huggingface.co/fushenshen/lession_model/tree/main

编译方法

  # 假设已经装好上述的第三方依赖
  mkdir build 
  cd build
  cmake ..
  make -j16

生成文本的方法

./llama_infer llama2_7b.bin tokenizer.model

About

动手实现大模型推理框架

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C++ 87.2%
  • Cuda 9.3%
  • CMake 3.5%
0