-
lmdeploy Public
Forked from InternLM/lmdeployLMDeploy is a toolkit for compressing, deploying, and serving LLMs.
C++ Apache License 2.0 UpdatedFeb 7, 2024 -
Chinese-LLaMA-Alpaca-2 Public
Forked from ymcui/Chinese-LLaMA-Alpaca-2中文LLaMA-2 & Alpaca-2大模型二期项目 + 16K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs, including 16K long context models)
Python Apache License 2.0 UpdatedDec 13, 2023 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedNov 1, 2023 -
DeepSpeed-MII Public
Forked from deepspeedai/DeepSpeed-MIIMII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
Python Apache License 2.0 UpdatedOct 12, 2023 -
InferLLM Public
Forked from MegEngine/InferLLMa lightweight LLM model inference framework
C++ Apache License 2.0 UpdatedOct 10, 2023 -
sentencepiece Public
Forked from google/sentencepieceUnsupervised text tokenizer for Neural Network-based text generation.
C++ Apache License 2.0 UpdatedMay 25, 2023 -
mystars Public
Forked from wuwenjie1992/StarryDivineSky很棒的列表,主要是机器学习、深度学习、NLP、GNN、推荐系统、生物医药、机器视觉等内容。持续更新!欢迎star!欢迎star!😀😀😀
Other UpdatedMar 6, 2022 -
mindspore Public
Forked from mindspore-ai/mindsporeMindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.
C++ Apache License 2.0 UpdatedMar 3, 2022 -
onnxruntime Public
Forked from microsoft/onnxruntimeONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
C++ MIT License UpdatedFeb 21, 2022 -
TNN Public
Forked from Tencent/TNNTNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its…
C++ Other UpdatedFeb 14, 2022 -
leetcode-master Public
Forked from youngyangyang04/leetcode-master《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
UpdatedFeb 9, 2022 -
libcudacxx Public
Forked from NVIDIA/libcudacxxThe C++ Standard Library for your entire system.
C++ Other UpdatedFeb 8, 2022 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedJan 19, 2022 -
Paddle-Lite Public
Forked from PaddlePaddle/Paddle-LiteMulti-platform high performance deep learning inference engine (『飞桨』多平台高性能深度学习预测引擎)
C++ Apache License 2.0 UpdatedJan 19, 2022 -
server Public
Forked from triton-inference-server/serverThe Triton Inference Server provides an optimized cloud and edge inferencing solution.
C++ BSD 3-Clause "New" or "Revised" License UpdatedJan 19, 2022 -
FasterTransformer Public
Forked from NVIDIA/FasterTransformerTransformer related optimization, including BERT, GPT
C++ Apache License 2.0 UpdatedJan 14, 2022 -
vosk-server Public
Forked from alphacep/vosk-serverWebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Python Apache License 2.0 UpdatedDec 26, 2021 -
CudaSteps Public
Forked from QINZHAOYU/CudaSteps基于《cuda编程-基础与实践》(樊哲勇 著)的cuda学习之路。
Cuda UpdatedDec 22, 2021 -
dl_inference Public
Forked from wuba/dl_inference通用深度学习推理工具,可在生产环境中快速上线由TensorFlow、PyTorch、Caffe框架训练出的深度学习模型。
Java Other UpdatedDec 21, 2021 -
fastseq Public
Forked from microsoft/fastseqAn efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/pdf/2106.04718.pdf
Python MIT License UpdatedDec 14, 2021 -
TurboTransformers Public
Forked from Tencent/TurboTransformersa fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.
C++ Other UpdatedSep 30, 2021 -
funNLP Public
Forked from fighting41love/funNLP中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
Python UpdatedAug 20, 2021 -
-
fastertransformer_backend Public
Forked from PerkzZheng/fastertransformer_backendC++ BSD 3-Clause "New" or "Revised" License UpdatedAug 4, 2021 -
nlp-tutorial Public
Forked from graykode/nlp-tutorialNatural Language Processing Tutorial for Deep Learning Researchers
Jupyter Notebook MIT License UpdatedJul 25, 2021 -
brpc-server Public
based brpc ,develop high performance server with wraped brpc lib including kafak .mysql,redis ,http client and so on
-
-
-
-