GitHub

Introduction

A inference benchmark tool, you can base on this tool to extend usage for different benchmark purpose. You can add a dataloader to provide your dataset for benchmarking, and also you can add your backend for benchmarking on different framework or models.

This tool provides several basic benchmarkers, like mlperf, direct, nlp_generative.

Usage

Install python packages

    pip install -r requirement.txt

Benchmark on single gpu

    python runner.py -m facebook/opt-1.3b -b 1 -s 16

Benchmark on multiple gpus

    deepspeed --num_gpus 2 runner.py -m facebook/opt-1.3b -b 1 -s 16

Builtin supported models

- facebook/opt-1.3b
- t5-3b
- EleutherAI/gpt-j-6B
- decapoda-research/llama-7b-hf
- decapoda-research/llama-13b-hf
- decapoda-research/llama-30b-hf
- decapoda-research/llama-65b-hf 
- bigscience/bloom-7b1
- bigscience/bloom
- microsoft/bloom-deepspeed-inference-fp16

For more details info

python runner.py -h

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
benchmark_model_distributed		benchmark_model_distributed
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Introduction

Usage

Install python packages

Benchmark on single gpu

Benchmark on multiple gpus

Builtin supported models

For more details info

About

Uh oh!

Releases

Packages

Languages

scjzhang/tools

Folders and files

Latest commit

History

Repository files navigation

Introduction

Usage

Install python packages

Benchmark on single gpu

Benchmark on multiple gpus

Builtin supported models

For more details info

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages