8000 farazkh80 (Faraz) · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View farazkh80's full-sized avatar
đź’­
Ambitious
đź’­
Ambitious

Highlights

  • Pro

Organizations

@uw-midsun @castorini

Block or report farazkh80

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
farazkh80/README.md

Hi, I’m Faraz.

I completed my Software Engineering degree at the University of Waterloo. I spent two years at Cohere AI working on model inference optimization and post‑training/finetuning. Now, I work on TensorRT-LLM at NVIDIA.



Thanks For Visiting and Feel Free to Connect:

Pinned Loading

  1. TensorRT-LLM TensorRT-LLM Public

    Forked from NVIDIA/TensorRT-LLM

    TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

    C++

  2. NVIDIA/TensorRT-Incubator NVIDIA/TensorRT-Incubator Public

    Experimental projects related to TensorRT

    MLIR 107 16

0