8000 basetenlabs repositories · GitHub

More Web Proxy on the site http://driver.im/

Baseten

All

58 repositories

truss-examples
Public
Examples of models deployable with Truss
python docker machine-learning ai examples mlops
Python
•
MIT License
•44•174•13•52•Updated Jun 5, 2025Jun 5, 2025
truss
Public
The simplest way to serve AI/ML models in production
open-source machine-learning packaging artificial-intelligence falcon easy-to-use whisper inference-server model-serving inference-api
Python
•
MIT License
•85•996•63•15•Updated Jun 5, 2025Jun 5, 2025
SGLang-Workshop
Public
Workshop materials for AI Engineer World's Fair
Jupyter Notebook
•
MIT License
•9•1•0•0•Updated Jun 3, 2025Jun 3, 2025
TensorRT-LLM
Public
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
C++
•
Apache License 2.0
•1.5k•0•0•0•Updated Jun 3, 2025Jun 3, 2025
TensorRT-Model-Optimizer
Public
A unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed on NVIDIA GPUs.
Python
•
Other
•78•0•0•2•Updated May 30, 2025May 30, 2025
action-junit-report
Public
Reports junit test results as GitHub Pull Request Check
TypeScript
•
Apache License 2.0
•140•0•0•2•Updated May 15, 2025May 15, 2025
nx-set-shas
Public
✨ A Github Action which sets the base and head SHAs required for `nx affected` commands in CI
TypeScript
•
MIT License
•84•0•0•1•Updated May 15, 2025May 15, 2025
changed-files
Public
Github action to retrieve all (added, copied, modified, deleted, renamed, type changed, unmerged, unknown) files and directories.
TypeScript
•
MIT License
•299•0•0•1•Updated May 15, 2025May 15, 2025
frontend-log-viewer-challenge
Public
TypeScript
•0•1•0•1•Updated May 15, 2025May 15, 2025
create-pull-request
Public
A GitHub action to create a pull request for changes to your repository in the actions workspace
TypeScript
•
MIT License
•480•0•0•1•Updated May 15, 2025May 15, 2025
lws
Public
LeaderWorkerSet: An API for deploying a group of pods as a unit of replication
Go
•
Apache License 2.0
•81•0•0•1•Updated Apr 16, 2025Apr 16, 2025
action-slack
Public
Provides the function of slack notification to GitHub Actions.
TypeScript
•
MIT License
•139•0•0•1•Updated Mar 28, 2025Mar 28, 2025
inference-optimization-interview
Public
1•0•0•0•Updated Mar 24, 2025Mar 24, 2025
honeymarker
Public
Add Honeycomb Markers to your GitHub Actions workflows.
Dockerfile
•
Other
•6•0•0•0•Updated Mar 17, 2025Mar 17, 2025
setup-mpi
Public
Set up your GitHub Actions workflow to use MPI
Shell
•
MIT License
•5•0•0•0•Updated Mar 17, 2025Mar 17, 2025
llm-tools
Public
Python
•
MIT License
•0•0•0•0•Updated Mar 12, 2025Mar 12, 2025
flashinfer
Public
FlashInfer: Kernel Library for LLM Serving
Cuda
•
Apache License 2.0
•324•0•0•0•Updated Feb 6, 2025Feb 6, 2025
.github
Public
0•0•0•0•Updated Jan 13, 2025Jan 13, 2025
autoscaler
Public
Autoscaling components for Kubernetes
Go
•
Apache License 2.0
•4.1k•0•0•3•Updated Dec 11, 2024Dec 11, 2024
axolotl
Public
Go ahead and axolotl questions
Python
•
Apache License 2.0
•1k•0•0•2•Updated Nov 7, 2024Nov 7, 2024
HackMIT-2024
Public
Jupyter Notebook
•1•2•0•0•Updated Sep 14, 2024Sep 14, 2024
Workshop-TRT-LLM
Public
Python
•12•19•0•0•Updated Jun 26, 2024Jun 26, 2024
gpu-operator
Public
NVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes
Go
•
Apache License 2.0
•350•0•0•3•Updated Apr 19, 2024Apr 19, 2024
triton-inference-server
Public
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Python
•
BSD 3-Clause "New" or "Revised" License
•1.6k•1•0•0•Updated Jan 11, 2024Jan 11, 2024
tensorrtllm_backend
Public
The Triton TensorRT-LLM Backend
Python
•
Apache License 2.0
•122•0•0•0•Updated Jan 11, 2024Jan 11, 2024
python_backend
Public
Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.
C++
•
BSD 3-Clause "New" or "Revised" License
•170•0•0•0•Updated Jan 11, 2024Jan 11, 2024
langchain
Public
⚡ Building applications with LLMs through composability ⚡
Python
•
MIT License
•18k•0•0•0•Updated Dec 22, 2023Dec 22, 2023
diffusers
Public
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
Python
•
Apache License 2.0
•6k•0•0•0•Updated Nov 27, 2023Nov 27, 2023
truss-public-gh-repo-test
Public
A public github repo for testing truss deploy flow
Python
•0•0•0•0•Updated Oct 25, 2023Oct 25, 2023
chainlit-cookbook
Public
Chainlit's cookbook repo
Python
•399•0•0•0•Updated Aug 17, 2023Aug 17, 2023

0