8000 guialfaro053 (Guillermo Alfaro) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View guialfaro053's full-sized avatar

Block or report guialfaro053

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A plain vanilla transformer implementation in Rust using the Candle ML framework

Rust 29 5 Updated May 21, 2024

NVIDIA Math Libraries for the Python Ecosystem

Cython 332 23 Updated Jun 10, 2025< 8000 /relative-time>

A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.

Python 313 17 Updated Jul 8, 2025

high performance in-memory cache

Python 400 8 Updated May 19, 2025

Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.

Python 47 12 Updated Jun 26, 2025

Universal cross-platform tokenizers binding to HF and sentencepiece

C++ 356 87 Updated Jun 25, 2025

🐮 A utility to load environment variables from a .env file

C++ 62 19 Updated Apr 29, 2025

C++ extensions in PyTorch

Python 1,112 237 Updated Jul 8, 2025

Nvidia contributed CUDA tutorial for Numba

Jupyter Notebook 251 40 Updated Aug 23, 2022

An LLM playground you can run on your laptop

TypeScript 6,354 491 Updated May 2, 2025

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 1,609 172 Updated Jul 8, 2025

Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.

Python 684 65 Updated Aug 22, 2024

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 6,009 559 Updated Apr 11, 2025

Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy

Python 1,226 75 Updated Jun 3, 2025

✨✨Latest Advances on Multimodal Large Language Models

15,766 1,023 Updated Jul 1, 2025

ONNX and TensorRT implementation of Whisper

Python 64 5 Updated May 27, 2023

A plugin for Jupyter Notebook to run CUDA C/C++ code

Jupyter Notebook 236 94 Updated Sep 13, 2024

CUDA tutorials for Maths & ML tutorials with examples, covers multi-gpus, fused attention, winograd convolution, reinforcement learning.

Cuda 185 5 Updated Jun 11, 2025

Puzzles for learning Triton

Jupyter Notebook 1,747 139 Updated Nov 18, 2024

Speech-to-text server framework with next-gen Kaldi

C++ 729 125 Updated Jul 7, 2025

Convenience scripts to finetune (chat-)LLaMa3 and other models for any language

Python 310 35 Updated Jun 17, 2024

Real-time transcription using faster-whisper

HTML 474 80 Updated Jul 23, 2024

Deep learning at the speed of light.

Rust 1,747 108 Updated Jul 9, 2025

PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.

Python 804 56 Updated Jul 2, 2025

This repository contains tutorials and examples for Triton Inference Server

Python 732 121 Updated Jun 10, 2025

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 17,765 1,190 Updated Jul 8, 2025

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Python 891 63 Updated Dec 23, 2024

Serving multiple LoRA finetuned LLM as one

Python 1,070 52 Updated May 8, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,602 6,558 Updated Jun 10, 2025

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a ca…

Python 1,798 306 Updated Mar 14, 2023
Next
0