8000 ofirzaf (Ofir Zafrir) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View ofirzaf's full-sized avatar

Block or report ofirzaf

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A library for researching neural networks compression and acceleration methods.

Python 139 24 Updated Aug 30, 2024

🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools

Python 2,877 532 Updated Apr 30, 2025

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 38,525 4,699 Updated Apr 12, 2025

User-friendly LLaMA: Train or Run the model using PyTorch. Nothing else.

Python 339 60 Updated Mar 24, 2023

A Python package for extending the official PyTorch that can easily obtain performance on Intel platform

Python 1,844 273 Updated May 7, 2025

Efficient Retrieval Augmentation and Generation Framework

Python 1,532 141 Updated Jan 9, 2025

Pattern Based Aspect Term Extraction

Python 5 Updated Sep 18, 2023

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Python 2,172 211 Updated Oct 8, 2024

🤗 Optimum Intel: Accelerate inference with Intel optimization tools

Jupyter Notebook 460 130 Updated May 6, 2025

Efficient few-shot learning with Sentence Transformers

Jupyter Notebook 2,478 241 Updated Apr 11, 2025

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

Python 2,394 266 Updated May 7, 2025

Sparsity-aware deep learning inference runtime for CPUs

Python 3,142 184 Updated May 5, 2025
TypeScript 6 Updated Feb 1, 2025

🐶 Kubernetes CLI To Manage Your Clusters In Style!

Go 29,605 1,872 Updated May 6, 2025

Repository containing code for "How to Train BERT with an Academic Budget" paper

Python 313 46 Updated Sep 18, 2023

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Python 20,074 2,819 Updated May 6, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 38,231 4,358 Updated May 2, 2025
Python 190 54 Updated Jan 16, 2021

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 89,735 24,078 Updated May 7, 2025

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 143,930 28,859 Updated May 6, 2025

A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks

Python 2,940 448 Updated Nov 7, 2022
0