Stars
🤗 Optimum Intel: Accelerate inference with Intel optimization tools
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
Efficient few-shot learning with Sentence Transformers
A library for researching neural networks compression and acceleration methods.
A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks