8000 SqueezeAILab · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
@SqueezeAILab

SqueezeAILab

SqueezeAI is part of Berkeley AI Research Lab at UC Berkeley focused on AI Systems research.

Popular repositories Loading

  1. LLMCompiler LLMCompiler Public

    [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling

    Python 1.7k 124

  2. SqueezeLLM SqueezeLLM Public

    [ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

    Python 688 45

  3. TinyAgent TinyAgent Public

    [EMNLP 2024 Demo] TinyAgent: Function Calling at the Edge!

    Python 407 63

  4. KVQuant KVQuant Public

    [NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

    Python 355 31

  5. LLM2LLM LLM2LLM Public

    [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

    Python 183 13

  6. SqueezedAttention SqueezedAttention Public

    SQUEEZED ATTENTION: Accelerating Long Prompt LLM Inference

    Python 46 6

Repositories

Showing 10 of 10 repositories

Top languages

Loading…

Most used topics

Loading…

0