8000 Changerzz / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Changerzz's full-sized avatar

Block or report Changerzz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

nvidia-modelopt is a unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for do…

Python 920 68 Updated May 9, 2025

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,893 443 Updated Aug 7, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 8,083 611 Updated Apr 27, 2025

Run generative AI models in sophgo BM1684X/BM1688

Python 209 36 Updated May 14, 2025

YOLOv12: Attention-Centric Real-Time Object Detectors

Python 1,696 212 Updated Apr 15, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Go 140,513 11,756 Updated May 15, 2025
Python 1,837 103 Updated Nov 20, 2024

This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.

Python 547 46 Updated Mar 10, 2024

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 5,342 567 Updated May 14, 2025

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS…

C++ 5,969 676 Updated May 15, 2025

Robust Speech Recognition via Large-Scale Weak Supervision

Python 81,702 9,820 Updated May 13, 2025

Llama3-Tutorial(XTuner、LMDeploy、OpenCompass)

Python 507 54 Updated May 10, 2024

The LLM Evaluation Framework

Python 6,307 554 Updated May 15, 2025

High-speed Large Language Model Serving for Local Deployment

C++ 8,202 431 Updated Feb 19, 2025

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

Python 683 121 Updated Apr 11, 2024

Noise supression using deep filtering

Python 3,041 281 Updated Oct 17, 2024

This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)

Python 194 22 Updated May 14, 2025

A flexible framework powered by ComfyUI for generating personalized Nobel Prize images.

Python 1,321 88 Updated Nov 4, 2024

The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.

Python 687 107 Updated May 9, 2025

Retrieval and Retrieval-augmented LLMs

Python 9,623 699 Updated Apr 15, 2025

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Python 1,727 91 Updated May 14, 2025

Official inference repo for FLUX.1 models

Python 21,680 1,538 Updated Feb 6, 2025

[EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models

Python 58 6 Updated Sep 22, 2024

A pytorch quantization backend for optimum

Python 935 73 Updated Apr 24, 2025
Python 323 57 Updated Nov 30, 2023

PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.

Python 1,687 252 Updated Mar 28, 2024
Next
0