qwen3

Here are 26 public repositories matching this topic...

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

Updated Jun 18, 2025
Python

1Panel-dev / MaxKB

Star

💬 MaxKB is an open-source AI assistant for enterprise. It seamlessly integrates RAG pipelines, supports robust workflows, and provides MCP tool-use capabilities.

chatbot knowledgebase rag llm langchain pgvector ollama maxkb llama3 mcp-server deepseek-r1 qwen3

Updated Jun 19, 2025
Python

sgl-project / sglang

Star

SGLang is a fast serving framework for large language models and vision language models.

Updated Jun 19, 2025
Python

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4v, Phi4, ...) (AAAI 2025).

Updated Jun 18, 2025
Python

zilliztech / deep-searcher

Star

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

agent openai grok claude rag milvus vector-database llm zilliz deepseek agentic-rag grok3 reasoning-models deepseek-r1 deep-research qwen3 llama4

Updated Jun 18, 2025
Python

xlite-dev / Awesome-LLM-Inference

Star

8000

📚A curated list of Awesome LLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.

mla vllm llm-inference awesome-llm flash-attention tensorrt-llm paged-attention deepseek flash-attention-3 deepseek-v3 minimax-01 deepseek-r1 flash-mla qwen3

Updated Jun 18, 2025
Python

JohanLi233 / Viby

Star

Viby vibes everything.

linux shell agent productivity terminal tools ai mcp python3 gpt uv rag llm generative-ai shell-gpt terminalgpt qwen3 qwen3-moe

Updated Jun 16, 2025
Python

NetEase-Media / grps_trtllm

Star

Higher performance OpenAI LLM service than vLLM serve: A pure C++ high-performance OpenAI LLM service implemented with GPRS+TensorRT-LLM+Tokenizers.cpp, supporting chat and function call, AI agents, distributed multi-GPU inference, multimodal capabilities, and a Gradio chat interface.

Updated May 14, 2025
Python

Zeyi-Lin / Qwen3-Medical-SFT

Star

Qwen3 Fine-tuning: Medical R1 Style Chat

r1 fine-tuning sft qwen3

Updated May 31, 2025
Python

AaronFeng753 / Better-Qwen3

Star

Auto Thinking Mode switch for Qwen3 in Open webui

qwen open-webui qwen3

Updated May 8, 2025
Python

bold84 / cot_proxy

Star

Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and <think> tag filtering. Perfect for using advanced models with apps that lack parameter customization.

llm qwen3

Updated May 19, 2025
Python

aws-samples / easy-model-deployer

Star

A user-friendly Command-line/SDK tool that makes it quickly and easier to deploy open-source LLMs on AWS

ec2 ecs sagemaker huggingface qwq langchain large-language-model vllm ollama deepseek comfyui-workflow inferentia-2 internlm2 openai-compatible-api qwen2-5 deepseek-r1 qwq-32b gemma3 qwen3

Updated Jun 18, 2025
Python

QwenLM / PolyMath

Star

Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"

multilingual mathematical-reasoning large-language-models qwen3

Updated May 22, 2025
Python

gty111 / gLLM

Star

gLLM: Global Balanced Pipeline Parallelism System for Distributed LLM Serving with Token Throttling

pipeline-parallelism tensor-parallelism llm-serving llm-inference pagedattention continuous-batching qwen3 token-throttling chunked-prefill

Updated Jun 18, 2025
Python

DAILtech / Qwen3-deploy-for-developer

Star

Local deployment guidance of Qwen3 for developer, and CLI script implementation.

linux cli deployment cuda developer llm qwen3

Updated May 1, 2025
Python

BaohaoLiao / frac-cot

Star

An efficient sampling method for long-CoT LLM with fractured CoT.

efficiency reasoning sampling-methods chain-of-thought llm-inference deepseek deepseek-r1 qwen3

Updated May 25, 2025
Python

lpalbou / llm-basic-benchmark

Star

Comprehensive benchmark of 44 open source language models across creative writing, logic puzzles, counterfactual reasoning, and programming tasks. Tested on Apple M4 Max with detailed performance analysis.

open-source benchmark programming puzzle writing mlx cogito counterfactual llm ollama llama3 phi4 gemma3 qwen3 llama4 granite3

Updated Jun 13, 2025
Python

Project-Unicron / Qwenywhere

Star

A fully dockerized, dynamic OpenAI compatible Qwen3 distribution with basic tool use, designed for easy setup

chat docker ai chatbot openai tool-use qwen3

Updated May 25, 2025
Python

stlin256 / SmolVLM_with_LLM

Star

Scripts for combining SmolVLM and LLM

image-recognition video-classification vlm llm smolvlm qwen3

Updated May 15, 2025
Python

PotatoHD404 / QwenRag

Star

A powerful RAG system for querying code repositories using tree-sitter parsing, LanceDB vector storage, and Qwen models

embedding rag llm vectordb qwen3

Updated Jun 15, 2025
Python

Improve this page

Add a description, image, and links to the qwen3 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the qwen3 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

qwen3

Here are 26 public repositories matching this topic...

unslothai / unsloth

1Panel-dev / MaxKB

sgl-project / sglang

modelscope / ms-swift

zilliztech / deep-searcher

xlite-dev / Awesome-LLM-Inference

JohanLi233 / Viby

NetEase-Media / grps_trtllm

Zeyi-Lin / Qwen3-Medical-SFT

AaronFeng753 / Better-Qwen3

bold84 / cot_proxy

aws-samples / easy-model-deployer

QwenLM / PolyMath

gty111 / gLLM

DAILtech / Qwen3-deploy-for-developer

BaohaoLiao / frac-cot

lpalbou / llm-basic-benchmark

Project-Unicron / Qwenywhere

stlin256 / SmolVLM_with_LLM

PotatoHD404 / QwenRag

Improve this page

Add this topic to your repo