Build software better, together

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

Updated Jun 12, 2025
Python

sinanuozdemir / oreilly-pytorch-dl

Star

Code for Deep Learning for Modern AI

deep-learning mnist neural-networks llama quantization clip bert diffusion distillation multimodal llms dreambooth unsloth llama3 dreambooth-finetuning

Updated Mar 11, 2025
Jupyter Notebook

GAD-cell / VLM_GRPO

Star

An implementation of GRPO for Unsloth's VLMs training

reinforcement-learning vlm huggingface trl unsloth grpo grpotrainer

Updated Jun 12, 2025
Python

shaheennabi / Production-Ready-Instruction-Finetuning-of-Meta-Llama-3.2-3B-Instruct-Project

Sponsor

Star

Instruction Fine-Tuning of Meta Llama 3.2-3B Instruct on Kannada Conversations. Tailoring the model to follow specific instructions in Kannada, enhancing its ability to generate relevant, context-aware responses based on conversational inputs. Using the Kannada Instruct dataset for fine-tuning! Happy Finetuning 🎋

Updated Jan 30, 2025
Jupyter Notebook

Breeze648 / MedCoT-7B

Star

本项目利用医学领域的 CoT 数据对 Deepseek-R1-Distill-Qwen-7B 进行微调，通过 QLoRA 量化和 Unsloth 加速训练，显著提升模型在复杂医学推理任务中的慢思考能力。知识蒸馏技术使轻量级模型获得大模型的推理优势，实现高效、准确且具有解释性的医学问答系统。

nlp ai lora medical-application distillation llm chain-of-thought qlora qwen unsloth deepseek-r1 4-bit-quantization

Updated Mar 10, 2025
Python

deep-div / Fine-Tuning-LLMs-and-VisionModels

Star

Fine-Tuning LLMs (Gemma, LLaMA, Mistral, etc.) A practical guide to fine-tuning various large language models using popular frameworks. Includes examples, scripts, and tips for efficient training on custom datasets.

transformers llama gemma huggingface large-language-models llm generative-ai finetuning-llms deepseek llama-factory unsloth keras-finetuning

Updated Jun 2, 2025
Jupyter Notebook

0xZee / DeepSeek-R1-FineTuning

Star

Fine-Tuning of DeepSeek-Style Reasoning Models | RL + Quantization Implementation

reinforcement-learning lora qlora unsloth deepseek-r1

Updated Feb 9, 2025
Jupyter Notebook

QuangNguyen2910 / AutClothingChatbot

Star

PTIT's Major Project: Website Programming - This repo contains a chatbot for a clothing store. The chatbot acts as an employee with specific knowledge about clothing consultation, website support, and store information.

chatbot clothing rag vector-database llm llms langchain unsloth

Updated May 23, 2024
Jupyter Notebook

Eviltr0N / Make-AI-Clone-of-Yourself

Star

Cloning Yourself using your whatsapp chat history 10000 and training a model on it.

ai whatsapp-bot finetuning ai-project whatsapp-python whatsapp-clone llm ollama llm-project unsloth llama3 llama3-rag llama3-finetune ai-clones llama3-8b

Updated Aug 14, 2024
Jupyter Notebook

alisonmitchell / Biomedical-Knowledge-Graph

Star

Information extraction from unstructured text to build a knowledge graph using techniques from traditional NLP to pre-trained transformers and LLMs for NER and Linking, and Relation Extraction.

Updated Dec 7, 2024
Jupyter Notebook

IAmSkyDra / finetune-quantize-llms

Star

Materials for CSE Summer School Hackathon 2024

fine-tune quantize llms llama-factory unsloth

Updated Nov 8, 2024
Jupyter Notebook

SrikarVeluvali / Astor-AI

Star

AstorAI is a user-friendly medical chatbot powered by Retrieval-Augmented Generation (RAG) and the advanced LLama 3 model. It offers real-time, accurate responses to a wide range of medical queries, ensuring privacy and security in every interaction. Designed for ease of use, AstorAI provides reliable health information on various topics 24/7.

react flask mongodb transformers huggingface llm ollama unsloth llama3

Updated Nov 9, 2024
Jupyter Notebook

bastienpo / unsloth_finetuning

Star

Finetuning of Gemma-2 2B for structured output

python ai fine-tuning llamacpp unsloth gemma2

Updated Aug 19, 2024
Jupyter Notebook

jkanalakis / finetuning-llama-model-for-text-generation-using-unsloth

Star

Fine-tuning Llama 3.2 3B Instruct model for text generation using Unsloth AI

text text-generation fine-tuning llm generative-ai unsloth llama3

Updated Jan 8, 2025
Jupyter Notebook

muhammad-fiaz / finetune-web-ui

Sponsor

Star

Finetune Web UI is a user-interface for training and deploying pre-trained models.

transformers gpt datasets gradio finetune fine-tuning huggingface large-language-models generative-ai finetuning-llms finetune-llms finetune-gpt unsloth finetune-web-ui

Updated Jun 13, 2025
Python

harshit433 / ResurrectAI

Star

ResurrectAI is an AI-driven chat application designed to bring the wisdom and knowledge of great historical personalities to life. Leveraging advanced language models and fine-tuning techniques, ResurrectAI enables users to interact with AI avatars of iconic figures, gaining access to their insights, guidance, and philosophical teaching in realtime

python flask machine-learning firebase chatbot artificial-intelligence flutter language-model html-css-javascript conversational-ai finetuning-llms ollama unsloth llama3-1

Updated Oct 14, 2024
Dart

mirabdullahyaser / LLaMA3-Financial-Analyst

Star

LLM-powered financial analyst using LoRA-tuned Llama-3 and RAG pipeline to answer complex queries over SEC 10-K filings with contextual accuracy.

finance chatbot embeddings question-answering lora financial-analysis fine-tuning rag huggingface vector-database llm supervised-finetuning unsloth llama3 fiass

Updated Feb 9, 2025
Jupyter Notebook

hyeonsangjeon / PDF2LLM-Tuning-Studio

Star

PDF 문서에서 GPU 가속 처리로 고품질 질의응답(QA) 데이터를 자동 생성하고 LLM을 효율적으로 파인튜닝하는 솔루션입니다. Unstructured 라이브러리와 AWS Bedrock Claude로 도메인 특화 QA 쌍을 생성하고, LoRA 기법으로 경량 모델을 훈련합니다.

processing docker aws gpu cuda bedrock data-extraction pdf-generation claude unstructured distillation finetuning sagemaker pdf-text-extraction data-argumantation llm unsloth processing-job text-disti

Updated Jun 5, 2025
Jupyter Notebook

Asad-Shahab / sudokuLLM

Star

LLM finetuning for Sudoku solving

llm-training unsloth grpo

Updated May 21, 2025
Python

xphot / app

Star

Análise Avançada de Dados com Causalidade e Aprendizado por Reforço

reinforcement-learning bug-tracker data-preprocessing experimental-psychology causal-machine-learning shap-analysis hypergraph-neural-network llms-reasoning llm-fine-tuning explainability-metric unsloth gguf-quantization

Updated Feb 27, 2025
Jupyter Notebook

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

unsloth

Here are 97 public repositories matching this topic...

unslothai / unsloth

sinanuozdemir / oreilly-pytorch-dl

GAD-cell / VLM_GRPO

shaheennabi / Production-Ready-Instruction-Finetuning-of-Meta-Llama-3.2-3B-Instruct-Project

Breeze648 / MedCoT-7B

deep-div / Fine-Tuning-LLMs-and-VisionModels

0xZee / DeepSeek-R1-FineTuning

QuangNguyen2910 / AutClothingChatbot

Eviltr0N / Make-AI-Clone-of-Yourself

alisonmitchell / Biomedical-Knowledge-Graph

IAmSkyDra / finetune-quantize-llms

SrikarVeluvali / Astor-AI

bastienpo / unsloth_finetuning

jkanalakis / finetuning-llama-model-for-text-generation-using-unsloth

muhammad-fiaz / finetune-web-ui

harshit433 / ResurrectAI

mirabdullahyaser / LLaMA3-Financial-Analyst

hyeonsangjeon / PDF2LLM-Tuning-Studio

Asad-Shahab / sudokuLLM

xphot / app

Improve this page

Add this topic to your repo