krohak

🎧

Rohak krohak

🎧

34 followers · 207 following

Achievements

Organizations

Lists (1)

Sort

🔮 Future ideas

Stars

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 21,503 1,421 Updated May 22, 2025

pytorch / opacus

Training PyTorch models with differential privacy

Jupyter Notebook 1,801 368 Updated May 14, 2025

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 5,186 444 Updated Apr 30, 2025

Eladlev / AutoPrompt

A framework for prompt tuning using Intent-based Prompt Calibration

Python 2,520 213 Updated Apr 10, 2025

AnswerDotAI / RAGatouille

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Python 3,454 232 Updated May 17, 2025

stanford-futuredata / ColBERT

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,392 423 Updated Apr 7, 2025

Scale3-Labs / langtrace

Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vector…

TypeScript 930 90 Updated May 4, 2025

astral-sh / uv

An extremely fast Python package and project manager, written in Rust.

Rust 55,236 1,551 Updated May 22, 2025

protectai / llm-guard

The Security Toolkit for LLM Interactions

Python 1,695 218 Updated May 19, 2025

open-webui / open-webui

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 95,709 12,353 Updated May 22, 2025

kyutai-labs / moshi

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 8,232 696 Updated May 20, 2025

neuralmagic / deepsparse

Sparsity-aware deep learning inference runtime for CPUs

Python 3,149 186 Updated May 10, 2025

vllm-project / llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 1,377 132 Updated May 22, 2025

modal-labs / modal-examples

Examples of programs built using Modal

Python 849 208 Updated May 22, 2025

unslothai / unsloth

Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥

Python 39,170 3,069 Updated May 22, 2025

ggml-org / llama.cpp

LLM inference in C/C++

C++ 80,701 11,867 Updated May 22, 2025

567-labs / instructor

structured outputs for llms

Python 10,488 784 Updated May 22, 2025

QwenLM / Qwen2.5-Omni

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 2,992 228 Updated May 19, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 47,856 7,554 Updated May 22, 2025

microsoft / TaskTracker

TaskTracker is an approach to detecting task drift in Large Language Models (LLMs) by analysing their internal activations. It provides a simple linear probe-based method and a more sophisticated m…

Jupyter Notebook 55 7 Updated Mar 7, 2025

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 11,965 754 Updated Dec 17, 2024

langfuse / langfuse

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

TypeScript 11,661 1,054 Updated May 22, 2025