Highlights
- Pro
Lists (2)
Sort Name ascending (A-Z)
Stars
An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…
Janus-Series: Unified Multimodal Understanding and Generation Models
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
A command-line tool that uses Gemini API to generate summaries of academic papers.
Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥
📝 Design doc template & examples for machine learning systems (requirements, methodology, implementation, etc.)
Eedi - Mining Misconceptions in Mathematics 5th place solution
Multiple Imputation with LightGBM in Python
Python tool for converting files and office documents to Markdown.
⚡️ 80x faster Fasttext language detection out of the box | Split text by language
Approaching (Almost) Any Machine Learning Problem
utilities for decoding deep representations (like sentence embeddings) back to text
Codebase for BirdClef 2023 solution
Gather around the table, and have a discussion to catch up the latest trend of machine learning 🤖
A Unified Semi-Supervised Learning Codebase (NeurIPS'22)
Evaluates search accuracy and relevance using Elasticsearch and datasets
Chronon is a data platform for serving for AI/ML applications.
experiments for business logic patterns
Scorta is a framework that runs side-by-side with your ML projects to provide a pleasant development experience.
Site Reliability Engineer Interview Preparation Guide
NLP2024 チュートリアル3 作って学ぶ日本語大規模言語モデル - 環境構築手順とソースコード / NLP2024 Tutorial 3: Practicing how to build a Japanese large-scale language model - Environment construction and experimental source codes
Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch