Stars
阿布量化交易系统(股票,期权,期货,比特币,机器学习) 基于python的开源量化交易,量化投资架构
Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, i…
🇨🇳Open source Chinese HSK vocabulary list with example sentences
verl: Volcano Engine Reinforcement Learning for LLMs
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.
Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。
Awesome speech/audio LLMs, representation learning, and codec models
Speech, Language, Audio, Music Processing with Large Language Model
Align Anything: Training All-modality Model with Feedback
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
数据挖掘、计算机视觉、自然语言处理、推荐系统竞赛知识、代码、思路
Ongoing research training transformer language models at scale, including: BERT & GPT-2
🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime (All in One Codebase!). Have a natural seamless conversation with AI everywhere (mobile, web and terminal) using LLM OpenAI …
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
博客信息
Source code and demo for memory bank and SiliconFriend
A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
Code for HC3 Plus: A Semantic-Invariant Human ChatGPT Comparison Corpus
Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥
A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current issues and future directions.
Letta (formerly MemGPT) is the stateful agents framework with memory, reasoning, and context management.
Python llama.cpp HTTP Server and LangChain LLM Client