Stars
Bringing BERT into modernity via both architecture changes and scaling
《大模型白盒子构建指南》:一个全手搓的Tiny-Universe
GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors
This is the first Chinese chat model specifically fine-tuned for Chinese through ORPO based on the Meta-Llama-3-8B-Instruct model.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
yangjianxin1 / unsloth
Forked from unslothai/unslothFinetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory
Pytorch-Named-Entity-Recognition-with-BERT
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc
中文文本分类模型集成,包括cnn, lstm, bert等,开箱即用
LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案
alibaba / Megatron-LLaMA
Forked from NVIDIA/Megatron-LMBest practice for training LLaMA models in Megatron-LM
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版
RSTutorials: A Curated List of Must-read Papers on Recommender System.
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
CTR prediction models based on deep learning(基于深度学习的广告推荐CTR预估模型)
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
This project demonstrates how to run and save predictions locally using exported tensorflow estimator model
Resume template for Chinese programmers . 程序员简历模板系列。包括PHP程序员简历模板、iOS程序员简历模板、Android程序员简历模板、Web前端程序员简历模板、Java程序员简历模板、C/C++程序员简历模板、NodeJS程序员简历模板、架构师简历模板以及通用程序员简历模板
📄 适合中文的简历模板收集(LaTeX,HTML/JS and so on)由 @hoochanlon 维护
Source code for Twitter's Recommendation Algorithm