8000 waterzxj (Ramos) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View waterzxj's full-sized avatar
  • BUPT
  • beijing

Block or report waterzxj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Example models using DeepSpeed

Python 6,525 1,094 Updated Jun 9, 2025

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,546 538 Updated May 3, 2024

Chat with any PDF. Easily upload the PDF documents you'd like to chat with. Instant answers. Ask questions, extract information, and summarize documents with AI. Sources included.

Jupyter Notebook 1,487 219 Updated Jun 12, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 18,714 1,902 Updated Jun 9, 2025

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Python 2,804 333 Updated May 21, 2024

Making large AI models cheaper, faster and more accessible

Python 40,945 4,523 Updated Jun 9, 2025

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

Python 860 51 Updated May 8, 2025

A series of large language models developed by Baichuan Intelligent Technology

Python 4,125 295 Updated Nov 8, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,618 1,037 Updated Nov 18, 2024

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 41,063 5,213 Updated Jun 27, 2024

SGPT: GPT Sentence Embeddings for Semantic Search

Jupyter Notebook 868 53 Updated Feb 17, 2024

Awesome pre-trained models toolkit based on PaddlePaddle. (400+ models including Image, Text, Audio, Video and Cross-Modal with Easy Inference & Serving)

Python 12,886 2,067 Updated Jun 10, 2025

Inference code for Llama models

Python 58,347 9,783 Updated Jan 26, 2025

Home of StarCoder: fine-tuning & inference!

Python 7,421 525 Updated Feb 27, 2024

Source code for Twitter's Recommendation Algorithm

Scala 1 Updated Apr 3, 2023

A simple and fast KD-tree for points in Python for kNN or nearest points. (damm short at just ~60 lines) No libraries needed.

Python 175 35 Updated Apr 14, 2024

Knowledge Distillation from BERT

Python 52 31 Updated Jan 7, 2019

BERT distillation(基于BERT的蒸馏实验 )

Python 313 86 Updated Jul 30, 2020

Implemention some Baseline Model upon Bert for Text Classification

Python 691 151 Updated Sep 19, 2019

UNF(Universal NLP Framework)

Python 70 10 Updated Mar 6, 2020

搜集、整理、发布 中文 自然语言处理 语料/数据集,与 有志之士 共同 促进 中文 自然语言处理 的 发展。

Jupyter Notebook 6,234 1,418 Updated Jan 29, 2019
Python 231 74 Updated Nov 27, 2019

Tutorials on getting started with PyTorch and TorchText for sentiment analysis.

Jupyter Notebook 4,515 1,179 Updated Mar 27, 2024

NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.

Python 1,896 447 Updated Jun 30, 2022

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

9,729 1,559 Updated May 23, 2024

A natural language modeling framework based on PyTorch

Python 6,326 796 Updated Oct 17, 2022

all kinds of text classification models and more with deep learning

Python 7,916 2,570 Updated Sep 28, 2023

Classic papers and resources on recommendation

Python 3,371 811 Updated Jun 13, 2020

An Open-source Neural Hierarchical Multi-label Text Classification Toolkit

Python 1,885 410 Updated Sep 6, 2023

A system for quickly generating training data with weak supervision

Python 5,864 858 Updated May 2, 2024
Next
0