8000 thongnt99 (Thong Nguyen ) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View thongnt99's full-sized avatar

Highlights

  • Pro

Block or report thongnt99

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Quantized Attention achieves speedup of 2-5x and 3-11x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models.

Cuda 1,819 139 Updated Jul 1, 2025

CUDA Templates for Linear Algebra Subroutines

C++ 7,776 1,292 Updated Jun 27, 2025

Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali

Python 2,279 153 Updated Jul 1, 2025

[Pytorch] Generative retrieval model using semantic IDs from "Recommender Systems with Generative Retrieval"

Python 345 46 Updated Mar 17, 2025

[CVPR 2025] LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant

Python 123 6 Updated May 13, 2025

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,559 105 Updated Jun 2, 2025

LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning

Python 60 2 Updated May 23, 2025
Python 50 1 Updated Feb 27, 2025

This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR 2025]

Python 276 20 Updated Jun 30, 2025

An Open-sourced Knowledgable Large Language Model Framework.

Python 1,326 132 Updated Jan 11, 2025
Python 21 3 Updated Apr 17, 2023

Late Interaction Models Training & Retrieval

Python 462 34 Updated Jun 10, 2025

Is ChatGPT Good at Search? LLMs as Re-Ranking Agent [EMNLP 2023 Outstanding Paper Award]

Python 619 60 Updated Mar 10, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 8,452 652 Updated May 29, 2025

LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models

Python 138 9 Updated Jun 28, 2025

A method to increase the speed and lower the memory footprint of existing vision transformers.

Python 1,067 73 Updated Jun 17, 2024

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,546 125 Updated Jan 24, 2025

Scalable training for dense retrieval models.

Python 298 32 Updated Jun 10, 2025

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Python 5,572 937 Updated Apr 24, 2025

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 110,501 17,964 Updated Jul 1, 2025

LLM training code for Databricks foundation models

Python 4,273 568 Updated Jun 30, 2025

The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

Python 1,989 176 Updated Jun 30, 2025

Line-by-line profiling for Python

Python 2,995 131 Updated May 23, 2025

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Python 3,544 247 Updated May 17, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 41,341 3,292 Updated Jul 1, 2025

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 9,658 687 Updated Jun 25, 2025

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 38,789 4,724 Updated Jun 2, 2025

An open-source implementation for training LLaVA-NeXT.

Python 403 22 Updated Oct 23, 2024

Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

Python 6,956 493 Updated Feb 7, 2025
Python 3,967 375 Updated Jun 13, 2025
Next
0