thongnt99

Thong Nguyen thongnt99

27 followers · 10 following

University of Amsterdam
Amsterdam
https://github.com/thongnt99
@thongnt99

Achievements

Highlights

Stars

thu-ml / SageAttention

Quantized Attention achieves speedup of 2-5x and 3-11x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models.

Cuda 1,819 139 Updated Jul 1, 2025

NVIDIA / cutlass

CUDA Templates for Linear Algebra Subroutines

C++ 7,776 1,292 Updated Jun 27, 2025

michaelfeil / infinity

Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali

Python 2,279 153 Updated Jul 1, 2025

EdoardoBotta / RQ-VAE-Recommender

[Pytorch] Generative retrieval model using semantic IDs from "Recommender Systems with Generative Retrieval"

Python 345 46 Updated Mar 17, 2025

Code-kunkun / LamRA

[CVPR 2025] LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant

Python 123 6 Updated May 13, 2025

huggingface / picotron

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,559 105 Updated Jun 2, 2025

XMUDeepLIT / LLaVE

LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning

Python 60 2 Updated May 23, 2025

haon-chen / mmE5

Python 50 1 Updated Feb 27, 2025

TIGER-AI-Lab / VLM2Vec

This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR 2025]

Python 276 20 Updated Jun 30, 2025

zjunlp / KnowLM

An Open-sourced Knowledgable Large Language Model Framework.

Python 1,326 132 Updated Jan 11, 2025

taoshen58 / LexMAE

Python 21 3 Updated Apr 17, 2023

lightonai / pylate

Late Interaction Models Training & Retrieval

Python 462 34 Updated Jun 10, 2025

sunnweiwei / RankGPT

Is ChatGPT Good at Search? LLMs as Re-Ranking Agent [EMNLP 2023 Outstanding Paper Award]

Python 619 60 Updated Mar 10, 2024

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 8,452 652 Updated May 29, 2025

42Shawn / LLaVA-PruMerge

LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models

Python 138 9 Updated Jun 28, 2025

facebookresearch / ToMe

A method to increase the speed and lower the memory footprint of existing vision transformers.

Python 1,067 73 Updated Jun 17, 2024

McGill-NLP / llm2vec

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,546 125 Updated Jan 24, 2025

facebookresearch / dpr-scale

Scalable training for dense retrieval models.

Python 298 32 Updated Jun 10, 2025

facebookresearch / mmf

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Python 5,572 937 Updated Apr 24, 2025

langchain-ai / langchain

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 110,501 17,964 Updated Jul 1, 2025

mosaicml / llm-foundry

LLM training code for Databricks foundation models

Python 4,273 568 Updated Jun 30, 2025

illuin-tech / colpali

The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

Python 1,989 176 Updated Jun 30, 2025

pyutils / line_profiler

Line-by-line profiling for Python

Python 2,995 131 Updated May 23, 2025

AnswerDotAI / RAGatouille

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Python 3,544 247 Updated May 17, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 41,341 3,292 Updated Jul 1, 2025

facebookresearch / xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 9,658 687 Updated Jun 25, 2025

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 38,789 4,724 Updated Jun 2, 2025

xiaoachen98 / Open-LLaVA-NeXT

An open-source implementation for training LLaVA-NeXT.

Python 403 22 Updated Oct 23, 2024

InternLM / InternLM

Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

Python 6,956 493 Updated Feb 7, 2025

LLaVA-VL / LLaVA-NeXT

Python 3,967 375 Updated Jun 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Thong Nguyen thongnt99

Achievements

Achievements

Highlights

Block or report thongnt99

Stars

thu-ml / SageAttention

NVIDIA / cutlass

michaelfeil / infinity

EdoardoBotta / RQ-VAE-Recommender

Code-kunkun / LamRA

huggingface / picotron

XMUDeepLIT / LLaVE

haon-chen / mmE5

TIGER-AI-Lab / VLM2Vec

zjunlp / KnowLM

taoshen58 / LexMAE

lightonai / pylate

sunnweiwei / RankGPT

OpenGVLab / InternVL

42Shawn / LLaVA-PruMerge

facebookresearch / ToMe

McGill-NLP / llm2vec

facebookresearch / dpr-scale

facebookresearch / mmf

langchain-ai / langchain

mosaicml / llm-foundry

illuin-tech / colpali

pyutils / line_profiler

AnswerDotAI / RAGatouille

unslothai / unsloth

facebookresearch / xformers

lm-sys / FastChat

xiaoachen98 / Open-LLaVA-NeXT

InternLM / InternLM

LLaVA-VL / LLaVA-NeXT