8000 fwtan (fwtan) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View fwtan's full-sized avatar

Block or report fwtan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repository for LTX-Video

Python 7,003 601 Updated Jul 9, 2025

[CVPR 2025] Official Implementation of LOCORE: Image Re-ranking with Long-Context

Python 8 1 Updated Apr 15, 2025

Scalable and memory-optimized training of diffusion models

Python 1,208 131 Updated Jun 4, 2025

Quantized Attention achieves speedup of 2-5x and 3-11x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models.

Cuda 1,978 152 Updated Jul 11, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 91,462 24,655 Updated Jul 11, 2025

Codebase for the Progressive Mixed-Precision Decoding paper.

Python 13 Updated Jan 27, 2025

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,579 110 Updated Jul 7, 2025

[MLSys'25] QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving; [MLSys'25] LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

C++ 717 48 Updated Mar 6, 2025

Puzzles for learning Triton

Jupyter Notebook 1,755 139 Updated Nov 18, 2024

Moments Retrieval Project Webpage (temporal)

Python 31 3 Updated Jan 17, 2024

LLM101n: Let's build a Storyteller

33,965 1,844 Updated Aug 1, 2024

[ICML'24] Recurrent Early Exits for Federated Learning with Heterogeneous Clients

Python 10 1 Updated Jul 11, 2024

Efficient LLM Inference Acceleration using Prompting

Python 48 2 Updated Oct 22, 2024

This repository contains an implementation of the models introduced in the paper Dialog-based Interactive Image Retrieval. The network is implemented using PyTorch and the rest of the framework is …

Python 69 16 Updated Oct 4, 2020
0