-
University of Michigan
- Ann Arbor, MI, US
- https://huangyibo.github.io/
Lists (1)
Sort Name ascending (A-Z)
Stars
A bounded multi-producer multi-consumer concurrent queue written in C++11
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
DeepEP: an efficient expert-parallel communication library
❓Curie: Automated and Rigorous Scientific Experimentation with AI Agents
🌱 a curated list of tools to help you with your research/life; I built a front end around this repo, please use the link below [This repo is deprecated. Instead, I maintain all the contents using t…
Running large language models on a single GPU for throughput-oriented scenarios.
Scaling Up Memory Disaggregated Applications with SMART
Official PyTorch Implementation of "Learning to Learn with Generative Models of Neural Network Checkpoints"
Real-time Monitoring and Analysis of Data Streams
A collection of bypasses and exploits for eBPF-based cloud security.
An implementation of a deep learning recommendation model (DLRM)
This project hosts security advisories and their accompanying proof-of-concepts related to research conducted at Google which impact non-Google owned code.
Tutorials for writing high-performance GPU operators in AI frameworks.
[ACM CoNEXT22 Best Paper Award] Case Study: A Fast Nginx and Apache Bench Tool over NTSocks for high performance file transfer services.
Set of datasets for the deep learning recommendation model (DLRM).
Paella: Low-latency Model Serving with Virtualized GPU Scheduling
How to Index Item IDs for Recommendation Foundation Models
http://vlsiarch.eecs.harvard.edu/research/recommendation/