Team leader of Supercomputing Team of DUT
ASC16: the first award prize
-
The Chinese University of Hong Kong
- The Chinese University of Hong Kong
Stars
Artifact repository for paper Automatic Generation of High-Performance Quantized Machine Learning Kernels
A JIT assembler for x86/x64 architectures supporting MMX, SSE (1-4), AVX (1-2, 512), FPU, APX, and AVX10.2
Accelerating network inference over video
"Byteslice: Pushing the envelop of main memory data processing with a new storage layout" (SIGMOD'15)
Bugfixing fork of Python bindings for the NVIDIA GPU Management Library