Stars
This project uses openMP, MPI, and Cuda to solve lu decomposition
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, s…
[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl
个人整理的《深入理解计算机系统》中文电子版(原书第 3 版)与实验材料:https://hansimov.gitbook.io/csapp/
Instructions, Docker images, and examples for Nsight Compute and Nsight Systems
Code base and slides for ECE408:Applied Parallel Programming On GPU.
all the course resources of UCB's CS61C course -- Great ideas in computer architecture
my implementation for the CS61C labs in 2020 summer version
The reference implementaiton for the paper "Deep Geometric Prior for Surface Reconstruction"
LAVIS - A One-stop Library for Language-Vision Intelligence
Code for ALBEF: a new vision-language pre-training method
An implementation of 1D, 2D, and 3D positional encoding in Pytorch and TensorFlow
measures and panorama ortorectification on google streetview
Official Pytorch implementations of TPS++: Attention-Enhanced Thin-Plate Spline for Scene Text Recognition (IJCAI 2023)
Implementation of Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition published in AAAI 2019 in PyTorch
This is an implementation of Show, Attend and Read with tensorflow
Tracking the latest progress in Scene Text Detection and Recognition: Must-read papers well organized with code and dataset
Project Malmo is a platform for Artificial Intelligence experimentation and research built on top of Minecraft. We aim to inspire a new generation of research into challenging new problems presente…
an Open Course Platform for Stanford CS224n (2020 Winter)
The Hoard Memory Allocator: A Fast, Scalable, and Memory-efficient Malloc for Linux, Windows, and Mac.
Fast and flexible physics-based battery models in Python
Official Repo for INFO 153B/253B for Spring'23