Lists (1)
Sort Name ascending (A-Z)
Stars
[ICLR 2024] Real-time Photorealistic Dynamic Scene Representation and Rendering with 4D Gaussian Splatting
Simulator framework for analysis of performance, energy consumption, area and cost of multi-node multi-chiplet tile-based manycore designs
Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and evaluation of new memory system designs (e.g., new DRAM stan…
This may be the simplest implement of DDPM. You can directly run Main.py to train the UNet on CIFAR-10 dataset and see the amazing process of denoising.
Denoising Diffusion Probabilistic Models
This is a Chinese translation of the CUDA programming guide
GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as…
📚 Collaborative cheatsheets for console commands
An integrated cache and memory access time, cycle time, area, leakage, and dynamic power model
Original reference implementation of "EVER: Exact Volumetric Ellipsoid Rendering for Real-time View Synthesis"
Enhance CHISEL for Smooth and Comfortable Chip Design
TCZ-Tao / PupilRay
Forked from mchenwang/PupilRayPupil Ray is a rendering methods playground for personal learning, witch is based on the HWRT framework PupilOptixLab(OptiX7.5).
This is an implementation for the paper '3D Gaussian Ray Tracing: Fast Tracing of Paticle Scenes"
Set of advanced samples for the NVIDIA OptiX Ray Tracing Engine.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Analog is an out-of-the-box feature-rich blog template with Next.js.
[CVPR 2024] GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Repo for counting stars and contributing. Press F to pay respect to glorious developers.
[NeurIPS'23] Speculative Decoding with Big Little Decoder
A SystemVerilog implementation of Row-Stationary dataflow and Hierarchical Mesh Network-on-Chip Architecture based on Eyeriss CNN Accelerator
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.