-
Purdue university
- West Lafayette
-
15:44
(UTC -04:00)
-
accel-sim-framework Public
Forked from accel-sim/accel-sim-frameworkThis is the top-level repository for the Accel-Sim framework.
Python Other UpdatedMar 11, 2025 -
gpu-app-collection Public
Forked from accel-sim/gpu-app-collectionA repository where GPU applications are aggregated using a common build flow that supports multiple CUDA versions.
Cuda UpdatedFeb 19, 2025 -
website-hugo-source Public
Forke 10000 d from tgrogers/website-hugo-sourceTeX MIT License UpdatedOct 9, 2024 -
mgpu-gpgpu-sim_distribution Public
Forked from accel-sim/gpgpu-sim_distributionGPGPU-Sim provides a detailed simulation model of a contemporary GPU running CUDA and/or OpenCL workloads and now includes an integrated (and validated) energy model, GPUWattch.
-
LLM4HWDesign_Starting_Toolkit Public
Forked from GATECH-EIC/LLM4HWDesign_Starting_ToolkitLLM4HWDesign Starting Toolkit
Python UpdatedJul 8, 2024 -
dlrm_syn Public
Forked from facebookresearch/dlrmAn implementation of a deep learning recommendation model (DLRM)
Python MIT License UpdatedJun 25, 2024 -
superblock Public
Forked from pytorch-labs/superblockA block oriented training approach for inference time optimization.
Python MIT License UpdatedMay 3, 2024 -
ECE60827_simulation_project_part4-bonus Public template
Repo for the HW simulation project part 4 (bonus)
C++ BSD 2-Clause "Simplified" License UpdatedApr 9, 2024 -
ECE60827_simulation_project_part3 Public template
C++ BSD 2-Clause "Simplified" License UpdatedApr 1, 2024 -
ECE60827_simulation_project_part2 Public template
C++ BSD 2-Clause "Simplified" License UpdatedMar 22, 2024 -
ECE60827_simulation_project_part1 Public template
C++ BSD 2-Clause "Simplified" License UpdatedMar 7, 2024 -
Part 1 of HW simulation project
Python Other UpdatedMar 7, 2024 -
-
-
-
-
python-mastery Public
Forked from dabeaz-course/python-masteryAdvanced Python Mastery (course by @dabeaz)
Python Creative Commons Attribution Share Alike 4.0 International UpdatedNov 17, 2023 -
ml-engineering Public
Forked from stas00/ml-engineeringMachine Learning Engineering Guides and Tools
Python Creative Commons Attribution Share Alike 4.0 International UpdatedOct 14, 2023 -
tpu_graphs Public
Forked from google-research-datasets/tpu_graphsC++ Apache License 2.0 UpdatedSep 22, 2023 -
ggml Public
Forked from ggml-org/ggmlTensor library for machine learning
C MIT License UpdatedSep 20, 2023 -
flash-llm Public
Forked from AlibabaResearch/flash-llmFlash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity
Cuda Apache License 2.0 UpdatedSep 19, 2023 -
TinyLlama Public
Forked from jzhang38/TinyLlamaThe TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Python Apache License 2.0 UpdatedSep 4, 2023 -
-
ml-fastvit Public
Forked from apple/ml-fastvitThis repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization"
Python Other UpdatedAug 15, 2023 -
CodeGen Public
Forked from salesforce/CodeGenCodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.
Python BSD 3-Clause "New" or "Revised" License UpdatedAug 15, 2023 -
llama Public
Forked from meta-llama/llamaInference code for LLaMA models
Python Other UpdatedJul 18, 2023 -
LLM-Pruner Public
Forked from horseee/LLM-PrunerLLM-Pruner: On the Structural Pruning of Large Language Models
Python Apache License 2.0 UpdatedJul 13, 2023 -
pytorch-direct_dgl Public
Forked from K-Wu/pytorch-direct_dglPyTorch-Direct code on top of PyTorch-1.8.0nightly (e152ca5) for Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture (accepted by PVLDB)
UpdatedJul 1, 2023 -
oss-arch-gym Public
Forked from srivatsankrishnan/oss-arch-gymOpen source version of ArchGym project.
Jupyter Notebook UpdatedJun 14, 2023 -
AITemplate Public
Forked from facebookincubator/AITemplateAITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
Python Apache License 2.0 UpdatedJun 3, 2023