8000 koowoo3 / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View koowoo3's full-sized avatar

Block or report koowoo3

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Bringing Language Models to the Most Resource Constrained Devices

Python 20 3 Updated Dec 23, 2024

Utilities intended for use with Llama models.

Python 7,018 1,157 Updated May 7, 2025

LLM inference in C/C++

C++ 80,816 11,893 Updated May 25, 2025
Jupyter Notebook 99 8 Updated Nov 11, 2024

Enhancing Autonomous Driving Systems with On-Board Deployed Large Language Models

Python 34 3 Updated May 7, 2025

Efficient and easy multi-instance LLM serving

Python 415 31 Updated May 23, 2025

Repository to host and maintain scale-sim-v2 code

Python 298 116 Updated Apr 23, 2025
Python 3 Updated Oct 19, 2023

(CVPR 2021, Oral) Dynamic Slimmable Network

Python 229 19 Updated Dec 31, 2021

Conditional channel- and precision-pruning on neural networks

Python 73 14 Updated Mar 4, 2020
Python 3 Updated Nov 21, 2022
Python 1 Updated Dec 3, 2021
C 1 1 Updated Aug 22, 2022

Code for "Effective Bayesian Heteroscedastic Regression with Deep Neural Networks" (NeurIPS 2023)

Python 20 2 Updated Mar 21, 2025

Energy-aware Timing Analysis of Intermittent Programs

HTML 3 1 Updated Feb 26, 2022

Open source software accompanying the publication: "Improving the forward progress of Transient

C 1 Updated Jul 13, 2022

Code for Adaptive Deep Neural Network Inference Optimization with EENet

Python 12 2 Updated Mar 28, 2024

Ratchet source code from OSDI 2016

C++ 10 4 Updated Jan 29, 2017

LaLaRAND: Flexible Layer-by-Layer CPU/GPU Scheduling for Real-Time DNN Tasks

Python 14 Updated Mar 25, 2022

Overview of conditional computution and dynamic CNNs for computer vision, with a focus on reducing computational complexity

43 4 Updated Jun 30, 2022
Jupyter Notebook 128 31 Updated Oct 3, 2023

This repository contains Adaptive Early-Exit (AdaEE).

Python 2 Updated May 20, 2022

This repository contains the program used to train and evaluate a Branched DNN capable of early-exit semantic segmentation, suited for an edge-cloud co-inference scenario in smart cities..

Python 2 Updated Feb 25, 2025

Improve a Model's accuracy by distilling knowledge to the earlier layers of the model. Improves accuracy and performance of lightweight DNN models

Jupyter Notebook 6 Updated Jan 25, 2023

A curated list of early exiting (LLM, CV, NLP, etc)

49 4 Updated Aug 21, 2024

Improving Low-Latency Predictions in Multi-Exit Neural Networks via Block-Dependent Losses

Python 3 1 Updated Nov 7, 2023

Code and model for "Peeking into the Future: Predicting Future Person Activities and Locations in Videos", Liang et al, CVPR 2019

Python 355 99 Updated Mar 24, 2023
Jupyter Notebook 65 12 Updated Sep 19, 2023

Mayfly language specification and compiler.

Java 5 1 Updated Nov 1, 2017
Python 70 10 Updated Mar 16, 2023
Next
0