Code for this paper "On the effectiveness of discrete representations in sparse mixture of experts".
-
Notifications
You must be signed in to change notification settings - Fork 0
giangdip2410/VQMoE
About
Code for this paper "On the effectiveness of discrete representations in sparse mixture of experts".
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published