8000 nicholasbaard (Nic Baard) · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View nicholasbaard's full-sized avatar
💭
Currently working on the building blocks of RL and Deep Learning!
💭
Currently working on the building blocks of RL and Deep Learning!

Block or report nicholasbaard

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. DQN DQN Public

    A replication of the Deep Q-Network Algorithm as seen in "Human-level control through deep reinforcement learning" - Mnih et. al. 2015

    Python 1

  2. Model-Free-Learning Model-Free-Learning Public

    An implementation of model-free learning techniques including SARSA, Q-Learning and SARSA-Lambda

    Python 1

  3. CNNs-for-Image-Classification CNNs-for-Image-Classification Public

    A Pytorch implementation of Convolutional Neural Networks, using LeNet5 for Image Classification

    Python 1

  4. Dynamic-Programming Dynamic-Programming Public

    A python implementation of dynamic programming methods to solve the Bellman Equation.

    Python 1

  5. Multi-Armed-Bandits Multi-Armed-Bandits Public

    A python implementation of the multi-armed bandit problem using reinforcement learning. The repo contains implementations of the epsilon greedy, optimistic initialization and upper confidence bound…

    Python 1

  6. Multi-Layered-Perceptron Multi-Layered-Perceptron Public

    A python implementation of a feed forward neural network from first principles. The network is trained via gradient descent and backpropagation on the MNIST dataset.

    Python

0