8000 nicholasbaard (Nic Baard) · GitHub

More Web Proxy on the site http://driver.im/

nicholasbaard

Follow

💭

Currently working on the building blocks of RL and Deep Learning!

Nic Baard nicholasbaard

💭

Currently working on the building blocks of RL and Deep Learning!

Follow

RL Enthusiast! Exploring Reinforcement Learning! I'm inspired by DeepMind's work. Building on research to contribute to positive AI for humanity.

7 followers · 6 following

South Africa
in/nicholas-baard-7227181b5

Achievements

Achievements

Pinned Loading

DQN DQN Public

A replication of the Deep Q-Network Algorithm as seen in "Human-level control through deep reinforcement learning" - Mnih et. al. 2015

Python 1
Model-Free-Learning Model-Free-Learning Public

An implementation of model-free learning techniques including SARSA, Q-Learning and SARSA-Lambda

Python 1
CNNs-for-Image-Classification CNNs-for-Image-Classification Public

A Pytorch implementation of Convolutional Neural Networks, using LeNet5 for Image Classification

Python 1
Dynamic-Programming Dynamic-Programming Public

A python implementation of dynamic programming methods to solve the Bellman Equation.

Python 1
Multi-Armed-Bandits Multi-Armed-Bandits Public

A python implementation of the multi-armed bandit problem using reinforcement learning. The repo contains implementations of the epsilon greedy, optimistic initialization and upper confidence bound…

Python 1
Multi-Layered-Perceptron Multi-Layered-Perceptron Public

A python implementation of a feed forward neural network from first principles. The network is trained via gradient descent and backpropagation on the MNIST dataset.

Python

0