[mlpack] GSoC 2017: Interested in Reinforcement Learning project

Marcus Edel marcus.edel at fu-berlin.de
Mon Mar 13 16:16:17 EDT 2017


Hello Shangtong,

great to see you back!

> It has been two years since last time I coded in mlpack for mean shift and CNN.
> Now I’m a MSc student in University of Alberta supervised by Prof Richard
> Sutton. My primary interest is reinforcement learning. I wrote python code for
> the book Reinforcement Learning: An Introduction(2nd Edition). However I don’t
> have much experience with DeepRL, so this project interests me.

You did some great work there! I think I'll go and add the book and repository
as another reference to the project idea, might be helpful to dive into the
topic.

> To warm up, I proposed a framework for Q-Learning with an implementation of DQN
> with experience replay and target network. The pull request is here  I test it
> in Mountain Car task.

That is a really nice PR, I'll take a closer look once I get the chance, and we
can discuss details over there.

Thanks,
Marcus

> On 13 Mar 2017, at 16:14, Shangtong Zhang <zhangshangtong.cpp at gmail.com> wrote:
> 
> Hello everyone,
> 
> I’m Shangtong Zhang. It has been two years since last time I coded in mlpack for mean shift and CNN. Now I’m a MSc student in University of Alberta supervised by Prof Richard Sutton. My primary interest is reinforcement learning. I wrote python code <https://github.com/ShangtongZhang/reinforcement-learning-an-introduction> for the book Reinforcement Learning: An Introduction(2nd Edition). However I don’t have much experience with DeepRL, so this project interests me.
> 
> To warm up, I proposed a framework for Q-Learning with an implementation of DQN with experience replay and target network. The pull request is here <https://github.com/mlpack/mlpack/pull/934>  I test it in Mountain Car task. The PR is just to show my design, it’s not ready to merge. I think we need to support batch update for our network component to make deep RL more efficient. Looking forward to any feedback.
> 
> Thanks,
> 
> Shangtong Zhang,
> First year graduate student,
> Department of Computing Science,
> University of Alberta
> Github <https://github.com/ShangtongZhang> | Stackoverflow <http://stackoverflow.com/users/3650053/slardar-zhang>
> _______________________________________________
> mlpack mailing list
> mlpack at lists.mlpack.org
> http://knife.lugatgt.org/cgi-bin/mailman/listinfo/mlpack

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://knife.lugatgt.org/pipermail/mlpack/attachments/20170313/9b128970/attachment.html>


More information about the mlpack mailing list