Latest posts

Compilation ICU 65.1

Voilà comment j'ai procéder pour réussir à compiler la library ICU en version 65.1 sous Windows, sans passer par cygwin.

→ Read more...

2019/12/30 08:24
,

Construction d'un cadre de lit

Bon j'avoue, je bosse souvent allongé sur un matelas, avec une petite table de lit pour mon ordinateur. Et Quand je ne suis pas jsute dans mon lit, je suis dans mon bureau, ou j'ai installer un petit matelas de fortune par terre…

Bon ça ne paie pas de mine, mais tout de même: quand vous êtes confortablement installé, vous avez de meilleurs chances d'atteindre le “flow”, et moi c'est ce que je recherche ;-)!

→ Read more...

2019/03/15 11:54

Introduction to Deep Q-Networks

Now finally something that sounds a bit more challenging on my Reinforcement Learning journey: Deep Q-Networks! Even if, when you start analysis the structure of those networks, they are still pretty simple in the end. But still, there are 3 key elements that we should consider here: convolutional layers, experience replay, and Target network usage.

On top of that we will also initroduce the Double DQN and Dueling DQN extensions, that are used to improve the system stability and performances. Let's get started!

→ Read more...

2019/03/10 11:36

Model-based policy networks

Continuing on our reinforcement learning path, we will consider here how to build a “model for our environment” and then use this model to train our policy network, instead of the “actual environment”.

→ Read more...

2019/03/09 12:27

Multi-actions vanilla policy Gradient

As a small extension to the previous policy gradient implementation we discussed, we are now going to study how to support multiple actions (ie. num_actions > 2) in the policy network.

→ Read more...

2019/03/09 08:54

<< Newer entries | Older entries >>

Blog History

2017-09: 7 entries 2017-11: 1 entry 2018-12: 6 entries 2019-01: 5 entries 2019-02: 2 entries 2019-03: 9 entries 2020-12: 1 entry 2020-04: 5 entries

2020

April

December

2019

March

February

January

2018

December

2017

November

September