Saturday, November 14, 2020
Sunday, November 1, 2020
In the context of reinforcement learning we will show that a specific scheme of Monte Carlo control is monotonic if Q(a, pi) is well estimated by the exploration stage.  https://drive.google.com/file/d/11Aa92Mr3nMF1Gxa5r0kIiHfg-9wn_rkI/view?usp=sharing
Subscribe to:
Comments (Atom)
Our next ML study group meeting will take place on Monday the 8 th of October. I'll cover the contraction theorem. See relevant s...
- 
Following the meeting yesterday I have added an example of a not continuous function that has continuous partial derivatives https://drive...
- 
Ml crash directory Are you familiar with regression - https://m.youtube.com/watch?v=aq8VU5KLmkY ? One way to view Ml is regression o...
- 
Ml crash directory Are you familiar with regression - https://m.youtube.com/watch?v=aq8VU5KLmkY ? One way to view Ml is regression on ster...
