Saturday, November 14, 2020
Sunday, November 1, 2020
In the context of reinforcement learning we will show that a specific scheme of Monte Carlo control is monotonic if Q(a, pi) is well estimated by the exploration stage. https://drive.google.com/file/d/11Aa92Mr3nMF1Gxa5r0kIiHfg-9wn_rkI/view?usp=sharing
Subscribe to:
Posts (Atom)
Our next ML study group meeting will take place on Monday the 8 th of October. I'll cover the contraction theorem. See relevant s...
-
Ml crash directory Are you familiar with regression - https://m.youtube.com/watch?v=aq8VU5KLmkY ? One way to view Ml is regression on ster...
-
We'll cover LDA in tw's meeting. Here is the slide - https://drive.google.com/open?id=1KRoCA4vo9H9oJOl3iD-qRqIHl9qQq9vf This is ...
-
Whiteboard from today's meeting on Bayesian ML: Cox: P(A, B) = P(A|B)P(B) = P(B|A)P(A) => P(A|B) = [P(B|A)P(A)] / P(B) (Bayes) ...