A good reinforcement resource -
https://urldefense.proofpoint.com/v2/url?u=https-3A__web.stanford.edu_class_psych209_Readings_SuttonBartoIPRLBook2ndEd.pdf&d=DwICAg&c=jf_iaSHvJObTbx-siA1ZOg&r=y_b69HJLyjn8lFwzYVfyxol578OEO4exeFDpgGN6MoQ&m=DP4VAIUt9yAI5EgscfOsnowTXUt4aKes1rBKt9ZmEi8&s=TS7tm5uW4x_prNkeiXMr6vWH-MGIATcvOwyA9A5QMRE&e=
Tuesday, May 28, 2019
Sunday, May 26, 2019
Back to reinforcement learning and discuss the difference between MC and TD - https://drive.google.com/file/d/1QDjeHLUEk0kL6fyxTE_DGANDcyLK5zF0/view?usp=sharing
Sunday, May 19, 2019
There is a deep relation between re-enforcement learning and recurring games which we'll start exploring next - https://drive.google.com/file/d/10FF4i_uURnzfqt6XnjK2CDWRgtoWxw3s/view?usp=sharing
Sunday, May 12, 2019
We'll continue with reinforcement learning https://drive.google.com/file/d/1xOytbzidb-oBRpRUqsfMmALQXpVkTcOs/view?usp=sharing and discuss how to learn the value of one policy by simulating another.
Subscribe to:
Posts (Atom)
Our next ML study group meeting will take place on Monday the 8 th of October. I'll cover the contraction theorem. See relevant s...
-
Ml crash directory Are you familiar with regression - https://m.youtube.com/watch?v=aq8VU5KLmkY ? One way to view Ml is regression on ster...
-
We'll cover LDA in tw's meeting. Here is the slide - https://drive.google.com/open?id=1KRoCA4vo9H9oJOl3iD-qRqIHl9qQq9vf This is ...
-
Whiteboard from today's meeting on Bayesian ML: Cox: P(A, B) = P(A|B)P(B) = P(B|A)P(A) => P(A|B) = [P(B|A)P(A)] / P(B) (Bayes) ...