קורס בינה מלאכותית : rb203 שיעור 3 reinforcement learning

קורס בינה מלאכותית : rb203 שיעור 3 reinforcement learning

חלק 1

חלק 2

חלק 3

הכנת סביבת עבודה

שלב ראשון ניצור סביבת עבודה ספציפית בקודה

conda create –name rf1 python==3.8

Markov decision process MDP

1,state in time

exm chess – all objects

2,action

base on the state how , the agent observe on the state

3, reward

chess : win the game , zero lost

robot : arm reward will be zero

packmen every time eat yellow point the agent gets reward

4,environment

chess the time

robot arm – gravity

התקנה :

conda create –name rf1 python==3.8

conda activate rf1

conda install pyopengl pytorch==1.6 matplotlib pandas numpy tqdm jupyter seaborn scikit-learn

pip install gym

https://www.udemy.com/course/beginner-master-rl-1/learn/lecture/26548364#questions