[인공지능] Chapter 8. Adversarial Search (2)

Uncertain outcomes controlled by chance, not an adversary!

Expectimax Search

어떠한 행동의 결과가 어떻게 될지 알 수 없는 이유는?
- Explicit randomness
- Unpredictable opponents
- Actions can fail
Values should now reflect average-case (expectimax) outcomes, not worst-case (minimax) outcomes
Expectimax search
- Compute the average score under optimal play
  - Minimax search와 같은 max node
  - Chance node
    - are like min nodes
    - but the outcome is uncertain
  - Calculate their expected utilities
  - take weighted average (expectation) on children

Ideas
- Evaluation by rollouts
  - 간단하고 빠른 rollout policy를 사용하여 여러 게임을 플레이 하며 승패를 계산
- Selective search
  - 깊이에 관계없이 뿌리 노드에서 결정을 개선하는데 도움이 되는 트리의 부분을 탐색

Games require decisions when optimality is impossible
- Bounded-depth search and approximate evaluation functions
Games force efficient use of computation
- Alpha-beta pruning, MCTS
Game playing has produced important research ideas
- Reinforcement learning (checkers)
- Iterative deepening (chess)
- Rational metareasoning (Othello)
- Monte Carlo tree search (chess, Go)
Video games present much greater challenges – lots to do!

[인공지능] Chapter 10. Markov Decision Process (2) (2)	2023.11.27
[인공지능] Chapter 9. Markov Decision Process (1) (1)	2023.10.25
[인공지능] Chapter 7. Adversarial search (1) (1)	2023.10.25
[인공지능] Chapter 6. Constraint Satisfaction problems (2)	2023.10.24
[인공지능] Chapter 5. Propositional logic (0)	2023.10.24