At the risk of being off- topic (and a novice in this domain to boot), is Alphago from Deepmind curve fitting as it teaches itself?
https://www.wired.com/story/this-more-powerful-version-of-alphago-learns-on-its-own/
Three Main components of Alphago:
Policy Network-
Trained on high level games to imitate the opponents.
Value Network-
Evaluates board positions and determines probability of winning from these positions.
Tree Search -
Looks through the different variations of the game from current positions to determine probability of future outcomes.
Netflix currently has a documentary on the competition between Alphago and the world’s best Go champion - Lee Sedol.
What was surprising is the crowd’s realization when confronted with the idea of human’s being ‘out-thought’ by a machine and the huge humble pie that was served.
https://www.wired.com/story/this-more-powerful-version-of-alphago-learns-on-its-own/
Three Main components of Alphago:
Policy Network-
Trained on high level games to imitate the opponents.
Value Network-
Evaluates board positions and determines probability of winning from these positions.
Tree Search -
Looks through the different variations of the game from current positions to determine probability of future outcomes.
Netflix currently has a documentary on the competition between Alphago and the world’s best Go champion - Lee Sedol.
What was surprising is the crowd’s realization when confronted with the idea of human’s being ‘out-thought’ by a machine and the huge humble pie that was served.