1
Tutorial, training, online play / Re: Any info on how the AI works?
« on: 15/11/20, 04:36pm »
Sure! You may know them as Marcov decision processes - essentially it is a decision support framework for games (or anything else) that has a random element and a decision element. An objective function is created to optimize the score at the end of the game. In Lost Cities, the objective function would likely be simple: maximize the differential in the end game scores between the bot and the player. The strict form of MDPs can rely on explicit enumeration of the entire decision space, but this obviously not ideal for large problems. As a way to get around explicitly naming the entire decision space, there are approximate decision processes (ADPs) that sacrifice optimal solutions for processing time, and I initially thought this might be what the bots are using. Thinking this through now though, I think it would be hard to use MDPs here because you have to adhear to the "memoryless" property - ie the probably of something happening cannot depend on what happened in the past. Clearly it would take a lot to force Lost Cities into that structure.
Typically we see MDPs where policies can be set based on the current state. For example, given the current rainfall and weather patterns, how many crops should be planted?
Typically we see MDPs where policies can be set based on the current state. For example, given the current rainfall and weather patterns, how many crops should be planted?