Research and development - Seminars
The project is motivated to demonstrate the convergence of Q-learning. This is an algorithm applied to finite Markov Decision Processes in discrete time, where there is not enough information. Thus, what the algorithm seeks is to solve the optimality equations (or Bellman equations). With this purpose in mind, in the project we discuss four things mainly:
Applications to complete Markov Decision Processes, and solutions to find optimal strategies in games of chance.
YouTube – Quantil Matemáticas Aplicadas
1. Presentation
Get information about Data Science, Artificial Intelligence, Machine Learning and more.