Month: May 2022

  • Markov Decision Processes

    We start with a basic formula, expected value. It is the expected value when you follow some transition function with some start state . Eq. 1: Note that the initial state is included in the reward function . Bellman TheoremA policy is optimal if and only if it is greedy with respect to its induced…

  • Word2Vec

    It is a mapping of words to the vector space. Words will be represented as vectors to be used in machine learning domain. We can utilize this transformation and treat the words as mathematical objects to extract the required information. Granted we have the embeddings, we can do operations easily. Now it comes how to…