线上银河娱乐「诚|信」

    <dd id="0zz7k"></dd>

  1. <th id="0zz7k"></th>

    <rp id="0zz7k"><object id="0zz7k"><input id="0zz7k"></input></object></rp>

    1. Nonlinear Inverse Reinforcement Learning with Gaussian Processes

      Nonlinear Inverse Reinforcement Learning with Gaussian Processes

      Abstract

      We present a probabilistic algorithm for nonlinear inverse reinforcement learning. The goal of inverse reinforcement learning is to learn the reward function in a Markov decision process from expert demonstrations. While most prior inverse reinforcement learning algorithms represent the reward as a linear combination of a set of features, we use Gaussian processes to learn the reward as a nonlinear function, while also determining the relevance of each feature to the expert’s policy. Our probabilistic algorithm allows complex behaviors to be captured from suboptimal stochastic demonstrations, while automatically balancing the simplicity of the learned reward structure against its consistency with the observed actions.




      线上银河娱乐