Working on reinforcement learning / game-theoretic methods for reducing policy uncertainty in interactions with humans