Testing for Fault Diversity in Reinforcement Learning