A Review of Validation and Verification of Neural Network-based Policies for Sequential Decision Making