Statistical Inference for Policy Evaluation in Reinforcement Learning