Video-based crowd counting using a multi-scale optical flow pyramid network