Contents:
returns an image tensor containing the portion of img that falls within box, where box is a tuple (cx, cy, width, height) in yolo format