Read the following extract and try to infer the information required to answer the questions. Rain lashed against the windows as Jane stamped up and down the room stopping only to check the time on ...
Supports single image or dual image (reference + inference) input modes 1. Reference Image {ref_bboxes}. 2. Target Image: The image to locate. Think through the reasoning process in your mind, induce ...