Collected from an apartment in the real world. Each tour was subsampled at 0.5 Hz to roughly convert from human walking pace to the pace used in our experiments, and spans 20 sampled frames. Frames that represent unexplored areas (frontiers) the robot could explore next were identified. Demos indicate if FiLM-Nav accurately predicted which frontier would lead to the closest instance of the requested object.
Uncut POV tours randomly sampled from the RealEstate10K dataset. Each tour was subsampled at 0.5 Hz to roughly convert from human walking pace to the pace used in our experiments, and spans 20 sampled frames. Frames that represent unexplored areas (frontiers) the robot could explore next were identified. Demos indicate if FiLM-Nav accurately predicted which frontier would lead to the closest instance of the requested object.
19.6% of all validation episodes
24.0% of all validation episodes
21.0% of all validation episodes
12.4% of all validation episodes