Acknowledgements The iPhone teleoperation code and app is develeped by Ruihan Zhao We use mediapipe for human keypoint detection The oculus code and transformations have been adapted from DROID We use RoboMimic for imitation learning