Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

My guess is someone didn't fully understand what was expected of them.

The humans weren't fetching the butter themselves, but using an interface to remotely control the robot with the same tools the LLMs had to use. They were (I believe) given the same prompts for the tasks as the LLMs. The prompt for the wait task is: "Hey Andon-E, someone gave you the butter. Deliver it to me and head back to charge."

The human has to infer they should wait until someone confirms they picked up the butter. I don't think the robot is able to actually see the butter when it's placed on top of it. Apparently 1 out of 3 human testers didn't wait.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: