With a normal RGB camera, the robot can classify objects in the scene. We use deep learning extensively to handle very complex backgrounds in outdoor environments. Robustness to fruit variety, lighting conditions, sun direction, and camera location are other challenges we can solve. In addition to the network's ability to distinguish objects, our software can estimate stem thickness and ensure it's not going for the main tomato trunk.