How do I display some images on kinect display screen so that the user selects the desired image by hand?
Go through this tutorial skeletal tracking. Once you are able to do this, you just have to measure the time which one hand was holding on a specific location. This can be done only with Kinect data, and if that time exceeds a certain threshold you can trigger it as a button click. Get an idea of what I'm trying to explain here from the below link virtual dressing room