Abstract: In one aspect there is provided an embodiment of an image capture device comprising a camera, an image processor, a storage device and an interface. The camera is configured to capture images in visible spectrum light of a human finger as part of a human hand in a field of view (FOV) of the camera. The image processor is configured to process a first one of the images to detect a presence of the finger. The image capture device is configured to detect the position of the presence of the finger tip, track movement of the finger tip within the FOV by processing at least a second one of the images and generate a command based on the tracked movement of the finger within the FOV. The method does not require any pre-detection training sequence with said finger prior to finger detection, and does not require the finger to be in specific relative angle or finger orientation in said FOV.