Abstract: Embodiments of the present invention provide a systems and methods for tracking features. In particular, some aspects of the present invention relate to a method and system for facial modelling and a method and system for determining facial features. Embodiments of the invention comprise receiving stereo image data comprising a set of corresponding first and second stereo-rectified image frames indicative of a target; annotating the stereo image data to determine a location of an image feature in the first and second stereo-rectified image frames, wherein the determined locations in the first and second corresponding stereo-rectified image frames are positionally constrained according to an epipolar constraint; and training a shape variation model corresponding to the target according to the determined image feature locations.