Abstract: The present disclosure provides a system and a method for voice based authentication, which involves receiving voice data of a user enunciating a predetermined sequence of speech elements, with the predetermined sequence of speech elements is configured to capture a defined range of voiced sounds produced by the user; extracting voice features from the received voice data; deriving a voice signature for the user based on the extracted voice features, wherein the voice signature is representative of at least one of tonal, timbral, and temporal characteristics of the user's voiced speech; storing the derived voice signature in a database; receiving a verification voice sample of the user enunciating a predetermined sub-set of speech elements; comparing the verification voice sample with the stored voice signature; and authenticating the user based on the comparison.