Abstract: A system and method for composing an audio message are disclosed, which may include a memory for storing control parameters identifying respective preconfigured audio segments, the preconfigured audio segments being emotones; a recorder for enabling a user of the recording system to introduce user voice input into an audio message; and command input means for enabling the user of the recording system to selectively add user voice input and emotones into the audio message.