Abstract: The technology described in this document can be embodied in a method that includes receiving, at one or more processing devices of a headset that includes a sidetone generation circuit, an input signal representing ambient audio, and determining, by the one or more processing devices of the headset, that at least a portion of the input signal represents voice activity that satisfies a threshold condition. The method also includes, responsive to determining that the voice activity in the input signal satisfies the threshold condition, a control signal configured to cause the sidetone generation circuit to generate sidetone signals, and generating, by an acoustic transducer of the headset, an audio signal that represents, at least in part, the sidetone signals generated in accordance with the control signal.