Patents Assigned to Cerence Operating Company
-
Patent number: 11069335Abstract: Aspects of the disclosure are related to synthesizing speech or other audio based on input data. Additionally, aspects of the disclosure are related to using one or more recurrent neural networks. For example, a computing device may receive text input; may determine features based on the text input; may provide the features as input to an recurrent neural network; may determine embedded data from one or more activations of a hidden layer of the recurrent neural network; may determine speech data based on a speech unit search that attempts to select, from a database, speech units based on the embedded data; and may generate speech output based on the speech data.Type: GrantFiled: July 12, 2017Date of Patent: July 20, 2021Assignee: Cerence Operating CompanyInventors: Vincent Pollet, Enrico Zovato
-
Patent number: 11057734Abstract: A method, computer program product, and computing system for receiving a request for information, concerning a geographically-proximate entity, on a consumer electronic device included within a vehicle. A location of the vehicle is determined; and the geographically-proximate entity is identified based, at least in part, upon the location of the vehicle.Type: GrantFiled: November 12, 2019Date of Patent: July 6, 2021Assignee: Cerence Operating CompanyInventors: Vincenzo A. Iannotti, Lior Ben-Gigi, Slawek Jarosz, David Ardman
-
Patent number: 10996327Abstract: A system and method for detecting multi-tone sirens despite environmental noises that may be present obtains a microphone input signal, applies, in real time, a time-frequency analysis to the microphone input signal to determine a time-frequency representation, provides at least one multi-tone model that has a plurality of tone duration patterns, performs multi-tone siren detection on the time-frequency representation, the detection based on the at least one multi-tone model and factoring of doppler shifts, and generates a detection result that can be used in systems for automated vehicles.Type: GrantFiled: July 19, 2019Date of Patent: May 4, 2021Assignee: Cerence Operating CompanyInventors: Markus Buck, Julien Premont, Friedrich Faubel
-
Patent number: 10991360Abstract: A system and method are disclosed for generating customized text-to-speech voices for a particular application. The method comprises generating a custom text-to-speech voice by selecting a voice for generating a custom text-to-speech voice associated with a domain, collecting text data associated with the domain from a pre-existing text data source and using the collected text data, generating an in-domain inventory of synthesis speech units by selecting speech units appropriate to the domain via a search of a pre-existing inventory of synthesis speech units, or by recording the minimal inventory for a selected level of synthesis quality. The text-to-speech custom voice for the domain is generated utilizing the in-domain inventory of synthesis speech units. Active learning techniques may also be employed to identify problem phrases wherein only a few minutes of recorded data is necessary to deliver a high quality TTS custom voice.Type: GrantFiled: July 31, 2017Date of Patent: April 27, 2021Assignee: Cerence Operating CompanyInventors: Srinivas Bangalore, Junlan Feng, Mazin Gilbert, Juergen Schroeter, Ann K. Syrdal, David Schulz
-
Publication number: 20210082402Abstract: A system and/or method receives speech input including an accent. The accent is classified with an accent classifier to yield an accent classification. Automatic speech recognition is performed based on the speech input and the accent classification to yield an automatic speech recognition output. Natural language understanding is performed on the speech recognition output and the accent classification determining an intent of the speech recognition output. Natural language generation generates an output based on the speech recognition output and the intent and the accent classification. An output is rendered using text to speech based on the natural language generation and the accent classification.Type: ApplicationFiled: September 13, 2019Publication date: March 18, 2021Applicant: Cerence Operating CompanyInventors: Yang SUN, Junho PARK, Goujin WEI, Daniel WILLETT
-
Patent number: 10943400Abstract: Some embodiments described herein relate to a multimodal user interface for use in an automobile. The multimodal user interface may display information on a windshield of the automobile, such as by projecting information on the windshield, and may accept input from a user via multiple modalities, which may include a speech interface as well as other interfaces. The other interfaces may include interfaces allowing a user to provide geometric input by indicating an angle. In some embodiments, a user may define a task to be performed using multiple different input modalities. For example, the user may provide via the speech interface speech input describing a task that the user is requesting be performed, and may provide via one or more other interfaces geometric parameters regarding the task. The multimodal user interface may determine the task and the geometric parameters from the inputs.Type: GrantFiled: January 7, 2019Date of Patent: March 9, 2021Assignee: Cerence Operating CompanyInventors: Mohammad Mehdi Moniri, Nils Lenke
-
Publication number: 20210043195Abstract: There is provided an automated speech recognition system that applies weights to grapheme-to-phoneme models, and interpolates pronunciations from combinations of the models, to recognize utterances of foreign named entities for naive, informed, and in-between pronunciations.Type: ApplicationFiled: August 6, 2019Publication date: February 11, 2021Applicant: Cerence Operating CompanyInventors: Stefan Christof HAHN, Efthymia GEORGALA, Olivier Stéphane Jérôme DIVAY, Eric Joseph MARSHALL
-
Patent number: 10783899Abstract: Systems and methods are introduced to perform noise suppression of an audio signal. The audio signal includes foreground speech components and background noise. The foreground speech components correspond to speech from a user's speaking into an audio receiving device. The background noise includes babble noise that includes speech from one or more interfering speakers. A soft speech detector determines, dynamically, a speech detection result indicating a likelihood of a presence of the foreground speech components in the audio signal. The speech detection result is employed to control, dynamically, an amount of attenuation of the noise suppression to reduce the babble noise in the audio signal. Further processing achieves a more stationary background and reduction of musical tones in the audio signal.Type: GrantFiled: November 18, 2016Date of Patent: September 22, 2020Assignee: Cerence Operating CompanyInventors: Simon Graf, Tobias Herbig, Markus Buck
-
Patent number: 10747497Abstract: Provided are a system and method of mixing a second audio stream with a first audio stream in an audio output device. The system is configured to execute the method, comprising buffering and outputting the first audio stream via the audio output device as unmodified output, determining at least one insertion spot within the first audio stream, modifying the first audio stream at an insertion spot to avoid content loss, outputting the second audio stream at the insertion spot, and resuming unmodified output of the first audio stream at or near a completion of the second audio stream. Modifying the first audio stream can include pausing and/or warping the first audio stream at the insertion spot. The audio output device can be a vehicle head unit or a wireless device, such as a mobile phone.Type: GrantFiled: October 25, 2019Date of Patent: August 18, 2020Assignee: CERENCE OPERATING COMPANYInventors: Nils Lenke, Christophe Couvreur
-
Publication number: 20200154233Abstract: A method, computer program product, and computing system for receiving a request for information, concerning a geographically-proximate entity, on a consumer electronic device included within a vehicle. A location of the vehicle is determined; and the geographically-proximate entity is identified based, at least in part, upon the location of the vehicle.Type: ApplicationFiled: November 12, 2019Publication date: May 14, 2020Applicant: CERENCE OPERATING COMPANYInventors: Vincenzo A. IANNOTTI, Lior BEN-GIGI, Slawek JAROSZ, David ARDMAN
-
Patent number: 10650806Abstract: A method, computer program product, and computer system for transforming, by a computing device, a speech signal into a speech signal representation. A regression deep neural network may be trained with a cost function to minimize a mean squared error between actual values of the speech signal representation and estimated values of the speech signal representation, wherein the cost function may include one or more discriminative terms. Bandwidth of the speech signal may be extended by extending the speech signal representation of the speech signal using the regression deep neural network trained with the cost function that includes the one or more discriminative terms.Type: GrantFiled: April 23, 2018Date of Patent: May 12, 2020Assignee: Cerence Operating CompanyInventors: Friedrich Faubel, Jonas Sautter
-
Patent number: 10636412Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for speech synthesis. A system practicing the method receives a set of ordered lists of speech units, for each respective speech unit in each ordered list in the set of ordered lists, constructs a sublist of speech units from a next ordered list which are suitable for concatenation, performs a cost analysis of paths through the set of ordered lists of speech units based on the sublist of speech units for each respective speech unit, and synthesizes speech using a lowest cost path of speech units through the set of ordered lists based on the cost analysis. The ordered lists can be ordered based on the respective pitch of each speech unit. In one embodiment, speech units which do not have an assigned pitch can be assigned a pitch.Type: GrantFiled: September 17, 2018Date of Patent: April 28, 2020Assignee: Cerence Operating CompanyInventor: Alistair D. Conkie
-
Publication number: 20200126562Abstract: Systems and methods of validating transcriptions of natural language content using crowdsourced validation jobs are provided herein. In various implementations, a transcription pair comprising natural language content and text corresponding to a transcription of the natural language content may be gathered. A group of validation devices may be selected for reviewing the transcription pair. A crowdsourced validation job may be created for the group of validation devices. The crowdsourced validation job may be provided to the group of validation devices. One or more votes representing whether or not the text accurately represents the natural language content may be received from the group of validation devices. Based on the one or more votes received, the transcription pair may be stored in a validated transcription library, which may be used to process end-user voice data.Type: ApplicationFiled: November 14, 2019Publication date: April 23, 2020Applicant: CERENCE OPERATING COMPANYInventors: Spencer John ROTHWELL, Daniela BRAGA, Ahmad Khamis ELSHENAWY, Stephen Steele CARTER
-
Patent number: 10536773Abstract: Methods and apparatus for frequency selective signal mixing for speech enhancement. In one embodiment frequency-based channel selection is performed for signal magnitude, signal energy, and noise estimate using speaker activity detection information, signal-to-noise ratio, and/or signal level, Frequency-based channel selection is performed for a dynamic spectral floor to adjust the noise estimate using speaker dominance information. Noise reduction is performed on the signal for the selected channel.Type: GrantFiled: October 30, 2013Date of Patent: January 14, 2020Assignee: Cerence Operating CompanyInventors: Timo Matheja, Markus Buck, Julien Premont
-
Patent number: 10523807Abstract: The present invention relates to a method for selecting and downloading content from a content provider which is accessible via an IP/DNS/URL address to a mobile device, the content being any text information data, for converting the text information data to at least one audio message and for storing the at least one audio message as at least one audio file on the mobile device, wherein the at least one audio file is playable and discernable as a music file. Said method implemented on a mobile phone enables controlling and playing the audio messages as music files by determining a title associated with the audio message using word underline and size attributes of the text, for user selection on the mobile's graphical user interface, for instance also in a car environment with a car kit enabling a control and a selection of one or more of said at least one audio files for playing from the mobile phone.Type: GrantFiled: June 29, 2017Date of Patent: December 31, 2019Assignee: Cerence Operating CompanyInventor: Cuneyt Goktekin
-
Patent number: 10504510Abstract: A method or associated system for motion adaptive speech processing includes dynamically estimating a motion profile that is representative of a user's motion based on data from one or more resources, such as sensors and non-speech resources, associated with the user. The method includes effecting processing of a speech signal received from the user, for example, while the user is in motion, the processing taking into account the estimated motion profile to produce an interpretation of the speech signal. Dynamically estimating the motion profile can include computing a motion weight vector using the data from the one or more resources associated with the user, and can further include interpolating a plurality of models using the motion weight vector to generate a motion adaptive model. The motion adaptive model can be used to enhance voice destination entry for the user and re-used for other users who do not provide motion profiles.Type: GrantFiled: June 10, 2015Date of Patent: December 10, 2019Assignee: Cerence Operating CompanyInventors: Munir Nikolai Alexander Georges, Josef Damianus Anastasiadis, Oliver Bender