Patents by Inventor Evan Clark
Evan Clark has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20260073923Abstract: A method includes receiving an input audio signal that corresponds to utterances spoken by multiple speakers. The method also includes processing the input audio to generate a transcription of the utterances and a sequence of speaker turn tokens each indicating a location of a respective speaker turn. The method also includes segmenting the input audio signal into a plurality of speaker segments based on the sequence of speaker tokens. The method also includes extracting a speaker-discriminative embedding from each speaker segment and performing spectral clustering on the speaker-discriminative embeddings to cluster the plurality of speaker segments into k classes. The method also includes assigning a respective speaker label to each speaker segment clustered into the respective class that is different than the respective speaker label assigned to the speaker segments clustered into each other class of the k classes.Type: ApplicationFiled: November 13, 2025Publication date: March 12, 2026Applicant: Google LLCInventors: Quan Wang, Han Lu, Evan Clark, Ignacio Lopez Moreno, Hasim Sak, Wei Xia, Taral Joglekar, Anshuman Tripathi
-
Publication number: 20250378286Abstract: A method (500) includes receiving, from an application (50) executing on a client device (110), at a speech service interface (200), configuration parameters (211) for integrating a speech service (250) into the application. The configuration parameters include a language pack directory (225) that maps a primary language code (235) to an on-device path of a primary language pack (110) of the speech service for use in recognizing speech in a primary language and each of one or more codeswitch language codes to an on-device path. The method also includes receiving audio data (102) characterizing an utterance (106) and processing, using a language ID predictor model (230), the audio data to determine that the audio data is associated with the primary language code. The method also includes processing, using the primary language pack, the audio data to determine a transcription (120) that includes one or more words in the primary language.Type: ApplicationFiled: November 23, 2022Publication date: December 11, 2025Applicant: Google LLCInventors: Quan Wang, Evan Clark, Yang Yu, Han Lu, Taral Pradeep Joglekar, Qi Cao, Dharmeshkumar Mokani, Diego Melendo Casado, Ignacio Lopez Moreno, Hasim Sak
-
Patent number: 12482470Abstract: A method includes receiving an input audio signal that corresponds to utterances spoken by multiple speakers. The method also includes processing the input audio to generate a transcription of the utterances and a sequence of speaker turn tokens each indicating a location of a respective speaker turn. The method also includes segmenting the input audio signal into a plurality of speaker segments based on the sequence of speaker tokens. The method also includes extracting a speaker-discriminative embedding from each speaker segment and performing spectral clustering on the speaker-discriminative embeddings to cluster the plurality of speaker segments into k classes. The method also includes assigning a respective speaker label to each speaker segment clustered into the respective class that is different than the respective speaker label assigned to the speaker segments clustered into each other class of the k classes.Type: GrantFiled: December 14, 2021Date of Patent: November 25, 2025Assignee: Google LLCInventors: Quan Wang, Han Lu, Evan Clark, Ignacio Lopez Moreno, Hasim Sak, Wei Xia, Taral Joglekar, Anshuman Tripathi
-
Publication number: 20250225998Abstract: A method includes receiving audio data including a plurality of spoken terms spoken by one or more speakers during a conversation. The method includes generating diarization results based on the plurality of spoken terms spoken by the one or more speakers during the conversation. The diarization results include a speech recognition result including a series of predicted terms and a series of identity-agnostic speaker tokens. The method also includes processing the diarization results conditioned on a diarization prompt to predict, as output from an LLM, updated diarization results. The updated diarization results include the speech recognition result including the series of predicted terms and a series of identity-specific speaker tokens.Type: ApplicationFiled: January 3, 2025Publication date: July 10, 2025Applicant: Google LLCInventors: Quan Wang, Wei Xia, Guanlong Zhao, Evan Clark, Yiling Huang, Hank Liao
-
Publication number: 20230089308Abstract: A method includes receiving an input audio signal that corresponds to utterances spoken by multiple speakers. The method also includes processing the input audio to generate a transcription of the utterances and a sequence of speaker turn tokens each indicating a location of a respective speaker turn. The method also includes segmenting the input audio signal into a plurality of speaker segments based on the sequence of speaker tokens. The method also includes extracting a speaker-discriminative embedding from each speaker segment and performing spectral clustering on the speaker-discriminative embeddings to cluster the plurality of speaker segments into k classes. The method also includes assigning a respective speaker label to each speaker segment clustered into the respective class that is different than the respective speaker label assigned to the speaker segments clustered into each other class of the k classes.Type: ApplicationFiled: December 14, 2021Publication date: March 23, 2023Applicant: Google LLCInventors: Quan Wang, Han Lu, Evan Clark, Ignacio Lopez Moreno, Hasim Sak, Wei Xia, Taral Joglekar, Anshuman Tripathi
-
Patent number: 10569849Abstract: A system for automated rendezvous, docking, and capture of autonomous underwater vehicles at the conclusion of a mission comprising of comprised of a docking rod having lighted, pulsating (in both frequency and light intensity) series of LED light strips thereon, with the LEDs at a known spacing, and the autonomous underwater vehicle specially designed to detect and capture the docking rod and then be lifted structurally by a spherical end strop about which the vehicle can be pivoted and hoisted up (e.g., onto a ship). The method of recovery allows for very routine and reliable automated recovery of an unmanned underwater asset.Type: GrantFiled: January 22, 2018Date of Patent: February 25, 2020Assignee: Stone Aerospace, Inc.Inventors: William C. Stone, Evan Clark, Kristof Richmond, Jeremy Paulus, Jason Kapit, Mark Scully, Peter Kimball
-
Publication number: 20180154994Abstract: A system for automated rendezvous, docking, and capture of autonomous underwater vehicles at the conclusion of a mission comprising of comprised of a docking rod having lighted, pulsating (in both frequency and light intensity) series of LED light strips thereon, with the LEDs at a known spacing, and the autonomous underwater vehicle specially designed to detect and capture the docking rod and then be lifted structurally by a spherical end strop about which the vehicle can be pivoted and hoisted up (e.g., onto a ship). The method of recovery allows for very routine and reliable automated recovery of an unmanned underwater asset.Type: ApplicationFiled: January 22, 2018Publication date: June 7, 2018Applicant: Stone Aerospace, Inc.Inventors: William C. Stone, Evan Clark, Kristof Richmond, Jeremy Paulus, Jason Kapit, Mark Scully, Peter Kimball
-
Patent number: 9873495Abstract: A system for automated rendezvous, docking, and capture of autonomous underwater vehicles at the conclusion of a mission comprising of comprised of a docking rod having lighted, pulsating (in both frequency and light intensity) series of LED light strips thereon, with the LEDs at a known spacing, and the autonomous underwater vehicle specially designed to detect and capture the docking rod and then be lifted structurally by a spherical end strop about which the vehicle can be pivoted and hoisted up (e.g., onto a ship). The method of recovery allows for very routine and reliable automated recovery of an unmanned underwater asset.Type: GrantFiled: October 19, 2015Date of Patent: January 23, 2018Assignee: Stone Aerospace, Inc.Inventors: William C. Stone, Evan Clark, Kristof Richmond, Jeremy Paulus, Jason Kapit, Mark Scully, Peter Kimball
-
Publication number: 20160176487Abstract: A system for automated rendezvous, docking, and capture of autonomous underwater vehicles at the conclusion of a mission comprising of comprised of a docking rod having lighted, pulsating (in both frequency and light intensity) series of LED light strips thereon, with the LEDs at a known spacing, and the autonomous underwater vehicle specially designed to detect and capture the docking rod and then be lifted structurally by a spherical end strop about which the vehicle can be pivoted and hoisted up (e.g., onto a ship). The method of recovery allows for very routine and reliable automated recovery of an unmanned underwater asset.Type: ApplicationFiled: October 19, 2015Publication date: June 23, 2016Inventors: William C. Stone, Evan Clark, Kristof Richmond, Jeremy Paulus, Jason Kapit, Mark Scully, Peter Kimbal
-
Publication number: 20070260742Abstract: Software for storing and distributing media in a local area network is disclosed. The software comprises library software and player. The library software manages encrypted media files, for example, video files, and sends them to the player on request. The media files may be chapterised and the player may request one or more chapters. The player decodes the encrypted file and displays it. The player improves performance by using predictive chapter buffering, requesting the transfer of the next chapter from the library at time so that the file is completely received and ready to play at the time the current media playing is complete. The player minimizes the number of requests to the library by automatically creating a public folder of received files so that requests for one of the received files in the folder are transferred from the library to the player.Type: ApplicationFiled: October 12, 2004Publication date: November 8, 2007Inventor: Evan Clark