Patents by Inventor Yun-cheng Ju

Yun-cheng Ju has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

System and method for speaker role determination and scrubbing identifying information

Patent number: 11768961

Abstract: Methods for speaker role determination and scrubbing identifying information are performed by systems and devices. In speaker role determination, data from an audio or text file is divided into respective portions related to speaking parties. Characteristics classifying the portions of the data for speaking party roles are identified in the portions to generate data sets from the portions corresponding to the speaking party roles and to assign speaking party roles for the data sets. For scrubbing identifying information in data, audio data for speaking parties is processed using speech recognition to generate a text-based representation. Text associated with identifying information is determined based on a set of key words/phrases, and a portion of the text-based representation that includes a part of the text is identified. A segment of audio data that corresponds to the identified portion is replaced with different audio data, and the portion is replaced with different text.

Type: Grant

Filed: October 28, 2021

Date of Patent: September 26, 2023

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Yun-Cheng Ju, Ashwarya Poddar, Royi Ronen, Oron Nir, Ami Turgman, Andreas Stolcke, Edan Hauon
Automated labeling of child objects within tagged parents

Patent number: 11544279

Abstract: Solutions for automated labeling of child objects within tagged parents include: receiving a plurality of parent objects, each having a tag and including a plurality of child objects; receiving a machine learning (ML) component operable to rank objects according to relevance to queries; for each parent object: generating a set of restricted objects, wherein each restricted object corresponds to each child object in the plurality of child objects; for each of a plurality of queries, ranking, with the ML component, the restricted objects according to relevance; based at least on the query and an inverse of the rank of the restricted objects, assigning a child object label.

Type: Grant

Filed: February 3, 2022

Date of Patent: January 3, 2023

Assignee: Microsoft Technology Licensing, LLC

Inventors: Yun-Cheng Ju, Ashwarya Poddar, Yu Zhang, Lei Lu
AUTOMATED LABELING OF CHILD OBJECTS WITHIN TAGGED PARENTS

Publication number: 20220179872

Abstract: Solutions for automated labeling of child objects within tagged parents include: receiving a plurality of parent objects, each having a tag and including a plurality of child objects; receiving a machine learning (ML) component operable to rank objects according to relevance to queries; for each parent object: generating a set of restricted objects, wherein each restricted object corresponds to each child object in the plurality of child objects; for each of a plurality of queries, ranking, with the ML component, the restricted objects according to relevance; based at least on the query and an inverse of the rank of the restricted objects, assigning a child object label.

Type: Application

Filed: February 3, 2022

Publication date: June 9, 2022

Inventors: Yun-Cheng JU, Ashwarya PODDAR, Yu ZHANG, Lei LU
Automated labeling of child objects within tagged parents

Patent number: 11269895

Abstract: Solutions for automated labeling of child objects within tagged parents include: receiving a plurality of parent objects, each having a tag and including a plurality of child objects; receiving a machine learning (ML) component operable to rank the parent objects; for each parent object: generating a set of restricted objects, wherein each restricted object is missing a child object; for each of a plurality of queries, ranking, with the ML component, the restricted objects according to relevance; based at least on the query and an inverse of the rank of the restricted objects, assigning a child object label. In some examples, the object missing from the restricted object having the lowest relevance rank is assigned a label based on the query. Various advantageous uses of the assigned child labels include enhancing search result reporting, automatically generating ML training data, and automatically generating and/or modifying reports.

Type: Grant

Filed: September 6, 2019

Date of Patent: March 8, 2022

Assignee: Microsoft Technology Licensing, LLC

Inventors: Yun-Cheng Ju, Ashwarya Poddar, Yu Zhang, Lei Lu
SYSTEM AND METHOD FOR SPEAKER ROLE DETERMINATION AND SCRUBBING IDENTIFYING INFORMATION

Publication number: 20220050922

Abstract: Methods for speaker role determination and scrubbing identifying information are performed by systems and devices. In speaker role determination, data from an audio or text file is divided into respective portions related to speaking parties. Characteristics classifying the portions of the data for speaking party roles are identified in the portions to generate data sets from the portions corresponding to the speaking party roles and to assign speaking party roles for the data sets. For scrubbing identifying information in data, audio data for speaking parties is processed using speech recognition to generate a text-based representation. Text associated with identifying information is determined based on a set of key words/phrases, and a portion of the text-based representation that includes a part of the text is identified. A segment of audio data that corresponds to the identified portion is replaced with different audio data, and the portion is replaced with different text.

Type: Application

Filed: October 28, 2021

Publication date: February 17, 2022

Inventors: Yun-Cheng Ju, Ashwarya Poddar, Royi Ronen, Oron Nir, Ami Turgman, Andreas Stolcke, Edan Hauon
System and method for speaker role determination and scrubbing identifying information

Patent number: 11182504

Abstract: Methods for speaker role determination and scrubbing identifying information are performed by systems and devices. In speaker role determination, data from an audio or text file is divided into respective portions related to speaking parties. Characteristics classifying the portions of the data for speaking party roles are identified in the portions to generate data sets from the portions corresponding to the speaking party roles and to assign speaking party roles for the data sets. For scrubbing identifying information in data, audio data for speaking parties is processed using speech recognition to generate a text-based representation. Text associated with identifying information is determined based on a set of key words/phrases, and a portion of the text-based representation that includes a part of the text is identified. A segment of audio data that corresponds to the identified portion is replaced with different audio data, and the portion is replaced with different text.

Type: Grant

Filed: April 29, 2019

Date of Patent: November 23, 2021

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Yun-Cheng Ju, Ashwarya Poddar, Royi Ronen, Oron Nir, Ami Turgman, Andreas Stolcke, Edan Hauon
System and method for speaker role determination and scrubbing identifying information

Patent number: 11062706

Abstract: Methods for speaker role determination and scrubbing identifying information are performed by systems and devices. In speaker role determination, data from an audio or text file is divided into respective portions related to speaking parties. Characteristics classifying the portions of the data for speaking party roles are identified in the portions to generate data sets from the portions corresponding to the speaking party roles and to assign speaking party roles for the data sets. For scrubbing identifying information in data, audio data for speaking parties is processed using speech recognition to generate a text-based representation. Text associated with identifying information is determined based on a set of key words/phrases, and a portion of the text-based representation that includes a part of the text is identified. A segment of audio data that corresponds to the identified portion is replaced with different audio data, and the portion is replaced with different text.

Type: Grant

Filed: April 29, 2019

Date of Patent: July 13, 2021

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Yun-Cheng Ju, Ashwarya Poddar, Royi Ronen, Oron Nir, Ami Turgman, Andreas Stolcke, Edan Hauon
Performing an operation relative to tabular data based upon voice input

Patent number: 10956433

Abstract: Described herein are various technologies pertaining to performing an operation relative to tabular data based upon voice input. An ASR system includes a language model that is customized based upon content of the tabular data. The ASR system receives a voice signal that is representative of speech of a user. The ASR system creates a transcription of the voice signal based upon the ASR being customized with the content of the tabular data. The operation relative to the tabular data is performed based upon the transcription of the voice signal.

Type: Grant

Filed: May 21, 2014

Date of Patent: March 23, 2021

Assignee: Microsoft Technology Licensing, LLC

Inventors: Prabhdeep Singh, Kris Ganjam, Sumit Gulwani, Mark Marron, Yun-Cheng Ju, Kaushik Chakrabarti
AUTOMATED LABELING OF CHILD OBJECTS WITHIN TAGGED PARENTS

Publication number: 20210073236

Abstract: Solutions for automated labeling of child objects within tagged parents include: receiving a plurality of parent objects, each having a tag and including a plurality of child objects; receiving a machine learning (ML) component operable to rank the parent objects; for each parent object: generating a set of restricted objects, wherein each restricted object is missing a child object; for each of a plurality of queries, ranking, with the ML component, the restricted objects according to relevance; based at least on the query and an inverse of the rank of the restricted objects, assigning a child object label. In some examples, the object missing from the restricted object having the lowest relevance rank is assigned a label based on the query. Various advantageous uses of the assigned child labels include enhancing search result reporting, automatically generating ML training data, and automatically generating and/or modifying reports.

Type: Application

Filed: September 6, 2019

Publication date: March 11, 2021

Inventors: Yun-Cheng JU, Ashwarya PODDAR, Yu ZHANG, Lei LU
SYSTEM AND METHOD FOR SPEAKER ROLE DETERMINATION AND SCRUBBING IDENTIFYING INFORMATION

Publication number: 20200342860

Abstract: Methods for speaker role determination and scrubbing identifying information are performed by systems and devices. In speaker role determination, data from an audio or text file is divided into respective portions related to speaking parties. Characteristics classifying the portions of the data for speaking party roles are identified in the portions to generate data sets from the portions corresponding to the speaking party roles and to assign speaking party roles for the data sets. For scrubbing identifying information in data, audio data for speaking parties is processed using speech recognition to generate a text-based representation. Text associated with identifying information is determined based on a set of key words/phrases, and a portion of the text-based representation that includes a part of the text is identified. A segment of audio data that corresponds to the identified portion is replaced with different audio data, and the portion is replaced with different text.

Type: Application

Filed: April 29, 2019

Publication date: October 29, 2020

Inventors: Yun-Cheng Ju, Ashwarya Poddar, Royi Ronen, Oron Nir, Ami Turgman, Andreas Stolcke, Edan Hauon
SYSTEM AND METHOD FOR SPEAKER ROLE DETERMINATION AND SCRUBBING IDENTIFYING INFORMATION

Publication number: 20200342138

Abstract: Methods for speaker role determination and scrubbing identifying information are performed by systems and devices. In speaker role determination, data from an audio or text file is divided into respective portions related to speaking parties. Characteristics classifying the portions of the data for speaking party roles are identified in the portions to generate data sets from the portions corresponding to the speaking party roles and to assign speaking party roles for the data sets. For scrubbing identifying information in data, audio data for speaking parties is processed using speech recognition to generate a text-based representation. Text associated with identifying information is determined based on a set of key words/phrases, and a portion of the text-based representation that includes a part of the text is identified. A segment of audio data that corresponds to the identified portion is replaced with different audio data, and the portion is replaced with different text.

Type: Application

Filed: April 29, 2019

Publication date: October 29, 2020

Inventors: Yun-Cheng Ju, Ashwarya Poddar, Royi Ronen, Oron Nir, Ami Turgman, Andreas Stolcke, Edan Hauon
Clustering user utterance intents with semantic parsing

Patent number: 10134389

Abstract: A system is provided that trains a spoken language understanding (SLU) classifier. A corpus of user utterances is received. For each of the user utterances in the corpus, the user utterance is semantically parsed, and the result of this semantic parsing is represented as a rooted semantic parse graph. The parse graphs representing all of the user utterances in the corpus are then combined into a single corpus graph that represents the semantic parses of the entire corpus. The user utterances in the corpus are then clustered into intent-wise homogeneous groups of user utterances, where this clustering includes finding subgraphs in the corpus graph that represent different groups of user utterances, and each of these different groups has a similar user intent. The intent-wise homogeneous groups of user utterances are then used to train the SLU classifier, and the trained SLU classifier is output.

Type: Grant

Filed: September 4, 2015

Date of Patent: November 20, 2018

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Dilek Hakkani-Tur, Yun-Cheng Ju, Geoffrey G. Zweig, Gokhan Tur
Crowdsourced, grounded language for intent modeling in conversational interfaces

Patent number: 9754585

Abstract: Different advantageous embodiments provide a crowdsourcing method for modeling user intent in conversational interfaces. One or more stimuli are presented to a plurality of describers. One or more sets of describer data are captured from the plurality of describers using a data collection mechanism. The one or more sets of describer data are processed to generate one or more models. Each of the one or more models is associated with a specific stimulus from the one or more stimuli.

Type: Grant

Filed: April 3, 2012

Date of Patent: September 5, 2017

Assignee: Microsoft Technology Licensing, LLC

Inventors: Christopher John Brockett, Piali Choudhury, William Brennan Dolan, Yun-Cheng Ju, Patrick Pantel, Noelle Mallory Sophy, Svitlana Volkova
Pronunciation learning from user correction

Patent number: 9640175

Abstract: Systems and methods are described for adding entries to a custom lexicon used by a speech recognition engine of a speech interface in response to user interaction with the speech interface. In one embodiment, a speech signal is obtained when the user speaks a name of a particular item to be selected from among a finite set of items. If a phonetic description of the speech signal is not recognized by the speech recognition engine, then the user is presented with a means for selecting the particular item from among the finite set of items by providing input in a manner that does not include speaking the name of the item. After the user has selected the particular item via the means for selecting, the phonetic description of the speech signal is stored in association with a text description of the particular item in the custom lexicon.

Type: Grant

Filed: October 7, 2011

Date of Patent: May 2, 2017

Assignee: Microsoft Technology Licensing, LLC

Inventors: Wei-Ting Frank Liu, Andrew Lovitt, Stefanie Tomko, Yun-Cheng Ju
CLUSTERING USER UTTERANCE INTENTS WITH SEMANTIC PARSING

Publication number: 20170069310

Abstract: A system is provided that trains a spoken language understanding (SLU) classifier. A corpus of user utterances is received. For each of the user utterances in the corpus, the user utterance is semantically parsed, and the result of this semantic parsing is represented as a rooted semantic parse graph. The parse graphs representing all of the user utterances in the corpus are then combined into a single corpus graph that represents the semantic parses of the entire corpus. The user utterances in the corpus are then clustered into intent-wise homogeneous groups of user utterances, where this clustering includes finding subgraphs in the corpus graph that represent different groups of user utterances, and each of these different groups has a similar user intent. The intent-wise homogeneous groups of user utterances are then used to train the SLU classifier, and the trained SLU classifier is output.

Type: Application

Filed: September 4, 2015

Publication date: March 9, 2017

Inventors: Dilek Hakkani-Tur, Yun-Cheng Ju, Geoffrey G. Zweig, Gokhan Tur
Automated data cleanup by substitution of words of the same pronunciation and different spelling in speech recognition

Patent number: 9460708

Abstract: The described implementations relate to automated data cleanup. One system includes a language model generated from language model seed text and a dictionary of possible data substitutions. This system also includes a transducer configured to cleanse a corpus utilizing the language model and the dictionary. The transducer can process speech recognition data in some cases by substituting a second word for a first word which shares pronunciation with the first word but is spelled differently. In some cases, this can be accomplished by establishing corresponding probabilities of the first word and second word based on a third word that appears in sequence with the first word.

Type: Grant

Filed: September 17, 2009

Date of Patent: October 4, 2016

Assignee: Microsoft Technology Licensing, LLC

Inventors: Geoffrey Zweig, Yun-Cheng Ju
Facilitating development of a spoken natural language interface

Patent number: 9330659

Abstract: A development system is described for facilitating the development of a spoken natural language (SNL) interface. The development system receives seed templates from a developer, each of which provides a command phrasing that can be used to invoke a function, when spoken by an end user. The development system then uses one or more development resources, such as a crowdsourcing system and a paraphrasing system, to provide additional templates. This yields an extended set of templates. A generation system then generates one or more models based on the extended set of templates. A user device may install the model(s) for use in interpreting commands spoken by an end user. When the user device recognizes a command, it may automatically invoke a function associated with that command. Overall, the development system provides an easy-to-use tool for producing an SNL interface.

Type: Grant

Filed: February 25, 2013

Date of Patent: May 3, 2016

Assignee: Microsoft Technology Licensing, LLC

Inventors: Yun-Cheng Ju, Matthai Philipose, Seungyeop Han
Searching a database of listings

Patent number: 9218412

Abstract: A database having listings rather than long documents is searched using a term frequency-inverse document frequency (Tf/Idf) algorithm.

Type: Grant

Filed: May 10, 2007

Date of Patent: December 22, 2015

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Ye-Yi Wang, Dong Yu, Yun-Cheng Ju, Alejandro Acero, Geoffrey G. Zweig
Spoken utterance classification training for a speech recognition system

Patent number: 9082403

Abstract: The subject disclosure is directed towards training a classifier for spoken utterances without relying on human-assistance. The spoken utterances may be related to a voice menu program for which a speech comprehension component interprets the spoken utterances into voice menu options. The speech comprehension component provides confirmations to some of the spoken utterances in order to accurately assign a semantic label. For each spoken utterance with a denied confirmation, the speech comprehension component automatically generates a pseudo-semantic label that is consistent with the denied confirmation and selected from a set of potential semantic labels and updates a classification model associated with the classifier using the pseudo-semantic label.

Type: Grant

Filed: December 15, 2011

Date of Patent: July 14, 2015

Assignee: Microsoft Technology Licensing, LLC

Inventors: Yun-Cheng Ju, James Garnet Droppo, III
Telephony service interaction management

Patent number: 8990071

Abstract: A method for managing an interaction of a calling party to a communication partner is provided. The method includes automatically determining if the communication partner expects DTMF input. The method also includes translating speech input to one or more DTMF tones and communicating the one or more DTMF tones to the communication partner, if the communication partner expects DTMF input.

Type: Grant

Filed: March 29, 2010

Date of Patent: March 24, 2015

Assignee: Microsoft Technology Licensing, LLC

Inventors: Yun-Cheng Ju, Stefanie Tomko, Frank Liu, Ivan Tashev

1 2 3 4 5 … next