Patents by Inventor Ziad Al Bawab

Ziad Al Bawab has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Organizational-based language model generation

Patent number: 11676576

Abstract: Systems and methods are provided for acquiring training data and building an organizational-based language model based on the training data. In organizational data is generated via one or more applications associated with an organization, the collected organizational data is aggregated and filtered into training data that is used for training an organizational-based language model for speech processing based on the training data.

Type: Grant

Filed: August 11, 2021

Date of Patent: June 13, 2023

Assignee: Microsoft Technology Licensing, LLC

Inventors: Ziad Al Bawab, Anand U Desai, Cem Aksoylar, Michael Levit, Xin Meng, Shuangyu Chang, Suyash Choudhury, Dhiresh Rawal, Tao Li, Rishi Girish, Marcus Jager, Ananth Rampura Sheshagiri Rao
Meeting-adapted language model for speech recognition

Patent number: 11636854

Abstract: A system includes acquisition of meeting data associated with a meeting, determination of a plurality of meeting participants based on the acquired meeting data, acquisition of e-mail data associated with each of the plurality of meeting participants, generation of a meeting language model based on the acquired e-mail data and the meeting data, and transcription of audio associated with the meeting based on the meeting language model.

Type: Grant

Filed: May 24, 2022

Date of Patent: April 25, 2023

Assignee: Microsoft Technology Licensing, LLC

Inventors: Ziad Al Bawab, Anand U Desai, Shuangyu Chang, Amit K Agarwal, Zoltan Romocsa, Christopher H Basoglu, Nathan E Wohlgemuth
Online language model interpolation for automatic speech recognition

Patent number: 11562738

Abstract: A system includes acquisition of a domain grammar, determination of an interpolated grammar based on the domain grammar and a base grammar, determination of a delta domain grammar based on an augmented first grammar and the interpolated grammar, determination of an out-of-vocabulary class based on the domain grammar and the base grammar, insertion of the out-of-vocabulary class into a composed transducer composed of the augmented first grammar and one or more other transducers to generate an updated composed transducer, composition of the delta domain grammar and the updated composed transducer, and application of the composition of the delta domain grammar and the updated composed transducer to an output of an acoustic model.

Type: Grant

Filed: October 28, 2019

Date of Patent: January 24, 2023

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Ziad Al Bawab, Anand U Desai, Shuangyu Chang, Amit K Agarwal, Zoltan Romocsa, Veljko Miljanic, Aadyot Bhatnagar, Hosam Khalil, Christopher Basoglu
MEETING-ADAPTED LANGUAGE MODEL FOR SPEECH RECOGNITION

Publication number: 20220358912

Abstract: A system includes acquisition of meeting data associated with a meeting, determination of a plurality of meeting participants based on the acquired meeting data, acquisition of e-mail data associated with each of the plurality of meeting participants, generation of a meeting language model based on the acquired e-mail data and the meeting data, and transcription of audio associated with the meeting based on the meeting language model.

Type: Application

Filed: May 24, 2022

Publication date: November 10, 2022

Inventors: Ziad AL BAWAB, Anand U. DESAI, Shuangyu CHANG, Amit K. AGARWAL, Zoltan ROMOCSA, Christopher H. BASOGLU, Nathan E. WOHLGEMUTH
Meeting-adapted language model for speech recognition

Patent number: 11430433

Abstract: A system includes acquisition of meeting data associated with a meeting, determination of a plurality of meeting participants based on the acquired meeting data, acquisition of e-mail data associated with each of the plurality of meeting participants, generation of a meeting language model based on the acquired e-mail data and the meeting data, and transcription of audio associated with the meeting based on the meeting language model.

Type: Grant

Filed: August 5, 2019

Date of Patent: August 30, 2022

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Ziad Al Bawab, Anand U Desai, Shuangyu Chang, Amit K Agarwal, Zoltan Romocsa, Christopher H Basoglu, Nathan E Wohlgemuth
Meeting-adapted language model for speech recognition

Patent number: 11348574

Abstract: A system includes acquisition of meeting data associated with a meeting, determination of a plurality of meeting participants based on the acquired meeting data, acquisition of e-mail data associated with each of the plurality of meeting participants, generation of a meeting language model based on the acquired e-mail data and the meeting data, and transcription of audio associated with the meeting based on the meeting language model.

Type: Grant

Filed: August 5, 2019

Date of Patent: May 31, 2022

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Ziad Al Bawab, Anand U Desai, Shuangyu Chang, Amit K Agarwal, Zoltan Romocsa, Christopher H Basoglu, Nathan E Wohlgemuth
ORGANIZATIONAL-BASED LANGUAGE MODEL GENERATION

Publication number: 20220013109

Abstract: Provided is a system and method for acquiring training data and building an organizational-based language model based on the training data. In one example, the method may include collecting organizational data that is generated via one or more applications associated with an organization, aggregating the collected organizational data with previously collected organizational data to generate aggregated organizational training data, training an organizational-based language model for speech processing based on the aggregated organizational training data, and storing the trained organizational-based language model.

Type: Application

Filed: August 11, 2021

Publication date: January 13, 2022

Inventors: Ziad AL BAWAB, Anand U. DESAI, Cem AKSOYLAR, Michael LEVIT, Xin MENG, Shuangyu CHANG, Suyash CHOUDHURY, Dhiresh RAWAL, Tao LI, Rishi GIRISH, Marcus JAGER, Ananth Rampura SHESHAGIRI RAO
Organizational-based language model generation

Patent number: 11120788

Abstract: Provided is a system and method for acquiring training data and building an organizational-based language model based on the training data. In one example, the method may include collecting organizational data that is generated via one or more applications associated with an organization, aggregating the collected organizational data with previously collected organizational data to generate aggregated organizational training data, training an organizational-based language model for speech processing based on the aggregated organizational training data, and storing the trained organizational-based language model.

Type: Grant

Filed: June 27, 2019

Date of Patent: September 14, 2021

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Ziad Al Bawab, Anand U Desai, Cem Aksoylar, Michael Levit, Xin Meng, Shuangyu Chang, Suyash Choudhury, Dhiresh Rawal, Tao Li, Rishi Girish, Marcus Jager, Ananth Rampura Sheshagiri Rao
Hyperarticulation detection in repetitive voice queries using pairwise comparison for improved speech recognition

Patent number: 10847147

Abstract: Automatic speech recognition systems can benefit from cues in user voice such as hyperarticulation. Traditional approaches typically attempt to define and detect an absolute state of hyperarticulation, which is very difficult, especially on short voice queries. This disclosure provides for an approach for hyperarticulation detection using pair-wise comparisons and on a real-world speech recognition system. The disclosed approach uses delta features extracted from a pair of repetitive user utterances. The improvements provided by the disclosed systems and methods include improvements in word error rate by using hyperarticulation information as a feature in a second pass N-best hypotheses rescoring setup.

Type: Grant

Filed: May 24, 2019

Date of Patent: November 24, 2020

Assignee: Microsoft Technology Licensing, LLC

Inventors: Ranjitha Gurunath Kulkarni, Ahmed Moustafa El Kholy, Ziad Al Bawab, Noha Alon, Imed Zitouni
ONLINE LANGUAGE MODEL INTERPOLATION FOR AUTOMATIC SPEECH RECOGNITION

Publication number: 20200349930

Abstract: A system includes acquisition of a domain grammar, determination of an interpolated grammar based on the domain grammar and a base grammar, determination of a delta domain grammar based on an augmented first grammar and the interpolated grammar, determination of an out-of-vocabulary class based on the domain grammar and the base grammar, insertion of the out-of-vocabulary class into a composed transducer composed of the augmented first grammar and one or more other transducers to generate an updated composed transducer, composition of the delta domain grammar and the updated composed transducer, and application of the composition of the delta domain grammar and the updated composed transducer to an output of an acoustic model.

Type: Application

Filed: October 28, 2019

Publication date: November 5, 2020

Inventors: Ziad AL BAWAB, Anand U. DESAI, Shuangyu CHANG, Amit K. AGARWAL, Zoltan ROMOCSA, Veljko MILJANIC, Aadyot BHATNAGAR, Hosam KHALIL, Christopher BASOGLU
MEETING-ADAPTED LANGUAGE MODEL FOR SPEECH RECOGNITION

Publication number: 20200349931

Abstract: A system includes acquisition of meeting data associated with a meeting, determination of a plurality of meeting participants based on the acquired meeting data, acquisition of e-mail data associated with each of the plurality of meeting participants, generation of a meeting language model based on the acquired e-mail data and the meeting data, and transcription of audio associated with the meeting based on the meeting language model.

Type: Application

Filed: August 5, 2019

Publication date: November 5, 2020

Inventors: Ziad AL BAWAB, Anand U. DESAI, Shuangyu CHANG, Amit K. AGARWAL, Zoltan ROMOCSA, Christopher H. BASOGLU, Nathan E. WOHLGEMUTH
ORGANIZATIONAL-BASED LANGUAGE MODEL GENERATION

Publication number: 20200349920

Abstract: Provided is a system and method for acquiring training data and building an organizational-based language model based on the training data. In one example, the method may include collecting organizational data that is generated via one or more applications associated with an organization, aggregating the collected organizational data with previously collected organizational data to generate aggregated organizational training data, training an organizational-based language model for speech processing based on the aggregated organizational training data, and storing the trained organizational-based language model.

Type: Application

Filed: June 27, 2019

Publication date: November 5, 2020

Inventors: Ziad AL BAWAB, Anand U DESAI, Cem AKSOYLAR, Michael LEVIT, Xin MENG, Shuangyu CHANG, Suyash CHOUDHURY, Dhiresh RAWAL, Tao LI, Rishi GIRISH, Marcus JAGER, Ananth Rampura SHESHAGIRI RAO
Confidence features for automated speech recognition arbitration

Patent number: 10706852

Abstract: The described technology provides arbitration between speech recognition results generated by different automatic speech recognition (ASR) engines, such as ASR engines trained according to different language or acoustic models. The system includes an arbitrator that selects between a first speech recognition result representing an acoustic utterance as transcribed by a first ASR engine and a second speech recognition result representing the acoustic utterance as transcribed by a second ASR engine. This selection is based on a set of confidence features that is initially used by the first ASR engine or the second ASR engine to generate the first and second speech recognition results.

Type: Grant

Filed: November 13, 2015

Date of Patent: July 7, 2020

Assignee: Microsoft Technology Licensing, LLC

Inventors: Kshitiz Kumar, Hosam Khalil, Yifan Gong, Ziad Al-Bawab, Chaojun Liu
HYPERARTICULATION DETECTION IN REPETITIVE VOICE QUERIES USING PAIRWISE COMPARISON FOR IMPROVED SPEECH RECOGNITION

Publication number: 20190279612

Abstract: Automatic speech recognition systems can benefit from cues in user voice such as hyperarticulation. Traditional approaches typically attempt to define and detect an absolute state of hyperarticulation, which is very difficult, especially on short voice queries. This disclosure provides for an approach for hyperarticulation detection using pair-wise comparisons and on a real-world speech recognition system. The disclosed approach uses delta features extracted from a pair of repetitive user utterances. The improvements provided by the disclosed systems and methods include improvements in word error rate by using hyperarticulation information as a feature in a second pass N-best hypotheses rescoring setup.

Type: Application

Filed: May 24, 2019

Publication date: September 12, 2019

Inventors: Ranjitha Gurunath Kulkarni, Ahmed Moustafa El Kholy, Ziad Al Bawab, Noha Alon, Imed Zitouni
Hyperarticulation detection in repetitive voice queries using pairwise comparison for improved speech recognition

Patent number: 10354642

Abstract: Automatic speech recognition systems can benefit from cues in user voice such as hyperarticulation. Traditional approaches typically attempt to define and detect an absolute state of hyperarticulation, which is very difficult, especially on short voice queries. This disclosure provides for an approach for hyperarticulation detection using pair-wise comparisons and on a real-world speech recognition system. The disclosed approach uses delta features extracted from a pair of repetitive user utterances. The improvements provided by the disclosed systems and methods include improvements in word error rate by using hyperarticulation information as a feature in a second pass N-best hypotheses rescoring setup.

Type: Grant

Filed: June 15, 2017

Date of Patent: July 16, 2019

Assignee: Microsoft Technology Licensing, LLC

Inventors: Ranjitha Gurunath Kulkarni, Ahmed Moustafa El Kholy, Ziad Al Bawab, Noha Alon, Imed Zitouni
HYPERARTICULATION DETECTION IN REPETITIVE VOICE QUERIES USING PAIRWISE COMPARISON FOR IMPROVED SPEECH RECOGNITION

Publication number: 20180254035

Abstract: Automatic speech recognition systems can benefit from cues in user voice such as hyperarticulation. Traditional approaches typically attempt to define and detect an absolute state of hyperarticulation, which is very difficult, especially on short voice queries. This disclosure provides for an approach for hyperarticulation detection using pair-wise comparisons and on a real-world speech recognition system. The disclosed approach uses delta features extracted from a pair of repetitive user utterances. The improvements provided by the disclosed systems and methods include improvements in word error rate by using hyperarticulation information as a feature in a second pass N-best hypotheses rescoring setup.

Type: Application

Filed: June 15, 2017

Publication date: September 6, 2018

Inventors: Ranjitha Gurunath Kulkarni, Ahmed Moustafa El Kholy, Ziad Al Bawab, Noha Alon, Imed Zitouni
Pronunciation learning through correction logs

Patent number: 9947317

Abstract: A new pronunciation learning system for dynamically learning new pronunciations assisted by user correction logs. The user correction logs provide a record of speech recognition events and subsequent user behavior that implicitly confirms or rejects the recognition result and/or shows the user's intended words by via subsequent input. The system analyzes the correction logs and distills them down to a set of words which lack acceptable pronunciations. Hypothetical pronunciations, constrained by spelling and other linguistic knowledge, are generated for each of the words. Offline recognition determines the hypothetical pronunciations with a good acoustical match to the audio data likely to contain the words. The matching pronunciations are aggregated and adjudicated to select new pronunciations for the words to improve general or personalized recognition models.

Type: Grant

Filed: February 13, 2017

Date of Patent: April 17, 2018

Assignee: Microsoft Technology Licensing, LLC

Inventors: Nicholas Kibre, Umut Ozertem, Sarangarajan Parthasarathy, Ziad Al Bawab
PRONUNCIATION LEARNING THROUGH CORRECTION LOGS

Publication number: 20170154623

Abstract: A new pronunciation learning system for dynamically learning new pronunciations assisted by user correction logs. The user correction logs provide a record of speech recognition events and subsequent user behavior that implicitly confirms or rejects the recognition result and/or shows the user's intended words by via subsequent input. The system analyzes the correction logs and distills them down to a set of words which lack acceptable pronunciations. Hypothetical pronunciations, constrained by spelling and other linguistic knowledge, are generated for each of the words. Offline recognition determines the hypothetical pronunciations with a good acoustical match to the audio data likely to contain the words. The matching pronunciations are aggregated and adjudicated to select new pronunciations for the words to improve general or personalized recognition models.

Type: Application

Filed: February 13, 2017

Publication date: June 1, 2017

Applicant: Microsoft Technology Licensing, LLC.

Inventors: Nicholas Kibre, Umut Ozertem, Sarangarajan Parthasarathy, Ziad Al Bawab
CONFIDENCE FEATURES FOR AUTOMATED SPEECH RECOGNITION ARBITRATION

Publication number: 20170140759

Abstract: The described technology provides arbitration between speech recognition results generated by different automatic speech recognition (ASR) engines, such as ASR engines trained according to different language or acoustic models. The system includes an arbitrator that selects between a first speech recognition result representing an acoustic utterance as transcribed by a first ASR engine and a second speech recognition result representing the acoustic utterance as transcribed by a second ASR engine. This selection is based on a set of confidence features that is initially used by the first ASR engine or the second ASR engine to generate the first and second speech recognition results.

Type: Application

Filed: November 13, 2015

Publication date: May 18, 2017

Inventors: Kshitiz Kumar, Hosam Khalil, Yifan Gong, Ziad Al-Bawab, Chaojun Liu
Pronunciation learning through correction logs

Patent number: 9589562

Abstract: A new pronunciation learning system for dynamically learning new pronunciations assisted by user correction logs. The user correction logs provide a record of speech recognition events and subsequent user behavior that implicitly confirms or rejects the recognition result and/or shows the user's intended words by via subsequent input. The system analyzes the correction logs and distills them down to a set of words which lack acceptable pronunciations. Hypothetical pronunciations, constrained by spelling and other linguistic knowledge, are generated for each of the words. Offline recognition determines the hypothetical pronunciations with a good acoustical match to the audio data likely to contain the words. The matching pronunciations are aggregated and adjudicated to select new pronunciations for the words to improve general or personalized recognition models.

Type: Grant

Filed: February 21, 2014

Date of Patent: March 7, 2017

Assignee: Microsoft Technology Licensing, LLC

Inventors: Nicholas Kibre, Umut Ozertem, Sarangarajan Parthasarathy, Ziad Al Bawab

1 2 next