Patents by Inventor Hosam A. Khalil

Hosam A. Khalil has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Adaptive batching to reduce recognition latency

Patent number: 11705117

Abstract: Acoustic features are batched into two different batches. The second batch of the two batches is made in response to a detection of a word hypothesis output by a speech recognition network that received the first batch. The number of acoustic feature frames of the second batch is equal to a second batch size greater than the first batch size. The second batch is also to the speech recognition network for processing.

Type: Grant

Filed: October 13, 2021

Date of Patent: July 18, 2023

Assignee: Microsoft Technology Licensing, LLC

Inventors: Hosam A. Khalil, Emilian Y. Stoimenov, Yifan Gong, Chaojun Liu, Christopher H. Basoglu, Amit K. Agarwal, Naveen Parihar, Sayan Pathak
Online language model interpolation for automatic speech recognition

Patent number: 11562738

Abstract: A system includes acquisition of a domain grammar, determination of an interpolated grammar based on the domain grammar and a base grammar, determination of a delta domain grammar based on an augmented first grammar and the interpolated grammar, determination of an out-of-vocabulary class based on the domain grammar and the base grammar, insertion of the out-of-vocabulary class into a composed transducer composed of the augmented first grammar and one or more other transducers to generate an updated composed transducer, composition of the delta domain grammar and the updated composed transducer, and application of the composition of the delta domain grammar and the updated composed transducer to an output of an acoustic model.

Type: Grant

Filed: October 28, 2019

Date of Patent: January 24, 2023

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Ziad Al Bawab, Anand U Desai, Shuangyu Chang, Amit K Agarwal, Zoltan Romocsa, Veljko Miljanic, Aadyot Bhatnagar, Hosam Khalil, Christopher Basoglu
ADAPTIVE BATCHING TO REDUCE RECOGNITION LATENCY

Publication number: 20220068269

Abstract: Embodiments may include collection of a first batch of acoustic feature frames of an audio signal, the number of acoustic feature frames of the first batch equal to a first batch size, input of the first batch to a speech recognition network, collection, in response to detection of a word hypothesis output by the speech recognition network, of a second batch of acoustic feature frames of the audio signal, the number of acoustic feature frames of the second batch equal to a second batch size greater than the first batch size, and input of the second batch to the speech recognition network.

Type: Application

Filed: October 13, 2021

Publication date: March 3, 2022

Inventors: Hosam A. KHALIL, Emilian Y. STOIMENOV, Yifan GONG, Chaojun LIU, Christopher H. BASOGLU, Amit K. AGARWAL, Naveen PARIHAR, Sayan PATHAK
Adaptive batching to reduce recognition latency

Patent number: 11183178

Abstract: Embodiments may include collection of a first batch of acoustic feature frames of an audio signal, the number of acoustic feature frames of the first batch equal to a first batch size, input of the first batch to a speech recognition network, collection, in response to detection of a word hypothesis output by the speech recognition network, of a second batch of acoustic feature frames of the audio signal, the number of acoustic feature frames of the second batch equal to a second batch size greater than the first batch size, and input of the second batch to the speech recognition network.

Type: Grant

Filed: January 27, 2020

Date of Patent: November 23, 2021

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Hosam A. Khalil, Emilian Y. Stoimenov, Yifan Gong, Chaojun Liu, Christopher H. Basoglu, Amit K. Agarwal, Naveen Parihar, Sayan Pathak
ADAPTIVE BATCHING TO REDUCE RECOGNITION LATENCY

Publication number: 20210217410

Abstract: Embodiments may include collection of a first batch of acoustic feature frames of an audio signal, the number of acoustic feature frames of the first batch equal to a first batch size, input of the first batch to a speech recognition network, collection, in response to detection of a word hypothesis output by the speech recognition network, of a second batch of acoustic feature frames of the audio signal, the number of acoustic feature frames of the second batch equal to a second batch size greater than the first batch size, and input of the second batch to the speech recognition network.

Type: Application

Filed: January 27, 2020

Publication date: July 15, 2021

Inventors: Hosam A. KHALIL, Emilian Y. STOIMENOV, Yifan GONG, Chaojun LIU, Christopher H. BASOGLU, Amit K. AGARWAL, Naveen PARIHAR, Sayan PATHAK
Cushioned outdoor oval furniture piece that can swivel on a stand with a tinted lid

Patent number: 11051619

Abstract: A cushioned, swiveling outdoor oval furniture piece on a stand with a tinted lid is disclosed. This invention improves on existing furniture pieces because the stand is a wider base attached from the top of the furniture piece wrapping around to the bottom allowing for a more stable stand on outdoor surfaces, is an oval ostrich egg shape allowing for up to two persons to sit, and has a tinted lid to cover the opening to protect the interior and person or persons sitting from normal weather conditions and sunlight with an option to swivel.

Type: Grant

Filed: April 1, 2018

Date of Patent: July 6, 2021

Inventor: Hosam Khalil Smiley
ONLINE LANGUAGE MODEL INTERPOLATION FOR AUTOMATIC SPEECH RECOGNITION

Publication number: 20200349930

Abstract: A system includes acquisition of a domain grammar, determination of an interpolated grammar based on the domain grammar and a base grammar, determination of a delta domain grammar based on an augmented first grammar and the interpolated grammar, determination of an out-of-vocabulary class based on the domain grammar and the base grammar, insertion of the out-of-vocabulary class into a composed transducer composed of the augmented first grammar and one or more other transducers to generate an updated composed transducer, composition of the delta domain grammar and the updated composed transducer, and application of the composition of the delta domain grammar and the updated composed transducer to an output of an acoustic model.

Type: Application

Filed: October 28, 2019

Publication date: November 5, 2020

Inventors: Ziad AL BAWAB, Anand U. DESAI, Shuangyu CHANG, Amit K. AGARWAL, Zoltan ROMOCSA, Veljko MILJANIC, Aadyot BHATNAGAR, Hosam KHALIL, Christopher BASOGLU
Confidence features for automated speech recognition arbitration

Patent number: 10706852

Abstract: The described technology provides arbitration between speech recognition results generated by different automatic speech recognition (ASR) engines, such as ASR engines trained according to different language or acoustic models. The system includes an arbitrator that selects between a first speech recognition result representing an acoustic utterance as transcribed by a first ASR engine and a second speech recognition result representing the acoustic utterance as transcribed by a second ASR engine. This selection is based on a set of confidence features that is initially used by the first ASR engine or the second ASR engine to generate the first and second speech recognition results.

Type: Grant

Filed: November 13, 2015

Date of Patent: July 7, 2020

Assignee: Microsoft Technology Licensing, LLC

Inventors: Kshitiz Kumar, Hosam Khalil, Yifan Gong, Ziad Al-Bawab, Chaojun Liu
Cushioned outdoor oval furniture piece that can swivel on a stand with a tinted lid

Publication number: 20190298065

Abstract: A cushioned, swiveling outdoor oval furniture piece on a stand with a tinted lid is disclosed. This invention improves on existing furniture pieces because the stand is a wider base attached from the top of the furniture piece wrapping around to the bottom allowing for a more stable stand on outdoor surfaces, is an oval ostrich egg shape allowing for up to two persons to sit, and has a tinted lid to cover the opening to protect the interior and person or persons sitting from normal weather conditions and sunlight with an option to swivel.

Type: Application

Filed: April 1, 2018

Publication date: October 3, 2019

Inventor: Hosam Khalil Smiley
Method for simultaneously peeling and cutting cooked vegetables

Patent number: 10362890

Abstract: A system, method and device for cutting and peeling a root vegetable. The system includes a frame to receive a cooked root vegetable, a surface slidably connected to the frame to move linearly and reciprocally along at least a portion of the length of the frame, a wire assembly fixed to the frame and oriented perpendicular to the movement of the surface, and power to enable the surface to move toward the wire assembly. When the surface is activated, it moves the vegetable through the wire assembly which simultaneously cuts and removes the skin therefrom.

Type: Grant

Filed: November 6, 2015

Date of Patent: July 30, 2019

Assignee: Hosam Khalil Smiley

Inventor: Hosam Khalil Smiley
CONFIDENCE FEATURES FOR AUTOMATED SPEECH RECOGNITION ARBITRATION

Publication number: 20170140759

Abstract: The described technology provides arbitration between speech recognition results generated by different automatic speech recognition (ASR) engines, such as ASR engines trained according to different language or acoustic models. The system includes an arbitrator that selects between a first speech recognition result representing an acoustic utterance as transcribed by a first ASR engine and a second speech recognition result representing the acoustic utterance as transcribed by a second ASR engine. This selection is based on a set of confidence features that is initially used by the first ASR engine or the second ASR engine to generate the first and second speech recognition results.

Type: Application

Filed: November 13, 2015

Publication date: May 18, 2017

Inventors: Kshitiz Kumar, Hosam Khalil, Yifan Gong, Ziad Al-Bawab, Chaojun Liu
Network jitter smoothing with reduced delay

Patent number: 9479276

Abstract: A method of compensating for jitter in a packet stream is described. The method comprises placing undecoded frames extracted from packets in the packet stream into a jitter buffer while decoding frames from the jitter buffer and placing the decoded frames into a sample buffer at a rate determined using an average playout delay. The average playout delay is the running average of the playout delay calculated for each packet as each packet becomes available. The playout delay for each packet is the sum of a sample buffer delay and a jitter buffer delay. As each packet is received, the average playout delay is adjusted based on a comparison of the playout delay associated with the received packet to the current average playout delay.

Type: Grant

Filed: July 9, 2013

Date of Patent: October 25, 2016

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Hosam A. Khalil, Guo-Wei Shieh, Tian Wang
Adaptive conference comfort noise

Patent number: 8589153

Abstract: A continuous comfort noise is provided that is overlaid for the entire duration of a conference call scenario. The comfort noise may be adapted to match the levels of the actual background noise detected on one or more of the conference call participant's devices on the transmitting end(s) of a conference call as well as the participants' speech levels. The comfort noise may also be adapted to the type of listening device employed on the receiving end of a conference call. The comfort noise level may be customized to an appropriate and comfortable level for the type of listening device being used, and the system may continuously mix the comfort noise with incoming audio signals for the entire duration of a conference call, lowering the comfort noise level gradually during speaking periods for additional user experience improvement.

Type: Grant

Filed: June 28, 2011

Date of Patent: November 19, 2013

Assignee: Microsoft Corporation

Inventors: Hosam Khalil, Xiaoqin Sun, Hong Wang Sodoma, Warren Lam
NETWORK JITTER SMOOTHING WITH REDUCED DELAY

Publication number: 20130294463

Abstract: A method of compensating for jitter in a packet stream is described. The method comprises placing undecoded frames extracted from packets in the packet stream into a jitter buffer while decoding frames from the jitter buffer and placing the decoded frames into a sample buffer at a rate determined using an average playout delay. The average playout delay is the running average of the playout delay calculated for each packet as each packet becomes available. The playout delay for each packet is the sum of a sample buffer delay and a jitter buffer delay. As each packet is received, the average playout delay is adjusted based on a comparison of the playout delay associated with the received packet to the current average playout delay.

Type: Application

Filed: July 9, 2013

Publication date: November 7, 2013

Inventors: Hosam A. Khalil, Guo-Wei Shieh, Tian Wang
Network jitter smoothing with reduced delay

Patent number: 8483243

Abstract: A method of compensating for jitter in a packet stream is described. The method comprises placing undecoded frames extracted from packets in the packet stream into a jitter buffer while decoding frames from the jitter buffer and placing the decoded frames into a sample buffer at a rate determined using an average playout delay. The average playout delay is the running average of the playout delay calculated for each packet as each packet becomes available. The playout delay for each packet is the sum of a sample buffer delay and a jitter buffer delay. As each packet is received, the average playout delay is adjusted based on a comparison of the playout delay associated with the received packet to the current average playout delay.

Type: Grant

Filed: September 15, 2006

Date of Patent: July 9, 2013

Assignee: Microsoft Corporation

Inventors: Hosam A Khalil, Guo-Wei Shieh, Tian Wang
Conference signal anomaly detection

Patent number: 8379800

Abstract: Detecting at least one of an echo detector and a noise detector based on analysis of audio streams transmitted to and received from each endpoint of a conference. When certain characteristics of the respective audio streams for a given endpoint are classified as significant against certain criteria, a determination is made as to whether that endpoint is a source of echo and/or noise. Subsequent actions are taken to alert users and/or prevent broadcast of impaired signals.

Type: Grant

Filed: March 29, 2011

Date of Patent: February 19, 2013

Assignee: Microsoft Corporation

Inventors: Warren M. Lam, Hosam A. Khalil, Vinod Prakash, Sang K. Choe, Hong Wang Sodoma
ADAPTIVE CONFERENCE COMFORT NOISE

Publication number: 20130006622

Abstract: A continuous comfort noise is provided that is overlaid for the entire duration of a conference call scenario. The comfort noise may be adapted to match the levels of the actual background noise detected on one or more of the conference call participant's devices on the transmitting end(s) of a conference call as well as the participants' speech levels. The comfort noise may also be adapted to the type of listening device employed on the receiving end of a conference call. The comfort noise level may be customized to an appropriate and comfortable level for the type of listening device being used, and the system may continuously mix the comfort noise with incoming audio signals for the entire duration of a conference call, lowering the comfort noise level gradually during speaking periods for additional user experience improvement.

Type: Application

Filed: June 28, 2011

Publication date: January 3, 2013

Applicant: MICROSOFT CORPORATION

Inventors: Hosam Khalil, Xiaoqin Sun, Hong Wang Sodoma, Warren Lam
CONFERENCE SIGNAL ANOMALY DETECTION

Publication number: 20120250830

Abstract: Detecting at least one of an echo detector and a noise detector based on analysis of audio streams transmitted to and received from each endpoint of a conference. When certain characteristics of the respective audio streams for a given endpoint are classified as significant against certain criteria, a determination is made as to whether that endpoint is a source of echo and/or noise. Subsequent actions are taken to alert users and/or prevent broadcast of impaired signals.

Type: Application

Filed: March 29, 2011

Publication date: October 4, 2012

Applicant: MICROSOFT CORPORATION

Inventors: Warren M. Lam, Hosam A. Khalil, Vinod Prakash, Sang K. Choe, Hong Wang Sodoma
Audio glitch reduction

Patent number: 8005670

Abstract: To reduce audio glitch rendering buffer of an audio application is pre-filled with natural sounding audio rather than zeros. For every frame of audio sent for rendering, the rendering buffer is also pre-filled or the signal is stretched in the buffer in anticipation of a glitch. If the glitch does not occur, then the stretched signal is overwritten and the end user does not notice it. If the glitch does occur, then the rendering buffer is already filled with a stretched version of the previous audio and may result in sound that is acceptable. After recovery from the glitch, any new data is smoothly merged into the fake audio that was generated before.

Type: Grant

Filed: October 17, 2007

Date of Patent: August 23, 2011

Assignee: Microsoft Corporation

Inventors: Hosam A. Khalil, Guo-Wei Shieh
Robust decoder

Patent number: 7962335

Abstract: Techniques and tools related to delayed or lost coded audio information are described. For example, a concealment technique for one or more missing frames is selected based on one or more factors that include a classification of each of one or more available frames near the one or more missing frames. As another example, information from a concealment signal is used to produce substitute information that is relied on in decoding a subsequent frame. As yet another example, a data structure having nodes corresponding to received packet delays is used to determine a desired decoder packet delay value.

Type: Grant

Filed: July 14, 2009

Date of Patent: June 14, 2011

Assignee: Microsoft Corporation

Inventors: Hosam A. Khalil, Tian Wang, Kazuhito Koishida, Xiaoqin Sun, Wei-Ge Chen

1 2 3 next