Patents by Inventor Mostafa El-Khamy

Mostafa El-Khamy has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 10929665
    Abstract: A method for computing a dominant class of a scene includes: receiving an input image of a scene; generating a segmentation map of the input image, the segmentation map including a plurality of pixels, each of the pixels being labeled with a corresponding class of a plurality of classes; computing a plurality of area ratios based on the segmentation map, each of the area ratios corresponding to a different class of the plurality of classes of the segmentation map; applying inference to generate a plurality of ranked labels based on the area ratios; and outputting a detected dominant class of the scene based on the plurality of ranked labels.
    Type: Grant
    Filed: June 25, 2019
    Date of Patent: February 23, 2021
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Qingfeng Liu, Mostafa El-Khamy, Rama Mythili Vadali, Tae-Ui Kim, Andrea Kang, Dongwoon Bai, Jungwon Lee, Maiyuran Wijay, Jaewon Yoo
  • Publication number: 20210027480
    Abstract: A method of depth detection based on a plurality of video frames includes receiving a plurality of input frames including a first input frame, a second input frame, and a third input frame respectively corresponding to different capture times, convolving the first to third input frames to generate a first feature map, a second feature map, and a third feature map corresponding to the different capture times, calculating a temporal attention map based on the first to third feature maps, the temporal attention map including a plurality of weights corresponding to different pairs of feature maps from among the first to third feature maps, each weight of the plurality of weights indicating a similarity level of a corresponding pair of feature maps, and applying the temporal attention map to the first to third feature maps to generate a feature map with temporal attention.
    Type: Application
    Filed: April 6, 2020
    Publication date: January 28, 2021
    Inventors: Haoyu Ren, Mostafa El-Khamy, Jungwon Lee
  • Patent number: 10886943
    Abstract: A method and apparatus for variable rate compression with a conditional autoencoder is herein provided. According to one embodiment, a method includes training a conditional autoencoder using a Lagrange multiplier and training a neural network that includes the conditional autoencoder with mixed quantization bin sizes.
    Type: Grant
    Filed: September 19, 2019
    Date of Patent: January 5, 2021
    Inventors: Yoo Jin Choi, Mostafa El-Khamy, Jungwon Lee
  • Publication number: 20200411030
    Abstract: A method for performing echo cancellation includes: receiving a far-end signal from a far-end device at a near-end device; recording a microphone signal at the near-end device including: a near-end signal; and an echo signal corresponding to the far-end signal; extracting far-end features from the far-end signal; extracting microphone features from the microphone signal; computing estimated near-end features by supplying the microphone features and the far-end features to an acoustic echo cancellation module including: an echo estimator including a first stack of a recurrent neural network configured to compute estimated echo features based on the far-end features; and a near-end estimator including a second stack of the recurrent neural network configured to compute the estimated near-end features based on an output of the first stack and the microphone signal; computing an estimated near-end signal from the estimated near-end features; and transmitting the estimated near-end signal to the far-end device.
    Type: Application
    Filed: September 9, 2020
    Publication date: December 31, 2020
    Inventors: Amin Fazeli, Mostafa El-Khamy, Jungwon Lee
  • Publication number: 20200401870
    Abstract: Apparatuses and methods of manufacturing same, systems, and methods for generating a convolutional neural network (CNN) are described. In one aspect, a minimal CNN having, e.g., three or more layers is trained. Cascade training may be performed on the trained CNN to insert one or more intermediate layers until a training error is less than a threshold. When cascade training is complete, cascade network trimming of the CNN output from the cascade training may be performed to improve computational efficiency. To further reduce network parameters, convolutional filters may be replaced with dilated convolutional filters with the same receptive field, followed by additional training/fine-tuning.
    Type: Application
    Filed: August 31, 2020
    Publication date: December 24, 2020
    Inventors: Haoyu REN, Mostafa EL-KHAMY, Jungwon LEE
  • Publication number: 20200395955
    Abstract: A method and apparatus for variable rate compression with a conditional autoencoder is herein provided. According to one embodiment, a method includes training a conditional autoencoder using a Lagrange multiplier and training a neural network that includes the conditional autoencoder with mixed quantization bin sizes.
    Type: Application
    Filed: September 1, 2020
    Publication date: December 17, 2020
    Inventors: Yoo Jin CHOI, Mostafa EL-KHAMY, Jungwon LEE
  • Patent number: 10862630
    Abstract: Apparatuses (including user equipment (UE) and modem chips for UEs), systems, and methods for UE downlink Hybrid Automatic Repeat reQuest (HARQ) buffer memory management are described. In one method, the entire UE DL HARQ buffer memory space is pre-partitioned according to the number and capacities of the UE's active carrier components. In another method, the UE DL HARQ buffer is split between on-chip and off-chip memory so that each partition and sub-partition is allocated between the on-chip and off-chip memories in accordance with an optimum ratio.
    Type: Grant
    Filed: September 8, 2015
    Date of Patent: December 8, 2020
    Inventors: Mostafa El-Khamy, Arvind Yedla, SangHyuck Ha, Hyunsang Cho, Inyup Kang
  • Publication number: 20200357425
    Abstract: A method and system for providing Gaussian weighted self-attention for speech enhancement are herein provided. According to one embodiment, the method includes receiving a input noise signal, generating a score matrix based on the received input noise signal, and applying a Gaussian weighted function to the generated score matrix.
    Type: Application
    Filed: October 2, 2019
    Publication date: November 12, 2020
    Inventors: Jaeyoung KIM, Mostafa EL-KHAMY, Jungwon LEE
  • Patent number: 10810482
    Abstract: An apparatus and a method. The apparatus includes a plurality of long short term memory (LSTM) networks, wherein each of the plurality of LSTM networks is at a different network layer, wherein each of the plurality of LSTM networks is configured to determine a residual function, wherein each of the plurality of LSTM networks includes an output gate to control what is provided to a subsequent LSTM network, and wherein each of the plurality of LSTM networks includes at least one highway connection to compensate for the residual function of a previous LSTM network.
    Type: Grant
    Filed: November 4, 2016
    Date of Patent: October 20, 2020
    Assignee: Samsung Electronics Co., Ltd
    Inventors: JaeYoung Kim, Jungwon Lee, Mostafa El-Khamy
  • Publication number: 20200327685
    Abstract: A method and system for determining depth information of an image are herein provided. According to one embodiment, the method includes receiving an image input, classifying the input image into a depth range of a plurality of depth ranges, and determining a depth map of the image by applying depth estimation based on the depth range into which the input image is classified.
    Type: Application
    Filed: September 18, 2019
    Publication date: October 15, 2020
    Inventors: Haoyu REN, Mostafa El-Khamy, Jungwon Lee
  • Patent number: 10803378
    Abstract: Apparatuses and methods of manufacturing same, systems, and methods for generating a convolutional neural network (CNN) are described. In one aspect, a minimal CNN having, e.g., three or more layers is trained. Cascade training may be performed on the trained CNN to insert one or more intermediate layers until a training error is less than a threshold. When cascade training is complete, cascade network trimming of the CNN output from the cascade training may be performed to improve computational efficiency. To further reduce network parameters, convolutional filters may be replaced with dilated convolutional filters with the same receptive field, followed by additional training/fine-tuning.
    Type: Grant
    Filed: July 20, 2017
    Date of Patent: October 13, 2020
    Assignee: Samsung Electronics Co., Ltd
    Inventors: Haoyu Ren, Mostafa El-Khamy, Jungwon Lee
  • Patent number: 10803881
    Abstract: A method for performing echo cancellation includes: receiving a far-end signal from a far-end device at a near-end device; recording a microphone signal at the near-end device including: a near-end signal; and an echo signal corresponding to the far-end signal; extracting far-end features from the far-end signal; extracting microphone features from the microphone signal; computing estimated near-end features by supplying the microphone features and the far-end features to an acoustic echo cancellation module including: an echo estimator including a first stack of a recurrent neural network configured to compute estimated echo features based on the far-end features; and a near-end estimator including a second stack of the recurrent neural network configured to compute the estimated near-end features based on an output of the first stack and the microphone signal; computing an estimated near-end signal from the estimated near-end features; and transmitting the estimated near-end signal to the far-end device.
    Type: Grant
    Filed: September 17, 2019
    Date of Patent: October 13, 2020
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Amin Fazeli, Mostafa El-Khamy, Jungwon Lee
  • Publication number: 20200312346
    Abstract: A system for performing echo cancellation includes: a processor configured to: receive a far-end signal; record a microphone signal including: a near-end signal; and an echo signal corresponding to the far-end signal; extract far-end features from the far-end signal; extract microphone features from the microphone signal; compute estimated near-end features by supplying the microphone features and the far-end features to an acoustic echo cancellation module including a recurrent neural network including: an encoder including a plurality of gated recurrent units; and a decoder including a plurality of gated recurrent units; compute an estimated near-end signal from the estimated near-end features; and transmit the estimated near-end signal to the far-end device. The recurrent neural network may include a contextual attention module; and the recurrent neural network may take, as input, a plurality of error features computed based on the far-end features, the microphone features, and acoustic path parameters.
    Type: Application
    Filed: January 23, 2020
    Publication date: October 1, 2020
    Inventors: Amin Fazeli, Mostafa El-Khamy, Jungwon Lee
  • Publication number: 20200312345
    Abstract: A method for performing echo cancellation includes: receiving a far-end signal from a far-end device at a near-end device; recording a microphone signal at the near-end device including: a near-end signal; and an echo signal corresponding to the far-end signal; extracting far-end features from the far-end signal; extracting microphone features from the microphone signal; computing estimated near-end features by supplying the microphone features and the far-end features to an acoustic echo cancellation module including: an echo estimator including a first stack of a recurrent neural network configured to compute estimated echo features based on the far-end features; and a near-end estimator including a second stack of the recurrent neural network configured to compute the estimated near-end features based on an output of the first stack and the microphone signal; computing an estimated near-end signal from the estimated near-end features; and transmitting the estimated near-end signal to the far-end device.
    Type: Application
    Filed: September 17, 2019
    Publication date: October 1, 2020
    Inventors: Amin Fazeli, Mostafa El-Khamy, Jungwon Lee
  • Publication number: 20200304147
    Abstract: A method and apparatus for variable rate compression with a conditional autoencoder is herein provided. According to one embodiment, a method includes training a conditional autoencoder using a Lagrange multiplier and training a neural network that includes the conditional autoencoder with mixed quantization bin sizes.
    Type: Application
    Filed: September 19, 2019
    Publication date: September 24, 2020
    Inventors: Yoo Jin CHOI, Mostafa EL-KHAMY, Jungwon LEE
  • Publication number: 20200294217
    Abstract: A method and an apparatus are provided. The method includes receiving a video with a first plurality of frames having a first resolution; generating a plurality of warped frames from the first plurality of frames based on a first type of motion compensation; generating a second plurality of frames having a second resolution, wherein the second resolution is of higher resolution than the first resolution, wherein each of the second plurality of frames having the second resolution is derived from a subset of the plurality of warped frames using a convolutional network; and generating a third plurality of frames having the second resolution based on a second type of motion compensation, wherein each of the third plurality of frames having the second resolution is derived from a fusing a subset of the second plurality of frames.
    Type: Application
    Filed: May 29, 2020
    Publication date: September 17, 2020
    Inventors: Mostafa EL-KHAMY, Haoyu REN, Jungwon LEE
  • Publication number: 20200285894
    Abstract: A method and apparatus for providing a rotational invariant neural network is herein disclosed. According to one embodiment, a method includes receiving a first input of an image in a first orientation and training a kernel to be symmetric such that an output corresponding to the first input is the same as an output corresponding to a second input of the image in a second orientation.
    Type: Application
    Filed: June 25, 2019
    Publication date: September 10, 2020
    Inventors: Mostafa EL-KHAMY, Jungwon LEE, Yoo Jin CHOI, Haoyu REN
  • Patent number: 10733714
    Abstract: A method and an apparatus are provided. The method includes receiving a video with a first plurality of frames having a first resolution; generating a plurality of warped frames from the first plurality of frames based on a first type of motion compensation; generating a second plurality of frames having a second resolution, wherein the second resolution is of higher resolution than the first resolution, wherein each of the second plurality of frames having the second resolution is derived from a subset of the plurality of warped frames using a convolutional network; and generating a third plurality of frames having the second resolution based on a second type of motion compensation, wherein each of the third plurality of frames having the second resolution is derived from a fusing a subset of the second plurality of frames.
    Type: Grant
    Filed: April 5, 2018
    Date of Patent: August 4, 2020
    Assignee: Samsung Electronics Co., Ltd
    Inventors: Mostafa El-Khamy, Haoyu Ren, Jungwon Lee
  • Patent number: 10726525
    Abstract: An image denoising neural network training architecture includes an image denoising neural network and a clean data neural network, and the image denoising neural network and clean data neural network share information between each other.
    Type: Grant
    Filed: March 20, 2018
    Date of Patent: July 28, 2020
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Mostafa El-Khamy, Igor Fedorov, Jungwon Lee
  • Publication number: 20200227070
    Abstract: A method and system for providing end-to-end multi-task denoising for joint signal distortion ratio (SDR) and perceptual evaluation of speech quality (PESQ) optimization is herein disclosed. According to one embodiment, an method includes receiving a noisy signal, generating a denoised output signal, determining a signal distortion ratio (SDR) loss function based on the denoised output signal, determining a perceptual evaluation of speech quality (PESQ) loss function based on the denoised output signal, and optimizing an overall loss function based on the PESQ loss function and the SDR loss function.
    Type: Application
    Filed: June 25, 2019
    Publication date: July 16, 2020
    Inventors: Jaeyoung KIM, Mostafa EL-KHAMY, Jungwon LEE