Patents by Inventor Mostafa El-Khamy

Mostafa El-Khamy has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

System and method for providing dominant scene classification by semantic segmentation

Patent number: 10929665

Abstract: A method for computing a dominant class of a scene includes: receiving an input image of a scene; generating a segmentation map of the input image, the segmentation map including a plurality of pixels, each of the pixels being labeled with a corresponding class of a plurality of classes; computing a plurality of area ratios based on the segmentation map, each of the area ratios corresponding to a different class of the plurality of classes of the segmentation map; applying inference to generate a plurality of ranked labels based on the area ratios; and outputting a detected dominant class of the scene based on the plurality of ranked labels.

Type: Grant

Filed: June 25, 2019

Date of Patent: February 23, 2021

Assignee: Samsung Electronics Co., Ltd.

Inventors: Qingfeng Liu, Mostafa El-Khamy, Rama Mythili Vadali, Tae-Ui Kim, Andrea Kang, Dongwoon Bai, Jungwon Lee, Maiyuran Wijay, Jaewon Yoo
VIDEO DEPTH ESTIMATION BASED ON TEMPORAL ATTENTION

Publication number: 20210027480

Abstract: A method of depth detection based on a plurality of video frames includes receiving a plurality of input frames including a first input frame, a second input frame, and a third input frame respectively corresponding to different capture times, convolving the first to third input frames to generate a first feature map, a second feature map, and a third feature map corresponding to the different capture times, calculating a temporal attention map based on the first to third feature maps, the temporal attention map including a plurality of weights corresponding to different pairs of feature maps from among the first to third feature maps, each weight of the plurality of weights indicating a similarity level of a corresponding pair of feature maps, and applying the temporal attention map to the first to third feature maps to generate a feature map with temporal attention.

Type: Application

Filed: April 6, 2020

Publication date: January 28, 2021

Inventors: Haoyu Ren, Mostafa El-Khamy, Jungwon Lee
Method and apparatus for variable rate compression with a conditional autoencoder

Patent number: 10886943

Abstract: A method and apparatus for variable rate compression with a conditional autoencoder is herein provided. According to one embodiment, a method includes training a conditional autoencoder using a Lagrange multiplier and training a neural network that includes the conditional autoencoder with mixed quantization bin sizes.

Type: Grant

Filed: September 19, 2019

Date of Patent: January 5, 2021

Inventors: Yoo Jin Choi, Mostafa El-Khamy, Jungwon Lee
SYSTEM AND METHOD FOR ACOUSTIC ECHO CANCELATION USING DEEP MULTITASK RECURRENT NEURAL NETWORKS

Publication number: 20200411030

Abstract: A method for performing echo cancellation includes: receiving a far-end signal from a far-end device at a near-end device; recording a microphone signal at the near-end device including: a near-end signal; and an echo signal corresponding to the far-end signal; extracting far-end features from the far-end signal; extracting microphone features from the microphone signal; computing estimated near-end features by supplying the microphone features and the far-end features to an acoustic echo cancellation module including: an echo estimator including a first stack of a recurrent neural network configured to compute estimated echo features based on the far-end features; and a near-end estimator including a second stack of the recurrent neural network configured to compute the estimated near-end features based on an output of the first stack and the microphone signal; computing an estimated near-end signal from the estimated near-end features; and transmitting the estimated near-end signal to the far-end device.

Type: Application

Filed: September 9, 2020

Publication date: December 31, 2020

Inventors: Amin Fazeli, Mostafa El-Khamy, Jungwon Lee
SYSTEM AND METHOD FOR DESIGNING EFFICIENT SUPER RESOLUTION DEEP CONVOLUTIONAL NEURAL NETWORKS BY CASCADE NETWORK TRAINING, CASCADE NETWORK TRIMMING, AND DILATED CONVOLUTIONS

Publication number: 20200401870

Abstract: Apparatuses and methods of manufacturing same, systems, and methods for generating a convolutional neural network (CNN) are described. In one aspect, a minimal CNN having, e.g., three or more layers is trained. Cascade training may be performed on the trained CNN to insert one or more intermediate layers until a training error is less than a threshold. When cascade training is complete, cascade network trimming of the CNN output from the cascade training may be performed to improve computational efficiency. To further reduce network parameters, convolutional filters may be replaced with dilated convolutional filters with the same receptive field, followed by additional training/fine-tuning.

Type: Application

Filed: August 31, 2020

Publication date: December 24, 2020

Inventors: Haoyu REN, Mostafa EL-KHAMY, Jungwon LEE
METHOD AND APPARATUS FOR VARIABLE RATE COMPRESSION WITH A CONDITIONAL AUTOENCODER

Publication number: 20200395955

Abstract: A method and apparatus for variable rate compression with a conditional autoencoder is herein provided. According to one embodiment, a method includes training a conditional autoencoder using a Lagrange multiplier and training a neural network that includes the conditional autoencoder with mixed quantization bin sizes.

Type: Application

Filed: September 1, 2020

Publication date: December 17, 2020

Inventors: Yoo Jin CHOI, Mostafa EL-KHAMY, Jungwon LEE
Method and system for contiguous HARQ memory management with memory splitting

Patent number: 10862630

Abstract: Apparatuses (including user equipment (UE) and modem chips for UEs), systems, and methods for UE downlink Hybrid Automatic Repeat reQuest (HARQ) buffer memory management are described. In one method, the entire UE DL HARQ buffer memory space is pre-partitioned according to the number and capacities of the UE's active carrier components. In another method, the UE DL HARQ buffer is split between on-chip and off-chip memory so that each partition and sub-partition is allocated between the on-chip and off-chip memories in accordance with an optimum ratio.

Type: Grant

Filed: September 8, 2015

Date of Patent: December 8, 2020

Inventors: Mostafa El-Khamy, Arvind Yedla, SangHyuck Ha, Hyunsang Cho, Inyup Kang
TRANSFORMER WITH GUASSIAN WEIGHTED SELF-ATTENTION FOR SPEECH ENHANCEMENT

Publication number: 20200357425

Abstract: A method and system for providing Gaussian weighted self-attention for speech enhancement are herein provided. According to one embodiment, the method includes receiving a input noise signal, generating a score matrix based on the received input noise signal, and applying a Gaussian weighted function to the generated score matrix.

Type: Application

Filed: October 2, 2019

Publication date: November 12, 2020

Inventors: Jaeyoung KIM, Mostafa EL-KHAMY, Jungwon LEE
System and method for residual long short term memories (LSTM) network

Patent number: 10810482

Abstract: An apparatus and a method. The apparatus includes a plurality of long short term memory (LSTM) networks, wherein each of the plurality of LSTM networks is at a different network layer, wherein each of the plurality of LSTM networks is configured to determine a residual function, wherein each of the plurality of LSTM networks includes an output gate to control what is provided to a subsequent LSTM network, and wherein each of the plurality of LSTM networks includes at least one highway connection to compensate for the residual function of a previous LSTM network.

Type: Grant

Filed: November 4, 2016

Date of Patent: October 20, 2020

Assignee: Samsung Electronics Co., Ltd

Inventors: JaeYoung Kim, Jungwon Lee, Mostafa El-Khamy
SYSTEM AND METHOD FOR PROVIDING SINGLE IMAGE DEPTH ESTIMATION BASED ON DEEP NEURAL NETWORK

Publication number: 20200327685

Abstract: A method and system for determining depth information of an image are herein provided. According to one embodiment, the method includes receiving an image input, classifying the input image into a depth range of a plurality of depth ranges, and determining a depth map of the image by applying depth estimation based on the depth range into which the input image is classified.

Type: Application

Filed: September 18, 2019

Publication date: October 15, 2020

Inventors: Haoyu REN, Mostafa El-Khamy, Jungwon Lee
System and method for designing efficient super resolution deep convolutional neural networks by cascade network training, cascade network trimming, and dilated convolutions

Patent number: 10803378

Abstract: Apparatuses and methods of manufacturing same, systems, and methods for generating a convolutional neural network (CNN) are described. In one aspect, a minimal CNN having, e.g., three or more layers is trained. Cascade training may be performed on the trained CNN to insert one or more intermediate layers until a training error is less than a threshold. When cascade training is complete, cascade network trimming of the CNN output from the cascade training may be performed to improve computational efficiency. To further reduce network parameters, convolutional filters may be replaced with dilated convolutional filters with the same receptive field, followed by additional training/fine-tuning.

Type: Grant

Filed: July 20, 2017

Date of Patent: October 13, 2020

Assignee: Samsung Electronics Co., Ltd

Inventors: Haoyu Ren, Mostafa El-Khamy, Jungwon Lee
System and method for acoustic echo cancelation using deep multitask recurrent neural networks

Patent number: 10803881

Abstract: A method for performing echo cancellation includes: receiving a far-end signal from a far-end device at a near-end device; recording a microphone signal at the near-end device including: a near-end signal; and an echo signal corresponding to the far-end signal; extracting far-end features from the far-end signal; extracting microphone features from the microphone signal; computing estimated near-end features by supplying the microphone features and the far-end features to an acoustic echo cancellation module including: an echo estimator including a first stack of a recurrent neural network configured to compute estimated echo features based on the far-end features; and a near-end estimator including a second stack of the recurrent neural network configured to compute the estimated near-end features based on an output of the first stack and the microphone signal; computing an estimated near-end signal from the estimated near-end features; and transmitting the estimated near-end signal to the far-end device.

Type: Grant

Filed: September 17, 2019

Date of Patent: October 13, 2020

Assignee: Samsung Electronics Co., Ltd.

Inventors: Amin Fazeli, Mostafa El-Khamy, Jungwon Lee
SYSTEM AND METHOD FOR ACOUSTIC ECHO CANCELLATION USING DEEP MULTITASK RECURRENT NEURAL NETWORKS

Publication number: 20200312346

Abstract: A system for performing echo cancellation includes: a processor configured to: receive a far-end signal; record a microphone signal including: a near-end signal; and an echo signal corresponding to the far-end signal; extract far-end features from the far-end signal; extract microphone features from the microphone signal; compute estimated near-end features by supplying the microphone features and the far-end features to an acoustic echo cancellation module including a recurrent neural network including: an encoder including a plurality of gated recurrent units; and a decoder including a plurality of gated recurrent units; compute an estimated near-end signal from the estimated near-end features; and transmit the estimated near-end signal to the far-end device. The recurrent neural network may include a contextual attention module; and the recurrent neural network may take, as input, a plurality of error features computed based on the far-end features, the microphone features, and acoustic path parameters.

Type: Application

Filed: January 23, 2020

Publication date: October 1, 2020

Inventors: Amin Fazeli, Mostafa El-Khamy, Jungwon Lee
SYSTEM AND METHOD FOR ACOUSTIC ECHO CANCELATION USING DEEP MULTITASK RECURRENT NEURAL NETWORKS

Publication number: 20200312345

Abstract: A method for performing echo cancellation includes: receiving a far-end signal from a far-end device at a near-end device; recording a microphone signal at the near-end device including: a near-end signal; and an echo signal corresponding to the far-end signal; extracting far-end features from the far-end signal; extracting microphone features from the microphone signal; computing estimated near-end features by supplying the microphone features and the far-end features to an acoustic echo cancellation module including: an echo estimator including a first stack of a recurrent neural network configured to compute estimated echo features based on the far-end features; and a near-end estimator including a second stack of the recurrent neural network configured to compute the estimated near-end features based on an output of the first stack and the microphone signal; computing an estimated near-end signal from the estimated near-end features; and transmitting the estimated near-end signal to the far-end device.

Type: Application

Filed: September 17, 2019

Publication date: October 1, 2020

Inventors: Amin Fazeli, Mostafa El-Khamy, Jungwon Lee
METHOD AND APPARATUS FOR VARIABLE RATE COMPRESSION WITH A CONDITIONAL AUTOENCODER

Publication number: 20200304147

Abstract: A method and apparatus for variable rate compression with a conditional autoencoder is herein provided. According to one embodiment, a method includes training a conditional autoencoder using a Lagrange multiplier and training a neural network that includes the conditional autoencoder with mixed quantization bin sizes.

Type: Application

Filed: September 19, 2019

Publication date: September 24, 2020

Inventors: Yoo Jin CHOI, Mostafa EL-KHAMY, Jungwon LEE
METHOD AND APPARATUS FOR VIDEO SUPER RESOLUTION USING CONVOLUTIONAL NEURAL NETWORK WITH TWO-STAGE MOTION COMPENSATION

Publication number: 20200294217

Abstract: A method and an apparatus are provided. The method includes receiving a video with a first plurality of frames having a first resolution; generating a plurality of warped frames from the first plurality of frames based on a first type of motion compensation; generating a second plurality of frames having a second resolution, wherein the second resolution is of higher resolution than the first resolution, wherein each of the second plurality of frames having the second resolution is derived from a subset of the plurality of warped frames using a convolutional network; and generating a third plurality of frames having the second resolution based on a second type of motion compensation, wherein each of the third plurality of frames having the second resolution is derived from a fusing a subset of the second plurality of frames.

Type: Application

Filed: May 29, 2020

Publication date: September 17, 2020

Inventors: Mostafa EL-KHAMY, Haoyu REN, Jungwon LEE
METHOD AND APPARATUS FOR PROVIDING ROTATIONAL INVARIANT NEURAL NETWORKS

Publication number: 20200285894

Abstract: A method and apparatus for providing a rotational invariant neural network is herein disclosed. According to one embodiment, a method includes receiving a first input of an image in a first orientation and training a kernel to be symmetric such that an output corresponding to the first input is the same as an output corresponding to a second input of the image in a second orientation.

Type: Application

Filed: June 25, 2019

Publication date: September 10, 2020

Inventors: Mostafa EL-KHAMY, Jungwon LEE, Yoo Jin CHOI, Haoyu REN
Method and apparatus for video super resolution using convolutional neural network with two-stage motion compensation

Patent number: 10733714

Abstract: A method and an apparatus are provided. The method includes receiving a video with a first plurality of frames having a first resolution; generating a plurality of warped frames from the first plurality of frames based on a first type of motion compensation; generating a second plurality of frames having a second resolution, wherein the second resolution is of higher resolution than the first resolution, wherein each of the second plurality of frames having the second resolution is derived from a subset of the plurality of warped frames using a convolutional network; and generating a third plurality of frames having the second resolution based on a second type of motion compensation, wherein each of the third plurality of frames having the second resolution is derived from a fusing a subset of the second plurality of frames.

Type: Grant

Filed: April 5, 2018

Date of Patent: August 4, 2020

Assignee: Samsung Electronics Co., Ltd

Inventors: Mostafa El-Khamy, Haoyu Ren, Jungwon Lee
Image denoising neural network architecture and method of training the same

Patent number: 10726525

Abstract: An image denoising neural network training architecture includes an image denoising neural network and a clean data neural network, and the image denoising neural network and clean data neural network share information between each other.

Type: Grant

Filed: March 20, 2018

Date of Patent: July 28, 2020

Assignee: Samsung Electronics Co., Ltd.

Inventors: Mostafa El-Khamy, Igor Fedorov, Jungwon Lee
END-TO-END MULTI-TASK DENOISING FOR JOINT SIGNAL DISTORTION RATIO (SDR) AND PERCEPTUAL EVALUATION OF SPEECH QUALITY (PESQ) OPTIMIZATION

Publication number: 20200227070

Abstract: A method and system for providing end-to-end multi-task denoising for joint signal distortion ratio (SDR) and perceptual evaluation of speech quality (PESQ) optimization is herein disclosed. According to one embodiment, an method includes receiving a noisy signal, generating a denoised output signal, determining a signal distortion ratio (SDR) loss function based on the denoised output signal, determining a perceptual evaluation of speech quality (PESQ) loss function based on the denoised output signal, and optimizing an overall loss function based on the PESQ loss function and the SDR loss function.

Type: Application

Filed: June 25, 2019

Publication date: July 16, 2020

Inventors: Jaeyoung KIM, Mostafa EL-KHAMY, Jungwon LEE

prev 1 2 3 4 5 6 7 8 9 … next