Patents by Inventor Mostafa El-Khamy

Mostafa El-Khamy has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

METHOD AND APPARATUS FOR DATA-FREE POST-TRAINING NETWORK QUANTIZATION AND GENERATING SYNTHETIC DATA BASED ON A PRE-TRAINED MACHINE LEARNING MODEL

Publication number: 20250086457

Abstract: A method for training a generator, by a generator training system including a processor and memory, includes: extracting training statistical characteristics from a batch normalization layer of a pre-trained model, the training statistical characteristics including a training mean ? and a training variance ?2; initializing a generator configured with generator parameters; generating a batch of synthetic data using the generator; supplying the batch of synthetic data to the pre-trained model; measuring statistical characteristics of activations at the batch normalization layer and at the output of the pre-trained model in response to the batch of synthetic data, the statistical characteristics including a measured mean, and {circumflex over (?)}? measured variance {circumflex over (?)}?2; computing a training loss in accordance with a loss function L? based on ?, ?2, {circumflex over (?)}?, and {circumflex over (?)}?2; and iteratively updating the generator parameters in accordance with the training loss until

Type: Application

Filed: November 22, 2024

Publication date: March 13, 2025

Inventors: Yoo Jin Choi, Mostafa El-Khamy, Jungwon Lee
ATTENTIVE SENSING FOR EFFICIENT MULTIMODAL GESTURE RECOGNITION

Publication number: 20250087025

Abstract: A system and a method for performing gesture recognition are disclosed, the method comprising detecting a gesture using a primary modality; evaluating an expected accuracy gain (EAG) to identify a modality that yields a maximum relative EAG among the primary modality and one or more secondary modalities; and activating the one or more secondary modalities for detecting the gesture if the one or more secondary modalities correspond to the modality that yields the maximum relative EAG.

Type: Application

Filed: September 11, 2024

Publication date: March 13, 2025

Inventors: Mostafa EL-KHAMY, Soheil HOR, Yanlin ZHOU
Method and apparatus for reducing computational complexity of convolutional neural networks

Patent number: 12248866

Abstract: A convolutional neural network (CNN) system for generating a classification for an input image is presented. The CNN system comprises circuitry running on clock cycles and configured to compute a product of two received values, and at least one non-transitory computer-readable medium that stores instructions for the circuitry to derive a feature map based on at least the input image; puncture at least one selection among the feature map and a kernel by setting the value of an element at an index of the at least one selection to zero and cyclic shifting a puncture pattern to achieve a 1/d reduction in number of clock cycles, where d is an integer and puncture interval value >1. The feature map is convolved with the kernel to generate an output, and a classification of the input image is generated based on the output.

Type: Grant

Filed: September 13, 2021

Date of Patent: March 11, 2025

Assignee: Samsung Electronics Co., Ltd

Inventors: Mostafa El-Khamy, Yoo Jin Choi, Jungwon Lee
Method and apparatus for data efficient semantic segmentation

Patent number: 12242964

Abstract: A method and system for training a neural network are provided. The method includes receiving an input image, selecting at least one data augmentation method from a pool of data augmentation methods, generating an augmented image by applying the selected at least one data augmentation method to the input image, and generating a mixed image from the input image and the augmented image.

Type: Grant

Filed: June 26, 2023

Date of Patent: March 4, 2025

Assignee: Samsung Electronics Co., Ltd

Inventors: Qingfeng Liu, Mostafa El-Khamy, Jungwon Lee, Behnam Babagholami Mohamadabadi
TEMPORAL CONSISTENCY VIA BACKBONE FEATURE FUSION FOR PIXEL-LEVEL PREDICTION TASKS

Publication number: 20250069247

Abstract: Methods and systems for performing video prediction, including obtaining an input frame from among a plurality of frames included in an input video; extracting a first feature map by providing the input frame to a first plurality of feature extraction layers and a first strided convolutional layer included in an encoder; providing the first feature map and at least one neighboring first feature map corresponding to at least one neighboring frame to a first fusion module included in the encoder; fusing the first feature map with the at least one neighboring first feature map to generate a fused first feature map using the first fusion module; generating a prediction corresponding to the input frame based on the fused first feature map using a decoder; and performing a video prediction task using the prediction.

Type: Application

Filed: November 11, 2024

Publication date: February 27, 2025

Applicant: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Haoyu REN, Mostafa EL-KHAMY, Jungwon LEE, Hai SU, Qingfeng LIU
Method and apparatus for federated learning

Patent number: 12236370

Abstract: Methods and devices are provided for performing federated learning. A global model is distributed from a server to a plurality of client devices. At each of the plurality of client devices: model inversion is performed on the global model to generate synthetic data; the global model is on an augmented dataset of collected data and the synthetic data to generate a respective client model; and the respective client model is transmitted to the server. At the server: client models are received from the plurality of client devices, where each client model is received from a respective client device of the plurality of client devices; model inversion is performed on each client model to generate a synthetic dataset; the client models are averaged to generate an averaged model; and the averaged model is trained using the synthetic dataset to generate an updated model.

Type: Grant

Filed: February 19, 2021

Date of Patent: February 25, 2025

Assignee: Samsung Electronics Co., Ltd

Inventors: Mostafa El-Khamy, Weituo Hao, Jungwon Lee
SYSTEM AND METHOD FOR HARDWARE-AWARE PRUNING OF CONFORMER NETWORKS

Publication number: 20250053811

Abstract: A system and a method are disclosed for hardware-aware pruning of conformer networks. In some embodiments, the method includes: training a neural network, the training including: performing a first pruning operation, on the neural network, after a first training epoch, and performing a second pruning operation, on the neural network, after a second training epoch and after the first pruning operation, wherein each of the pruning operations results in a respective pruning fraction, the respective pruning fraction being a function of an index of a training epoch preceding the pruning operation.

Type: Application

Filed: April 18, 2024

Publication date: February 13, 2025

Inventors: Ahmed ELKORDY, Behnam BABAGHOLAMI MOHAMADABADI, Mostafa EL-KHAMY
Method and apparatus for neural network quantization

Patent number: 12190231

Abstract: Apparatuses and methods of manufacturing same, systems, and methods for performing network parameter quantization in deep neural networks are described. In one aspect, multi-dimensional vectors representing network parameters are constructed from a trained neural network model. The multi-dimensional vectors are quantized to obtain shared quantized vectors as cluster centers, which are fine-tuned. The fine-tuned and shared quantized vectors/cluster centers are then encoded. Decoding reverses the process.

Type: Grant

Filed: September 6, 2017

Date of Patent: January 7, 2025

Assignee: Samsung Electronics Co., Ltd

Inventors: Yoo Jin Choi, Mostafa El-Khamy, Jungwon Lee
SYSTEM AND METHOD FOR ACOUSTIC ECHO CANCELATION USING DEEP MULTITASK RECURRENT NEURAL NETWORKS

Publication number: 20240420720

Abstract: A system for performing echo cancellation includes: a processor configured to: receive a far-end signal; record a microphone signal including: a near-end signal; and an echo signal corresponding to the far-end signal; extract far-end features from the far-end signal; extract microphone features from the microphone signal; compute estimated near-end features by supplying the microphone features and the far-end features to an acoustic echo cancellation module including a recurrent neural network including: an encoder including a plurality of gated recurrent units; and a decoder including a plurality of gated recurrent units; compute an estimated near-end signal from the estimated near-end features; and transmit the estimated near-end signal to the far-end device. The recurrent neural network may include a contextual attention module; and the recurrent neural network may take, as input, a plurality of error features computed based on the far-end features, the microphone features, and acoustic path parameters.

Type: Application

Filed: August 26, 2024

Publication date: December 19, 2024

Inventors: Amin Fazeli, Mostafa El-Khamy, Jungwon Lee
Method and apparatus for data-free post-training network quantization and generating synthetic data based on a pre-trained machine learning model

Patent number: 12154030

Abstract: A method for training a generator, by a generator training system including a processor and memory, includes: extracting training statistical characteristics from a batch normalization layer of a pre-trained model, the training statistical characteristics including a training mean ? and a training variance ?2; initializing a generator configured with generator parameters; generating a batch of synthetic data using the generator; supplying the batch of synthetic data to the pre-trained model; measuring statistical characteristics of activations at the batch normalization layer and at the output of the pre-trained model in response to the batch of synthetic data, the statistical characteristics including a measured mean {circumflex over (?)}? and a measured variance {circumflex over (?)}?2; computing a training loss in accordance with a loss function L? based on ?, ?2, {circumflex over (?)}?, and {circumflex over (?)}?2; and iteratively updating the generator parameters in accordance with the training loss unti

Type: Grant

Filed: November 12, 2020

Date of Patent: November 26, 2024

Assignee: Samsung Electronics Co., Ltd.

Inventors: Yoo Jin Choi, Mostafa El-Khamy, Jungwon Lee
SYSTEMS AND METHODS FOR TRAINING A MODEM ALGORITHM USING FEDERATED LEARNING

Publication number: 20240362494

Abstract: A system and a method are disclosed, the method including receiving, by a first local controller of a first edge device, an input associated with an environment in which the first edge device operates, using a first machine-learning algorithm, determining, by the first local controller, a parameter for a pre-trained modem algorithm of the first edge device based on the input, executing a task on the first edge device based on executing the pre-trained modem algorithm with the parameter, determining a result of executing the task, training the first machine-learning algorithm, generating a first update to the first machine-learning algorithm based on the training, sending the first update to a server, receiving, from the server, a server update to the first machine-learning algorithm, and based on the server update, updating the first machine-learning algorithm.

Type: Application

Filed: November 30, 2023

Publication date: October 31, 2024

Inventor: Mostafa El-Khamy
SYSTEMS AND METHODS FOR TUNING PARAMETERS OF A MACHINE LEARNING MODEL FOR FEDERATED LEARNING

Publication number: 20240362487

Abstract: A system and a method are disclosed for tuning parameters of a large language model. The method comprises identifying first weights of a machine learning (ML) model. Second weights are received from a client device. The second weights may be based on updating, by the client device, the first weights. An update matrix may be generated based on the second weights. The update matrix may be decomposed into first decomposition matrices. Singular values that satisfy a criterion may be identified based on the first decomposition matrices. Singular vectors may be identified based on the singular values. Second decomposition matrices may be identified based on the singular vectors. Updates may be received from the client device of third weights associated with the second decomposition matrices. An updated ML model may be generated based on the updates of the third weights. An inference may be generated based on the updated ML model.

Type: Application

Filed: April 25, 2024

Publication date: October 31, 2024

Inventors: Ahmed Roushdy ElKordy, Sara Babakniya, Qingfeng Liu, Mostafa El-Khamy, Yahya Hussain Ezzeldin Essa, Salman Avestimehr
SYSTEMS AND METHODS FOR TRAINING ARTIFICIAL NEURAL NETWORKS

Publication number: 20240355320

Abstract: A system including: one or more processors; and memory including instructions that, when executed by the one or more processors, cause the one or more processors to: generate augmented input data by mixing noise components of training data; train a first neural network based on the augmented input data and ground truth data of the training data to output a first prediction of clean speech; lock trainable parameters of the first neural network as a result of the training of the first neural network; and train a second neural network according to the augmented input data and predictions of the first neural network to output a second prediction of the clean speech.

Type: Application

Filed: September 21, 2023

Publication date: October 24, 2024

Inventors: Behnam Babagholami Mohamadabadi, Mostafa El-Khamy, Kee-Bong Song
VIDEO DEPTH ESTIMATION BASED ON TEMPORAL ATTENTION

Publication number: 20240346673

Abstract: A method of depth detection based on a plurality of video frames includes receiving a plurality of input frames including a first input frame, a second input frame, and a third input frame respectively corresponding to different capture times, convolving the first to third input frames to generate a first feature map, a second feature map, and a third feature map corresponding to the different capture times, calculating a temporal attention map based on the first to third feature maps, the temporal attention map including a plurality of weights corresponding to different pairs of feature maps from among the first to third feature maps, each weight of the plurality of weights indicating a similarity level of a corresponding pair of feature maps, and applying the temporal attention map to the first to third feature maps to generate a feature map with temporal attention.

Type: Application

Filed: May 28, 2024

Publication date: October 17, 2024

Inventors: Haoyu Ren, Mostafa El-Khamy, Jungwon Lee
Transformer with Gaussian weighted self-attention for speech enhancement

Patent number: 12100412

Abstract: A method and system for providing Gaussian weighted self-attention for speech enhancement are herein provided. According to one embodiment, the method includes receiving an input noise signal, generating a score matrix based on the received input noise signal, and applying a Gaussian weighted function to the generated score matrix.

Type: Grant

Filed: December 6, 2021

Date of Patent: September 24, 2024

Assignee: Samsung Electronics Co., Ltd

Inventors: JaeYoung Kim, Mostafa El-Khamy, Jungwon Lee
System and method for acoustic echo cancelation using deep multitask recurrent neural networks

Patent number: 12073847

Abstract: A system for performing echo cancellation includes: a processor configured to: receive a far-end signal; record a microphone signal including: a near-end signal; and an echo signal corresponding to the far-end signal; extract far-end features from the far-end signal; extract microphone features from the microphone signal; compute estimated near-end features by supplying the microphone features and the far-end features to an acoustic echo cancellation module including a recurrent neural network including: an encoder including a plurality of gated recurrent units; and a decoder including a plurality of gated recurrent units; compute an estimated near-end signal from the estimated near-end features; and transmit the estimated near-end signal to the far-end device. The recurrent neural network may include a contextual attention module; and the recurrent neural network may take, as input, a plurality of error features computed based on the far-end features, the microphone features, and acoustic path parameters.

Type: Grant

Filed: May 27, 2022

Date of Patent: August 27, 2024

Assignee: Samsung Electronics Co., Ltd.

Inventors: Amin Fazeli, Mostafa El-Khamy, Jungwon Lee
Multi-expert adversarial regularization for robust and data-efficient deep supervised learning

Patent number: 12051237

Abstract: A system and a method to train a neural network are disclosed. A first image is weakly and strongly augmented. The first image, the weakly and strongly augmented first images are input into a feature extractor to obtain augmented features. Each weakly augmented first image is input to a corresponding first expert head to determine a supervised loss for each weakly augmented first image. Each strongly augmented first image is input to a corresponding second expert head to determine a diversity loss for each strongly augmented first image. The feature extractor is trained to minimize the supervised loss on weakly augmented first images and to minimize a multi-expert consensus loss on strongly augmented first images. Each first expert head is trained to minimize the supervised loss for each weakly augmented first image, and each second expert head is trained to minimize the diversity loss for each strongly augmented first image.

Type: Grant

Filed: February 17, 2022

Date of Patent: July 30, 2024

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Behnam Babagholami Mohamadabadi, Qingfeng Liu, Mostafa El-Khamy, Jungwon Lee
System and method for acoustic echo cancelation using deep multitask recurrent neural networks

Patent number: 12033652

Abstract: A system for performing echo cancellation includes: a processor configured to: receive a far-end signal; record a microphone signal including: a near-end signal; and an echo signal corresponding to the far-end signal; extract far-end features from the far-end signal; extract microphone features from the microphone signal; compute estimated near-end features by supplying the microphone features and the far-end features to an acoustic echo cancellation module including a recurrent neural network including: an encoder including a plurality of gated recurrent units; and a decoder including a plurality of gated recurrent units; compute an estimated near-end signal from the estimated near-end features; and transmit the estimated near-end signal to the far-end device. The recurrent neural network may include a contextual attention module; and the recurrent neural network may take, as input, a plurality of error features computed based on the far-end features, the microphone features, and acoustic path parameters.

Type: Grant

Filed: May 27, 2022

Date of Patent: July 9, 2024

Assignee: Samsung Electronics Co., Ltd.

Inventors: Amin Fazeli, Mostafa El-Khamy, Jungwon Lee
System and method for deep machine learning for computer vision applications

Patent number: 12026627

Abstract: A computer vision (CV) training system, includes: a supervised learning system to estimate a supervision output from one or more input images according to a target CV application, and to determine a supervised loss according to the supervision output and a ground-truth of the supervision output; an unsupervised learning system to determine an unsupervised loss according to the supervision output and the one or more input images; a weakly supervised learning system to determine a weakly supervised loss according to the supervision output and a weak label corresponding to the one or more input images; and a joint optimizer to concurrently optimize the supervised loss, the unsupervised loss, and the weakly supervised loss.

Type: Grant

Filed: August 17, 2022

Date of Patent: July 2, 2024

Assignee: Samsung Electronics Co., Ltd.

Inventors: Haoyu Ren, Mostafa El-Khamy, Jungwon Lee, Aman Raj
TEMPORAL SEMANTIC BOUNDARY LOSS FOR VIDEO SEMANTIC SEGMENTATION NETWORKS

Publication number: 20240177318

Abstract: Disclosed is a method including receiving, in a semantic segmentation network, input data from a plurality of frames, computing a ground truth label on the plurality of frames, generating a ground truth temporal semantic boundary map from the ground truth label on the plurality of frames, generating a predicted temporal semantic boundary map based on an output of the input data, and determining a loss based on the ground truth temporal semantic boundary map and the predicted temporal semantic boundary map.

Type: Application

Filed: February 8, 2023

Publication date: May 30, 2024

Inventors: Mostafa EL-KHAMY, Hai SU, Qingfeng LIU

1 2 3 4 5 … next