Patents by Inventor Adam Marek

Adam Marek has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

AUGMENTING DATASETS FOR TRAINING AUDIO GENERATION MODELS

Publication number: 20250191573

Abstract: A target voice dataset may be augmented using speech prediction. Encoder and decoder models may be trained to encode audio data into encoded speech data and convert it back to audio. The encoded units may include semantic information (e.g., phonemes and/or words) as well as feature data indicating prosody, timbre, speaker identity, speech style, emotion, etc. of speech. An acoustic/semantic language model (ASLM) may be configured to predict encoded speech data in a manner analogous to a language model predicting words; for example, based on preceding encoded speech data. The models may be used to generate synthesized speech samples having voice characteristics (e.g., feature data) similar to those of the target voice dataset. The augmented dataset may be used to train a text-to-speech (TTS) model to reproduce the target voice characteristics, and may improve performance of the TTS model over training with only the original target voice dataset.

Type: Application

Filed: February 24, 2025

Publication date: June 12, 2025

Inventors: Mateusz Aleksander Lajszczak, Adam Marek Gabrys, Arent van Korlaar, Ruizhe Li, Elena Sergeevna Sokolova, Jaime Lorenzo Trueba, Arnaud Vincent Pierre Yves Joly, Marco Nicolis, Ekaterina Petrova
Augmenting datasets for training audio generation models

Patent number: 12254864

Abstract: A target voice dataset may be augmented using speech prediction. Encoder and decoder models may be trained to encode audio data into encoded speech data, and convert it back to audio. The encoded units may include semantic information (e.g., phonemes and/or words) as well as feature data indicating prosody, timbre, speaker identity, speech style, emotion, etc. of speech. An acoustic/semantic language model (ASLM) may be configured to predict encoded speech data in a manner analogous to a language model predicting words; for example, based on preceding encoded speech data. The models may be used to generate synthesized speech samples having voice characteristics (e.g., feature data) similar to those of the target voice dataset. The augmented dataset may be used to train a text-to-speech (TTS) model to reproduce the target voice characteristics, and may improve performance of the TTS model over training with only the original target voice dataset.

Type: Grant

Filed: June 30, 2022

Date of Patent: March 18, 2025

Assignee: Amazon Technologies, Inc.

Inventors: Mateusz Aleksander Lajszczak, Adam Marek Gabrys, Arent van Korlaar, Ruizhe Li, Elena Sergeevna Sokolova, Jaime Lorenzo Trueba, Arnaud Vincent Pierre Yves Joly, Marco Nicolis, Ekaterina Petrova
Voice adaptation using synthetic speech processing

Patent number: 11915683

Abstract: A text-to-speech (TTS) system may be configured to imitate characteristics of a target voice based on a limited dataset. The TTS system may include a machine learning model pre-trained using a synthetic parallel dataset and fine-tuned using examples of the target voice. A TTS component trained using a large single-speaker dataset may be used to generate the synthetic parallel dataset based on a multi-speaker dataset. The synthetic parallel dataset may include target audio data representing speech in the multi-speaker dataset and predicted audio data generated by the TTS component based on transcripts of the speech. The machine learning model may be pre-trained using the synthetic parallel dataset and fine-tuned using audio data representing target voice speech and predicted audio generated by the TTS component based on transcripts of the target voice speech. The trained model may be used to modify synthetic speech to approximate the characteristics of the target speech.

Type: Grant

Filed: February 14, 2022

Date of Patent: February 27, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Adam Marek Gabrys, Jaime Lorenzo Trueba, Goeric Sydney Huybrechts
Optical feedback for visual recognition authentication

Patent number: 11836827

Abstract: Providing optical watermark signals for a visual authentication session by performing at least the following: receive, at an anti-spoof engine, an instruction to perform visual authentication operations for a visual authentication session, generate, with the anti-spoof engine, an optical watermark signal based on receiving the instruction, wherein the optical watermark signal includes at least one optical identifier to authenticate images captured during the visual authentication session, obtain, with the anti-spoof engine, an image source that includes captured images of the visual authentication session, determine, with the anti-spoof engine, whether the image source includes a reflected optical watermark signal, and compare, with the anti-spoof engine, whether the reflected optical watermark signal matches the generated optical watermark signal based on the determination that the image source includes the reflected optical watermark signal.

Type: Grant

Filed: April 23, 2021

Date of Patent: December 5, 2023

Assignee: McAfee, LLC

Inventors: Alex Nayshtut, Igor Muttik, Oleg Pogorelik, Adam Marek
VOICE ADAPTATION USING SYNTHETIC SPEECH PROCESSING

Publication number: 20230260502

Abstract: A text-to-speech (TTS) system may be configured to imitate characteristics of a target voice based on a limited dataset. The TTS system may include a machine learning model pre-trained using a synthetic parallel dataset and fine-tuned using examples of the target voice. A TTS component trained using a large single-speaker dataset may be used to generate the synthetic parallel dataset based on a multi-speaker dataset. The synthetic parallel dataset may include target audio data representing speech in the multi-speaker dataset and predicted audio data generated by the TTS component based on transcripts of the speech. The machine learning model may be pre-trained using the synthetic parallel dataset and fine-tuned using audio data representing target voice speech and predicted audio generated by the TTS component based on transcripts of the target voice speech. The trained model may be used to modify synthetic speech to approximate the characteristics of the target speech.

Type: Application

Filed: February 14, 2022

Publication date: August 17, 2023

Inventors: Adam Marek Gabrys, Jaime Lorenzo Trueba, Goeric Sydney Huybrechts
Apparatus and methods for determining multi-subject performance metrics in a three-dimensional space

Patent number: 11715213

Abstract: Apparatus and methods for extraction and calculation of multi-person performance metrics in a three-dimensional space. An example apparatus includes a detector to identify a first subject in a first image captured by a first image capture device based on a first set of two-dimensional kinematic keypoints in the first image, the two-dimensional kinematic keypoints corresponding to a joint of the first subject, the first image capture device associated with a first view of the first subject, a multi-view associator to verify the first subject using the first image and a second image captured by a second image capture device, the second image capture device associated with a second view of the first subject, the second view different than the first view, and a keypoint generator to generate three-dimensional keypoints for the first subject using the first set of two-dimensional kinematic keypoints.

Type: Grant

Filed: June 26, 2020

Date of Patent: August 1, 2023

Assignee: Intel Corporation

Inventors: Nelson Leung, Jonathan K. Lee, Bridget L. Williams, Sameer Sheorey, Amery Cong, Mehrnaz Khodam Hazrati, Sabar Mourad Souag, Adam Marek, Pawel Pieniazek, Bogna Bylicka, Jakub Powierza, Anna Banaszczyk-fiszer
Methods of modelling systems or performing predictive maintenance of lithographic systems

Patent number: 11543814

Abstract: Predictive maintenance methods and systems, including a method of applying transfer entropy techniques to find a causal link between parameters; a method of applying quality weighting to context data based on a priori knowledge of the accuracy of the context data; a method of detecting a maintenance action from parameter data by detecting a step and a process capability improvement; a method of managing unattended alerts by considering cost/benefit of attending to one or more alerts over time and assigning alert expiry time and/or ranking the alerts accordingly; a method of displaying components of a complex system in a functional way enabling improvements in system diagnostics; a method of determining the time of an event indicator in time series parameter data; a method of classifying an event associated with a fault condition occurring within a system; and a method of determining whether an event recorded in parameter data is attributable to an external factor.

Type: Grant

Filed: September 13, 2016

Date of Patent: January 3, 2023

Assignee: ASML NETHERLANDS B.V.

Inventors: David Evert Song Kook Sigtermans, René Fussenich, Adam Marek Kielczewski, Errol Arthur Zalmijn, Marcel Richard André Brunt, Stefan Lucian Voinea
OPTICAL FEEDBACK FOR VISUAL RECOGNITION AUTHENTICATION

Publication number: 20210241409

Abstract: Providing optical watermark signals for a visual authentication session by performing at least the following: receive, at an anti-spoof engine, an instruction to perform visual authentication operations for a visual authentication session, generate, with the anti-spoof engine, an optical watermark signal based on receiving the instruction, wherein the optical watermark signal includes at least one optical identifier to authenticate images captured during the visual authentication session, obtain, with the anti-spoof engine, an image source that includes captured images of the visual authentication session, determine, with the anti-spoof engine, whether the image source includes a reflected optical watermark signal, and compare, with the anti-spoof engine, whether the reflected optical watermark signal matches the generated optical watermark signal based on the determination that the image source includes the reflected optical watermark signal.

Type: Application

Filed: April 23, 2021

Publication date: August 5, 2021

Inventors: Alex Nayshtut, Igor Muttik, Oleg Pogorelik, Adam Marek
Optical feedback for visual recognition authentication

Patent number: 11004168

Abstract: Providing optical watermark signals for a visual authentication session by performing at least the following: receive, at an anti-spoof engine, an instruction to perform visual authentication operations for a visual authentication session, generate, with the anti-spoof engine, an optical watermark signal based on receiving the instruction, wherein the optical watermark signal includes at least one optical identifier to authenticate images captured during the visual authentication session, obtain, with the anti-spoof engine, an image source that includes captured images of the visual authentication session, determine, with the anti-spoof engine, whether the image source includes a reflected optical watermark signal, and compare, with the anti-spoof engine, whether the reflected optical watermark signal matches the generated optical watermark signal based on the determination that the image source includes the reflected optical watermark signal.

Type: Grant

Filed: March 29, 2019

Date of Patent: May 11, 2021

Assignee: MCAFEE, LLC

Inventors: Alex Nayshtut, Igor Muttik, Oleg Pogorelik, Adam Marek
APPARATUS AND METHODS FOR DETERMINING MULTI-SUBJECT PERFORMANCE METRICS IN A THREE-DIMENSIONAL SPACE

Publication number: 20200401793

Abstract: Apparatus and methods for extraction and calculation of multi-person performance metrics in a three-dimensional space. An example apparatus includes a detector to identify a first subject in a first image captured by a first image capture device based on a first set of two-dimensional kinematic keypoints in the first image, the two-dimensional kinematic keypoints corresponding to a joint of the first subject, the first image capture device associated with a first view of the first subject, a multi-view associator to verify the first subject using the first image and a second image captured by a second image capture device, the second image capture device associated with a second view of the first subject, the second view different than the first view, and a keypoint generator to generate three-dimensional keypoints for the first subject using the first set of two-dimensional kinematic keypoints.

Type: Application

Filed: June 26, 2020

Publication date: December 24, 2020

Inventors: Nelson Leung, Jonathan K. Lee, Bridget L. Williams, Sameer Sheorey, Amery Cong, Mehrnaz Khodam Hazrati, Mourad S. Souag, Adam Marek, Pawel Pieniazek, Bogna Bylicka, Jakub Powierza, Anna Banaszczyk-fiszer
Methods and apparatus to enhance security of authentication

Patent number: 10778667

Abstract: A system is disclosed that includes a processor including watermark logic to output a first watermark to an output device that outputs a first watermark signal, based on the first watermark, to an acoustic transmission medium. The processor also includes recording logic to capture, at a first time period, an authentication submission comprising the first watermark signal convolved, via the acoustic transmission medium, with a first passphrase signal. The system also includes a dynamic random access memory (DRAM). Other embodiments are disclosed and claimed.

Type: Grant

Filed: November 15, 2019

Date of Patent: September 15, 2020

Assignee: McAfee, LLC

Inventors: Igor Muttik, Adam Marek, Alex Nayshtut
Cepstral variance normalization for audio feature extraction

Patent number: 10629184

Abstract: Cepstral variance normalization is described for audio feature extraction.

Type: Grant

Filed: December 22, 2014

Date of Patent: April 21, 2020

Assignee: Intel Corporation

Inventors: Tobias Bocklet, Adam Marek
METHODS AND APPARATUS TO ENHANCE SECURITY OF AUTHENTICATION

Publication number: 20200092274

Abstract: A system is disclosed that includes a processor including watermark logic to output a first watermark to an output device that outputs a first watermark signal, based on the first watermark, to an acoustic transmission medium. The processor also includes recording logic to capture, at a first time period, an authentication submission comprising the first watermark signal convolved, via the acoustic transmission medium, with a first passphrase signal. The system also includes a dynamic random access memory (DRAM). Other embodiments are disclosed and claimed.

Type: Application

Filed: November 15, 2019

Publication date: March 19, 2020

Inventors: Igor Muttik, Adam Marek, Alex Nayshtut
Methods and apparatus to enhance security of authentication

Patent number: 10516657

Abstract: A system is disclosed that includes a processor including watermark logic to output a first watermark to an output device that outputs a first watermark signal, based on the first watermark, to an acoustic transmission medium. The processor also includes recording logic to capture, at a first time period, an authentication submission comprising the first watermark signal convolved, via the acoustic transmission medium, with a first passphrase signal. The system also includes a dynamic random access memory (DRAM). Other embodiments are disclosed and claimed.

Type: Grant

Filed: April 24, 2014

Date of Patent: December 24, 2019

Assignee: McAfee, LLC

Inventors: Igor Muttik, Adam Marek, Alex Nayshtut
OPTICAL FEEDBACK FOR VISUAL RECOGNITION AUTHENTICATION

Publication number: 20190228496

Abstract: Providing optical watermark signals for a visual authentication session by performing at least the following: receive, at an anti-spoof engine, an instruction to perform visual authentication operations for a visual authentication session, generate, with the anti-spoof engine, an optical watermark signal based on receiving the instruction, wherein the optical watermark signal includes at least one optical identifier to authenticate images captured during the visual authentication session, obtain, with the anti-spoof engine, an image source that includes captured images of the visual authentication session, determine, with the anti-spoof engine, whether the image source includes a reflected optical watermark signal, and compare, with the anti-spoof engine, whether the reflected optical watermark signal matches the generated optical watermark signal based on the determination that the image source includes the reflected optical watermark signal.

Type: Application

Filed: March 29, 2019

Publication date: July 25, 2019

Inventors: Alex Nayshtut, Igor Muttik, Oleg Pogorelik, Adam Marek
Optical feedback for visual recognition authentication

Patent number: 10296998

Abstract: Providing optical watermark signals for a visual authentication session by performing at least the following: receive, at an anti-spoof engine, an instruction to perform visual authentication operations for a visual authentication session, generate, with the anti-spoof engine, an optical watermark signal based on receiving the instruction, wherein the optical watermark signal includes at least one optical identifier to authenticate images captured during the visual authentication session, obtain, with the anti-spoof engine, an image source that includes captured images of the visual authentication session, determine, with the anti-spoof engine, whether the image source includes a reflected optical watermark signal, and compare, with the anti-spoof engine, whether the reflected optical watermark signal matches the generated optical watermark signal based on the determination that the image source includes the reflected optical watermark signal.

Type: Grant

Filed: November 10, 2016

Date of Patent: May 21, 2019

Assignee: MCAFEE, LLC

Inventors: Alex Nayshtut, Igor Muttik, Oleg Pogorelik, Adam Marek
CEPSTRAL VARIANCE NORMALIZATION FOR AUDIO FEATURE EXTRACTION

Publication number: 20180322863

Abstract: Cepstral variance normalization is described for audio feature extraction.

Type: Application

Filed: December 22, 2014

Publication date: November 8, 2018

Inventors: TOBIAS BOCKLET, ADAM MAREK
METHODS OF MODELLING SYSTEMS OR PERFORMING PREDICTIVE MAINTENANCE OF LITHOGRAPHIC SYSTEMS

Publication number: 20180267523

Abstract: Predictive maintenance methods and systems, including a method of applying transfer entropy techniques to find a causal link between parameters; a method of applying quality weighting to context data based on a priori knowledge of the accuracy of the context data; a method of detecting a maintenance action from parameter data by detecting a step and a process capability improvement; a method of managing unattended alerts by considering cost/benefit of attending to one or more alerts over time and assigning alert expiry time and/or ranking the alerts accordingly; a method of displaying components of a complex system in a functional way enabling improvements in system diagnostics; a method of determining the time of an event indicator in time series parameter data; a method of classifying an event associated with a fault condition occurring within a system; and a method of determining whether an event recorded in parameter data is attributable to an external factor.

Type: Application

Filed: September 13, 2016

Publication date: September 20, 2018

Applicant: ASML NETHERLANDS B.V.

Inventors: David Evert Song Kook SIGTERMANS, René FUSSENICH, Adam Marek KIELCZEWSKI, Errol Arthur ZALMIJN, Marcel Richard André BRUNT, Stefan Lucian VOINEA
Centralized secure device pairing

Patent number: 10038679

Abstract: Various embodiments are generally directed to pairing computing devices for collaborative interaction via a network through a centralized secure device pairing service. An apparatus comprises a controller processor circuit, and a controller storage communicatively coupled to the controller processor circuit to store an initial private key and to store instructions that when executed by the controller processor circuit cause the controller processor circuit to create a first signature using the initial private key, transmit the first signature to an issuing server via a network, receive a group public key and an associated member private key from the issuing server, create a second signature using the member private key, transmit the second signature to a member device via the network; receive a third signature from the member device; and authenticate the third signature using the group public key. Other embodiments are described and claimed herein.

Type: Grant

Filed: December 24, 2012

Date of Patent: July 31, 2018

Assignee: INTEL CORPORATION

Inventor: Adam Marek
Intermediate scoring and rejection loopback for improved key phrase detection

Patent number: 9972313

Abstract: Techniques related to key phrase detection for applications such as wake on voice are discussed. Such techniques may include intermediate scoring of a state or states of a key phrase model and/or a backward transition or rejection loopback from a state of the key phrase model to a rejection model to reduce false accepts based on received utterances.

Type: Grant

Filed: March 1, 2016

Date of Patent: May 15, 2018

Assignee: Intel Corporation

Inventors: Tobias Bocklet, Adam Marek, Tomasz Dorau, Przemyslaw Sobon

1 2 next