Patents by Inventor Adam Marek
Adam Marek has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12254864Abstract: A target voice dataset may be augmented using speech prediction. Encoder and decoder models may be trained to encode audio data into encoded speech data, and convert it back to audio. The encoded units may include semantic information (e.g., phonemes and/or words) as well as feature data indicating prosody, timbre, speaker identity, speech style, emotion, etc. of speech. An acoustic/semantic language model (ASLM) may be configured to predict encoded speech data in a manner analogous to a language model predicting words; for example, based on preceding encoded speech data. The models may be used to generate synthesized speech samples having voice characteristics (e.g., feature data) similar to those of the target voice dataset. The augmented dataset may be used to train a text-to-speech (TTS) model to reproduce the target voice characteristics, and may improve performance of the TTS model over training with only the original target voice dataset.Type: GrantFiled: June 30, 2022Date of Patent: March 18, 2025Assignee: Amazon Technologies, Inc.Inventors: Mateusz Aleksander Lajszczak, Adam Marek Gabrys, Arent van Korlaar, Ruizhe Li, Elena Sergeevna Sokolova, Jaime Lorenzo Trueba, Arnaud Vincent Pierre Yves Joly, Marco Nicolis, Ekaterina Petrova
-
Patent number: 11915683Abstract: A text-to-speech (TTS) system may be configured to imitate characteristics of a target voice based on a limited dataset. The TTS system may include a machine learning model pre-trained using a synthetic parallel dataset and fine-tuned using examples of the target voice. A TTS component trained using a large single-speaker dataset may be used to generate the synthetic parallel dataset based on a multi-speaker dataset. The synthetic parallel dataset may include target audio data representing speech in the multi-speaker dataset and predicted audio data generated by the TTS component based on transcripts of the speech. The machine learning model may be pre-trained using the synthetic parallel dataset and fine-tuned using audio data representing target voice speech and predicted audio generated by the TTS component based on transcripts of the target voice speech. The trained model may be used to modify synthetic speech to approximate the characteristics of the target speech.Type: GrantFiled: February 14, 2022Date of Patent: February 27, 2024Assignee: Amazon Technologies, Inc.Inventors: Adam Marek Gabrys, Jaime Lorenzo Trueba, Goeric Sydney Huybrechts
-
Patent number: 11836827Abstract: Providing optical watermark signals for a visual authentication session by performing at least the following: receive, at an anti-spoof engine, an instruction to perform visual authentication operations for a visual authentication session, generate, with the anti-spoof engine, an optical watermark signal based on receiving the instruction, wherein the optical watermark signal includes at least one optical identifier to authenticate images captured during the visual authentication session, obtain, with the anti-spoof engine, an image source that includes captured images of the visual authentication session, determine, with the anti-spoof engine, whether the image source includes a reflected optical watermark signal, and compare, with the anti-spoof engine, whether the reflected optical watermark signal matches the generated optical watermark signal based on the determination that the image source includes the reflected optical watermark signal.Type: GrantFiled: April 23, 2021Date of Patent: December 5, 2023Assignee: McAfee, LLCInventors: Alex Nayshtut, Igor Muttik, Oleg Pogorelik, Adam Marek
-
Publication number: 20230260502Abstract: A text-to-speech (TTS) system may be configured to imitate characteristics of a target voice based on a limited dataset. The TTS system may include a machine learning model pre-trained using a synthetic parallel dataset and fine-tuned using examples of the target voice. A TTS component trained using a large single-speaker dataset may be used to generate the synthetic parallel dataset based on a multi-speaker dataset. The synthetic parallel dataset may include target audio data representing speech in the multi-speaker dataset and predicted audio data generated by the TTS component based on transcripts of the speech. The machine learning model may be pre-trained using the synthetic parallel dataset and fine-tuned using audio data representing target voice speech and predicted audio generated by the TTS component based on transcripts of the target voice speech. The trained model may be used to modify synthetic speech to approximate the characteristics of the target speech.Type: ApplicationFiled: February 14, 2022Publication date: August 17, 2023Inventors: Adam Marek Gabrys, Jaime Lorenzo Trueba, Goeric Sydney Huybrechts
-
Apparatus and methods for determining multi-subject performance metrics in a three-dimensional space
Patent number: 11715213Abstract: Apparatus and methods for extraction and calculation of multi-person performance metrics in a three-dimensional space. An example apparatus includes a detector to identify a first subject in a first image captured by a first image capture device based on a first set of two-dimensional kinematic keypoints in the first image, the two-dimensional kinematic keypoints corresponding to a joint of the first subject, the first image capture device associated with a first view of the first subject, a multi-view associator to verify the first subject using the first image and a second image captured by a second image capture device, the second image capture device associated with a second view of the first subject, the second view different than the first view, and a keypoint generator to generate three-dimensional keypoints for the first subject using the first set of two-dimensional kinematic keypoints.Type: GrantFiled: June 26, 2020Date of Patent: August 1, 2023Assignee: Intel CorporationInventors: Nelson Leung, Jonathan K. Lee, Bridget L. Williams, Sameer Sheorey, Amery Cong, Mehrnaz Khodam Hazrati, Sabar Mourad Souag, Adam Marek, Pawel Pieniazek, Bogna Bylicka, Jakub Powierza, Anna Banaszczyk-fiszer -
Patent number: 11543814Abstract: Predictive maintenance methods and systems, including a method of applying transfer entropy techniques to find a causal link between parameters; a method of applying quality weighting to context data based on a priori knowledge of the accuracy of the context data; a method of detecting a maintenance action from parameter data by detecting a step and a process capability improvement; a method of managing unattended alerts by considering cost/benefit of attending to one or more alerts over time and assigning alert expiry time and/or ranking the alerts accordingly; a method of displaying components of a complex system in a functional way enabling improvements in system diagnostics; a method of determining the time of an event indicator in time series parameter data; a method of classifying an event associated with a fault condition occurring within a system; and a method of determining whether an event recorded in parameter data is attributable to an external factor.Type: GrantFiled: September 13, 2016Date of Patent: January 3, 2023Assignee: ASML NETHERLANDS B.V.Inventors: David Evert Song Kook Sigtermans, René Fussenich, Adam Marek Kielczewski, Errol Arthur Zalmijn, Marcel Richard André Brunt, Stefan Lucian Voinea
-
Publication number: 20210241409Abstract: Providing optical watermark signals for a visual authentication session by performing at least the following: receive, at an anti-spoof engine, an instruction to perform visual authentication operations for a visual authentication session, generate, with the anti-spoof engine, an optical watermark signal based on receiving the instruction, wherein the optical watermark signal includes at least one optical identifier to authenticate images captured during the visual authentication session, obtain, with the anti-spoof engine, an image source that includes captured images of the visual authentication session, determine, with the anti-spoof engine, whether the image source includes a reflected optical watermark signal, and compare, with the anti-spoof engine, whether the reflected optical watermark signal matches the generated optical watermark signal based on the determination that the image source includes the reflected optical watermark signal.Type: ApplicationFiled: April 23, 2021Publication date: August 5, 2021Inventors: Alex Nayshtut, Igor Muttik, Oleg Pogorelik, Adam Marek
-
Patent number: 11004168Abstract: Providing optical watermark signals for a visual authentication session by performing at least the following: receive, at an anti-spoof engine, an instruction to perform visual authentication operations for a visual authentication session, generate, with the anti-spoof engine, an optical watermark signal based on receiving the instruction, wherein the optical watermark signal includes at least one optical identifier to authenticate images captured during the visual authentication session, obtain, with the anti-spoof engine, an image source that includes captured images of the visual authentication session, determine, with the anti-spoof engine, whether the image source includes a reflected optical watermark signal, and compare, with the anti-spoof engine, whether the reflected optical watermark signal matches the generated optical watermark signal based on the determination that the image source includes the reflected optical watermark signal.Type: GrantFiled: March 29, 2019Date of Patent: May 11, 2021Assignee: MCAFEE, LLCInventors: Alex Nayshtut, Igor Muttik, Oleg Pogorelik, Adam Marek
-
APPARATUS AND METHODS FOR DETERMINING MULTI-SUBJECT PERFORMANCE METRICS IN A THREE-DIMENSIONAL SPACE
Publication number: 20200401793Abstract: Apparatus and methods for extraction and calculation of multi-person performance metrics in a three-dimensional space. An example apparatus includes a detector to identify a first subject in a first image captured by a first image capture device based on a first set of two-dimensional kinematic keypoints in the first image, the two-dimensional kinematic keypoints corresponding to a joint of the first subject, the first image capture device associated with a first view of the first subject, a multi-view associator to verify the first subject using the first image and a second image captured by a second image capture device, the second image capture device associated with a second view of the first subject, the second view different than the first view, and a keypoint generator to generate three-dimensional keypoints for the first subject using the first set of two-dimensional kinematic keypoints.Type: ApplicationFiled: June 26, 2020Publication date: December 24, 2020Inventors: Nelson Leung, Jonathan K. Lee, Bridget L. Williams, Sameer Sheorey, Amery Cong, Mehrnaz Khodam Hazrati, Mourad S. Souag, Adam Marek, Pawel Pieniazek, Bogna Bylicka, Jakub Powierza, Anna Banaszczyk-fiszer -
Patent number: 10778667Abstract: A system is disclosed that includes a processor including watermark logic to output a first watermark to an output device that outputs a first watermark signal, based on the first watermark, to an acoustic transmission medium. The processor also includes recording logic to capture, at a first time period, an authentication submission comprising the first watermark signal convolved, via the acoustic transmission medium, with a first passphrase signal. The system also includes a dynamic random access memory (DRAM). Other embodiments are disclosed and claimed.Type: GrantFiled: November 15, 2019Date of Patent: September 15, 2020Assignee: McAfee, LLCInventors: Igor Muttik, Adam Marek, Alex Nayshtut
-
Patent number: 10629184Abstract: Cepstral variance normalization is described for audio feature extraction.Type: GrantFiled: December 22, 2014Date of Patent: April 21, 2020Assignee: Intel CorporationInventors: Tobias Bocklet, Adam Marek
-
Publication number: 20200092274Abstract: A system is disclosed that includes a processor including watermark logic to output a first watermark to an output device that outputs a first watermark signal, based on the first watermark, to an acoustic transmission medium. The processor also includes recording logic to capture, at a first time period, an authentication submission comprising the first watermark signal convolved, via the acoustic transmission medium, with a first passphrase signal. The system also includes a dynamic random access memory (DRAM). Other embodiments are disclosed and claimed.Type: ApplicationFiled: November 15, 2019Publication date: March 19, 2020Inventors: Igor Muttik, Adam Marek, Alex Nayshtut
-
Patent number: 10516657Abstract: A system is disclosed that includes a processor including watermark logic to output a first watermark to an output device that outputs a first watermark signal, based on the first watermark, to an acoustic transmission medium. The processor also includes recording logic to capture, at a first time period, an authentication submission comprising the first watermark signal convolved, via the acoustic transmission medium, with a first passphrase signal. The system also includes a dynamic random access memory (DRAM). Other embodiments are disclosed and claimed.Type: GrantFiled: April 24, 2014Date of Patent: December 24, 2019Assignee: McAfee, LLCInventors: Igor Muttik, Adam Marek, Alex Nayshtut
-
Publication number: 20190228496Abstract: Providing optical watermark signals for a visual authentication session by performing at least the following: receive, at an anti-spoof engine, an instruction to perform visual authentication operations for a visual authentication session, generate, with the anti-spoof engine, an optical watermark signal based on receiving the instruction, wherein the optical watermark signal includes at least one optical identifier to authenticate images captured during the visual authentication session, obtain, with the anti-spoof engine, an image source that includes captured images of the visual authentication session, determine, with the anti-spoof engine, whether the image source includes a reflected optical watermark signal, and compare, with the anti-spoof engine, whether the reflected optical watermark signal matches the generated optical watermark signal based on the determination that the image source includes the reflected optical watermark signal.Type: ApplicationFiled: March 29, 2019Publication date: July 25, 2019Inventors: Alex Nayshtut, Igor Muttik, Oleg Pogorelik, Adam Marek
-
Patent number: 10296998Abstract: Providing optical watermark signals for a visual authentication session by performing at least the following: receive, at an anti-spoof engine, an instruction to perform visual authentication operations for a visual authentication session, generate, with the anti-spoof engine, an optical watermark signal based on receiving the instruction, wherein the optical watermark signal includes at least one optical identifier to authenticate images captured during the visual authentication session, obtain, with the anti-spoof engine, an image source that includes captured images of the visual authentication session, determine, with the anti-spoof engine, whether the image source includes a reflected optical watermark signal, and compare, with the anti-spoof engine, whether the reflected optical watermark signal matches the generated optical watermark signal based on the determination that the image source includes the reflected optical watermark signal.Type: GrantFiled: November 10, 2016Date of Patent: May 21, 2019Assignee: MCAFEE, LLCInventors: Alex Nayshtut, Igor Muttik, Oleg Pogorelik, Adam Marek
-
Publication number: 20180322863Abstract: Cepstral variance normalization is described for audio feature extraction.Type: ApplicationFiled: December 22, 2014Publication date: November 8, 2018Inventors: TOBIAS BOCKLET, ADAM MAREK
-
Publication number: 20180267523Abstract: Predictive maintenance methods and systems, including a method of applying transfer entropy techniques to find a causal link between parameters; a method of applying quality weighting to context data based on a priori knowledge of the accuracy of the context data; a method of detecting a maintenance action from parameter data by detecting a step and a process capability improvement; a method of managing unattended alerts by considering cost/benefit of attending to one or more alerts over time and assigning alert expiry time and/or ranking the alerts accordingly; a method of displaying components of a complex system in a functional way enabling improvements in system diagnostics; a method of determining the time of an event indicator in time series parameter data; a method of classifying an event associated with a fault condition occurring within a system; and a method of determining whether an event recorded in parameter data is attributable to an external factor.Type: ApplicationFiled: September 13, 2016Publication date: September 20, 2018Applicant: ASML NETHERLANDS B.V.Inventors: David Evert Song Kook SIGTERMANS, René FUSSENICH, Adam Marek KIELCZEWSKI, Errol Arthur ZALMIJN, Marcel Richard André BRUNT, Stefan Lucian VOINEA
-
Patent number: 10038679Abstract: Various embodiments are generally directed to pairing computing devices for collaborative interaction via a network through a centralized secure device pairing service. An apparatus comprises a controller processor circuit, and a controller storage communicatively coupled to the controller processor circuit to store an initial private key and to store instructions that when executed by the controller processor circuit cause the controller processor circuit to create a first signature using the initial private key, transmit the first signature to an issuing server via a network, receive a group public key and an associated member private key from the issuing server, create a second signature using the member private key, transmit the second signature to a member device via the network; receive a third signature from the member device; and authenticate the third signature using the group public key. Other embodiments are described and claimed herein.Type: GrantFiled: December 24, 2012Date of Patent: July 31, 2018Assignee: INTEL CORPORATIONInventor: Adam Marek
-
Patent number: 9972313Abstract: Techniques related to key phrase detection for applications such as wake on voice are discussed. Such techniques may include intermediate scoring of a state or states of a key phrase model and/or a backward transition or rejection loopback from a state of the key phrase model to a rejection model to reduce false accepts based on received utterances.Type: GrantFiled: March 1, 2016Date of Patent: May 15, 2018Assignee: Intel CorporationInventors: Tobias Bocklet, Adam Marek, Tomasz Dorau, Przemyslaw Sobon
-
Publication number: 20180130168Abstract: Providing optical watermark signals for a visual authentication session by performing at least the following: receive, at an anti-spoof engine, an instruction to perform visual authentication operations for a visual authentication session, generate, with the anti-spoof engine, an optical watermark signal based on receiving the instruction, wherein the optical watermark signal includes at least one optical identifier to authenticate images captured during the visual authentication session, obtain, with the anti-spoof engine, an image source that includes captured images of the visual authentication session, determine, with the anti-spoof engine, whether the image source includes a reflected optical watermark signal, and compare, with the anti-spoof engine, whether the reflected optical watermark signal matches the generated optical watermark signal based on the determination that the image source includes the reflected optical watermark signal.Type: ApplicationFiled: November 10, 2016Publication date: May 10, 2018Inventors: Alex Nayshtut, Igor Muttik, Oleg Pogorelik, Adam Marek