Patents by Inventor Dirk Padfield
Dirk Padfield has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20250210038Abstract: An ASR transcript of at least a portion of a media content is obtained from an ASR tool. Suggested words are received for corrected words of the ASR transcript. Features are obtained using at least the suggested words. The features include a reasonableness score. The reasonableness score is obtained from an NLP model evaluating sentences including the suggested words in context of the ASR transcript. The features are input into a machine learning (ML) model to obtain a probabilistic determination of a validity of the suggested words. Responsive to the probabilistic determination exceeding a validity threshold, the suggested words are incorporated into the ASR transcript in place of the corrected words.Type: ApplicationFiled: March 13, 2025Publication date: June 26, 2025Inventors: Dirk Padfield, Noah Murad, Edward Lo, Bryan Huh
-
Patent number: 12254874Abstract: An automated speech recognition (ASR) transcript of at least a portion of a media content is obtained from an ASR tool. Suggested words are received for corrected words of the ASR transcript of the media content. Features are obtained using at least the suggested words or the corrected words. The features include features relating to sound similarities between the suggested words and the corrected words. The features are input into a machine learning (ML) model to obtain a determination regarding a validity of the suggested words. Responsive to the suggested words constituting a valid suggestion, the suggested words are incorporated into the ASR transcript. At least a portion of the ASR transcript is transmitted to a user device in conjunction with at least a portion of the media content.Type: GrantFiled: February 20, 2022Date of Patent: March 18, 2025Assignee: GOOGLE LLCInventors: Dirk Padfield, Noah Murad, Edward Lo, Bryan Huh
-
Publication number: 20230267926Abstract: An automated speech recognition (ASR) transcript of at least a portion of a media content is obtained from an ASR tool. Suggested words are received for corrected words of the ASR transcript of the media content. Features are obtained using at least the suggested words or the corrected words. The features include features relating to sound similarities between the suggested words and the corrected words. The features are input into a machine learning (ML) model to obtain a determination regarding a validity of the suggested words. Responsive to the suggested words constituting a valid suggestion, the suggested words are incorporated into the ASR transcript. At least a portion of the ASR transcript is transmitted to a user device in conjunction with at least a portion of the media content.Type: ApplicationFiled: February 20, 2022Publication date: August 24, 2023Inventors: Dirk Padfield, Noah Murad, Edward Lo, Bryan Huh
-
Patent number: 11710496Abstract: A computing device receives a first audio waveform representing a first utterance and a second utterance. The computing device receives identity data indicating that the first utterance corresponds to a first speaker and the second utterance corresponds to a second speaker. The computing device determines, based on the first utterance, the second utterance, and the identity data, a diarization model configured to distinguish between utterances by the first speaker and utterances by the second speaker. The computing device receives, exclusively of receiving further identity data indicating a source speaker of a third utterance, a second audio waveform representing the third utterance. The computing device determines, by way of the diarization model and independently of the further identity data of the first type, the source speaker of the third utterance. The computing device updates the diarization model based on the third utterance and the determined source speaker.Type: GrantFiled: July 1, 2019Date of Patent: July 25, 2023Assignee: Google LLCInventors: Aaron Donsbach, Dirk Padfield
-
Publication number: 20220310109Abstract: A computing device receives a first audio waveform representing a first utterance and a second utterance. The computing device receives identity data indicating that the first utterance corresponds to a first speaker and the second utterance corresponds to a second speaker. The computing device determines, based on the first utterance, the second utterance, and the identity data, a diarization model configured to distinguish between utterances by the first speaker and utterances by the second speaker. The computing device receives, exclusively of receiving further identity data indicating a source speaker of a third utterance, a second audio waveform representing the third utterance. The computing device determines, by way of the diarization model and independently of the further identity data of the first type, the source speaker of the third utterance. The computing device updates the diarization model based on the third utterance and the determined source speaker.Type: ApplicationFiled: July 1, 2019Publication date: September 29, 2022Inventors: Aaron Donsbach, Dirk Padfield
-
Patent number: 11250552Abstract: Machine learning techniques are disclosed for training a model to identify each of multiple different classes in images, based on training data where each training image may not be labeled in a complete manner with respect to the classes. The disclosed training techniques use a new label value to indicate when a ground truth value is unknown for a particular class, and do not penalize the machine learning model for output predictions that do not match the label value representing unknown ground truth. The disclosed processes may, for example, be used to train a model to detect each multiple types of image defects based on incomplete information provided by human reviewers who accept and reject images based on whether any of the types of image defects are found.Type: GrantFiled: May 22, 2020Date of Patent: February 15, 2022Assignee: Amazon Technologies, Inc.Inventors: Dirk Padfield, Suren Kumar
-
Patent number: 10769766Abstract: Aspects of the present disclosure relate to machine learning techniques for training a model to identify each of a number of different classes in images, based on training data where each training image may not be labeled in a complete manner with respect to the classes. The disclosed training techniques use a new label value to indicate when a ground truth value is unknown for a particular class, and do not penalize the machine learning network for output predictions that do not match the label value representing unknown ground truth. Some implementations of the training process can be regularized to impose sparsity on predicted classes in order to avoid false positive predictions.Type: GrantFiled: May 31, 2018Date of Patent: September 8, 2020Assignee: Amazon Technologies, Inc.Inventors: Dirk Padfield, Suren Kumar
-
Patent number: 8995740Abstract: Improved systems and methods for the analysis of digital images are provided. More particularly, the present disclosure provides for improved systems and methods for the analysis of digital images of biological tissue samples. Exemplary embodiments provide for: i) segmenting, ii) grouping, and iii) quantifying molecular protein profiles of individual cells in terms of sub cellular compartments (nuclei, membrane, and cytoplasm). The systems and methods of the present disclosure advantageously perform tissue segmentation at the sub-cellular level to facilitate analyzing, grouping and quantifying protein expression profiles of tissue in tissue sections globally and/or locally. Performing local-global tissue analysis and protein quantification advantageously enables correlation of spatial and molecular configuration of cells with molecular information of different types of cancer.Type: GrantFiled: April 17, 2013Date of Patent: March 31, 2015Assignee: General Electric CompanyInventors: Alberto Santamaria-Pang, Jens Rittscher, Dirk Padfield, Ali Can, Zhengyu Pang, Musodiq Bello, Fiona Ginty, Christopher Sevinsky, Qing Li, Megan Rothney, Brion Sarachan
-
Publication number: 20140314299Abstract: Improved systems and methods for the analysis of digital images are provided. More particularly, the present disclosure provides for improved systems and methods for the analysis of digital images of biological tissue samples. Exemplary embodiments provide for: i) segmenting, ii) grouping, and iii) quantifying molecular protein profiles of individual cells in terms of sub cellular compartments (nuclei, membrane, and cytoplasm). The systems and methods of the present disclosure advantageously perform tissue segmentation at the sub-cellular level to facilitate analyzing, grouping and quantifying protein expression profiles of tissue in tissue sections globally and/or locally. Performing local-global tissue analysis and protein quantification advantageously enables correlation of spatial and molecular configuration of cells with molecular information of different types of cancer.Type: ApplicationFiled: April 17, 2013Publication date: October 23, 2014Inventors: Alberto Santamaria-Pang, Jens Rittscher, Dirk Padfield, Ali Can, Zhengyu Pang, Musodiq Bello, Fiona Ginty, Christopher Sevinsky, Qing Li, Megan Rothney, Brion Sarachan
-
Publication number: 20070109874Abstract: The present invention provides a cell imaging technique for automatically tracking the progression of a cell through the cell cycle over time through segmentation of a volume of two-dimensional time-lapse images. The technique allows long-term tracking of the cell cycle progression of an individual cell or multiple cells. Further, the invention provides a unique display of cell cycle progression, allowing an end user to easily determine changes to cell cycle progression for a cell of interest.Type: ApplicationFiled: October 27, 2006Publication date: May 17, 2007Inventors: Dirk Padfield, Thomas Sebastian, Jens Rittscher, Nicholas Thomas
-
Publication number: 20060066911Abstract: An imaging system for correcting a bias in the location edges in an image is provided. The imaging system comprises an image processor configured to detect edges in an image of a given substructure, characterize a blurring factor in the image and correct a bias in the detected edges in the of a given substructure using the blurring factor.Type: ApplicationFiled: September 27, 2004Publication date: March 30, 2006Inventors: James Miller, Paulo Mendonca, Matthew Turek, Dirk Padfield