Patents by Inventor Harry Bratt

Harry Bratt has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20220115001
    Abstract: A voice-based digital assistant (VDA) uses a conversation intelligence (CI) manager module having a rule-based engine on conversational intelligence to process information from one or more modules to make determinations on both i) understanding the human conversational cues and ii) generating the human conversational cues, including at least understanding and generating a backchannel utterance, in a flow and exchange of human communication in order to at least one of grab or yield a conversational floor between a user and the VDA. The CI manager module uses the rule-based engine to analyze and make a determination on a conversational cue of, at least, prosody in a user's flow of speech to generate the backchannel utterance to signal any of i) an understanding, ii) a correction, iii) a confirmation, and iv) a questioning of verbal communications conveyed by the user in the flow of speech during a time frame when the user still holds the conversational floor.
    Type: Application
    Filed: May 7, 2020
    Publication date: April 14, 2022
    Inventors: Harry Bratt, Kristin Precoda, Dimitra Vergyri
  • Patent number: 11152016
    Abstract: Embodiments of the disclosed technologies include finding content of interest in an RF spectrum by automatically scanning the RF spectrum; detecting, in a range of frequencies of the RF spectrum that includes one or more undefined channels, a candidate RF segment; where the candidate RF segment includes a frequency-bound time segment of electromagnetic energy; executing a machine learning-based process to determine, for the candidate RF segment, signal characterization data indicative of one or more of: a frequency range, a modulation type, a timestamp; using the signal characterization data to determine whether audio contained in the candidate RF segment corresponds to a search criterion; in response to determining that the candidate RF segment corresponds to the search criterion, outputting, through an electronic device, data indicative of the candidate RF segment; where the data indicative of the candidate RF segment is output in a real-time time interval after the candidate RF segment is detected.
    Type: Grant
    Filed: May 8, 2019
    Date of Patent: October 19, 2021
    Assignee: SRI INTERNATIONAL
    Inventors: Aaron D. Lawson, Harry Bratt, Mitchell L. McLaren, Martin Graciarena
  • Patent number: 11024291
    Abstract: In an embodiment, the disclosed technologies include automatically recognizing speech content of an audio stream that may contain multiple different classes of speech content, by receiving, by an audio capture device, an audio stream; outputting, by one or more classifiers, in response to an inputting to the one or more classifiers of digital data that has been extracted from the audio stream, score data; where a score of the score data indicates a likelihood that a particular time segment of the audio stream contains speech of a particular class; where the one or more classifiers use one or more machine-learned models that have been trained to recognize audio of one or more particular classes to determine the score data; using a sliding time window process, selecting particular scores from the score data; using the selected particular scores, determining and outputting one or more decisions as to whether one or more particular time segments of the audio stream contain speech of one or more particular classes
    Type: Grant
    Filed: March 27, 2019
    Date of Patent: June 1, 2021
    Assignee: SRI INTERNATIONAL
    Inventors: Diego Castan Lavilla, Harry Bratt, Mitchell Leigh McLaren
  • Publication number: 20200184997
    Abstract: Embodiments of the disclosed technologies include finding content of interest in an RF spectrum by automatically scanning the RF spectrum; detecting, in a range of frequencies of the RF spectrum that includes one or more undefined channels, a candidate RF segment; where the candidate RF segment includes a frequency-bound time segment of electromagnetic energy; executing a machine learning-based process to determine, for the candidate RF segment, signal characterization data indicative of one or more of: a frequency range, a modulation type, a timestamp; using the signal characterization data to determine whether audio contained in the candidate RF segment corresponds to a search criterion; in response to determining that the candidate RF segment corresponds to the search criterion, outputting, through an electronic device, data indicative of the candidate RF segment; where the data indicative of the candidate RF segment is output in a real-time time interval after the candidate RF segment is detected.
    Type: Application
    Filed: May 8, 2019
    Publication date: June 11, 2020
    Inventors: Aaron D. Lawson, Harry Bratt, Mitchell L. McLaren, Martin Graciarena
  • Publication number: 20200160845
    Abstract: In an embodiment, the disclosed technologies include automatically recognizing speech content of an audio stream that may contain multiple different classes of speech content, by receiving, by an audio capture device, an audio stream; outputting, by one or more classifiers, in response to an inputting to the one or more classifiers of digital data that has been extracted from the audio stream, score data; where a score of the score data indicates a likelihood that a particular time segment of the audio stream contains speech of a particular class; where the one or more classifiers use one or more machine-learned models that have been trained to recognize audio of one or more particular classes to determine the score data; using a sliding time window process, selecting particular scores from the score data; using the selected particular scores, determining and outputting one or more decisions as to whether one or more particular time segments of the audio stream contain speech of one or more particular classes
    Type: Application
    Filed: March 27, 2019
    Publication date: May 21, 2020
    Inventors: Diego Castan Lavilla, Harry Bratt, Mitchell Leigh McLaren
  • Patent number: 10133538
    Abstract: An audio file analyzer computing system includes technologies to, among other things, localize audio events of interest (such as speakers of interest) within an audio file that includes multiple different classes (e.g., different speakers) of audio. The illustrative audio file analyzer computing system uses a seed segment to perform a semi-supervised diarization of the audio file. The seed segment is pre-selected, such as by a human person using an interactive graphical user interface.
    Type: Grant
    Filed: March 27, 2015
    Date of Patent: November 20, 2018
    Assignee: SRI International
    Inventors: Mitchell Leigh McLaren, Aaron Dennis Lawson, Harry Bratt
  • Patent number: 9928832
    Abstract: A method for classifying lexical stress in an utterance includes generating a feature vector representing stress characteristics of a syllable occurring in the utterance, wherein the feature vector includes a plurality of features based on prosodic information and spectral information, computing a plurality of scores, wherein each of the plurality of scores is related to a probability of a given class of lexical stress, and classifying the lexical stress of the syllable based on the plurality of scores.
    Type: Grant
    Filed: June 30, 2014
    Date of Patent: March 27, 2018
    Assignee: SRI INTERNATIONAL
    Inventors: Horacio E. Franco, Luciana Ferrer, Harry Bratt, Colleen Richey, Kristin Precoda, Victor Abrash
  • Patent number: 9798799
    Abstract: A vehicle personal assistant to engage a user in a conversational dialog about vehicle-related topics, such as those commonly found in a vehicle owner's manual, includes modules to interpret spoken natural language input, search a vehicle knowledge base and/or other data sources for pertinent information, and respond to the user's input in a conversational fashion. The dialog may be initiated by the user or more proactively by the vehicle personal assistant based on events that may be currently happening in relation to the vehicle. The vehicle personal assistant may use real-time inputs obtained from the vehicle and/or non-verbal inputs from the user to enhance its understanding of the dialog and assist the user in a variety of ways.
    Type: Grant
    Filed: November 15, 2012
    Date of Patent: October 24, 2017
    Assignee: SRI INTERNATIONAL
    Inventors: Michael J. Wolverton, William S. Mark, Harry Bratt, Douglas A. Bercow
  • Publication number: 20170061316
    Abstract: The present invention relates to a method and apparatus for tailoring the output of an intelligent automated assistant. One embodiment of a method for conducting an interaction with a human user includes collecting data about the user using a multimodal set of sensors positioned in a vicinity of the user, making a set of inferences about the user in accordance with the data, and tailoring an output to be delivered to the user in accordance with the set of inferences.
    Type: Application
    Filed: November 16, 2016
    Publication date: March 2, 2017
    Inventors: Gokhan Tur, Horacio E. Franco, Elizabeth Shriberg, Gregory K. Myers, William S. Mark, Norman D. Winarsky, Andreas Stolcke, Bart Peintner, Michael J. Wolverton, Luciana Ferrer, Martin Graciarena, Neil Yorke-Smith, Harry Bratt
  • Patent number: 9501743
    Abstract: The present invention relates to a method and apparatus for tailoring the output of an intelligent automated assistant. One embodiment of a method for conducting an interaction with a human user includes collecting data about the user using a multimodal set of sensors positioned in a vicinity of the user, making a set of inferences about the user in accordance with the data, and tailoring an output to be delivered to the user in accordance with the set of inferences.
    Type: Grant
    Filed: December 2, 2015
    Date of Patent: November 22, 2016
    Assignee: SRI INTERNATIONAL
    Inventors: Gokhan Tur, Horacio E. Franco, Elizabeth Shriberg, Gregory K. Myers, William S. Mark, Norman D. Winarsky, Andreas Stolcke, Bart Peintner, Michael J. Wolverton, Luciana Ferrer, Martin Graciarena, Neil Yorke-Smith, Harry Bratt
  • Publication number: 20160283185
    Abstract: An audio file analyzer computing system includes technologies to, among other things, localize audio events of interest (such as speakers of interest) within an audio file that includes multiple different classes (e.g., different speakers) of audio. The illustrative audio file analyzer computing system uses a seed segment to perform a semi-supervised diarization of the audio file. The seed segment is pre-selected, such as by a human person using an interactive graphical user interface.
    Type: Application
    Filed: March 27, 2015
    Publication date: September 29, 2016
    Inventors: Mitchell Leigh McLaren, Aaron Dennis Lawson, Harry Bratt
  • Publication number: 20160086090
    Abstract: The present invention relates to a method and apparatus for tailoring the output of an intelligent automated assistant. One embodiment of a method for conducting an interaction with a human user includes collecting data about the user using a multimodal set of sensors positioned in a vicinity of the user, making a set of inferences about the user in accordance with the data, and tailoring an output to be delivered to the user in accordance with the set of inferences.
    Type: Application
    Filed: December 2, 2015
    Publication date: March 24, 2016
    Inventors: Gokhan Tur, Horacio E. Franco, Elizabeth Shriberg, Gregory K. Myers, William S. Mark, Norman D. Winarsky, Andreas Stolcke, Bart Peinter, Michael J. Wolverton, Luciana Ferrer, Martin Graciarena, Neil Yorke-Smith, Harry Bratt
  • Patent number: 9213558
    Abstract: The present invention relates to a method and apparatus for tailoring the output of an intelligent automated assistant. One embodiment of a method for conducting an interaction with a human user includes collecting data about the user using a multimodal set of sensors positioned in a vicinity of the user, making a set of inferences about the user in accordance with the data, and tailoring an output to be delivered to the user in accordance with the set of inferences.
    Type: Grant
    Filed: September 1, 2010
    Date of Patent: December 15, 2015
    Assignee: SRI INTERNATIONAL
    Inventors: Gokhan Tur, Horacio E. Franco, Elizabeth Shriberg, Gregory K. Myers, William S. Mark, Norman D. Winarsky, Andreas Stolcke, Bart Peintner, Michael J. Wolverton, Luciana Ferrer, Martin Graciarena, Harry Bratt, Neil Yorke-Smith
  • Patent number: 9085303
    Abstract: A vehicle personal assistant to engage a user in a conversational dialog about vehicle-related topics, such as those commonly found in a vehicle owner's manual, includes modules to interpret spoken natural language input, search a vehicle knowledge base and/or other data sources for pertinent information, and respond to the user's input in a conversational fashion. The dialog may be initiated by the user or more proactively by the vehicle personal assistant based on events that may be currently happening in relation to the vehicle. The vehicle personal assistant may use real-time inputs obtained from the vehicle and/or non-verbal inputs from the user to enhance its understanding of the dialog and assist the user in a variety of ways.
    Type: Grant
    Filed: November 15, 2012
    Date of Patent: July 21, 2015
    Assignee: SRI International
    Inventors: Michael J. Wolverton, William S. Mark, Harry Bratt, Douglas A. Bercow
  • Publication number: 20150170644
    Abstract: A method for classifying lexical stress in an utterance includes generating a feature vector representing stress characteristics of a syllable occurring in the utterance, wherein the feature vector includes a plurality of features based on prosodic information and spectral information, computing a plurality of scores, wherein each of the plurality of scores is related to a probability of a given class of lexical stress, and classifying the lexical stress of the syllable based on the plurality of scores.
    Type: Application
    Filed: June 30, 2014
    Publication date: June 18, 2015
    Inventors: Horacio E. Franco, Luciana Ferrer, Harry Bratt, Colleen Richey, Kristin Precoda, Victor Abrash
  • Publication number: 20140136187
    Abstract: A vehicle personal assistant to engage a user in a conversational dialog about vehicle-related topics, such as those commonly found in a vehicle owner's manual, includes modules to interpret spoken natural language input, search a vehicle knowledge base and/or other data sources for pertinent information, and respond to the user's input in a conversational fashion. The dialog may be initiated by the user or more proactively by the vehicle personal assistant based on events that may be currently happening in relation to the vehicle. The vehicle personal assistant may use real-time inputs obtained from the vehicle and/or non-verbal inputs from the user to enhance its understanding of the dialog and assist the user in a variety of ways.
    Type: Application
    Filed: November 15, 2012
    Publication date: May 15, 2014
    Applicant: SRI INTERNATIONAL
    Inventors: Michael J. Wolverton, William S. Mark, Harry Bratt, Douglas A. Bercow
  • Publication number: 20140136013
    Abstract: A vehicle personal assistant to engage a user in a conversational dialog about vehicle-related topics, such as those commonly found in a vehicle owner's manual, includes modules to interpret spoken natural language input, search a vehicle knowledge base and/or other data sources for pertinent information, and respond to the user's input in a conversational fashion. The dialog may be initiated by the user or more proactively by the vehicle personal assistant based on events that may be currently happening in relation to the vehicle. The vehicle personal assistant may use real-time inputs obtained from the vehicle and/or non-verbal inputs from the user to enhance its understanding of the dialog and assist the user in a variety of ways.
    Type: Application
    Filed: November 15, 2012
    Publication date: May 15, 2014
    Applicant: SRI International
    Inventors: Michael J. Wolverton, William S. Mark, Harry Bratt, Douglas A. Bercow
  • Patent number: 7533020
    Abstract: A method and apparatus are provided for performing speech recognition using observable and meaningful relationships between words within a single utterance and using a structured data source as a source of constraints on the recognition process. Results from a first constrained speech recognition pass can be combined with information about the observable and meaningful word relationships to constrain or simplify subsequent recognition passes. This iterative process greatly reduces the search space required for each recognition pass, making the speech recognition process more efficient, faster and accurate.
    Type: Grant
    Filed: February 23, 2005
    Date of Patent: May 12, 2009
    Assignee: Nuance Communications, Inc.
    Inventors: James F. Arnold, Michael W. Frandsen, Anand Venkataraman, Douglas A. Bercow, Gregory K. Myers, David J. Israel, Venkata Ramana Rao Gadde, Horacio Franco, Harry Bratt
  • Publication number: 20080010065
    Abstract: A method and apparatus for speaker recognition is provided. One embodiment of a method for determining whether a given speech signal is produced by an alleged speaker, where a plurality of statistical models (including at least one support vector machine) have been produced for the alleged speaker based on a previous speech signal received from the alleged speaker, includes receiving the given speech signal, the speech signal representing an utterance made by a speaker claiming to be the alleged speaker, scoring the given speech signal using at least two modeling systems, where at least one of the modeling systems is a support vector machine, combining scores produced by the modeling systems, with equal weights, to produce a final score, and determining, in accordance with the final score, whether the speaker is likely the alleged speaker.
    Type: Application
    Filed: June 5, 2007
    Publication date: January 10, 2008
    Inventors: Harry BRATT, Luciana Ferrer, Martin Graciarena, Sachin Kajarekar, Elizabeth Shriberg, Mustafa Sonmez, Andreas Stolcke, Gokhan Tur, Anand Venkataraman
  • Patent number: 7177810
    Abstract: A method and apparatus for finding endpoints in speech by utilizing information contained in speech prosody. Prosody denotes the way speakers modulate the timing, pitch and loudness of phones, words, and phrases to convey certain aspects of meaning; informally, prosody includes what is perceived as the “rhythm” and “melody” of speech. Because speakers use prosody to convey units of speech to listeners, the method and apparatus performs endpoint detection by extracting and interpreting the relevant prosodic properties of speech.
    Type: Grant
    Filed: April 10, 2001
    Date of Patent: February 13, 2007
    Assignee: SRI International
    Inventors: Elizabeth Shriberg, Harry Bratt, Mustafa K. Sonmez