Patents by Inventor William F. Ganong

William F. Ganong has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 8913722
    Abstract: A voicemail computer system transcribes a voicemail message into text that is presented to a calling party for approval. A calling party is able to approve, disapprove or edit a voicemail message prior to delivery to one or more called parties. The voicemail computer system may analyze a voicemail message to detect errors, omissions, or potentially offensive words. The voicemail computer may analyze a voicemail message to make suggestions as to tone, content or information contained within the voicemail message. The calling party can edit the voicemail message or approve it prior to providing a notification to one or more called parties that they have received the voicemail message.
    Type: Grant
    Filed: May 5, 2011
    Date of Patent: December 16, 2014
    Assignee: Nuance Communications, Inc.
    Inventors: Shreedhar Madhavapeddi, William F. Ganong, III, George Zavaliagkos, Uwe H. Jost, Gary B. Clayton
  • Patent number: 8909526
    Abstract: In some embodiments, a recognition result produced by a speech processing system based on an analysis of a speech input is evaluated for indications of potential errors. In some embodiments, sets of words/phrases that may be acoustically similar or otherwise confusable, the misrecognition of which can be significant in the domain, may be used together with a language model to evaluate a recognition result to determine whether the recognition result includes such an indication. In some embodiments, a word/phrase of a set that appears in the result is iteratively replaced with each of the other words/phrases of the set. The result of the replacement may be evaluated using a language model to determine a likelihood of the newly-created string of words appearing in a language and/or domain. The likelihood may then be evaluated to determine whether the result of the replacement is sufficiently likely for an alert to be triggered.
    Type: Grant
    Filed: July 9, 2012
    Date of Patent: December 9, 2014
    Assignee: Nuance Communications, Inc.
    Inventors: William F. Ganong, III, Raghu Vemula, Robert Fleming
  • Publication number: 20140278354
    Abstract: Described herein are techniques for determining corresponding positions between different representations of a textual work. In some of the techniques, portions of one or more representations may be processed. A determination of a corresponding position may be made in response to a request received from a user, such as a reader that desires to switch between representations. The request may indicate a position in one representation and the representation to which the user would like to switch. In response to receiving the request, one or more portions of one or more representations of a textual work may be processed. In some techniques, a corresponding position between different representations may be determined without processing the entirety of one or more representations of the textual work. For example, a corresponding position may be determined without processing an entire audio representation.
    Type: Application
    Filed: March 13, 2013
    Publication date: September 18, 2014
    Inventor: William F. Ganong, III
  • Publication number: 20140278429
    Abstract: Described herein are techniques for determining corresponding positions between different representations of a textual work. In some of the techniques, portions of one or more representations may be processed. A determination of a corresponding position may be made in response to a request received from a user, such as a reader that desires to switch between representations. The request may indicate a position in one representation and the representation to which the user would like to switch. In response to receiving the request, one or more portions of one or more representations of a textual work may be processed. In some techniques, a corresponding position between different representations may be determined without processing the entirety of one or more representations of the textual work. For example, a corresponding position may be determined without processing an entire audio representation.
    Type: Application
    Filed: March 13, 2013
    Publication date: September 18, 2014
    Inventor: William F. Ganong, III
  • Publication number: 20140278425
    Abstract: Training speech recognizers, e.g., their language or acoustic models, using actual user data is useful, but retaining personally identifiable information may be restricted in certain environments due to regulations. Accordingly, a method or system is provided for enabling training of a language model which includes producing segments of text in a text corpus and counts corresponding to the segments of text, the text corpus being in a depersonalized state. The method further includes enabling a system to train a language model using the segments of text in the depersonalized state and the counts. Because the data is depersonalized, actual data may be used, enabling speech recognizers to keep up-to-date with user trends in speech and usage, among other benefits.
    Type: Application
    Filed: March 13, 2013
    Publication date: September 18, 2014
    Inventors: Uwe Helmut Jost, Philip Charles Woodland, Marcel Katz, Syed Raza Shahid, Paul J. Vozila, William F. Ganong, III
  • Publication number: 20140278401
    Abstract: Described herein are techniques for determining corresponding positions between different representations of a textual work. In some of the techniques, portions of one or more representations may be processed. A determination of a corresponding position may be made in response to a request received from a user, such as a reader that desires to switch between representations. The request may indicate a position in one representation and the representation to which the user would like to switch. In response to receiving the request, one or more portions of one or more representations of a textual work may be processed. In some techniques, a corresponding position between different representations may be determined without processing the entirety of one or more representations of the textual work. For example, a corresponding position may be determined without processing an entire audio representation.
    Type: Application
    Filed: March 13, 2013
    Publication date: September 18, 2014
    Inventor: William F. Ganong, III
  • Publication number: 20140278426
    Abstract: Training speech recognizers, e.g., their language or acoustic models, using actual user data is useful, but retaining personally identifiable information may be restricted in certain environments due to regulations. Accordingly, a method or system is provided for enabling training of an acoustic model which includes dynamically shredding a speech corpus to produce text segments and depersonalized audio features corresponding to the text segments. The method further includes enabling a system to train an acoustic model using the text segments and the depersonalized audio features. Because the data is depersonalized, actual data may be used, enabling speech recognizers to keep up-to-date with user trends in speech and usage, among other benefits.
    Type: Application
    Filed: March 13, 2013
    Publication date: September 18, 2014
    Applicant: Nuance Communications, Inc.
    Inventors: Uwe Helmut Jost, Philip Charles Woodland, Marcel Katz, Syed Raza Shahid, Paul J. Vozila, William F. Ganong, III
  • Publication number: 20140278435
    Abstract: Some aspects include a method of monitoring an acoustic environment of a mobile device operating in a low power mode, the mobile device having a first and second processor, the method comprises receiving acoustic input while the mobile device is operating in the low power mode, performing at least one first processing stage on the acoustic input using the first processor, prior to engaging the second processor, to evaluate whether the acoustic input includes a voice command, performing at least one second processing stage on the acoustic input using the second processor to evaluate whether the acoustic input includes a voice command if further processing is needed to determine whether the acoustic input includes a voice command, and initiating responding to the voice command when either the at least one first processing stage or the at least one second processing stage determines that the acoustic input includes a voice command.
    Type: Application
    Filed: March 12, 2013
    Publication date: September 18, 2014
    Applicant: Nuance Communications, Inc.
    Inventors: William F. Ganong, III, Paul Adrian Van Mulbregt, Vladimir Sejnoha, Glenn Wilson
  • Publication number: 20140274211
    Abstract: According to some aspects, a method of monitoring an acoustic environment of a mobile device, at least one computer readable medium encoded with instructions that, when executed, perform such a method and/or a mobile device configured to perform such a method is provided. The method comprises receiving, by the mobile device, acoustic input from the environment of the mobile device, detecting whether the acoustic input includes a voice command from a user without requiring receipt of an explicit trigger from the user, and initiating responding to the detected voice command.
    Type: Application
    Filed: March 12, 2013
    Publication date: September 18, 2014
    Applicant: Nuance Communications, Inc.
    Inventors: Vladimir Sejnoha, Paul Adrian Van Mulbregt, Glen Edward Wilson, William F. Ganong, III
  • Publication number: 20140274203
    Abstract: According to some aspects, a method of monitoring an acoustic environment of a mobile device, at least one computer readable medium encoded with instructions that, when executed, perform such a method and/or a mobile device configured to perform such a method is provided. The method comprises receiving acoustic input from the environment of the mobile device while the mobile device is operating in the low power mode, detecting whether the acoustic input includes a voice command based on performing a plurality of processing stages on the acoustic input, wherein at least one of the plurality of processing stages is performed while the mobile device is operating in the low power mode, and using at least one contextual cue to assist in detecting whether the acoustic input includes a voice command.
    Type: Application
    Filed: March 12, 2013
    Publication date: September 18, 2014
    Applicant: Nuance Communications, Inc.
    Inventors: William F. Ganong, III, Paul Adrian Van Mulbregt, Vladimir Sejnoha, Glen Wilson
  • Publication number: 20140207442
    Abstract: A mobile device is adapted for protecting private information on the mobile device in a hybrid automatic speech recognition arrangement. The mobile device includes a speech input component for receiving a speech input signal from a user. Additionally, the mobile device includes a local ASR arrangement for performing local ASR processing of the speech input signal and determining if private information is included within the speech input signal. A control unit on the mobile device obscures private information in the speech input signal if the local ASR arrangement identifies information within a speech recognition result as private information. The control unit releases the speech input signal with the obscured private information for transmission to a remote server for further ASR processing. Results from the remote server's ASR processing are integrated and combined with results from local ASR processing to display information on the mobile device.
    Type: Application
    Filed: January 24, 2013
    Publication date: July 24, 2014
    Applicant: NUANCE COMMUNICATIONS, INC.
    Inventors: William F. Ganong, III, Paul J. Vozila
  • Publication number: 20140163981
    Abstract: A speech transcription system is described for producing a representative transcription text from one or more different audio signals representing one or more different speakers participating in a speech session. A preliminary transcription module develops a preliminary transcription of the speech session using automatic speech recognition having a preliminary recognition accuracy performance. A speech selection module enables user selection of one or more portions of the preliminary transcription to receive higher accuracy transcription processing. A final transcription module is responsive to the user selection for developing a final transcription output for the speech session having a final recognition accuracy performance for the selected one or more portions which is higher than the preliminary recognition accuracy performance.
    Type: Application
    Filed: December 12, 2012
    Publication date: June 12, 2014
    Applicant: NUANCE COMMUNICATIONS, INC.
    Inventors: Gary David Cook, William F. Ganong, III, Andrew Johnathon Daborn
  • Publication number: 20140164597
    Abstract: In many speech-enabled applications, adaptation of speech recognition and language understanding tools for different users are employed. With such adaptation, identifying the particular user precedes applying the speech recognition and language understanding tools. According to at least one example embodiment, a method and corresponding apparatus of identifying a user includes comparing personal information data received from a user network device against personal information accessible by the server; and identifying a speech profile specific to the user based on the results of comparing the personal information data retrieved from the first user network device against the personal information accessible by the server. The identified speech profile is used in processing a speech of the user. Through use of the method or corresponding apparatus, a user can proceed directly to the use of the speech recognition or other applications and bypassing a login sequence.
    Type: Application
    Filed: December 12, 2012
    Publication date: June 12, 2014
    Applicant: NUANCE COMMUNICATIONS, INC.
    Inventors: Jean-Guy E. Dahan, William F. Ganong, III, Jianxiong Wu
  • Publication number: 20140143550
    Abstract: Methods and apparatus for reducing security vulnerabilities in a client/server speech recognition system including one or more client computers and one or more server computers connected via a network. Decryption of sensitive information, such as medical dictation information, is performed on designated servers to limit the attack surface of unencrypted data. Management of encryption and decryption keys to restrict the storage and/or use of decryption keys on the server side of the client/server speech recognition system, while maintaining encrypted data on the server side is also described.
    Type: Application
    Filed: November 16, 2012
    Publication date: May 22, 2014
    Applicant: Nuance Cornmunications, Inc.
    Inventors: William F. Ganong, III, Maximilian Bisani
  • Publication number: 20140143533
    Abstract: Methods and apparatus for reducing security vulnerabilities in a client/server speech recognition system including one or more client computers and one or more server computers connected via a network. Decryption of sensitive information, such as medical dictation information, is performed on designated servers to limit the attack surface of unencrypted data. Management of encryption and decryption keys to restrict the storage and/or use of decryption keys on the server side of the client/server speech recognition system, while maintaining encrypted data on the server side is also described.
    Type: Application
    Filed: November 16, 2012
    Publication date: May 22, 2014
    Applicant: Nuance Communications, Inc.
    Inventors: William F. Ganong, III, Maximilian Bisani
  • Publication number: 20140129220
    Abstract: Techniques disclosed herein include systems and methods for open-domain voice-enabled searching that is speaker sensitive. Techniques include using speech information, speaker information, and information associated with a spoken query to enhance open voice search results. This includes integrating a textual index with a voice index to support the entire search cycle. Given a voice query, the system can execute two matching processes simultaneously. This can include a text matching process based on the output of speech recognition, as well as a voice matching process based on characteristics of a caller or user voicing a query. Characteristics of the caller can include output of voice feature extraction and metadata about the call. The system clusters callers according to these characteristics. The system can use specific voice and text clusters to modify speech recognition results, as well as modifying search results.
    Type: Application
    Filed: January 10, 2014
    Publication date: May 8, 2014
    Inventors: Shilei Zhang, Shenghua Bao, Wen Liu, Yong Qin, Zhiwei Shuang, Jian Chen, Zhong Su, Qin Shi, William F. Ganong, III
  • Publication number: 20140081640
    Abstract: One aspect includes determining validity of an identity asserted by a speaker using a voice print associated with a user whose identity the speaker is asserting, the voice print obtained from characteristic features of at least one first voice signal obtained from the user uttering at least one enrollment utterance including at least one enrollment word by obtaining a second voice signal of the speaker uttering at least one challenge utterance that includes at least one word not in the at least one enrollment utterance, obtaining at least one characteristic feature from the second voice signal, comparing the at least one characteristic feature with at least a portion of the voice print to determine a similarity between the at least one characteristic feature and the at least a portion of the voice print, and determining whether the speaker is the user based, at least in part, on the similarity.
    Type: Application
    Filed: November 21, 2013
    Publication date: March 20, 2014
    Applicant: Nuance Communications, Inc.
    Inventors: Kevin R. Farrell, David A. James, William F. Ganong, III, Jerry K. Carter
  • Patent number: 8666963
    Abstract: Some embodiments relate to a method of performing a search for content on the Internet, in which a user may speak a search query and speech recognition may be performed on the spoken query to generate a text search query to be provided to a plurality of search engines. This enables a user to speak the search query rather than having to type it, and also allows the user to provide the search query only once, rather than having to provide it separately to multiple different search engines.
    Type: Grant
    Filed: June 19, 2012
    Date of Patent: March 4, 2014
    Assignee: Nuance Communications, Inc.
    Inventors: Vladimir Sejnoha, William F. Ganong, III, Paul J. Vozila, Nathan M. Bodenstab, Yik-Cheung Tam
  • Patent number: 8630860
    Abstract: Techniques disclosed herein include systems and methods for open-domain voice-enabled searching that is speaker sensitive. Techniques include using speech information, speaker information, and information associated with a spoken query to enhance open voice search results. This includes integrating a textual index with a voice index to support the entire search cycle. Given a voice query, the system can execute two matching processes simultaneously. This can include a text matching process based on the output of speech recognition, as well as a voice matching process based on characteristics of a caller or user voicing a query. Characteristics of the caller can include output of voice feature extraction and metadata about the call. The system clusters callers according to these characteristics. The system can use specific voice and text clusters to modify speech recognition results, as well as modifying search results.
    Type: Grant
    Filed: March 3, 2011
    Date of Patent: January 14, 2014
    Assignee: Nuance Communications, Inc.
    Inventors: Shilei Zhang, Shenghua Bao, Wen Liu, Yong Qin, Zhiwei Shuang, Jian Chen, Zhong Su, Qin Shi, William F. Ganong, III
  • Publication number: 20140012581
    Abstract: In some embodiments, the recognition results produced by a speech processing system (which may include two or more recognition results, including a top recognition result and one or more alternative recognition results) based on an analysis of a speech input, are evaluated for indications of potential significant errors. In some embodiments, the recognition results may be evaluated using one or more sets of words and/or phrases, such as pairs of words/phrases that may include words/phrases that are acoustically similar to one another and/or that, when included in a result, would change a meaning of the result in a manner that would be significant for a domain. The recognition results may be evaluated using the set(s) of words/phrases to determine, when the top result includes a word/phrase from a set of words/phrases, whether any of the alternative recognition results includes any of the other, corresponding words/phrases from the set.
    Type: Application
    Filed: July 9, 2012
    Publication date: January 9, 2014
    Applicant: Nuance Communications, Inc.
    Inventors: William F. Ganong, III, Raghu Vemula, Robert Fleming