Patents Examined by Shaun Roberts
  • Patent number: 9484032
    Abstract: The disclosed embodiments illustrate methods and systems for processing multimedia content. The method includes extracting one or more words from an audio stream associated with multimedia content. Each word has associated one or more timestamps indicative of temporal occurrences of said word in said multimedia content. The method further includes creating a word cloud of said one or more words in said multimedia content based on a measure of emphasis laid on each word in said multimedia content and said one or more timestamps associated with said one or more words. The method further includes presenting one or more multimedia snippets, of said multimedia content, associated with a word selected by a user from said word cloud. Each of said one or more multimedia snippets corresponds to said one or more timestamps associated with occurrences of said word in said multimedia content.
    Type: Grant
    Filed: October 27, 2014
    Date of Patent: November 1, 2016
    Assignee: Xerox Corporation
    Inventors: Kuldeep Yadav, Kundan Shrivastava, Om D Deshmukh
  • Patent number: 9466291
    Abstract: A voice retrieval device includes a processor; and a memory which stores a plurality of instructions, which when executed by the processor, cause the processor to execute: setting detection criteria for a retrieval word, based on a characteristic of the retrieval word, such that the higher the detection accuracy of the retrieval word or the lower the pronunciation difficulty of the retrieval word or the lower the appearance probability of the retrieval word, the stricter the detection criteria; performing first voice retrieval processing on voice data according to the detection criteria and detecting a section that possibly includes the retrieval word as a candidate section from the voice data; and performing second voice retrieval processing different from the first voice retrieval processing on each candidate section and determining whether or not the retrieval word is included in each candidate section.
    Type: Grant
    Filed: October 16, 2014
    Date of Patent: October 11, 2016
    Assignee: FUJITSU LIMITED
    Inventors: Masakiyo Tanaka, Hitoshi Iwamida, Nobuyuki Washio
  • Patent number: 9448996
    Abstract: Various embodiments described herein facilitate multi-lingual communications. The systems and methods of some embodiments enable multi-lingual communications through different modes of communication including, for example, Internet-based chat, e-mail, text-based mobile phone communications, postings to online forums, postings to online social media services, and the like. Certain embodiments implement communication systems and methods that translate text between two or more languages. Users of the systems and methods may be incentivized to submit corrections for inaccurate or erroneous translations, and may receive a reward for these submissions. Systems and methods for assessing the accuracy of translations are described.
    Type: Grant
    Filed: April 14, 2016
    Date of Patent: September 20, 2016
    Assignee: Machine Zone, Inc.
    Inventors: Francois Orsini, Nikhil Bojja, Arun Nedunchezhian
  • Patent number: 9442918
    Abstract: A computer-implemented method of managing perspective data associated with a common feature in items is disclosed. The method can include identifying a common feature in a first item and a second item, the first item having a set of perspective data and establishing a subset of perspective data associated with the common feature. The method can include associating the subset of perspective with the second item. The method can include determining a set of relevancy scores for the subset of perspective data associated with the common feature and establishing a set of relevant perspective data from the subset of perspective data. The set of relevant perspective data can have relevancy scores outside of a relevancy threshold. The method can include associating the set of relevant perspective data with the second item.
    Type: Grant
    Filed: March 24, 2015
    Date of Patent: September 13, 2016
    Assignee: International Business Machines Corporation
    Inventors: Adam T. Clark, Jeffrey K. Huebert, Aspen L. Payton, John E. Petri
  • Patent number: 9431025
    Abstract: The present document relates to audio source coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR), as well as to digital effect processors, e.g. exciters, where generation of harmonic distortion add brightness to the processed signal, and to time stretchers where a signal duration is prolonged with maintained spectral content. A system and method configured to generate a time stretched and/or frequency transposed signal from an input signal is described. The system comprises an analysis filterbank (101) configured to provide an analysis subband signal from the input signal; wherein the analysis subband signal comprises a plurality of complex valued analysis samples, each having a phase and a magnitude. Furthermore, the system comprises a subband processing unit (102) configured to determine a synthesis subband signal from the analysis subband signal using a subband transposition factor Q and a subband stretch factor S.
    Type: Grant
    Filed: October 13, 2014
    Date of Patent: August 30, 2016
    Assignee: Dolby International AB
    Inventor: Lars Villemoes
  • Patent number: 9406307
    Abstract: A method, device, and apparatus provide the ability to predict a portion of a polyphonic audio signal for compression and networking applications. The solution involves a framework of a cascade of long term prediction filters, which by design is tailored to account for all periodic components present in a polyphonic signal. This framework is complemented with a design method to optimize the system parameters. Specialization may include specific techniques for coding and networking scenarios, where the potential of each enhanced prediction is realized to considerably improve the overall system performance for that application. One specific technique provides enhanced inter-frame prediction for the compression of polyphonic audio signals, particularly at low delay. Another specific technique provides improved frame loss concealment capabilities to combat packet loss in audio communications.
    Type: Grant
    Filed: August 19, 2013
    Date of Patent: August 2, 2016
    Assignee: The Regents of the University of California
    Inventors: Kenneth Rose, Tejaswi Nanjundaswamy
  • Patent number: 9400780
    Abstract: A computer-implemented method of managing perspective data associated with a common feature in items is disclosed. The method can include identifying a common feature in a first item and a second item, the first item having a set of perspective data and establishing a subset of perspective data associated with the common feature. The method can include associating the subset of perspective with the second item. The method can include determining a set of relevancy scores for the subset of perspective data associated with the common feature and establishing a set of relevant perspective data from the subset of perspective data. The set of relevant perspective data can have relevancy scores outside of a relevancy threshold. The method can include associating the set of relevant perspective data with the second item.
    Type: Grant
    Filed: October 17, 2014
    Date of Patent: July 26, 2016
    Assignee: International Business Machines Corporation
    Inventors: Adam T. Clark, Jeffrey K. Huebert, Aspen L. Payton, John E. Petri
  • Patent number: 9384446
    Abstract: A management system for guiding an agent in a media-specific dialog has a conversion engine for instantiating ongoing dialog as machine-readable text, if the dialog is in voice media, a context analysis engine for determining facts from the text, a rules engine for asserting rules based on fact input, and a presentation engine for presenting information to the agent to guide the agent in the dialog. The context analysis engine passes determined facts to the rules engine, which selects and asserts to the presentation engine rules based on the facts, and the presentation engine provides periodically updated guidance to the agent based on the rules asserted.
    Type: Grant
    Filed: July 27, 2015
    Date of Patent: July 5, 2016
    Assignee: GENESYS TELECOMMUNICATIONS LABORATORIES INC.
    Inventors: Dave Sneyders, Brian Galvin, S. Michael Perlmutter
  • Patent number: 9384189
    Abstract: An apparatus and a method for predicting the pleasantness-unpleasantness index of words are disclosed. The disclosed apparatus includes: a computing unit configured to compute an emotion correlation between a word and one or more comparison word, compute emotion correlations between multiple reference words included in a reference word set and the one or more comparison word, compute multiple first absolute emotion similarity values between the word and the multiple reference words, and compute at least one second absolute emotion similarity value between a reference word and another reference word for all of the reference words included in the reference word set; and a prediction unit configured to predict the pleasantness-unpleasantness index of the word by using the multiple number of first absolute emotion similarity values, the at least one second absolute emotion similarity value, and a preset pleasantness-unpleasantness index of the multiple number of reference words.
    Type: Grant
    Filed: October 21, 2014
    Date of Patent: July 5, 2016
    Assignee: Foundation of Soongsil University—Industry Corporation
    Inventors: Soo Won Lee, Kang Bok Lee
  • Patent number: 9380009
    Abstract: Embodiments are directed towards providing word-by-word message completion for an incomplete response message, wherein the response message is composed in response to a received stimulus message. The message completion is based on a Response Completion Model (RCM) that may model both the language used in the incomplete response message and the contextual information in the received stimulus message. The RCM may be determined based on conversational stimulus-response data including stimulus-response message pairs. The RCM may be a mixture model and include a generic response language model based on an N-gram model, a Stimulus Model based on a Selection Model or a Topic. Model, and a mixture parameter. In some embodiments, at least one candidate next word for the incomplete response message is determined based on the RCM. The at least one candidate next word may be selected and included in the incomplete response message. A complete response message may be generated and provided to a user.
    Type: Grant
    Filed: July 12, 2012
    Date of Patent: June 28, 2016
    Assignee: Yahoo! Inc.
    Inventors: Sujith Ravi, Bo Pang
  • Patent number: 9363104
    Abstract: Various approaches enable automatic communication generation based on patterned behavior in a particular context. For example, a computing device can monitor behavior of a user to determine patterns of communication behavior in certain situations. In response to detecting multiple occurrences of the certain situation, a computing device can prompt a user to perform an action corresponding to the pattern of behavior. In some embodiments, a set of speech models corresponding to a type of contact is generated. The speech models include language consistent with patterns of speech between a user and the type of contact. Based on context and on the contact, a message using language consistent with past communications between the user and contact is generated from a speech model associated with the type of contact.
    Type: Grant
    Filed: April 21, 2014
    Date of Patent: June 7, 2016
    Assignee: Amazon Technologies, Inc.
    Inventors: Isaac Scott Noble, Gabrielle M. Halberg, Kenneth Mark Karakotsios, Yuzo Watanabe
  • Patent number: 9361893
    Abstract: Provided are, among other things, systems, methods and techniques for detecting whether a transient exists within an audio signal. According to one representative embodiment, a segment of a digital audio signal is divided into blocks, and a norm value is calculated for each of a number of the blocks, resulting in a set of norm values for such blocks, each such norm value representing a measure of signal strength within a corresponding block. A maximum norm value is then identified across such blocks, and a test criterion is applied to the norm values. If the test criterion is not satisfied, a first signal indicating that the segment does not include any transient is output, and if the test criterion is satisfied, a second signal indicating that the segment includes a transient is output. According to this embodiment, the test criterion involves a comparison of the maximum norm value to a different second maximum norm value, subject to a specified constraint, within the segment.
    Type: Grant
    Filed: July 5, 2014
    Date of Patent: June 7, 2016
    Assignee: Digital Rise Technology Co., Ltd.
    Inventor: Yuli You
  • Patent number: 9349375
    Abstract: According to an embodiment, a signal processing apparatus includes an estimation unit and an updating unit. The estimation unit is configured to estimate an auxiliary variable of a target section including first and second sections of input signals by using an approximating auxiliary function for approximating an auxiliary function having an auxiliary variable as an argument. The auxiliary function is determined according to an objective function that outputs a function value that is smaller as a statistical independence of separated signals into which input signals in time-series are separated by a demixing matrix is higher. The estimation unit is configured to estimate a value of the auxiliary variable of the target section based on the estimated auxiliary variable. The updating unit is configured to update the demixing matrix such that a function value of the approximating auxiliary function is minimized.
    Type: Grant
    Filed: August 15, 2013
    Date of Patent: May 24, 2016
    Assignees: Inter-University Research Institute Corporation, Research Organization of Information and Systems, KABUSHIKI KAISHA TOSHIBA
    Inventors: Toru Taniguchi, Nobutaka Ono
  • Patent number: 9349379
    Abstract: The embodiments of the present invention improves conventional attenuation schemes by replacing constant attenuation with an adaptive attenuation scheme that allows more aggressive attenuation, without introducing audible change of signal frequency characteristics.
    Type: Grant
    Filed: November 20, 2013
    Date of Patent: May 24, 2016
    Assignee: TELEFONAKTIEBOLAGET L M ERICSSON (PUBL)
    Inventors: Sebastian Näslund, Volodya Grancharov, Erik Norvell
  • Patent number: 9349368
    Abstract: A computer-implemented method of determining when an audio notification should be generated includes detecting receipt of a triggering event that occurs on a user device; generating, based on detecting, the audio notification for the triggering event; receiving, from the user device, a user voice command responding to the audio notification; and generating a response to the user voice command based on one or more of (i) information associated with the audio notification, and (ii) information associated with the user voice command.
    Type: Grant
    Filed: August 5, 2010
    Date of Patent: May 24, 2016
    Assignee: Google Inc.
    Inventors: Michael J. Lebeau, John Nicholas Jitkoff
  • Patent number: 9343064
    Abstract: Establishing a multimodal personality for a multimodal application, including evaluating, by the multimodal application, attributes of a user's interaction with the multimodal application; selecting, by the multimodal application, a visual demeanor in dependence upon the values of the attributes of the user's interaction with the multimodal application; and incorporating, by the multimodal application, the visual demeanor into the multimodal application.
    Type: Grant
    Filed: November 26, 2013
    Date of Patent: May 17, 2016
    Assignee: Nuance Communications, Inc.
    Inventors: Charles W. Cross, Jr., Hilary A. Pike
  • Patent number: 9336770
    Abstract: Provided is a pattern recognition apparatus for creating multiple systems and combining the multiple systems to improve the recognition performance, including a discriminative training unit for constructing model parameters of a second or subsequent system based on an output tendency of a previously-constructed model so as to be different from the output tendency of the previously-constructed model. Accordingly, when multiple systems are combined, the recognition performance can be improved without trials-and-errors.
    Type: Grant
    Filed: August 13, 2013
    Date of Patent: May 10, 2016
    Assignees: MITSUBISHI ELECTRIC CORPORATION, MITSUBISHI ELECTRIC RESEARCH LABORATORIES, INC.
    Inventors: Yuki Tachioka, Shinji Watanabe
  • Patent number: 9336206
    Abstract: Various embodiments described herein facilitate multi-lingual communications. The systems and methods of some embodiments enable multi-lingual communications through different modes of communication including, for example, Internet-based chat, e-mail, text-based mobile phone communications, postings to online forums, postings to online social media services, and the like. Certain embodiments implement communication systems and methods that translate text between two or more languages. Users of the systems and methods may be incentivized to submit corrections for inaccurate or erroneous translations, and may receive a reward for these submissions. Systems and methods for accessing the accuracy of translations using word based and language based features are described.
    Type: Grant
    Filed: January 21, 2016
    Date of Patent: May 10, 2016
    Assignee: Machine Zone, Inc.
    Inventors: Francois Orsini, Nikhil Bojja, Arun Nedunchezhian
  • Patent number: 9323745
    Abstract: Disclosed are systems, methods, and computer-readable media for performing translations from a source language to a target language. The method comprises receiving a source phrase, generating a target bag of words based on a global lexical selection of words that loosely couples the source words/phrases and target words/phrases, and reconstructing a target phrase or sentence by considering all permutations of words with a conditional probability greater than a threshold.
    Type: Grant
    Filed: July 21, 2014
    Date of Patent: April 26, 2016
    Assignee: AT&T Intellectual Property II, L.P.
    Inventors: Srinivas Bangalore, Patrick Haffner, Stephan Kanthak
  • Patent number: 9305552
    Abstract: Disclosed herein are systems, computer-implemented methods, and tangible computer-readable storage media for captioning a media presentation. The method includes receiving automatic speech recognition (ASR) output from a media presentation and a transcription of the media presentation. The method includes selecting via a processor a pair of anchor words in the media presentation based on the ASR output and transcription and generating captions by aligning the transcription with the ASR output between the selected pair of anchor words. The transcription can be human-generated. Selecting pairs of anchor words can be based on a similarity threshold between the ASR output and the transcription. In one variation, commonly used words on a stop list are ineligible as anchor words. The method includes outputting the media presentation with the generated captions. The presentation can be a recording of a live event.
    Type: Grant
    Filed: September 22, 2014
    Date of Patent: April 5, 2016
    Assignee: AT&T Intellectual Property I, L.P.
    Inventors: Yeon-Jun Kim, David C. Gibbon, Horst J. Schroeter