Patents Examined by Shaun Roberts
-
Patent number: 9484032Abstract: The disclosed embodiments illustrate methods and systems for processing multimedia content. The method includes extracting one or more words from an audio stream associated with multimedia content. Each word has associated one or more timestamps indicative of temporal occurrences of said word in said multimedia content. The method further includes creating a word cloud of said one or more words in said multimedia content based on a measure of emphasis laid on each word in said multimedia content and said one or more timestamps associated with said one or more words. The method further includes presenting one or more multimedia snippets, of said multimedia content, associated with a word selected by a user from said word cloud. Each of said one or more multimedia snippets corresponds to said one or more timestamps associated with occurrences of said word in said multimedia content.Type: GrantFiled: October 27, 2014Date of Patent: November 1, 2016Assignee: Xerox CorporationInventors: Kuldeep Yadav, Kundan Shrivastava, Om D Deshmukh
-
Patent number: 9466291Abstract: A voice retrieval device includes a processor; and a memory which stores a plurality of instructions, which when executed by the processor, cause the processor to execute: setting detection criteria for a retrieval word, based on a characteristic of the retrieval word, such that the higher the detection accuracy of the retrieval word or the lower the pronunciation difficulty of the retrieval word or the lower the appearance probability of the retrieval word, the stricter the detection criteria; performing first voice retrieval processing on voice data according to the detection criteria and detecting a section that possibly includes the retrieval word as a candidate section from the voice data; and performing second voice retrieval processing different from the first voice retrieval processing on each candidate section and determining whether or not the retrieval word is included in each candidate section.Type: GrantFiled: October 16, 2014Date of Patent: October 11, 2016Assignee: FUJITSU LIMITEDInventors: Masakiyo Tanaka, Hitoshi Iwamida, Nobuyuki Washio
-
Patent number: 9448996Abstract: Various embodiments described herein facilitate multi-lingual communications. The systems and methods of some embodiments enable multi-lingual communications through different modes of communication including, for example, Internet-based chat, e-mail, text-based mobile phone communications, postings to online forums, postings to online social media services, and the like. Certain embodiments implement communication systems and methods that translate text between two or more languages. Users of the systems and methods may be incentivized to submit corrections for inaccurate or erroneous translations, and may receive a reward for these submissions. Systems and methods for assessing the accuracy of translations are described.Type: GrantFiled: April 14, 2016Date of Patent: September 20, 2016Assignee: Machine Zone, Inc.Inventors: Francois Orsini, Nikhil Bojja, Arun Nedunchezhian
-
Patent number: 9442918Abstract: A computer-implemented method of managing perspective data associated with a common feature in items is disclosed. The method can include identifying a common feature in a first item and a second item, the first item having a set of perspective data and establishing a subset of perspective data associated with the common feature. The method can include associating the subset of perspective with the second item. The method can include determining a set of relevancy scores for the subset of perspective data associated with the common feature and establishing a set of relevant perspective data from the subset of perspective data. The set of relevant perspective data can have relevancy scores outside of a relevancy threshold. The method can include associating the set of relevant perspective data with the second item.Type: GrantFiled: March 24, 2015Date of Patent: September 13, 2016Assignee: International Business Machines CorporationInventors: Adam T. Clark, Jeffrey K. Huebert, Aspen L. Payton, John E. Petri
-
Patent number: 9431025Abstract: The present document relates to audio source coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR), as well as to digital effect processors, e.g. exciters, where generation of harmonic distortion add brightness to the processed signal, and to time stretchers where a signal duration is prolonged with maintained spectral content. A system and method configured to generate a time stretched and/or frequency transposed signal from an input signal is described. The system comprises an analysis filterbank (101) configured to provide an analysis subband signal from the input signal; wherein the analysis subband signal comprises a plurality of complex valued analysis samples, each having a phase and a magnitude. Furthermore, the system comprises a subband processing unit (102) configured to determine a synthesis subband signal from the analysis subband signal using a subband transposition factor Q and a subband stretch factor S.Type: GrantFiled: October 13, 2014Date of Patent: August 30, 2016Assignee: Dolby International ABInventor: Lars Villemoes
-
Patent number: 9406307Abstract: A method, device, and apparatus provide the ability to predict a portion of a polyphonic audio signal for compression and networking applications. The solution involves a framework of a cascade of long term prediction filters, which by design is tailored to account for all periodic components present in a polyphonic signal. This framework is complemented with a design method to optimize the system parameters. Specialization may include specific techniques for coding and networking scenarios, where the potential of each enhanced prediction is realized to considerably improve the overall system performance for that application. One specific technique provides enhanced inter-frame prediction for the compression of polyphonic audio signals, particularly at low delay. Another specific technique provides improved frame loss concealment capabilities to combat packet loss in audio communications.Type: GrantFiled: August 19, 2013Date of Patent: August 2, 2016Assignee: The Regents of the University of CaliforniaInventors: Kenneth Rose, Tejaswi Nanjundaswamy
-
Patent number: 9400780Abstract: A computer-implemented method of managing perspective data associated with a common feature in items is disclosed. The method can include identifying a common feature in a first item and a second item, the first item having a set of perspective data and establishing a subset of perspective data associated with the common feature. The method can include associating the subset of perspective with the second item. The method can include determining a set of relevancy scores for the subset of perspective data associated with the common feature and establishing a set of relevant perspective data from the subset of perspective data. The set of relevant perspective data can have relevancy scores outside of a relevancy threshold. The method can include associating the set of relevant perspective data with the second item.Type: GrantFiled: October 17, 2014Date of Patent: July 26, 2016Assignee: International Business Machines CorporationInventors: Adam T. Clark, Jeffrey K. Huebert, Aspen L. Payton, John E. Petri
-
Patent number: 9384446Abstract: A management system for guiding an agent in a media-specific dialog has a conversion engine for instantiating ongoing dialog as machine-readable text, if the dialog is in voice media, a context analysis engine for determining facts from the text, a rules engine for asserting rules based on fact input, and a presentation engine for presenting information to the agent to guide the agent in the dialog. The context analysis engine passes determined facts to the rules engine, which selects and asserts to the presentation engine rules based on the facts, and the presentation engine provides periodically updated guidance to the agent based on the rules asserted.Type: GrantFiled: July 27, 2015Date of Patent: July 5, 2016Assignee: GENESYS TELECOMMUNICATIONS LABORATORIES INC.Inventors: Dave Sneyders, Brian Galvin, S. Michael Perlmutter
-
Patent number: 9384189Abstract: An apparatus and a method for predicting the pleasantness-unpleasantness index of words are disclosed. The disclosed apparatus includes: a computing unit configured to compute an emotion correlation between a word and one or more comparison word, compute emotion correlations between multiple reference words included in a reference word set and the one or more comparison word, compute multiple first absolute emotion similarity values between the word and the multiple reference words, and compute at least one second absolute emotion similarity value between a reference word and another reference word for all of the reference words included in the reference word set; and a prediction unit configured to predict the pleasantness-unpleasantness index of the word by using the multiple number of first absolute emotion similarity values, the at least one second absolute emotion similarity value, and a preset pleasantness-unpleasantness index of the multiple number of reference words.Type: GrantFiled: October 21, 2014Date of Patent: July 5, 2016Assignee: Foundation of Soongsil University—Industry CorporationInventors: Soo Won Lee, Kang Bok Lee
-
Patent number: 9380009Abstract: Embodiments are directed towards providing word-by-word message completion for an incomplete response message, wherein the response message is composed in response to a received stimulus message. The message completion is based on a Response Completion Model (RCM) that may model both the language used in the incomplete response message and the contextual information in the received stimulus message. The RCM may be determined based on conversational stimulus-response data including stimulus-response message pairs. The RCM may be a mixture model and include a generic response language model based on an N-gram model, a Stimulus Model based on a Selection Model or a Topic. Model, and a mixture parameter. In some embodiments, at least one candidate next word for the incomplete response message is determined based on the RCM. The at least one candidate next word may be selected and included in the incomplete response message. A complete response message may be generated and provided to a user.Type: GrantFiled: July 12, 2012Date of Patent: June 28, 2016Assignee: Yahoo! Inc.Inventors: Sujith Ravi, Bo Pang
-
Patent number: 9363104Abstract: Various approaches enable automatic communication generation based on patterned behavior in a particular context. For example, a computing device can monitor behavior of a user to determine patterns of communication behavior in certain situations. In response to detecting multiple occurrences of the certain situation, a computing device can prompt a user to perform an action corresponding to the pattern of behavior. In some embodiments, a set of speech models corresponding to a type of contact is generated. The speech models include language consistent with patterns of speech between a user and the type of contact. Based on context and on the contact, a message using language consistent with past communications between the user and contact is generated from a speech model associated with the type of contact.Type: GrantFiled: April 21, 2014Date of Patent: June 7, 2016Assignee: Amazon Technologies, Inc.Inventors: Isaac Scott Noble, Gabrielle M. Halberg, Kenneth Mark Karakotsios, Yuzo Watanabe
-
Patent number: 9361893Abstract: Provided are, among other things, systems, methods and techniques for detecting whether a transient exists within an audio signal. According to one representative embodiment, a segment of a digital audio signal is divided into blocks, and a norm value is calculated for each of a number of the blocks, resulting in a set of norm values for such blocks, each such norm value representing a measure of signal strength within a corresponding block. A maximum norm value is then identified across such blocks, and a test criterion is applied to the norm values. If the test criterion is not satisfied, a first signal indicating that the segment does not include any transient is output, and if the test criterion is satisfied, a second signal indicating that the segment includes a transient is output. According to this embodiment, the test criterion involves a comparison of the maximum norm value to a different second maximum norm value, subject to a specified constraint, within the segment.Type: GrantFiled: July 5, 2014Date of Patent: June 7, 2016Assignee: Digital Rise Technology Co., Ltd.Inventor: Yuli You
-
Patent number: 9349375Abstract: According to an embodiment, a signal processing apparatus includes an estimation unit and an updating unit. The estimation unit is configured to estimate an auxiliary variable of a target section including first and second sections of input signals by using an approximating auxiliary function for approximating an auxiliary function having an auxiliary variable as an argument. The auxiliary function is determined according to an objective function that outputs a function value that is smaller as a statistical independence of separated signals into which input signals in time-series are separated by a demixing matrix is higher. The estimation unit is configured to estimate a value of the auxiliary variable of the target section based on the estimated auxiliary variable. The updating unit is configured to update the demixing matrix such that a function value of the approximating auxiliary function is minimized.Type: GrantFiled: August 15, 2013Date of Patent: May 24, 2016Assignees: Inter-University Research Institute Corporation, Research Organization of Information and Systems, KABUSHIKI KAISHA TOSHIBAInventors: Toru Taniguchi, Nobutaka Ono
-
Patent number: 9349379Abstract: The embodiments of the present invention improves conventional attenuation schemes by replacing constant attenuation with an adaptive attenuation scheme that allows more aggressive attenuation, without introducing audible change of signal frequency characteristics.Type: GrantFiled: November 20, 2013Date of Patent: May 24, 2016Assignee: TELEFONAKTIEBOLAGET L M ERICSSON (PUBL)Inventors: Sebastian Näslund, Volodya Grancharov, Erik Norvell
-
Patent number: 9349368Abstract: A computer-implemented method of determining when an audio notification should be generated includes detecting receipt of a triggering event that occurs on a user device; generating, based on detecting, the audio notification for the triggering event; receiving, from the user device, a user voice command responding to the audio notification; and generating a response to the user voice command based on one or more of (i) information associated with the audio notification, and (ii) information associated with the user voice command.Type: GrantFiled: August 5, 2010Date of Patent: May 24, 2016Assignee: Google Inc.Inventors: Michael J. Lebeau, John Nicholas Jitkoff
-
Patent number: 9343064Abstract: Establishing a multimodal personality for a multimodal application, including evaluating, by the multimodal application, attributes of a user's interaction with the multimodal application; selecting, by the multimodal application, a visual demeanor in dependence upon the values of the attributes of the user's interaction with the multimodal application; and incorporating, by the multimodal application, the visual demeanor into the multimodal application.Type: GrantFiled: November 26, 2013Date of Patent: May 17, 2016Assignee: Nuance Communications, Inc.Inventors: Charles W. Cross, Jr., Hilary A. Pike
-
Patent number: 9336770Abstract: Provided is a pattern recognition apparatus for creating multiple systems and combining the multiple systems to improve the recognition performance, including a discriminative training unit for constructing model parameters of a second or subsequent system based on an output tendency of a previously-constructed model so as to be different from the output tendency of the previously-constructed model. Accordingly, when multiple systems are combined, the recognition performance can be improved without trials-and-errors.Type: GrantFiled: August 13, 2013Date of Patent: May 10, 2016Assignees: MITSUBISHI ELECTRIC CORPORATION, MITSUBISHI ELECTRIC RESEARCH LABORATORIES, INC.Inventors: Yuki Tachioka, Shinji Watanabe
-
Patent number: 9336206Abstract: Various embodiments described herein facilitate multi-lingual communications. The systems and methods of some embodiments enable multi-lingual communications through different modes of communication including, for example, Internet-based chat, e-mail, text-based mobile phone communications, postings to online forums, postings to online social media services, and the like. Certain embodiments implement communication systems and methods that translate text between two or more languages. Users of the systems and methods may be incentivized to submit corrections for inaccurate or erroneous translations, and may receive a reward for these submissions. Systems and methods for accessing the accuracy of translations using word based and language based features are described.Type: GrantFiled: January 21, 2016Date of Patent: May 10, 2016Assignee: Machine Zone, Inc.Inventors: Francois Orsini, Nikhil Bojja, Arun Nedunchezhian
-
Patent number: 9323745Abstract: Disclosed are systems, methods, and computer-readable media for performing translations from a source language to a target language. The method comprises receiving a source phrase, generating a target bag of words based on a global lexical selection of words that loosely couples the source words/phrases and target words/phrases, and reconstructing a target phrase or sentence by considering all permutations of words with a conditional probability greater than a threshold.Type: GrantFiled: July 21, 2014Date of Patent: April 26, 2016Assignee: AT&T Intellectual Property II, L.P.Inventors: Srinivas Bangalore, Patrick Haffner, Stephan Kanthak
-
Patent number: 9305552Abstract: Disclosed herein are systems, computer-implemented methods, and tangible computer-readable storage media for captioning a media presentation. The method includes receiving automatic speech recognition (ASR) output from a media presentation and a transcription of the media presentation. The method includes selecting via a processor a pair of anchor words in the media presentation based on the ASR output and transcription and generating captions by aligning the transcription with the ASR output between the selected pair of anchor words. The transcription can be human-generated. Selecting pairs of anchor words can be based on a similarity threshold between the ASR output and the transcription. In one variation, commonly used words on a stop list are ineligible as anchor words. The method includes outputting the media presentation with the generated captions. The presentation can be a recording of a live event.Type: GrantFiled: September 22, 2014Date of Patent: April 5, 2016Assignee: AT&T Intellectual Property I, L.P.Inventors: Yeon-Jun Kim, David C. Gibbon, Horst J. Schroeter