Patents Examined by Shaun Roberts

Methods and systems for navigating through multimedia content

Patent number: 9484032

Abstract: The disclosed embodiments illustrate methods and systems for processing multimedia content. The method includes extracting one or more words from an audio stream associated with multimedia content. Each word has associated one or more timestamps indicative of temporal occurrences of said word in said multimedia content. The method further includes creating a word cloud of said one or more words in said multimedia content based on a measure of emphasis laid on each word in said multimedia content and said one or more timestamps associated with said one or more words. The method further includes presenting one or more multimedia snippets, of said multimedia content, associated with a word selected by a user from said word cloud. Each of said one or more multimedia snippets corresponds to said one or more timestamps associated with occurrences of said word in said multimedia content.

Type: Grant

Filed: October 27, 2014

Date of Patent: November 1, 2016

Assignee: Xerox Corporation

Inventors: Kuldeep Yadav, Kundan Shrivastava, Om D Deshmukh
Voice retrieval device and voice retrieval method for detecting retrieval word from voice data

Patent number: 9466291

Abstract: A voice retrieval device includes a processor; and a memory which stores a plurality of instructions, which when executed by the processor, cause the processor to execute: setting detection criteria for a retrieval word, based on a characteristic of the retrieval word, such that the higher the detection accuracy of the retrieval word or the lower the pronunciation difficulty of the retrieval word or the lower the appearance probability of the retrieval word, the stricter the detection criteria; performing first voice retrieval processing on voice data according to the detection criteria and detecting a section that possibly includes the retrieval word as a candidate section from the voice data; and performing second voice retrieval processing different from the first voice retrieval processing on each candidate section and determining whether or not the retrieval word is included in each candidate section.

Type: Grant

Filed: October 16, 2014

Date of Patent: October 11, 2016

Assignee: FUJITSU LIMITED

Inventors: Masakiyo Tanaka, Hitoshi Iwamida, Nobuyuki Washio
Systems and methods for determining translation accuracy in multi-user multi-lingual communications

Patent number: 9448996

Abstract: Various embodiments described herein facilitate multi-lingual communications. The systems and methods of some embodiments enable multi-lingual communications through different modes of communication including, for example, Internet-based chat, e-mail, text-based mobile phone communications, postings to online forums, postings to online social media services, and the like. Certain embodiments implement communication systems and methods that translate text between two or more languages. Users of the systems and methods may be incentivized to submit corrections for inaccurate or erroneous translations, and may receive a reward for these submissions. Systems and methods for assessing the accuracy of translations are described.

Type: Grant

Filed: April 14, 2016

Date of Patent: September 20, 2016

Assignee: Machine Zone, Inc.

Inventors: Francois Orsini, Nikhil Bojja, Arun Nedunchezhian
Perspective data management for common features of multiple items

Patent number: 9442918

Abstract: A computer-implemented method of managing perspective data associated with a common feature in items is disclosed. The method can include identifying a common feature in a first item and a second item, the first item having a set of perspective data and establishing a subset of perspective data associated with the common feature. The method can include associating the subset of perspective with the second item. The method can include determining a set of relevancy scores for the subset of perspective data associated with the common feature and establishing a set of relevant perspective data from the subset of perspective data. The set of relevant perspective data can have relevancy scores outside of a relevancy threshold. The method can include associating the set of relevant perspective data with the second item.

Type: Grant

Filed: March 24, 2015

Date of Patent: September 13, 2016

Assignee: International Business Machines Corporation

Inventors: Adam T. Clark, Jeffrey K. Huebert, Aspen L. Payton, John E. Petri
Subband block based harmonic transposition

Patent number: 9431025

Abstract: The present document relates to audio source coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR), as well as to digital effect processors, e.g. exciters, where generation of harmonic distortion add brightness to the processed signal, and to time stretchers where a signal duration is prolonged with maintained spectral content. A system and method configured to generate a time stretched and/or frequency transposed signal from an input signal is described. The system comprises an analysis filterbank (101) configured to provide an analysis subband signal from the input signal; wherein the analysis subband signal comprises a plurality of complex valued analysis samples, each having a phase and a magnitude. Furthermore, the system comprises a subband processing unit (102) configured to determine a synthesis subband signal from the analysis subband signal using a subband transposition factor Q and a subband stretch factor S.

Type: Grant

Filed: October 13, 2014

Date of Patent: August 30, 2016

Assignee: Dolby International AB

Inventor: Lars Villemoes
Method and apparatus for polyphonic audio signal prediction in coding and networking systems

Patent number: 9406307

Abstract: A method, device, and apparatus provide the ability to predict a portion of a polyphonic audio signal for compression and networking applications. The solution involves a framework of a cascade of long term prediction filters, which by design is tailored to account for all periodic components present in a polyphonic signal. This framework is complemented with a design method to optimize the system parameters. Specialization may include specific techniques for coding and networking scenarios, where the potential of each enhanced prediction is realized to considerably improve the overall system performance for that application. One specific technique provides enhanced inter-frame prediction for the compression of polyphonic audio signals, particularly at low delay. Another specific technique provides improved frame loss concealment capabilities to combat packet loss in audio communications.

Type: Grant

Filed: August 19, 2013

Date of Patent: August 2, 2016

Assignee: The Regents of the University of California

Inventors: Kenneth Rose, Tejaswi Nanjundaswamy
Perspective data management for common features of multiple items

Patent number: 9400780

Abstract: A computer-implemented method of managing perspective data associated with a common feature in items is disclosed. The method can include identifying a common feature in a first item and a second item, the first item having a set of perspective data and establishing a subset of perspective data associated with the common feature. The method can include associating the subset of perspective with the second item. The method can include determining a set of relevancy scores for the subset of perspective data associated with the common feature and establishing a set of relevant perspective data from the subset of perspective data. The set of relevant perspective data can have relevancy scores outside of a relevancy threshold. The method can include associating the set of relevant perspective data with the second item.

Type: Grant

Filed: October 17, 2014

Date of Patent: July 26, 2016

Assignee: International Business Machines Corporation

Inventors: Adam T. Clark, Jeffrey K. Huebert, Aspen L. Payton, John E. Petri
Recursive adaptive interaction management system

Patent number: 9384446

Abstract: A management system for guiding an agent in a media-specific dialog has a conversion engine for instantiating ongoing dialog as machine-readable text, if the dialog is in voice media, a context analysis engine for determining facts from the text, a rules engine for asserting rules based on fact input, and a presentation engine for presenting information to the agent to guide the agent in the dialog. The context analysis engine passes determined facts to the rules engine, which selects and asserts to the presentation engine rules based on the facts, and the presentation engine provides periodically updated guidance to the agent based on the rules asserted.

Type: Grant

Filed: July 27, 2015

Date of Patent: July 5, 2016

Assignee: GENESYS TELECOMMUNICATIONS LABORATORIES INC.

Inventors: Dave Sneyders, Brian Galvin, S. Michael Perlmutter
Apparatus and method for predicting the pleasantness-unpleasantness index of words using relative emotion similarity

Patent number: 9384189

Abstract: An apparatus and a method for predicting the pleasantness-unpleasantness index of words are disclosed. The disclosed apparatus includes: a computing unit configured to compute an emotion correlation between a word and one or more comparison word, compute emotion correlations between multiple reference words included in a reference word set and the one or more comparison word, compute multiple first absolute emotion similarity values between the word and the multiple reference words, and compute at least one second absolute emotion similarity value between a reference word and another reference word for all of the reference words included in the reference word set; and a prediction unit configured to predict the pleasantness-unpleasantness index of the word by using the multiple number of first absolute emotion similarity values, the at least one second absolute emotion similarity value, and a preset pleasantness-unpleasantness index of the multiple number of reference words.

Type: Grant

Filed: October 21, 2014

Date of Patent: July 5, 2016

Assignee: Foundation of Soongsil University—Industry Corporation

Inventors: Soo Won Lee, Kang Bok Lee
Response completion in social media

Patent number: 9380009

Abstract: Embodiments are directed towards providing word-by-word message completion for an incomplete response message, wherein the response message is composed in response to a received stimulus message. The message completion is based on a Response Completion Model (RCM) that may model both the language used in the incomplete response message and the contextual information in the received stimulus message. The RCM may be determined based on conversational stimulus-response data including stimulus-response message pairs. The RCM may be a mixture model and include a generic response language model based on an N-gram model, a Stimulus Model based on a Selection Model or a Topic. Model, and a mixture parameter. In some embodiments, at least one candidate next word for the incomplete response message is determined based on the RCM. The at least one candidate next word may be selected and included in the incomplete response message. A complete response message may be generated and provided to a user.

Type: Grant

Filed: July 12, 2012

Date of Patent: June 28, 2016

Assignee: Yahoo! Inc.

Inventors: Sujith Ravi, Bo Pang
Customized speech generation

Patent number: 9363104

Abstract: Various approaches enable automatic communication generation based on patterned behavior in a particular context. For example, a computing device can monitor behavior of a user to determine patterns of communication behavior in certain situations. In response to detecting multiple occurrences of the certain situation, a computing device can prompt a user to perform an action corresponding to the pattern of behavior. In some embodiments, a set of speech models corresponding to a type of contact is generated. The speech models include language consistent with patterns of speech between a user and the type of contact. Based on context and on the contact, a message using language consistent with past communications between the user and contact is generated from a speech model associated with the type of contact.

Type: Grant

Filed: April 21, 2014

Date of Patent: June 7, 2016

Assignee: Amazon Technologies, Inc.

Inventors: Isaac Scott Noble, Gabrielle M. Halberg, Kenneth Mark Karakotsios, Yuzo Watanabe
Detection of an audio signal transient using first and second maximum norms

Patent number: 9361893

Abstract: Provided are, among other things, systems, methods and techniques for detecting whether a transient exists within an audio signal. According to one representative embodiment, a segment of a digital audio signal is divided into blocks, and a norm value is calculated for each of a number of the blocks, resulting in a set of norm values for such blocks, each such norm value representing a measure of signal strength within a corresponding block. A maximum norm value is then identified across such blocks, and a test criterion is applied to the norm values. If the test criterion is not satisfied, a first signal indicating that the segment does not include any transient is output, and if the test criterion is satisfied, a second signal indicating that the segment includes a transient is output. According to this embodiment, the test criterion involves a comparison of the maximum norm value to a different second maximum norm value, subject to a specified constraint, within the segment.

Type: Grant

Filed: July 5, 2014

Date of Patent: June 7, 2016

Assignee: Digital Rise Technology Co., Ltd.

Inventor: Yuli You
Apparatus, method, and computer program product for separating time series signals

Patent number: 9349375

Abstract: According to an embodiment, a signal processing apparatus includes an estimation unit and an updating unit. The estimation unit is configured to estimate an auxiliary variable of a target section including first and second sections of input signals by using an approximating auxiliary function for approximating an auxiliary function having an auxiliary variable as an argument. The auxiliary function is determined according to an objective function that outputs a function value that is smaller as a statistical independence of separated signals into which input signals in time-series are separated by a demixing matrix is higher. The estimation unit is configured to estimate a value of the auxiliary variable of the target section based on the estimated auxiliary variable. The updating unit is configured to update the demixing matrix such that a function value of the approximating auxiliary function is minimized.

Type: Grant

Filed: August 15, 2013

Date of Patent: May 24, 2016

Assignees: Inter-University Research Institute Corporation, Research Organization of Information and Systems, KABUSHIKI KAISHA TOSHIBA

Inventors: Toru Taniguchi, Nobutaka Ono
Method and a decoder for attenuation of signal regions reconstructed with low accuracy

Patent number: 9349379

Abstract: The embodiments of the present invention improves conventional attenuation schemes by replacing constant attenuation with an adaptive attenuation scheme that allows more aggressive attenuation, without introducing audible change of signal frequency characteristics.

Type: Grant

Filed: November 20, 2013

Date of Patent: May 24, 2016

Assignee: TELEFONAKTIEBOLAGET L M ERICSSON (PUBL)

Inventors: Sebastian Näslund, Volodya Grancharov, Erik Norvell
Generating an audio notification based on detection of a triggering event

Patent number: 9349368

Abstract: A computer-implemented method of determining when an audio notification should be generated includes detecting receipt of a triggering event that occurs on a user device; generating, based on detecting, the audio notification for the triggering event; receiving, from the user device, a user voice command responding to the audio notification; and generating a response to the user voice command based on one or more of (i) information associated with the audio notification, and (ii) information associated with the user voice command.

Type: Grant

Filed: August 5, 2010

Date of Patent: May 24, 2016

Assignee: Google Inc.

Inventors: Michael J. Lebeau, John Nicholas Jitkoff
Establishing a multimodal personality for a multimodal application in dependence upon attributes of user interaction

Patent number: 9343064

Abstract: Establishing a multimodal personality for a multimodal application, including evaluating, by the multimodal application, attributes of a user's interaction with the multimodal application; selecting, by the multimodal application, a visual demeanor in dependence upon the values of the attributes of the user's interaction with the multimodal application; and incorporating, by the multimodal application, the visual demeanor into the multimodal application.

Type: Grant

Filed: November 26, 2013

Date of Patent: May 17, 2016

Assignee: Nuance Communications, Inc.

Inventors: Charles W. Cross, Jr., Hilary A. Pike
Pattern recognition apparatus for creating multiple systems and combining the multiple systems to improve recognition performance and pattern recognition method

Patent number: 9336770

Abstract: Provided is a pattern recognition apparatus for creating multiple systems and combining the multiple systems to improve the recognition performance, including a discriminative training unit for constructing model parameters of a second or subsequent system based on an output tendency of a previously-constructed model so as to be different from the output tendency of the previously-constructed model. Accordingly, when multiple systems are combined, the recognition performance can be improved without trials-and-errors.

Type: Grant

Filed: August 13, 2013

Date of Patent: May 10, 2016

Assignees: MITSUBISHI ELECTRIC CORPORATION, MITSUBISHI ELECTRIC RESEARCH LABORATORIES, INC.

Inventors: Yuki Tachioka, Shinji Watanabe
Systems and methods for determining translation accuracy in multi-user multi-lingual communications

Patent number: 9336206

Abstract: Various embodiments described herein facilitate multi-lingual communications. The systems and methods of some embodiments enable multi-lingual communications through different modes of communication including, for example, Internet-based chat, e-mail, text-based mobile phone communications, postings to online forums, postings to online social media services, and the like. Certain embodiments implement communication systems and methods that translate text between two or more languages. Users of the systems and methods may be incentivized to submit corrections for inaccurate or erroneous translations, and may receive a reward for these submissions. Systems and methods for accessing the accuracy of translations using word based and language based features are described.

Type: Grant

Filed: January 21, 2016

Date of Patent: May 10, 2016

Assignee: Machine Zone, Inc.

Inventors: Francois Orsini, Nikhil Bojja, Arun Nedunchezhian
Machine translation using global lexical selection and sentence reconstruction

Patent number: 9323745

Abstract: Disclosed are systems, methods, and computer-readable media for performing translations from a source language to a target language. The method comprises receiving a source phrase, generating a target bag of words based on a global lexical selection of words that loosely couples the source words/phrases and target words/phrases, and reconstructing a target phrase or sentence by considering all permutations of words with a conditional probability greater than a threshold.

Type: Grant

Filed: July 21, 2014

Date of Patent: April 26, 2016

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Srinivas Bangalore, Patrick Haffner, Stephan Kanthak
Systems, computer-implemented methods, and tangible computer-readable storage media for transcription alignment

Patent number: 9305552

Abstract: Disclosed herein are systems, computer-implemented methods, and tangible computer-readable storage media for captioning a media presentation. The method includes receiving automatic speech recognition (ASR) output from a media presentation and a transcription of the media presentation. The method includes selecting via a processor a pair of anchor words in the media presentation based on the ASR output and transcription and generating captions by aligning the transcription with the ASR output between the selected pair of anchor words. The transcription can be human-generated. Selecting pairs of anchor words can be based on a similarity threshold between the ASR output and the transcription. In one variation, commonly used words on a stop list are ineligible as anchor words. The method includes outputting the media presentation with the generated captions. The presentation can be a recording of a live event.

Type: Grant

Filed: September 22, 2014

Date of Patent: April 5, 2016

Assignee: AT&T Intellectual Property I, L.P.

Inventors: Yeon-Jun Kim, David C. Gibbon, Horst J. Schroeter

prev … 14 15 16 17 18 19 20 21 22 … next