General Speech Analysis Without Concrete Application (epo) Patents (Class 704/E11.002)
-
Patent number: 11954149Abstract: Techniques of content unification are disclosed. In some example embodiments, a computer-implemented method comprises: determining clusters based a comparison of a plurality of audio content using a first matching criteria, each cluster of the plurality of clusters comprising at least two audio content from the plurality of audio content; for each cluster of the plurality of clusters, determining a representative audio content for the cluster from the at least two audio content of the cluster; loading the corresponding representative audio content of each cluster into an index; matching the query audio content to one of the representative audio contents using a first matching criteria; determining the corresponding cluster of the matched representative audio content; and identifying a match between the query audio content and at least one of the audio content of the cluster of the matched representative audio content based on a comparison using a second matching criteria.Type: GrantFiled: October 27, 2022Date of Patent: April 9, 2024Assignee: Gracenote, Inc.Inventors: Peter C. DiMaria, Markus K. Cremer, Barnabas Mink, Tanji Koshio, Kei Tsuji
-
Patent number: 11900945Abstract: An information processing method, a system, an apparatus, an electronic device and a storage medium, where the method is applied to a client, and includes: receiving a transcript and a sentence identifier of the transcript sent by a service server; reading a local sentence identifier, and when the received sentence identifier is the same as the local sentence identifier, updating a displayed caption content corresponding to the local sentence identifier with the transcript. When the received sentence identifier of the client is the same as the local sentence identifier, the displayed caption content is replaced with the received transcript.Type: GrantFiled: March 21, 2022Date of Patent: February 13, 2024Assignee: BEIJING BYTEDANCE NETWORK TECHNOLOGY CO., LTD.Inventors: Li Zhao, Xiao Han, Kojung Chen, Jian Tong
-
Patent number: 11810546Abstract: Provided are a sample generation method and apparatus. The sample generation method comprises: acquiring a plurality of text-audio pairs, wherein each text-audio pair contains a text segment and an audio segment; calculating an audio feature of an audio segment of each of the plurality of text-audio pairs, and selecting, by means of screening and according to the audio feature, a target text-audio pair and a splicing text-audio pair corresponding to the target text-audio pair from among the plurality of text-audio pairs; splicing the target text-audio pair and the splicing text-audio pair into a text-audio pair to be tested, and testing the text-audio pair to be tested; and when the text-audio pair to be tested meets a preset test condition, writing the text-audio pair to be tested into a training database.Type: GrantFiled: November 12, 2021Date of Patent: November 7, 2023Assignee: Beijing Yuanli Weilai Science and Technology Co., Ltd.Inventors: Dongxiao Wang, Mingqi Yang, Nan Ma, Long Xia, Changzhen Guo
-
Patent number: 11775608Abstract: A system and method model a time series from missing data by imputing missing values, denoising measured but noisy values, and forecasting future values of a single time series. A time series of potentially noisy, partially-measured values of a physical process is represented as a non-overlapping matrix. For several classes of common model functions, it can be proved that the resulting matrix has a low rank or approximately low rank, allowing a matrix estimation technique, for example singular value thresholding, to be efficiently applied. Applying such a technique produces a mean matrix that estimates latent values, of the physical process at times or intervals corresponding to measurements, with less error than previously known methods. These latent values have been denoised (if noisy) and imputed (if missing). Linear regression of the estimated latent values permits forecasting with an error that decreases as more measurements are made.Type: GrantFiled: July 7, 2022Date of Patent: October 3, 2023Assignee: Massachusetts Institute of TechnologyInventors: Devavrat D. Shah, Anish Agarwal, Muhammad Amjad, Dennis Shen
-
Patent number: 11682385Abstract: A method for training hotword detection includes receiving a training input audio sequence including a sequence of input frames that define a hotword that initiates a wake-up process on a device. The method also includes feeding the training input audio sequence into an encoder and a decoder of a memorized neural network. Each of the encoder and the decoder of the memorized neural network include sequentially-stacked single value decomposition filter (SVDF) layers. The method further includes generating a logit at each of the encoder and the decoder based on the training input audio sequence. For each of the encoder and the decoder, the method includes smoothing each respective logit generated from the training input audio sequence, determining a max pooling loss from a probability distribution based on each respective logit, and optimizing the encoder and the decoder based on all max pooling losses associated with the training input audio sequence.Type: GrantFiled: June 15, 2021Date of Patent: June 20, 2023Assignee: Google LLCInventors: Raziel Alvarez Guevara, Hyun Jin Park, Patrick Violette
-
Patent number: 11568872Abstract: An information processing method, a system, an apparatus, an electronic device and a storage medium, where the method is applied to a client, and includes: receiving a transcript and a sentence identifier of the transcript sent by a service server; reading a local sentence identifier, and when the received sentence identifier is the same as the local sentence identifier, updating a displayed caption content corresponding to the local sentence identifier with the transcript. When the received sentence identifier of the client is the same as the local sentence identifier, the displayed caption content is replaced with the received transcript.Type: GrantFiled: March 21, 2022Date of Patent: January 31, 2023Assignee: BEIJING BYTEDANCE NETWORK TECHNOLOGY CO., LTD.Inventors: Li Zhao, Xiao Han, Kojung Chen, Jian Tong
-
Patent number: 11487814Abstract: Techniques of content unification are disclosed. In some example embodiments, a computer-implemented method comprises: determining clusters based a comparison of a plurality of audio content using a first matching criteria, each cluster of the plurality of clusters comprising at least two audio content from the plurality of audio content; for each cluster of the plurality of clusters, determining a representative audio content for the cluster from the at least two audio content of the cluster; loading the corresponding representative audio content of each cluster into an index; matching the query audio content to one of the representative audio contents using a first matching criteria; determining the corresponding cluster of the matched representative audio content; and identifying a match between the query audio content and at least one of the audio content of the cluster of the matched representative audio content based on a comparison using a second matching criteria.Type: GrantFiled: January 4, 2021Date of Patent: November 1, 2022Assignee: Gracenote, Inc.Inventors: Peter C. DiMaria, Markus K. Cremer, Barnabas Mink, Tanji Koshio, Kei Tsuji
-
Patent number: 11340602Abstract: A method includes converting time-series data from a plurality of prognostic and health monitoring (PHM) sensors into frequency domain data. One or more portions of the frequency domain data are labeled as indicative of one or more target modes to form labeled target data. A model including a deep neural network is applied to the labeled target data. A result of applying the model is classified as one or more discretized PHM training indicators associated with the one or more target modes. The one or more discretized PHM training indicators are output.Type: GrantFiled: December 18, 2015Date of Patent: May 24, 2022Assignee: RAYTHEON TECHNOLOGIES CORPORATIONInventors: Michael J. Giering, Madhusudana Shashanka, Soumik Sarkar, Vivek Venugopalan
-
Patent number: 10726852Abstract: Methods and apparatus to perform windowed sliding transforms are disclosed. An example apparatus includes a transformer to transform a first block of time-domain samples of an input signal into a first frequency-domain representation based on a second frequency-domain representation of a second block of time-domain samples of the input signal, and a windower to apply a third frequency-domain representation of a time-domain window function to the first frequency-domain representation.Type: GrantFiled: February 19, 2018Date of Patent: July 28, 2020Assignee: The Nielsen Company (US), LLCInventor: Zafar Rafii
-
Patent number: 10692507Abstract: Methods and apparatus to perform windowed sliding transforms are disclosed. An example apparatus includes a transformer to transform a first block of time-domain samples of an input signal into a first frequency-domain representation based on a second frequency-domain representation of a second block of time-domain samples of the input signal, and a windower to apply a third frequency-domain representation of a time-domain window function to the first frequency-domain representation.Type: GrantFiled: February 19, 2018Date of Patent: June 23, 2020Assignee: The Nielsen Company (US), LLCInventor: Zafar Rafii
-
Patent number: 10571390Abstract: A method of detecting local material changes in a composite structure is presented. A pulsed laser beam is directed towards the composite structure comprised of a number of composite materials. Wide-band ultrasonic signals are formed in the composite structure when radiation of the pulsed laser beam is absorbed by the composite structure. The wide-band ultrasonic signals are detected to form data. The data is processed to identify a local frequency value for the composite structure. The local frequency value is used to determine if local material changes are present in the number of composite materials.Type: GrantFiled: March 15, 2016Date of Patent: February 25, 2020Assignee: The Boeing CompanyInventors: William P. Motzer, Gary Ernest Georgeson, Jill Paisley Bingham, Steven Kenneth Brady, Alan F. Stewart, James C. Kennedy, Ivan Pelivanov, Matthew O'Donnell, Jeffrey Reyner Kollgaard
-
Publication number: 20130030813Abstract: Methods and arrangements for improving quality of content in voice applications. A specification is provided for acceptable content for a voice application, and user generated audio content for the voice application is inputted. At least one test is applied to the user generated audio content, and it is thereupon determined as to whether the user generated audio content meets the provided specification.Type: ApplicationFiled: August 29, 2012Publication date: January 31, 2013Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Nitendra Rajput, Kundan Shrivastava
-
Publication number: 20120095753Abstract: A noise power estimation system for estimating noise power of each frequency spectral component includes a cumulative histogram generating section for generating a cumulative histogram for each frequency spectral component of a time series signal, in which the horizontal axis indicates index of power level and the vertical axis indicates cumulative frequency and which is weighted by exponential moving average; and a noise power estimation section for determining an estimated value of noise power for each frequency spectral component of the time series signal based on the cumulative histogram.Type: ApplicationFiled: September 14, 2011Publication date: April 19, 2012Applicant: HONDA MOTOR CO., LTD.Inventors: Hirofumi NAKAJIMA, Kazuhiro NAKADAI, Yuji HASEGAWA
-
Publication number: 20110264446Abstract: A method, a system, and a media gateway (MG) for reporting media instance information are disclosed. The method for reporting media instance information includes: detecting, by an MG, received media data according to a set media instance detection (MID) event; and reporting, by the MG, the MID event when the media instance information is detected. With the present invention, the MG reports the detected media instance information related to the media data to a media gateway controller (MGC) through a set MID event, so that the MG can detect media instance information related to the media data, and report the detected media instance information related to the media data to the MGC. In this way, the MGC can execute corresponding control operations according to the media instance information related to the media data, extending the applicable scope of media services.Type: ApplicationFiled: July 7, 2011Publication date: October 27, 2011Inventor: Weiwei YANG
-
Publication number: 20100191523Abstract: A method and an apparatus for recovering a line spectrum pair (LSP) parameter of a spectrum region when frame loss occurs during speech decoding and a speech decoding apparatus adopting the same are provided. The method of recovering an LSP parameter in speech decoding includes: if it is determined that a received speech packet has an erased frame, converting an LSP parameter of a previous good frame (PGF) of the erased frame or LSP parameters of the PGF and a next good frame (NGF) of the erased frame into a spectrum region and obtaining a spectrum envelope of the PGF or spectrum envelopes of the PGF and NGF; recovering a spectrum envelope of the erased frame using the spectrum envelope of the PGF or the spectrum envelopes of the PGF and NGF; and converting the recovered spectrum envelope of the erased frame into an LSP parameter of the erased frame.Type: ApplicationFiled: March 25, 2010Publication date: July 29, 2010Applicant: SAMSUNG ELECTRONIC CO., LTD.Inventors: Hosang Sung, Seungho Choi, Kihyun Choo
-
Publication number: 20100145690Abstract: A sound signal generating method includes: generating, using a computer, a plurality of unit waveform signals by dividing the original sound signal having a periodic length of repeating similar waveforms by the length of the waveform; generating, using a computer, a repetitive waveform signal for each of the generated unit waveform signals by repeating the waveform of the unit waveform signal a given number of times; and generating, using a computer, an outputsound signal by shifting each of the repetitive waveform signals in each length with a sequence in which the unit waveform signals form the original sound signal and then superimposing on one another.Type: ApplicationFiled: February 10, 2010Publication date: June 10, 2010Applicant: FUJITSU LIMITEDInventor: Kazuhiro Watanabe
-
Publication number: 20100094635Abstract: SYSTEM FOR VOICE-BASE INTERACTION ON WEB PAGES, of type that permits the incorporation of voice-handling functions on a Web page, in which from a Terminal (1) a Web page (3) of a Web site that is structured under the DOM (Domain Object Model), or any of its extensions, and a networked Voice Service Server (5), by means of a downloadable module (6) for further incorporation in a Web browser, the system including the operating procedures for enabling said module to act as a transparent gateway in a dialogue between said Voice Service Server (5) and said Web page (3), said Web browser permitting to handle said Voice Services of said Server (5) through script functions incorporated in said Web page (3).Type: ApplicationFiled: November 30, 2007Publication date: April 15, 2010Inventor: Juan Jose Bermudez Perez
-
Publication number: 20090099843Abstract: A method for determining a speech quality measure of an output speech signal with respect to an input speech signal, wherein the input signal passes through a signal path of a data transmission system resulting in the output signal, includes the steps of pre-processing the output signal; determining at least one of an interruption rate of the pre-processed output signal and a measure for an intensity of musical tones present in the pre-processed output signal; and determining the speech quality measure from at least one of the interruption rate and the measure for the intensity of the musical tones.Type: ApplicationFiled: September 11, 2008Publication date: April 16, 2009Applicants: DEUTSCHE TELEKOM AG, FRANCE TELECOM, TECHNISCHE UNIVERSITAET BERLINInventors: Vincent Barriac, Nicolas Cote, Valerie Gautier-Turbin, Sebastian Moeller, Alexander Raake, Marcel Waeltermann, Ulrich Heute, Kirstin Scholz
-
Publication number: 20080228491Abstract: The technical disclosure of my invention is the pre-recorded sayings inside a sound module housed in a collectable plush bear, and also the fact that both pre-recorded and recordable sound modules will be used with specific wording that will be used in the pre-recorded left hand of the plush bear. The pre-recorded sayings are as follows: I can do all things through Christ who strengthens me. When you believe you can achieve. Everything I touch turns to gold. What you dream about you bring about.Type: ApplicationFiled: March 12, 2007Publication date: September 18, 2008Inventor: Nancy C. Hale
-
Publication number: 20080215325Abstract: An apparatus, method and program for dividing a conversational dialog into utterance. The apparatus includes a computer processor; a word database for storing spellings and pronunciations of words; a grammar database for storing syntactic rules on words; a pause detecting section which detects a pause location in a channel making a main speech among conversational dialogs inputted in at least two channels; an acknowledgement detecting section which detects an acknowledgement location in a channel not making the main speech; a boundary-candidate extracting section which extracts boundary candidates in the main speech, by extracting pauses existing within a predetermined range before and after a base point that is the acknowledgement location; and a recognizing unit which outputs a word string of the main speech segmented by one of the extracted boundary candidates after dividing the segmented speech into optimal utterance in reference to the word database and grammar database.Type: ApplicationFiled: December 27, 2007Publication date: September 4, 2008Inventors: Hiroshi Horii, Hideki Tai, Gaku Yamamoto
-
Publication number: 20080162146Abstract: A method and device are provided for classifying at least two languages in an automatic dialogue system, which processes digitized speech input. At least one speech recognition method and at least one language identification method are used on the digitized speech input in order, by logical evaluation of the results of the method, to identify the language of the speech input.Type: ApplicationFiled: December 3, 2007Publication date: July 3, 2008Applicant: Deutsche Telekom AGInventors: Martin Eckert, Roman Englert, Wiebke Johannsen, Fred Runge, Markus Van Ballegooy
-
Publication number: 20080120115Abstract: In one embodiment, the methods and apparatuses detect an original audio signal;detect a sound model wherein the sound model includes a sound parameter; transform the original audio signal based on the parameter whereby forming a transformed audio signal; and compare the transformed audio signal with the original audio signal.Type: ApplicationFiled: November 16, 2006Publication date: May 22, 2008Inventor: Xiao Dong Mao