Patents Examined by Martin Lerner

Speaker template update with embedding vectors based on distance metric

Patent number: 11017783

Abstract: A device includes a processor configured to determine a feature vector based on an utterance and to determine a first embedding vector by processing the feature vector using a trained embedding network. The processor is configured to determine a first distance metric based on distances between the first embedding vector and each embedding vector of a speaker template. The processor is configured to determine, based on the first distance metric, that the utterance is verified to be from a particular user. The processor is configured to, based on a comparison of a first particular distance metric associated with the first embedding vector to a second distance metric associated with a first test embedding vector of the speaker template, generate an updated speaker template by adding the first embedding vector as a second test embedding vector and removing the first test embedding vector from test embedding vectors of the speaker template.

Type: Grant

Filed: March 8, 2019

Date of Patent: May 25, 2021

Assignee: QUALCOMM Incorporated

Inventors: Sunkuk Moon, Bicheng Jiang, Erik Visser
Method and device for recognizing text segmentation position

Patent number: 11004448

Abstract: The present disclosure provides a method and a device for recognizing a text segmentation position. The method includes: receiving a continuous voice message inputted by a user, and recognizing the continuous voice message to generate a text message corresponding to the continuous voice message; analyzing the text message to determine an interval position, and sequentially inserting a sentence end and sentence begin sign at each interval position; calculating a segmentation value corresponding to the sentence end and sentence begin sign inserted at a present interval position according to a preset algorithm; and determining whether the segmentation value is greater than a preset threshold, and determining the present interval position as a segmentation position when the segmentation value is greater than the preset threshold.

Type: Grant

Filed: June 20, 2018

Date of Patent: May 11, 2021

Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.

Inventors: Sheng Qian, Qiang Cheng
Hybrid concealment method: combination of frequency and time domain packet loss concealment in audio codecs

Patent number: 10984804

Abstract: Embodiments of the invention relate to an error concealment unit for providing an error concealment audio information for concealing a loss of an audio frame in an encoded audio information. The error concealment unit provides a first error concealment audio information component for a first frequency range using a frequency domain concealment. The error concealment unit also provides a second error concealment audio information component for a second frequency range, which includes lower frequencies than the first frequency range, using a time domain concealment. The error concealment unit also combines the first error concealment audio information component and the second error concealment audio information component, to obtain the error concealment audio information. Other embodiments of the invention relate to a decoder including the error concealment unit, as well as related encoders, methods, and computer programs for decoding and/or concealing.

Type: Grant

Filed: September 7, 2018

Date of Patent: April 20, 2021

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Jérémie Lecomte, Adrian Tomasek
Coordinated audiovisual montage from selected crowd-sourced content with alignment to audio baseline

Patent number: 10971191

Abstract: A generally diverse set of audiovisual clips is sourced from one or more repositories for use in preparing a coordinated audiovisual work. In some cases, audiovisual clips are retrieved using tags such as user-assigned hashtags or metadata. Pre-existing associations of such tags can be used as hints that certain audiovisual clips are likely to share correspondence with an audio signal encoding of a particular song or other audio baseline. Clips are evaluated for computationally determined correspondence with an audio baseline track. In general, comparisons of audio power spectra, of rhythmic features, tempo, pitch sequences and other extracted audio features may be used to establish correspondence. For clips exhibiting a desired level of correspondence, computationally determined temporal alignments of individual clips with the baseline audio track are used to prepare a coordinated audiovisual work that mixes the selected audiovisual clips with the audio track.

Type: Grant

Filed: June 15, 2015

Date of Patent: April 6, 2021

Inventors: Mark T. Godfrey, Turner Evan Kirk, Ian S. Simon, Nick Kruge
Lyrics analyzer

Patent number: 10957290

Abstract: A lyrics analyzer generates tags and explicitness indicators for a set of tracks. These tags may indicate the genre, mood, occasion, or other features of each track. The lyrics analyzer does so by generating an n-dimensional vector relating to a set of topics extracted from the lyrics and then using those vectors to train a classifier to determine whether each tag applies to each track. The lyrics analyzer may also generate playlists for a user based on a single seed song by comparing the lyrics vector or the lyrics and acoustics vectors of the seed song to other songs to select songs that closely match the seed song. Such a playlist generator may also take into account the tags generated for each track.

Type: Grant

Filed: August 24, 2018

Date of Patent: March 23, 2021

Assignee: Spotify AB

Inventors: Tahora H. Nazer, Tristan Jehan
Proactive assistance based on dialog communication between devices

Patent number: 10942703

Abstract: Systems and processes for proactive assistance based on dialog communication between devices are provided. In one example process, while voice communication between an electronic device and a second electronic device is established, a stream of audio data associated with the second electronic device can be received. In response to detecting a user input, a text representation of speech contained in a portion of the stream of audio data can be generated. The process can determine whether the text representation contains information corresponding to one of a plurality of types of information. In response to determining that the text representation contains information corresponding to one of a plurality of types of information, one or more tasks based on the information can be performed.

Type: Grant

Filed: January 16, 2019

Date of Patent: March 9, 2021

Assignee: Apple Inc.

Inventors: Mathieu Jean Martel, Thomas Deniau
Conversational interface determining lexical personality score for response generation with synonym replacement

Patent number: 10943605

Abstract: The present disclosure involves systems, software, and computer implemented methods for personalizing interactions within a conversational interface based on an input context. One example system performs operations including receiving a conversational input received via a conversational interface. The conversational input is analyzed to determine an intent and lexical personality score based on the input's characteristics. A set of responsive content is determined and includes a set of initial tokens representing an initial response. A set of synonym tokens associated with at least some of the initial tokens are identified, and at least one synonym token associated with a similar lexical personality score to the input is determined. At least one of the initial tokens are replaced with the determined synonym token to generate a modified version of the set of response content. The modified version of the response is then transmitted to a device in response to the input.

Type: Grant

Filed: September 13, 2019

Date of Patent: March 9, 2021

Assignee: The Toronto-Dominion Bank

Inventors: Dean C. N. Tseretopoulos, Robert Alexander McCarter, Sarabjit Singh Walia, Vipul Kishore Lalka, Nadia Moretti, Paige Elyse Dickie, Denny Devasia Kuruvilla, Milos Dunjic, Dino Paul D'Agostino, Arun Victor Jagga, John Jong-Suk Lee, Rakesh Thomas Jethwa
Scoring entity names of devices in a building management system

Patent number: 10936818

Abstract: A controller for classifying devices of a building management system (BMS). The controller may be configured to obtain an entity name for a device, extract a core name from the entity name, compare the core name to candidate core names, determine scores for each comparison, identify a highest score, identify a class of a candidate core name, and classify the device in the class.

Type: Grant

Filed: November 30, 2018

Date of Patent: March 2, 2021

Assignee: Honeywell International Inc.

Inventors: Bose Falk, Jiri Vass, Paul Kleinhans, Jakub Malanik, Cuong Huynh, Patrick Brisbine
Dialogue system and method to identify service from state and input information

Patent number: 10937420

Abstract: The present disclosure provides a dialogue system and a method for controlling thereof. The dialogue system may include: an input processor configured to authenticate a user and receive new input information and new state information of the user; a storage configured to store existing state information of the user, existing input information, and available services; a controller configured to identify a service based on the new input information and the existing input information, and to identify the service based on the new state information of the user and the existing state information of the user, wherein the service is configured to fit needs of the user; and an output processor configured to determine a service format based on the new input information and the new state information of the user, wherein the service format is regarding ways to provide the service to the user.

Type: Grant

Filed: June 11, 2018

Date of Patent: March 2, 2021

Assignees: HYUNDAI MOTOR COMPANY, KIA MOTORS CORPORATION

Inventors: Jung Mi Park, Jimin Han, Jia Lee, Kye Yoon Kim
Determining a category of a request by word vector representation of a natural language text string with a similarity value

Patent number: 10929448

Abstract: A computer-implemented method determines a category of a request provided by a user by means of a user device. The user device includes connection means and means for receiving a request description relating to said request from said user. The method includes receiving, from the user, the request description, by means of the device, and uploading the request description to a server. The server has access to a database which includes a number of previously categorized requests each including a category and a vocabulary, which includes a number of word vector representations. The method further includes identifying, by the server, a number of component words belonging to a natural language text string included in the request description; obtaining, for at least one of the component words, an associated word vector representation from the vocabulary, and determining a request vector, based on at least one obtained word vector representation.

Type: Grant

Filed: August 10, 2018

Date of Patent: February 23, 2021

Assignee: KBC GROEP NV

Inventors: Hans Verstraete, Hans Verstraete, Pieter Van Hertum, Rahul Maheshwari, Jeroen D'Haen, Michaël Mariën, Barak Chizi, Frank Fripon, Sven Evens
Method for supporting translation of global languages and mobile phone

Patent number: 10922497

Abstract: The present disclosure provides a method for supporting translation of global languages, and the product thereof. The method includes the following steps: receiving, by a smart phone, a calling request sent by a terminal, connecting the calling request, and establishing a calling connection; receiving, by the smart phone, first voice information transmitted through the calling connection, identifying a first language and a first dialect that correspond to the first voice information, obtaining a translation model corresponding to the first dialect, and translating the first voice information of the first dialect into second voice information of a second dialect; and playing, by the smart phone, the second voice information of the second dialect by using a speaker device.

Type: Grant

Filed: March 8, 2019

Date of Patent: February 16, 2021

Assignee: WING TAK LEE SILICONE RUBBER TECHNOLOGY (SHENZHEN) CO., LTD

Inventor: Tak Nam Liu
Combining results from first and second speaker recognition processes

Patent number: 10877727

Abstract: A received signal represents a user's speech. A first speaker recognition process is performed on a first portion of the received signal, to obtain a first output result. A second speaker recognition process is performed on a second portion of the received signal that is different from the first portion of the received signal, to obtain a second output result. The second speaker recognition process is different from the first speaker recognition process. The first and second output results are combined to obtain a combined output result indicating a likelihood that the user is a registered user.

Type: Grant

Filed: June 28, 2019

Date of Patent: December 29, 2020

Assignee: Cirrus Logic, Inc.

Inventors: Carlos Vaquero Avilés-Casco, Marta García Gomar, David Martínez González
Adaptive transition frequency between noise fill and bandwidth extension

Patent number: 10878829

Abstract: A method for spectrum recovery in spectral decoding of an audio signal, comprises obtaining of an initial set of spectral coefficients representing the audio signal, and determining a transition frequency. The transition frequency is adapted to a spectral content of the audio signal. Spectral holes in the initial set of spectral coefficients below the transition frequency are noise filled and the initial set of spectral coefficients are bandwidth extended above the transition frequency. Decoders and encoders being arranged for performing part of or the entire method are also illustrated.

Type: Grant

Filed: December 21, 2018

Date of Patent: December 29, 2020

Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)

Inventors: Gustaf Ullberg, Manuel Briand, Anisse Taleb
Systems and methods for providing searchable customer call indexes

Patent number: 10872068

Abstract: A system and method is provided for providing searchable customer call indexes. Consistent with disclosed embodiments, a system may receive call information associated with telephone conversations between callers and a vendor, the call information including an audio recording or transcript for each telephone conversation. The system may also identify one or more keywords from the audio recordings or transcripts and index the call information into one or more indexes based on the identified keywords. Finally, the system may determine search results responsive to a search query based on the indexing. In some embodiments, changes to customer service may be identified based on the search results.

Type: Grant

Filed: July 19, 2019

Date of Patent: December 22, 2020

Assignee: Capital One Services, LLC

Inventor: Nikhil Murgai
Method, apparatus, and program of dialog presentation steps for agents

Patent number: 10872609

Abstract: A dialog method carried out by a dialog system includes an agent that performs a dialog with a user. The dialog method carried out by the dialog system includes a speech receiving step in which the dialog system receives input of a user speech which is a speech of the user, a first presentation step in which when the dialog system cannot obtain any recognition result of a desired level corresponding to the user speech, the dialog system presents a speech which does not include any content words as a first agent speech which is a speech of the agent uttered immediately after the user speech and a second speech step in which the dialog system presents a speech generated or selected not based on the user speech as a second agent speech which is a speech of an agent uttered after uttering the first agent speech.

Type: Grant

Filed: May 19, 2017

Date of Patent: December 22, 2020

Assignees: NIPPON TELEGRAPH AND TELEPHONE CORPORATION, OSAKA UNIVERSITY

Inventors: Hiroaki Sugiyama, Toyomi Meguro, Junji Yamato, Yuichiro Yoshikawa, Hiroshi Ishiguro
Determining translation similarity of reverse translations for a plurality of languages

Patent number: 10872207

Abstract: A translation device includes an input unit configured to receive an input sentence in an original language, a controller configured to generate a first translation sentence obtained by translation of the input sentence into a first language, and a display. The controller generates a second translation sentence obtained by translation of the input sentence into a second language different from the first language, a first reverse translation sentence obtained by reverse translation of the first translation sentence into the original language, and a second reverse translation sentence obtained by reverse translation of the second translation sentence into the original language.

Type: Grant

Filed: December 18, 2018

Date of Patent: December 22, 2020

Assignee: PANASONIC INTELLECTUAL PROPERTY MANAGEMENT CO., LTD.

Inventors: Tetsuji Mochida, He Cai
Text analysis of morphemes by syntax dependency relationship with determination rules

Patent number: 10839155

Abstract: A morpheme analysis unit sets beforehand a meaning-candidate tag and a sentimental theme tag for a morpheme required to be input as a text. A syntax analysis unit generates an index where a clause including a meaning-candidate tag and a sentimental theme tag and a type of each tag. A meaning attribute extraction unit recognizes a clause including a meaning-candidate and a type of tag with reference to the index, and then applies a meaning attribute rule, sets a meaning attribute tag for a necessary clause, and updates the index. A sentimental analysis unit also recognizes a clause including a sentimental theme tag and a clause including a meaning attribute tag with reference to the index, and then applies a sentimental analysis rule and sets a sentimental attribute tag for a necessary clause.

Type: Grant

Filed: September 21, 2018

Date of Patent: November 17, 2020

Assignee: NOMURA RESEARCH INSTITUTE, LTD.

Inventors: Osamu Oshima, Morio Watanabe
Speech recognition system with interactive spelling function

Patent number: 10832675

Abstract: An interactive speech recognition system is provided for interactively interpreting a spoken phrase. The speech recognition system includes a phrase interpretation module which attempts to accurately interpret a spoken phrase by interpreting each individual term of the spoken phrase. A term interpretation module attempts to accurately interpret each individual term of the spoken phrase not accurately interpreted by the phrase interpretation module, by using a spoken spelling of the term provided by a user. An interactive spelling module attempts to interactively spell at least a portion of an individual term of the spoken phrase not accurately interpreted by the term interpretation module, by enabling a user to interactively select at least one individual character of the term of the spoken phrase from a plurality of characters.

Type: Grant

Filed: August 24, 2018

Date of Patent: November 10, 2020

Assignee: Denso International America, Inc.

Inventor: Yu Zhang
Sound synthesis for data sonification employing a human auditory perception eigenfunction model in Hilbert space

Patent number: 10832693

Abstract: A numerical sound synthesis method for representing data as audio for use in data sonification employing a Hilbert Space eigenfunction model of human auditory perception is described. The synthesis method comprises approximating an eigenfunction equation representing a model of human hearing, calculating the approximation to each of a plurality of eigenfunctions from at least one aspect of the eigenfunction equation, and storing the approximation to each of a plurality of eigenfunctions. The approximation to each of a plurality of eigenfunctions represents a perception-oriented basis functions for mathematically representing audio information in a Hilbert-space representation of an audio signal space. The model of human hearing can include a bandpass operation with a bandwidth having the frequency range of human hearing and a time-limiting operation approximating the time duration correlation window of human hearing.

Type: Grant

Filed: June 4, 2018

Date of Patent: November 10, 2020

Inventor: Lester F. Ludwig
Method and electronic device for predicting a response from context with a language model

Patent number: 10831283

Abstract: An electronic device and a method for predicting a response are provided. The electronic device includes a display and a processor configured to receive at least one message, identify at least one contextual category of the at least one message, predict at least one response for the at least one message from a language model based on the at least one contextual category, and control the display to display the at least one predicted response.

Type: Grant

Filed: June 2, 2017

Date of Patent: November 10, 2020

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Barath Raj Kandur Raja, Balaji Vijayanagaram Ramalingam, Harshavardhana Poojari, Raju Suresh Dixit, Sreevatsa Dwaraka Bhamidipati, Srinivasa Rao Siddi, Kalyan Kakani, Vibhav Agarwal, Yashwant Saini

prev … 6 7 8 9 10 11 12 13 14 … next