Patents Examined by Martin Lerner
-
Patent number: 11017783Abstract: A device includes a processor configured to determine a feature vector based on an utterance and to determine a first embedding vector by processing the feature vector using a trained embedding network. The processor is configured to determine a first distance metric based on distances between the first embedding vector and each embedding vector of a speaker template. The processor is configured to determine, based on the first distance metric, that the utterance is verified to be from a particular user. The processor is configured to, based on a comparison of a first particular distance metric associated with the first embedding vector to a second distance metric associated with a first test embedding vector of the speaker template, generate an updated speaker template by adding the first embedding vector as a second test embedding vector and removing the first test embedding vector from test embedding vectors of the speaker template.Type: GrantFiled: March 8, 2019Date of Patent: May 25, 2021Assignee: QUALCOMM IncorporatedInventors: Sunkuk Moon, Bicheng Jiang, Erik Visser
-
Patent number: 11004448Abstract: The present disclosure provides a method and a device for recognizing a text segmentation position. The method includes: receiving a continuous voice message inputted by a user, and recognizing the continuous voice message to generate a text message corresponding to the continuous voice message; analyzing the text message to determine an interval position, and sequentially inserting a sentence end and sentence begin sign at each interval position; calculating a segmentation value corresponding to the sentence end and sentence begin sign inserted at a present interval position according to a preset algorithm; and determining whether the segmentation value is greater than a preset threshold, and determining the present interval position as a segmentation position when the segmentation value is greater than the preset threshold.Type: GrantFiled: June 20, 2018Date of Patent: May 11, 2021Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.Inventors: Sheng Qian, Qiang Cheng
-
Patent number: 10984804Abstract: Embodiments of the invention relate to an error concealment unit for providing an error concealment audio information for concealing a loss of an audio frame in an encoded audio information. The error concealment unit provides a first error concealment audio information component for a first frequency range using a frequency domain concealment. The error concealment unit also provides a second error concealment audio information component for a second frequency range, which includes lower frequencies than the first frequency range, using a time domain concealment. The error concealment unit also combines the first error concealment audio information component and the second error concealment audio information component, to obtain the error concealment audio information. Other embodiments of the invention relate to a decoder including the error concealment unit, as well as related encoders, methods, and computer programs for decoding and/or concealing.Type: GrantFiled: September 7, 2018Date of Patent: April 20, 2021Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.Inventors: Jérémie Lecomte, Adrian Tomasek
-
Coordinated audiovisual montage from selected crowd-sourced content with alignment to audio baseline
Patent number: 10971191Abstract: A generally diverse set of audiovisual clips is sourced from one or more repositories for use in preparing a coordinated audiovisual work. In some cases, audiovisual clips are retrieved using tags such as user-assigned hashtags or metadata. Pre-existing associations of such tags can be used as hints that certain audiovisual clips are likely to share correspondence with an audio signal encoding of a particular song or other audio baseline. Clips are evaluated for computationally determined correspondence with an audio baseline track. In general, comparisons of audio power spectra, of rhythmic features, tempo, pitch sequences and other extracted audio features may be used to establish correspondence. For clips exhibiting a desired level of correspondence, computationally determined temporal alignments of individual clips with the baseline audio track are used to prepare a coordinated audiovisual work that mixes the selected audiovisual clips with the audio track.Type: GrantFiled: June 15, 2015Date of Patent: April 6, 2021Inventors: Mark T. Godfrey, Turner Evan Kirk, Ian S. Simon, Nick Kruge -
Patent number: 10957290Abstract: A lyrics analyzer generates tags and explicitness indicators for a set of tracks. These tags may indicate the genre, mood, occasion, or other features of each track. The lyrics analyzer does so by generating an n-dimensional vector relating to a set of topics extracted from the lyrics and then using those vectors to train a classifier to determine whether each tag applies to each track. The lyrics analyzer may also generate playlists for a user based on a single seed song by comparing the lyrics vector or the lyrics and acoustics vectors of the seed song to other songs to select songs that closely match the seed song. Such a playlist generator may also take into account the tags generated for each track.Type: GrantFiled: August 24, 2018Date of Patent: March 23, 2021Assignee: Spotify ABInventors: Tahora H. Nazer, Tristan Jehan
-
Patent number: 10942703Abstract: Systems and processes for proactive assistance based on dialog communication between devices are provided. In one example process, while voice communication between an electronic device and a second electronic device is established, a stream of audio data associated with the second electronic device can be received. In response to detecting a user input, a text representation of speech contained in a portion of the stream of audio data can be generated. The process can determine whether the text representation contains information corresponding to one of a plurality of types of information. In response to determining that the text representation contains information corresponding to one of a plurality of types of information, one or more tasks based on the information can be performed.Type: GrantFiled: January 16, 2019Date of Patent: March 9, 2021Assignee: Apple Inc.Inventors: Mathieu Jean Martel, Thomas Deniau
-
Patent number: 10943605Abstract: The present disclosure involves systems, software, and computer implemented methods for personalizing interactions within a conversational interface based on an input context. One example system performs operations including receiving a conversational input received via a conversational interface. The conversational input is analyzed to determine an intent and lexical personality score based on the input's characteristics. A set of responsive content is determined and includes a set of initial tokens representing an initial response. A set of synonym tokens associated with at least some of the initial tokens are identified, and at least one synonym token associated with a similar lexical personality score to the input is determined. At least one of the initial tokens are replaced with the determined synonym token to generate a modified version of the set of response content. The modified version of the response is then transmitted to a device in response to the input.Type: GrantFiled: September 13, 2019Date of Patent: March 9, 2021Assignee: The Toronto-Dominion BankInventors: Dean C. N. Tseretopoulos, Robert Alexander McCarter, Sarabjit Singh Walia, Vipul Kishore Lalka, Nadia Moretti, Paige Elyse Dickie, Denny Devasia Kuruvilla, Milos Dunjic, Dino Paul D'Agostino, Arun Victor Jagga, John Jong-Suk Lee, Rakesh Thomas Jethwa
-
Patent number: 10936818Abstract: A controller for classifying devices of a building management system (BMS). The controller may be configured to obtain an entity name for a device, extract a core name from the entity name, compare the core name to candidate core names, determine scores for each comparison, identify a highest score, identify a class of a candidate core name, and classify the device in the class.Type: GrantFiled: November 30, 2018Date of Patent: March 2, 2021Assignee: Honeywell International Inc.Inventors: Bose Falk, Jiri Vass, Paul Kleinhans, Jakub Malanik, Cuong Huynh, Patrick Brisbine
-
Patent number: 10937420Abstract: The present disclosure provides a dialogue system and a method for controlling thereof. The dialogue system may include: an input processor configured to authenticate a user and receive new input information and new state information of the user; a storage configured to store existing state information of the user, existing input information, and available services; a controller configured to identify a service based on the new input information and the existing input information, and to identify the service based on the new state information of the user and the existing state information of the user, wherein the service is configured to fit needs of the user; and an output processor configured to determine a service format based on the new input information and the new state information of the user, wherein the service format is regarding ways to provide the service to the user.Type: GrantFiled: June 11, 2018Date of Patent: March 2, 2021Assignees: HYUNDAI MOTOR COMPANY, KIA MOTORS CORPORATIONInventors: Jung Mi Park, Jimin Han, Jia Lee, Kye Yoon Kim
-
Patent number: 10929448Abstract: A computer-implemented method determines a category of a request provided by a user by means of a user device. The user device includes connection means and means for receiving a request description relating to said request from said user. The method includes receiving, from the user, the request description, by means of the device, and uploading the request description to a server. The server has access to a database which includes a number of previously categorized requests each including a category and a vocabulary, which includes a number of word vector representations. The method further includes identifying, by the server, a number of component words belonging to a natural language text string included in the request description; obtaining, for at least one of the component words, an associated word vector representation from the vocabulary, and determining a request vector, based on at least one obtained word vector representation.Type: GrantFiled: August 10, 2018Date of Patent: February 23, 2021Assignee: KBC GROEP NVInventors: Hans Verstraete, Hans Verstraete, Pieter Van Hertum, Rahul Maheshwari, Jeroen D'Haen, Michaël Mariën, Barak Chizi, Frank Fripon, Sven Evens
-
Patent number: 10922497Abstract: The present disclosure provides a method for supporting translation of global languages, and the product thereof. The method includes the following steps: receiving, by a smart phone, a calling request sent by a terminal, connecting the calling request, and establishing a calling connection; receiving, by the smart phone, first voice information transmitted through the calling connection, identifying a first language and a first dialect that correspond to the first voice information, obtaining a translation model corresponding to the first dialect, and translating the first voice information of the first dialect into second voice information of a second dialect; and playing, by the smart phone, the second voice information of the second dialect by using a speaker device.Type: GrantFiled: March 8, 2019Date of Patent: February 16, 2021Assignee: WING TAK LEE SILICONE RUBBER TECHNOLOGY (SHENZHEN) CO., LTDInventor: Tak Nam Liu
-
Patent number: 10877727Abstract: A received signal represents a user's speech. A first speaker recognition process is performed on a first portion of the received signal, to obtain a first output result. A second speaker recognition process is performed on a second portion of the received signal that is different from the first portion of the received signal, to obtain a second output result. The second speaker recognition process is different from the first speaker recognition process. The first and second output results are combined to obtain a combined output result indicating a likelihood that the user is a registered user.Type: GrantFiled: June 28, 2019Date of Patent: December 29, 2020Assignee: Cirrus Logic, Inc.Inventors: Carlos Vaquero Avilés-Casco, Marta García Gomar, David Martínez González
-
Patent number: 10878829Abstract: A method for spectrum recovery in spectral decoding of an audio signal, comprises obtaining of an initial set of spectral coefficients representing the audio signal, and determining a transition frequency. The transition frequency is adapted to a spectral content of the audio signal. Spectral holes in the initial set of spectral coefficients below the transition frequency are noise filled and the initial set of spectral coefficients are bandwidth extended above the transition frequency. Decoders and encoders being arranged for performing part of or the entire method are also illustrated.Type: GrantFiled: December 21, 2018Date of Patent: December 29, 2020Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)Inventors: Gustaf Ullberg, Manuel Briand, Anisse Taleb
-
Patent number: 10872068Abstract: A system and method is provided for providing searchable customer call indexes. Consistent with disclosed embodiments, a system may receive call information associated with telephone conversations between callers and a vendor, the call information including an audio recording or transcript for each telephone conversation. The system may also identify one or more keywords from the audio recordings or transcripts and index the call information into one or more indexes based on the identified keywords. Finally, the system may determine search results responsive to a search query based on the indexing. In some embodiments, changes to customer service may be identified based on the search results.Type: GrantFiled: July 19, 2019Date of Patent: December 22, 2020Assignee: Capital One Services, LLCInventor: Nikhil Murgai
-
Patent number: 10872609Abstract: A dialog method carried out by a dialog system includes an agent that performs a dialog with a user. The dialog method carried out by the dialog system includes a speech receiving step in which the dialog system receives input of a user speech which is a speech of the user, a first presentation step in which when the dialog system cannot obtain any recognition result of a desired level corresponding to the user speech, the dialog system presents a speech which does not include any content words as a first agent speech which is a speech of the agent uttered immediately after the user speech and a second speech step in which the dialog system presents a speech generated or selected not based on the user speech as a second agent speech which is a speech of an agent uttered after uttering the first agent speech.Type: GrantFiled: May 19, 2017Date of Patent: December 22, 2020Assignees: NIPPON TELEGRAPH AND TELEPHONE CORPORATION, OSAKA UNIVERSITYInventors: Hiroaki Sugiyama, Toyomi Meguro, Junji Yamato, Yuichiro Yoshikawa, Hiroshi Ishiguro
-
Patent number: 10872207Abstract: A translation device includes an input unit configured to receive an input sentence in an original language, a controller configured to generate a first translation sentence obtained by translation of the input sentence into a first language, and a display. The controller generates a second translation sentence obtained by translation of the input sentence into a second language different from the first language, a first reverse translation sentence obtained by reverse translation of the first translation sentence into the original language, and a second reverse translation sentence obtained by reverse translation of the second translation sentence into the original language.Type: GrantFiled: December 18, 2018Date of Patent: December 22, 2020Assignee: PANASONIC INTELLECTUAL PROPERTY MANAGEMENT CO., LTD.Inventors: Tetsuji Mochida, He Cai
-
Patent number: 10839155Abstract: A morpheme analysis unit sets beforehand a meaning-candidate tag and a sentimental theme tag for a morpheme required to be input as a text. A syntax analysis unit generates an index where a clause including a meaning-candidate tag and a sentimental theme tag and a type of each tag. A meaning attribute extraction unit recognizes a clause including a meaning-candidate and a type of tag with reference to the index, and then applies a meaning attribute rule, sets a meaning attribute tag for a necessary clause, and updates the index. A sentimental analysis unit also recognizes a clause including a sentimental theme tag and a clause including a meaning attribute tag with reference to the index, and then applies a sentimental analysis rule and sets a sentimental attribute tag for a necessary clause.Type: GrantFiled: September 21, 2018Date of Patent: November 17, 2020Assignee: NOMURA RESEARCH INSTITUTE, LTD.Inventors: Osamu Oshima, Morio Watanabe
-
Patent number: 10832675Abstract: An interactive speech recognition system is provided for interactively interpreting a spoken phrase. The speech recognition system includes a phrase interpretation module which attempts to accurately interpret a spoken phrase by interpreting each individual term of the spoken phrase. A term interpretation module attempts to accurately interpret each individual term of the spoken phrase not accurately interpreted by the phrase interpretation module, by using a spoken spelling of the term provided by a user. An interactive spelling module attempts to interactively spell at least a portion of an individual term of the spoken phrase not accurately interpreted by the term interpretation module, by enabling a user to interactively select at least one individual character of the term of the spoken phrase from a plurality of characters.Type: GrantFiled: August 24, 2018Date of Patent: November 10, 2020Assignee: Denso International America, Inc.Inventor: Yu Zhang
-
Patent number: 10832693Abstract: A numerical sound synthesis method for representing data as audio for use in data sonification employing a Hilbert Space eigenfunction model of human auditory perception is described. The synthesis method comprises approximating an eigenfunction equation representing a model of human hearing, calculating the approximation to each of a plurality of eigenfunctions from at least one aspect of the eigenfunction equation, and storing the approximation to each of a plurality of eigenfunctions. The approximation to each of a plurality of eigenfunctions represents a perception-oriented basis functions for mathematically representing audio information in a Hilbert-space representation of an audio signal space. The model of human hearing can include a bandpass operation with a bandwidth having the frequency range of human hearing and a time-limiting operation approximating the time duration correlation window of human hearing.Type: GrantFiled: June 4, 2018Date of Patent: November 10, 2020Inventor: Lester F. Ludwig
-
Patent number: 10831283Abstract: An electronic device and a method for predicting a response are provided. The electronic device includes a display and a processor configured to receive at least one message, identify at least one contextual category of the at least one message, predict at least one response for the at least one message from a language model based on the at least one contextual category, and control the display to display the at least one predicted response.Type: GrantFiled: June 2, 2017Date of Patent: November 10, 2020Assignee: SAMSUNG ELECTRONICS CO., LTD.Inventors: Barath Raj Kandur Raja, Balaji Vijayanagaram Ramalingam, Harshavardhana Poojari, Raju Suresh Dixit, Sreevatsa Dwaraka Bhamidipati, Srinivasa Rao Siddi, Kalyan Kakani, Vibhav Agarwal, Yashwant Saini