Patents Examined by Richa Mishra
-
Patent number: 11593558Abstract: In an example, a text sentence comprising a plurality of words is obtained. Each of the plurality of words is passed through a deep compositional character-to-word model to encode character-level information of each of the plurality of words into a character-to-word expression. The character-to-word expressions are combined with pre-trained word embeddings. The combined character-to-word expressions and pre-trained word embeddings are fed into one or more bidirectional long short-term memories to learn contextual information for each of the plurality of words. Then, sequential conditional random fields are applied to the contextual information for each of the plurality of words.Type: GrantFiled: August 31, 2017Date of Patent: February 28, 2023Assignee: eBay Inc.Inventors: Yingwei Xin, Jean-David Ruvini, Ethan J. Hart
-
Patent number: 11545132Abstract: Techniques regarding speech characterization are provided. For example, one or more embodiments described herein can comprise a system, which can comprise a memory that can store computer executable components. The system can also comprise a processor, operably coupled to the memory, and that can execute the computer executable components stored in the memory. The computer executable components can comprise a speech analysis component that can determine a condition of an origin of an audio signal based on a difference between a first feature of the audio signal and a second feature of a synthesized reference audio signal.Type: GrantFiled: August 28, 2019Date of Patent: January 3, 2023Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Avner Abrami, Mary Pietrowicz
-
Patent number: 11501084Abstract: In one example, a system can execute a first machine-learning model to determine an overall classification for a textual dataset. The system can also determine classification scores indicating the level of influence that each token in the textual dataset had on the overall classification. The system can select a first subset of the tokens based on their classification scores. The system can also execute a second machine-learning model to determine probabilities that the textual dataset falls into various categories. The system can determine category scores indicating the level of influence that each token had on a most-likely category determination. The system can select a second subset of the tokens based on their category scores. The system can then generate a first visualization depicting the first subset of tokens color-coded to indicate their classification scores and a second visualization depicting the second subset of tokens color-coded to indicate their category scores.Type: GrantFiled: May 18, 2022Date of Patent: November 15, 2022Assignee: SAS INSTITUTE INC.Inventors: Reza Soleimani, Samuel Paul Leeman-Munk, James Allen Cox, David Blake Styles
-
Patent number: 11482232Abstract: Concealing a lost audio frame of a received audio signal is provided by performing a sinusoidal analysis (81) of a part of a previously received or reconstructed audio signal, wherein the sinusoidal analysis involves identifying frequencies of sinusoidal components of the audio signal, applying a sinusoidal model on a segment of the previously received or reconstructed audio signal, wherein said segment is used as a prototype frame in order to create a substitution frame for a lost audio frame, and creating the substitution frame (83) for the lost audio frame by time-evolving sinusoidal components of the prototype frame, up to the time instance of the lost audio frame, in response to the corresponding identified frequencies.Type: GrantFiled: May 16, 2019Date of Patent: October 25, 2022Assignee: Telefonaktiebolaget LM Ericsson (publ)Inventor: Stefan Bruhn
-
Patent number: 11475909Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing speech separation. One of the methods includes obtaining a recording comprising speech from a plurality of speakers; processing the recording using a speaker neural network having speaker parameter values and configured to process the recording in accordance with the speaker parameter values to generate a plurality of per-recording speaker representations, each speaker representation representing features of a respective identified speaker in the recording; and processing the per-recording speaker representations and the recording using a separation neural network having separation parameter values and configured to process the recording and the speaker representations in accordance with the separation parameter values to generate, for each speaker representation, a respective predicted isolated audio signal that corresponds to speech of one of the speakers in the recording.Type: GrantFiled: February 8, 2021Date of Patent: October 18, 2022Assignee: Google LLCInventors: Neil Zeghidour, David Grangier
-
Patent number: 11462207Abstract: Disclosed are a method and an apparatus for editing audio, an electronic device and a storage medium. The method includes: acquiring a modified text obtained by modifying a known original text of an audio to be edited according to a known text for modification; predicting a duration of an audio corresponding to the text for modification; adjusting a region to be edited of the audio to be edited according to the duration of the audio corresponding to the text for modification, to obtain an adjusted audio to be edited; obtaining, based on a pre-trained audio editing model, an edited audio according to the adjusted audio to be edited and the modified text. In the present disclosure, the edited audio obtained by the audio editing model sounds natural in the context, and supports the function of synthesizing new words that do not appear in the corpus.Type: GrantFiled: May 5, 2022Date of Patent: October 4, 2022Assignee: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCESInventors: Jianhua Tao, Tao Wang, Jiangyan Yi, Ruibo Fu
-
Patent number: 11462236Abstract: The disclosure describes one or more embodiments of an acoustic improvement system that accurately and efficiently determines and provides actionable acoustic improvement suggestions to users for digital audio recordings via an interactive graphical user interface. For example, the acoustic improvement system can assist users in creating high-quality digital audio recordings by providing a combination of acoustic quality metrics and actionable acoustic improvement suggestions within the interactive graphical user interface customized to each digital audio recording. In this manner, all users can easily and intuitively utilize the acoustic improvement system to improve the quality of digital audio recordings.Type: GrantFiled: October 25, 2019Date of Patent: October 4, 2022Assignee: Adobe Inc.Inventor: Nick Bryan
-
Patent number: 11462233Abstract: An electronic device and method of recognizing an audio scene are provided.Type: GrantFiled: November 15, 2019Date of Patent: October 4, 2022Assignee: SAMSUNG ELECTRONICS CO., LTD.Inventors: Hoon Heo, Sunmin Kim, Kiwoong Kang, Kibeom Kim, Inwoo Hwang
-
Patent number: 11430444Abstract: The technology of the present application provides software as a service (SaaS) executing on a server in a cloud or network. The SaaS receives data from a mobile device of a user over the network. The SaaS processes the data and returns the processed data to a client application executing on a client device of the user, which user is the same as the user of the mobile device wherein there is no direct communication link, wireless or wired, between the mobile device and the client device. In one aspect, the technology of the present application provides the mobile device as a smartphone and a microphone application to be executed on the smartphone.Type: GrantFiled: September 10, 2019Date of Patent: August 30, 2022Assignee: nVoq IncorporatedInventors: David Mondragon, Michael Clark, Jarek Foltynski, Charles Corfield
-
Patent number: 11361749Abstract: A method, computer program product, and computing system for obtaining calibration information for a three-dimensional space incorporating an ACI system; and processing the calibration information to calibrate the ACI system.Type: GrantFiled: October 22, 2020Date of Patent: June 14, 2022Assignee: NUANCE COMMUNICATIONS, INC.Inventors: Dushyant Sharma, Patrick A. Naylor, Joel Praveen Pinto, Daniel Paulino Almendro Barreda
-
Patent number: 11361763Abstract: A speech-processing system capable of receiving and processing audio data to determine if the audio data includes speech that was intended for the system. Non-system directed speech may be filtered out while system-directed speech may be selected for further processing. A system-directed speech detector may use a trained machine learning model (such as a deep neural network or the like) to process a feature vector representing a variety of characteristics of the incoming audio data, including the results of automatic speech recognition and/or other data. Using the feature vector the model may output an indicator as to whether the speech is system-directed. The system may also incorporate other filters such as voice activity detection prior to speech recognition, or the like.Type: GrantFiled: September 1, 2017Date of Patent: June 14, 2022Assignee: Amazon Technologies, Inc.Inventors: Roland Maximilian Rolf Maas, Sri Harish Reddy Mallidi, Spyridon Matsoukas, Bjorn Hoffmeister
-
Patent number: 11329933Abstract: A method and computing platform to imitate human conversational response as a context transitions across multiple channels (e.g., chat, messaging, email, voice, third party communication, etc.) where inputs to the system are categorized into identified speech acts and physical acts, and a conversational bot is associated to the channels. In this approach, a data model associated with a multi-turn conversation is provided. The data model comprises an observation history, wherein an observation in the observation history includes an identification of a channel in which the observation originates. As turns are added to the multi-turn conversation, a conversational context across multiple channels is persisted using the data model. Using this approach, an AI-supported conversation started in one channel can move to another conversation channel while maintaining the context of the conversation intact and coherent.Type: GrantFiled: December 28, 2020Date of Patent: May 10, 2022Assignee: Drift.com, Inc.Inventors: Bernard N. Kiyanda, Jeffrey D. Orkin, Christopher M. Ward, Elias Torres
-
Patent number: 11302303Abstract: A method and device for training an acoustic model are provided. The method comprises determining a plurality of tasks for training an acoustic model, obtaining resource occupancies of nodes participating in the training of the acoustic model, and distributing the tasks to the nodes according to the resource occupancies of the nodes and complexities of the tasks. By using computational resources distributed at multiple nodes, tasks for training an acoustic model are performed in parallel in a distributed manner, so as to improve training efficiency.Type: GrantFiled: September 13, 2019Date of Patent: April 12, 2022Assignee: Baidu Online Network Technology (Beijing) Co., Ltd.Inventors: Yunfeng Li, Qingchang Hao, Yutao Gai, Chenxi Sun, Zhiping Zhou
-
Patent number: 11288455Abstract: Computer implemented systems and methods of processing clinical documentation for a multi-axial coding scheme include inputting clinical documentation from memory operatively coupled with a computer system, and executing a natural language processor configured to process narrative text in the clinical documentation. The processor segments the narrative text based on boundaries defined in the clinical documentation, sequences words in the narrative text based on the segmentation, and maps the sequenced words to semantic objects in an ontology database. The ontology defines classes of semantic objects and relationships between them, corresponding to the multi-axial coding scheme. The semantic objects are converted into characters and output into slots in a medical code, with the characters positioned in the slots based on the multi-axial coding scheme.Type: GrantFiled: October 20, 2018Date of Patent: March 29, 2022Assignee: Optum360, LLCInventors: George Karres, Destinee Tormey, Christopher Miller, Brian Potter, Mark L. Morsch
-
Patent number: 11250841Abstract: A method and method for natural language generation employ a natural language generation model which has been trained to assign an utterance label to a new text sequence, based on features extracted from the text sequence, such as parts-of-speech. The model assigns an utterance label to the new text sequence, based on the extracted features. The utterance label is used to guide the generation of a natural language utterance, such as a question, from the new text sequence. The system and method find application in dialog systems for generating utterances, to be sent to a user, from brief descriptions of problems or solutions in a knowledge base.Type: GrantFiled: June 10, 2016Date of Patent: February 15, 2022Assignee: CONDUENT BUSINESS SERVICES, LLCInventors: Claude Roux, Julien Perez
-
Patent number: 11238854Abstract: Methods, apparatus, and computer readable media are described related to recording, organizing, and making audio files available for consumption by voice-activated products. In various implementations, in response to receiving an input from a first user indicating that the first user intends to record audio content, audio content may be captured and stored. Input may be received from the first user indicating at least one identifier for the audio content. The stored audio content may be associated with the at least one identifier. A voice input may be received from a subsequent user. In response to determining that the voice input has particular characteristics, speech recognition may be biased in respect of the voice input towards recognition of the at least one identifier. In response to recognizing, based on the biased speech recognition, presence of the at least one identifier in the voice input, the stored audio content may be played.Type: GrantFiled: December 14, 2016Date of Patent: February 1, 2022Assignee: Google LLCInventors: Vikram Aggarwal, Barnaby James
-
Patent number: 11237794Abstract: An information processing device and information processing method capable of outputting an action based on an intention of the user. The information processing device including an action deciding unit that determines an action for a user on a basis of a distance from the user and an output control unit that outputs the action.Type: GrantFiled: December 13, 2016Date of Patent: February 1, 2022Assignee: SONY CORPORATIONInventor: Reiko Kirihara
-
Patent number: 11217252Abstract: A method of zoning a transcription of audio data includes separating the transcription of audio data into a plurality of utterances. A that each word in an utterances is a meaning unit boundary is calculated. The utterance is split into two new utterances at a work with a maximum calculated probability. At least one of the two new utterances that is shorter than a maximum utterance threshold is identified as a meaning unit.Type: GrantFiled: August 28, 2019Date of Patent: January 4, 2022Assignee: VERINT SYSTEMS INC.Inventors: Roni Romano, Yair Horesh, Jeremie Dreyfuss
-
Patent number: 11200379Abstract: Computer implemented systems and methods of processing clinical documentation for a multi-axial coding scheme include inputting clinical documentation from memory operatively coupled with a computer system, and executing a natural language processor configured to process narrative text in the clinical documentation. The processor segments the narrative text based on boundaries defined in the clinical documentation, sequences words in the narrative text based on the segmentation, and maps the sequenced words to semantic objects in an ontology database. The ontology defines classes of semantic objects and relationships between them, corresponding to the multi-axial coding scheme. The semantic objects are converted into characters and output into slots in a medical code, with the characters positioned in the slots based on the multi-axial coding scheme.Type: GrantFiled: May 29, 2019Date of Patent: December 14, 2021Assignee: Optum360, LLCInventors: George Karres, Destinee Tormey, Christopher Miller, Brian Potter, Mark L. Morsch
-
Patent number: 11176931Abstract: A computer-implemented technique is described for enabling a user to create a conversational bookmark in the course of the user's interaction with a BOT. The bookmark designates a particular juncture in the user's interaction with the BOT. When the user later invokes the bookmark, the computer-implemented technique resumes the user's interaction with the BOT, starting at the particular juncture. The technique can accomplish the above functions in a BOT-independent manner (which does not involve changes to the BOT) or a BOT-dependent manner (which involves changes to the BOT). The technique can also be extended to a task of creating and activating bookmarks in the course of a conversation among two or more humans.Type: GrantFiled: September 23, 2016Date of Patent: November 16, 2021Assignee: Microsoft Technology Licensing, LLCInventors: Benny Schlesinger, Keren Damari, Avichai Cohen, Yuval Pinchas Borsutsky