Word Recognition Patents (Class 704/251)

Preliminary matching (Class 704/252)

Endpoint detection (Class 704/253)

Subportions (Class 704/254)

Specialized models (Class 704/255)

Markov (Class 704/256)

Hidden Markov Model (HMM) (EPO) (Class 704/256.1)

Training of HMM (EPO) (Class 704/256.2)

With insufficient amount of training data, e.g., state sharing, tying, deleted interpolation (EPO) (Class 704/256.3)

Continuous density, e.g, Gaussian distribution, Lapalce (EPO) (Class 704/256.7)
Discrete density, e.g., Vector Quantization preprocessor, look up tables (EPO) (Class 704/256.8)

Natural language (Class 704/257)

Crowdsourced on-boarding of digital assistant operations

Patent number: 11520610

Abstract: Embodiments described herein are generally directed towards systems and methods relating to a crowd-sourced digital assistant and system. In particular, embodiments facilitate the intuitive creation and distribution of action datasets that include computing events or tasks that can be reproduced when an associated command, stored in an action dataset, is determined received by a digital assistant device. The digital assistant device described herein can generate new action datasets, on-board new action datasets, and receive new action datasets or updates to existing action datasets. Each digital assistant device in the described system can participate in the building of action datasets, so as to crowd-source a dialect that can be understood by a digital assistant device.

Type: Grant

Filed: May 18, 2018

Date of Patent: December 6, 2022

Assignee: PELOTON INTERACTIVE INC.

Inventors: Rajat Mukherjee, Kiran Bindhu Hemaraj, Matan Levi
Server that supports speech recognition of device, and operation method of the server

Patent number: 11514916

Abstract: A server for supporting speech recognition of a device and an operation method of the server. The server and method identify a plurality of estimated character strings from the first character string and obtain a second character string, based on the plurality of estimated character strings, and transmit the second character string to the device. The first character string is output from a speech signal input to the device, via speech recognition.

Type: Grant

Filed: August 13, 2020

Date of Patent: November 29, 2022

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Chanwoo Kim, Sichen Jin, Kyungmin Lee, Dhananjaya N. Gowda, Kwangyoun Kim
Multi-computer processing system with machine learning engine for optimized forecasting

Patent number: 11507907

Abstract: Systems for optimized forecasting are provided. In some examples, data associated with strategy of one or more business units may be received. The strategy data may include identification of projects or goals. In some examples, industry trend data may be received and may include data associated with in-demand job skills and the like. An instruction to capture user data may be transmitted to one or more user devices of an employee user. The instruction may cause activation of one or more sensors or data capture devices. The captured user data may be received and analyzed to determine a competency of the user. Based on the strategy data, industry data and determined competency, one or more deficiencies between the resources needed to meet the business unit strategy data and the available resources may be identified. Based on the identified deficiency, one or more actions for execution may be identified and executed.

Type: Grant

Filed: December 9, 2020

Date of Patent: November 22, 2022

Assignee: Bank of America Corporation

Inventors: Sandeep Kumar Chauhan, Madhuri Aniruddha Deshpande, Moses Salagala, Jagadish Reddy
Apparatuses and methods for facilitating an insertion of markers in content

Patent number: 11503383

Abstract: Aspects of the subject disclosure may include, for example, applying first data associated with a first content item to a model to generate first classification characteristics, analyzing the first classification characteristics to generate a first marker, wherein the first marker delineates a first location of inventory within the first content item, selecting a first creative to populate a portion of the inventory, and populating, based on the selecting, the portion of the inventory with the first creative. Other embodiments are disclosed.

Type: Grant

Filed: May 13, 2021

Date of Patent: November 15, 2022

Assignee: AT&T Intellectual Property I, L.P.

Inventors: Binny Asarikuniyil, Megha Venugopal
Virtual assistant operation in multi-device environments

Patent number: 11495218

Abstract: Systems and processes for providing a virtual assistant service are provided. In accordance with one or more examples, a method includes receiving, from an accessory device communicatively coupled to the first electronic device, a representation of a speech input representing a user request. The method further includes detecting a second electronic device and transmitting, from the first electronic device, a representation of the user request and data associated with the detected second electronic device to a third electronic device. The method further includes receiving, from the third electronic device, a determination of whether a task is to be performed by the second electronic device in accordance with the user request; and in accordance with a determination that a task is to be performed by the second electronic device, requesting the second electronic device to performed the task in accordance with the user request.

Type: Grant

Filed: August 31, 2018

Date of Patent: November 8, 2022

Assignee: Apple Inc.

Inventors: Brandon J. Newendorp, Anumita Biswas, Gagan A. Gupta, Benjamin S. Phipps, Kisun You
Detecting potential significant errors in speech recognition results

Patent number: 11495208

Abstract: In some embodiments, recognition results produced by a speech processing system (which may include two or more recognition results, including a top recognition result and one or more alternative recognition results) based on an analysis of a speech input, are evaluated for indications of potential errors. In some embodiments, the indications of potential errors may include discrepancies between recognition results that are meaningful for a domain, such as medically-meaningful discrepancies. The evaluation of the recognition results may be carried out using any suitable criteria, including one or more criteria that differ from criteria used by an ASR system in determining the top recognition result and the alternative recognition results from the speech input. In some embodiments, a recognition result may additionally or alternatively be processed to determine whether the recognition result includes a word or phrase that is unlikely to appear in a domain to which speech input relates.

Type: Grant

Filed: October 23, 2017

Date of Patent: November 8, 2022

Assignee: Nuance Communications, Inc.

Inventors: William F. Ganong, III, Raghu Vemula, Robert Fleming
Regional features based speech recognition method and system

Patent number: 11488587

Abstract: Disclosed is a regional-features-based speech recognition method, including learning speech features by region using speech data classified by region category, and recognizing input speech using an acoustic model and a language model generated through classification of a region category for the input speech and the learning. A user may use a dialect recognition service that is improved using learning based on artificial intelligence (AI) and enhanced mobile broadband (eMBB), ultra-reliable and low latency communications (URLLC), and massive machine-type communications (mMTC) techniques of 5G mobile communication.

Type: Grant

Filed: March 18, 2020

Date of Patent: November 1, 2022

Assignee: LG ELECTRONICS INC.

Inventor: Seonyeong Park
Dynamically assigning wake words

Patent number: 11482222

Abstract: A method and apparatus for determining a unique wake word for devices within an incident. One system includes an electronic computing device comprising a transceiver and an electronic processor communicatively coupled to the transceiver. The electronic processor is configured to receive a notification indicative of an occurrence of an incident and one or more communication devices present at the incident, determine contextual information associated with the incident and the one or more communication devices, and identify one or more wake words based on the contextual information. The electronic processor is further configured to determine a phonetic distance for each pair of wake words included in the one or more wake words, and select a unique wake word from the one or more wake words for each communication device of the one or more communication devices based on the determined phonetic distance.

Type: Grant

Filed: March 12, 2020

Date of Patent: October 25, 2022

Assignee: MOTOROLA SOLUTIONS, INC.

Inventors: Sean Regan, Maryam Eneim, Melanie King, Manoj Prasad Nagendra Prasad
Method and apparatus for providing feedback information based on audio input

Patent number: 11475894

Abstract: This application discloses a method and apparatus for processing audio information, a storage medium, and an electronic apparatus. The method includes: detecting that a segment of audio information is being received on a client, a first portion of audio information in the segment of audio information having been currently received on the client; obtaining first information, second information, and third information based on the first portion of audio information that has been currently received, the first information including text information corresponding to the first portion of audio information, the second information including information that meets a target condition and that corresponds to the first information, and the third information including information to be pushed to the client, which is obtained based on a keyword in the first information; and displaying the first information, the second information, and the third information on the client.

Type: Grant

Filed: June 19, 2020

Date of Patent: October 18, 2022

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventor: Longbin Li
Method and system for implementing language neutral virtual assistant

Patent number: 11475875

Abstract: In one aspect, a computerized method useful for implementing a language neutral virtual assistant including the step of providing a language detector. The language detector comprises one or more trained language classifiers. With language detector identifying a language of an incoming message from a user to an artificially intelligent (AI) personal assistant. The method includes the step of receiving an incoming message to the AI personal assistant. The method includes the step of normalizing the incoming message, wherein the normalizing the incoming message comprises a set of spelling corrections and a set of grammar corrections. The method includes the step of translating the incoming message to a specified language with a specified encoding process and a specified decoding process. The method includes the step of providing an AI personal assistant engine that comprise an artificial intelligence which conducts a conversation via auditory or textual methods.

Type: Grant

Filed: October 27, 2019

Date of Patent: October 18, 2022

Inventors: Sriram Chakravarthy, Madhav Vodnala, Balakota Srinivas Vinnakota, Ram Menon
Robot capable of conversation with another robot and method of controlling the same

Patent number: 11465290

Abstract: A robot capable of conversation with another robot and a method of controlling the same are disclosed. The robot includes a main body having a first region corresponding to a human face and rotatable in left-right direction directions, a signal generator generating a first data signal to be transmitted to a listener robot and a first robot voice signal corresponding to the first data signal, a communication unit transmitting the first data signal to an external server, a speaker outputting the first robot voice signal, and a controller controlling a rotation direction of the main body such that the first region is directed toward the listener robot at a time point adjacent to a transmission time of the first data signal and controlling the speaker to output the first robot voice signal after the rotation direction of the robot is controlled, wherein the listener robot receives the first data signal transmitted from the external server and is controlled to operate based on the first data signal.

Type: Grant

Filed: August 29, 2019

Date of Patent: October 11, 2022

Assignee: LG ELECTRONICS INC.

Inventors: Ji Yoon Park, Jungkwan Son
Electronic device and control method therefor

Patent number: 11455990

Abstract: An electronic device is disclosed. The electronic device comprises: a voice input unit; a storage unit for storing a first text according to a first transcript format and at least one second text obtained by transcribing the first text in a second transcript format; and a processor for, when a voice text converted from a user voice input through the voice input unit corresponds to a preset instruction, executing a function according to the preset instruction. The processor executes a function according to a preset instruction when the preset instruction includes a first text and a voice text is a text in which the first text of the preset instruction has been transcribed into a second text of a second transcript format.

Type: Grant

Filed: November 23, 2018

Date of Patent: September 27, 2022

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventor: Jaesung Kwon
Noise reduction in shared workspaces

Patent number: 11455984

Abstract: A method and system of reducing noise associated with telephony-based activities occurring in shared workspaces is provided. An end-user may lower their own voice to a whisper or other less audible or intelligible utterances and submit such low-quality audio signals to an automated speech recognition system via a microphone. The words identified by the automated speech recognition system are provided to a speech synthesizer, and a synthesized audio signal is created artificially that carries the content of the original human-produced utterances. The synthesized audio signal is significantly more audible and intelligible than the original audio signal. The method allows customer support agents to speak at barely audible levels yet be heard clearly by their customers.

Type: Grant

Filed: October 28, 2020

Date of Patent: September 27, 2022

Assignee: United Services Automobile Association (USAA)

Inventors: Justin Dax Haslam, Donnette L. Moncrief Brown, Eric David Schroeder, Ravi Durairaj, Deborah Janette Schulz
Coding of quantization matrices using parametric models

Patent number: 11457214

Abstract: Quantization matrix can be used to adjust quantization of transform coefficients at different frequencies. In one embodiment, a single fixed parametric model, such as a polynomial is used to represent a quantization matrix. Modulation of bit cost and complexity is achieved by specifying only the n first polynomial coefficients, the remaining ones being implicitly set to zero or other default values. One form of the single fixed polynomial is a fully developed polynomial in (x, y), where x, y indicate the coordinates of a given coefficient in a quantization matrix, with terms ordered by increasing exponent. Since higher exponents are the last ones, reducing the number of polynomial coefficients reduces the degree of the polynomial, hence its complexity. The polynomial coefficients can be symmetrical in x and y, and thus reducing the number of polynomial coefficients that need to be signaled in the bitstream.

Type: Grant

Filed: August 8, 2019

Date of Patent: September 27, 2022

Assignee: InterDigital VC Holdings France, SAS

Inventors: Philippe De Lagrange, Ya Chen, Edouard Francois
Presentation support system for displaying keywords for a voice presentation

Patent number: 11443736

Abstract: [Problem] Provided is a presentation support system that makes it possible to give effective presentations, for both presentations by machines and normal presenters. [Solution] The presentation support system included: a display unit 3; a material storage unit 5 that stores a presentation material and a plurality of keywords; an audio storage unit 7; an audio analysis unit 9 that analyzes a term contained in a presentation; a keyword order adjustment unit 11 that analyzes an order of appearance of a plurality of keywords contained in the audio analyzed by the audio analysis unit and changes the order of the plurality of keywords on the basis of the order of appearance; and a display control unit 13 that controls content displayed in the display unit 3.

Type: Grant

Filed: September 9, 2020

Date of Patent: September 13, 2022

Assignee: Interactive Solutions Corp.

Inventor: Kiyoshi Sekine
Variable latency device coordination

Patent number: 11431642

Abstract: Systems and processes for operating an intelligent automated assistant are provided. In one example process, an event associated with an audio input is detected with a first process. In accordance with a detection of the event, a delay value associated with an electronic device is determined. The delay value corresponds to a time required to determine, with a second process, whether the audio input includes a spoken trigger. In accordance with a determination that the delay value exceeds a threshold, the delay value is broadcast during a first advertising session, and determination is made, during a second advertising session, whether the electronic device is to respond to the audio input. In accordance with a determination that the threshold is not exceeded, a determination is made, during the first advertising session, whether the electronic device is to respond to the audio input or wait for the second advertising session.

Type: Grant

Filed: October 13, 2020

Date of Patent: August 30, 2022

Assignee: Apple Inc.

Inventor: Kurt Piersol
Contextual hotwords

Patent number: 11430442

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for contextual hotwords are disclosed. In one aspect, a method, during a boot process of a computing device, includes the actions of determining, by a computing device, a context associated with the computing device. The actions further include, based on the context associated with the computing device, determining a hotword. The actions further include, after determining the hotword, receiving audio data that corresponds to an utterance. The actions further include determining that the audio data includes the hotword. The actions further include, in response to determining that the audio data includes the hotword, performing an operation associated with the hotword.

Type: Grant

Filed: October 12, 2020

Date of Patent: August 30, 2022

Assignee: Google LLC

Inventors: Christopher Thaddeus Hughes, Ignacio Lopez Moreno, Aleksandar Kracun
Vehicular function control using sensing device with backlit icons

Patent number: 11427156

Abstract: A method for generating an output for controlling a vehicular function of a vehicle includes providing (i) a vehicular sensing device having at least one illumination source operable to backlight a plurality of icons, each icon representative of a respective vehicle function, and (ii) a plurality of sensors, each sensor having a respective field of sensing associated with a respective icon of the plurality of icons. With the vehicular sensing device disposed at a vehicle, and with the at least one illumination source activated to backlight the plurality of icons, the backlit icons are viewable at an exterior portion of the vehicle, and the sensors sense movement of a person's hand or foot in a field of sensing of one of the sensors, and a controller generates an output to control the vehicular function that is represented by the respective backlit icon associated with that sensor.

Type: Grant

Filed: January 11, 2021

Date of Patent: August 30, 2022

Assignee: MAGNA MIRRORS OF AMERICA, INC.

Inventors: Justin E. Sobecki, David P. O'Connell, Kenneth C. Peterson
Electronic device and control method therefor

Patent number: 11423892

Abstract: An electronic device is disclosed. The electronic device comprises: a voice input unit; a storage unit for storing a first text according to a first transcript format and at least one second text obtained by transcribing the first text in a second transcript format; and a processor for, when a voice text converted from a user voice input through the voice input unit corresponds to a preset instruction, executing a function according to the preset instruction. The processor executes a function according to a preset instruction when the preset instruction includes a first text and a voice text is a text in which the first text of the preset instruction has been transcribed into a second text of a second transcript format.

Type: Grant

Filed: November 23, 2018

Date of Patent: August 23, 2022

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventor: Jaesung Kwon
Method and device for controlling terminal, and computer readable storage medium

Patent number: 11417331

Abstract: The present disclosure provides a method for controlling a terminal, including the following operations: obtaining recognition results corresponding to control signals after receiving the control signals, and determining whether control instructions corresponding to the recognition results conflict, each control signal comprising at least one of a voice signal or a gesture signal; determining a credibility of each control instruction in response to a determination that there exists conflict among control instructions; and sending the control instruction with highest credibility to a control terminal. The present disclosure further provides a device for controlling a terminal and a computer readable storage medium. When control instructions are received and there exists conflict among control instructions, the control instruction with the highest credibility is sent to the control terminal after the credibility of each control instructions is determined, thereby avoiding settings from conflict.

Type: Grant

Filed: March 6, 2020

Date of Patent: August 16, 2022

Assignees: GD MIDEA AIR-CONDITIONING EQUIPMENT CO., LTD., MIDEA GROUP CO., LTD.

Inventors: Zhicai Ou, Weiying Li
Controlling voice recognition sensitivity for voice recognition

Patent number: 11417321

Abstract: A device for changing a speech recognition sensitivity for speech recognition can include a memory and a processor configured to obtain a first plurality of speech data input at different times, apply a pre-trained speech recognition model to the first plurality of speech data at a plurality of different speech recognition sensitivities, obtain a first speech recognition sensitivity from among the plurality of different speech recognition sensitivities based on the pre-trained speech recognition model and the plurality of different speech recognition sensitivities, the first speech recognition sensitivity corresponding to an optimal speech recognition sensitivity at which a speech recognition success rate of the speech recognition model satisfies a set first recognition success rate criterion, and change a setting of the speech recognition sensitivity based on the first speech recognition sensitivity obtained from among the plurality of different speech recognition sensitivities.

Type: Grant

Filed: April 24, 2020

Date of Patent: August 16, 2022

Assignee: LG ELECTRONICS INC.

Inventors: Sang Won Kim, Joonbeom Lee
Automatic speaker identification in calls using multiple speaker-identification parameters

Patent number: 11417343

Abstract: A speaker identification system (“system”) automatically assigns a speaker to voiced segments in a call. The system identifies one or more speakers in a call using one or more speaker-identification parameters. The system processes the call to determine one or more speaker-identification parameters, such as a transcript of the call, a facial image of the speaker, a scene image, which is an image of a scene in which the speaker is located during the call, or textual data associated with the call such as names of the speaker or an organization that are retrieved from the scene images or video data of the call. The system analyzes one or more of the speaker-identification parameters and determines the identity of the speaker. The system then identifies the voice segments associated with the identified speaker and marks the voice segments with the identity of the speaker.

Type: Grant

Filed: July 2, 2018

Date of Patent: August 16, 2022

Assignee: ZOOMINFO CONVERSE LLC

Inventors: Raphael Cohen, Erez Volk, Russell Levy, Micha Yochanan Breakstone, Orgad Keller, Ilana Tuil, Amit Ashkenazi
Voice recognition system

Patent number: 11410660

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for voice recognition. In one aspect, a method includes the actions of receiving a voice input; determining a transcription for the voice input, wherein determining the transcription for the voice input includes, for a plurality of segments of the voice input: obtaining a first candidate transcription for a first segment of the voice input; determining one or more contexts associated with the first candidate transcription; adjusting a respective weight for each of the one or more contexts; and determining a second candidate transcription for a second segment of the voice input based in part on the adjusted weights; and providing the transcription of the plurality of segments of the voice input for output.

Type: Grant

Filed: April 1, 2020

Date of Patent: August 9, 2022

Assignee: Google LLC

Inventors: Petar Aleksic, Pedro J. Moreno Mengibar
System for minimizing repetition in intelligent virtual assistant conversations

Patent number: 11409961

Abstract: This disclosure describes techniques and architectures for evaluating conversations. In some instances, conversations with users, virtual assistants, and others may be analyzed to identify potential risks within a language model that is employed by the virtual assistants and other entities. The potential risks may be evaluated by administrators, users, systems, and others to identify potential issues with the language model that need to be addressed. This may allow the language model to be improved and enhance user experience with the virtual assistants and others that employ the language model.

Type: Grant

Filed: October 10, 2019

Date of Patent: August 9, 2022

Inventors: Cynthia Freeman, Ian Beaver
Multiple digital assistant coordination in vehicular environments

Patent number: 11410648

Abstract: The present disclosure is generally related to a data processing system to selectively invoke applications for execution. A data processing system can receive an input audio signal and can parse the input audio signal to identify a command. The data processing system can identify a first functionality of a first digital assistant application hosted on the data processing system in the vehicle and a second functionality of a second digital assistant application accessible via a client device. The data processing system can determine that one of the first functionality or the second functionality supports the command. The data processing system can select one of the first digital assistant application or the second digital assistant application based on the determination. The data processing system invoke one of the first digital assistant application or the second digital assistant application based on the selection.

Type: Grant

Filed: October 3, 2017

Date of Patent: August 9, 2022

Assignee: GOOGLE LLC

Inventors: Haris Ramic, Vikram Aggarwal, Moises Morgenstern Gali, Brandon Stuut
Information processing apparatus and information processing method

Patent number: 11405522

Abstract: The present technology relates to an information processing device, and an information processing method, each of which enables to reduce a confirmation load put on a user before a task is executed. The information processing device according to one embodiment of the present technology has the feature of, on the basis of relationship between a first cost required in a case where execution of a predetermined task has been a mistake and a second cost that is allowed by a user for the predetermined task that has been executed by mistake, calculating a confirmation degree of confirming the user as to whether or not to execute the predetermined task, and performing the confirmation by contents corresponding to the calculated degree. The present technology can be applied to an agent apparatus that operates using a voice UI.

Type: Grant

Filed: April 13, 2018

Date of Patent: August 2, 2022

Assignee: SONY CORPORATION

Inventor: Katsuyoshi Kanemoto
Interactive augmentation and integration of real-time speech-to-text

Patent number: 11404049

Abstract: In non-limiting examples of the present disclosure, systems, methods and devices for integrating speech-to-text transcription in a productivity application are presented. A request to access a real-time speech-to-text transcription of an audio signal that is being received by a second device is sent by a first device. The real-time speech-to-text transcription may be surfaced in a transcription pane of a productivity application on the first device. A request to translate the transcription to a different language may be received. The transcription may be translated in real-time and surfaced in the transcription pane. A selection of a word in the surfaced transcription may be received. A request to drag the word from the transcription pane and drop it in a window in the productivity application outside of the transcription pane may be received. The word may be surfaced in the window in the productivity application outside of the transcription pane.

Type: Grant

Filed: December 9, 2019

Date of Patent: August 2, 2022

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Dana Minh Nguyen, Rohail Mustafa Syed, Alisa Marilyn Bacon, William Duncan Lewis, Michael Tholfsen, Carly Larsson
Sentence generation device, model learning device, sentence generation method, model learning method, and program

Patent number: 11403469

Abstract: The present invention makes it possible to generate a paraphrastic sentence that has a similar meaning to the original sentence despite a local word/phrase difference, or a non-paraphrastic sentence that is not a paraphrase despite having a similar meaning to the original sentence in terms of the entire sentence. An estimation unit 22 estimates a word deletion probability for each of words constituting an input sentence, by using a positive example model that has been trained based on a positive example constituted by a sentence and a paraphrastic sentence of the sentence, and is used to generate a paraphrastic sentence by deleting a word, or by using a negative example model that has been trained based on a negative example constituted by the sentence and a non-paraphrastic sentence of the sentence, and is used to generate a non-paraphrastic sentence by deleting a word.

Type: Grant

Filed: July 23, 2019

Date of Patent: August 2, 2022

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Itsumi Saito, Kyosuke Nishida, Hisako Asano, Junji Tomita
Phonetic patterns for fuzzy matching in natural language processing

Patent number: 11397856

Abstract: A token is extracted from a Natural Language input. A phonetic pattern is computed corresponding to the token, the phonetic pattern including a sound pattern that represents a part of the token when the token is spoken. New data is created from data of the phonetic pattern, the new data including a syllable sequence corresponding to the phonetic pattern. A state of a data storage device is changed by storing the new data in a matrix of syllable sequences corresponding to the token. An option is selected that corresponds to the token by executing a fuzzy matching algorithm using a processor and a memory, the selecting of the option is based on a syllable sequence in the matrix.

Type: Grant

Filed: November 26, 2019

Date of Patent: July 26, 2022

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Sean M. Fuoco, John M. Ganci, Jr., Craig M. Trim, Jie Zeng
Natural machine conversing method and apparatus

Patent number: 11393464

Abstract: Apparatuses, methods and storage medium associated with a spoken dialogue system are disclosed herein. In embodiments, an apparatus for natural machine conversing with a user may comprise a listening component to detect a keyword that denotes start of a conversation; a dialogue engine to converse with the user during the conversation; and a controller to selectively activate or cause to be activated one of the listening component or the dialogue component, and to pass control to the activated listening component or the activated dialogue engine, based at least in part on a state of the conversation. Other embodiments may be disclosed or claimed.

Type: Grant

Filed: June 6, 2019

Date of Patent: July 19, 2022

Assignee: Intel Corporation

Inventors: Lavinia A. Danielescu, Shawn C. Nikkila, Robert J. Firby, Beth Ann Hockey
Analyzing readability of communications

Patent number: 11392665

Abstract: A computer-implemented method, system, and computer program product for analyzing readability of a communication intended for a target audience includes: analyzing the communication to determine a first readability measure associated with the communication; determining a second readability measure associated with the target audience based on one or more historical communications previously transmitted or received by the target audience; and generating a readability feedback signal for the communication based on the first readability measure and the second readability measure.

Type: Grant

Filed: August 1, 2019

Date of Patent: July 19, 2022

Assignee: International Business Machines Corporation

Inventors: Adam Pilkington, Graham Charters, Gordon Hutchison, Tim Mitchell
Systems and methods for active noise cancellation for interior of autonomous vehicle

Patent number: 11386910

Abstract: Various technologies described herein pertain to active noise cancellation in the interior of a vehicle. In exemplary embodiments, a microphone mounted on the vehicle outputs an audio signal indicative of noise emitted by a noise source. A computing system of the vehicle determines a position of the noise source based upon sensor signals output by sensors mounted on the vehicle. The computing system further determines a position of a passenger in the vehicle based upon a sensor mounted inside the vehicle. The computing system generates a complementary signal that is configured to attenuate the noise based upon the audio signal, the position of the noise source, and the position of the passenger. The complementary signal is then output by way of a speaker in the interior of the vehicle.

Type: Grant

Filed: May 11, 2020

Date of Patent: July 12, 2022

Assignee: GM CRUISE HOLDINGS LLC

Inventors: Marko Tintor, Matt Fornero
Methods and systems for audio sample quality control

Patent number: 11386919

Abstract: The present disclosure provides methods and systems that may be used for providing quality control for audio samples. The audio samples may be speech samples of a user. The user may be participating in an audio interview.

Type: Grant

Filed: December 31, 2020

Date of Patent: July 12, 2022

Assignee: AC Global Risk, Inc.

Inventor: James A. Kane
Natural language understanding

Patent number: 11386890

Abstract: A system is provided for reducing friction during user interactions with a natural language processing system, such as voice assistant systems. The system determines a pre-trained model using dialog session data corresponding to multiple user profiles. The system determines a fine-tuned model using the pre-trained model and a fine-tuning dataset that corresponds to a particular task, such as query rewriting. The system uses the fine-tuned model to process a user input and determine an alternative representation of the input that can result in a desired response from the natural language processing system.

Type: Grant

Filed: February 11, 2020

Date of Patent: July 12, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Xing Fan, Zheng Chen, Yuan Ling, Lambert Leo Mathias, Chenlei Guo
Data bundle generation and deployment

Patent number: 11386233

Abstract: The present disclosure provides a method, system, and device for distributing a software release. To illustrate, based on one or more files for distribution as a software release, a release bundle is generated that includes release bundle information, such as, for each file of the one or more files, a checksum, meta data, or both. One or more other aspects of the present disclosure further provide sending the release bundle to a node device. After receiving the release bundle at the node device, the node device receives and stores at least one file at a transaction directory. After verification that each of the one or more files is present/available at the node device, the one or more files may be provided to a memory of a node device and meta data included in the release bundle information may be applied to the one or more files transferred to the memory.

Type: Grant

Filed: April 30, 2019

Date of Patent: July 12, 2022

Assignee: JFrog, Ltd.

Inventor: Yoav Landman
Iterative intent building utilizing dynamic scheduling of batch utterance expansion methods

Patent number: 11380306

Abstract: Expansion of intent classification data utilizing batch utterance scheduling, by a processor in a computing environment. A set of unlabeled examples for intent processing is received by an intent builder iteratively defining an intent. The set of examples are separated into a first subset processed according to a first model and a second subset processed according to a second model. The first subset is incorporated into the intent builder during a building iteration and scheduling a first batch processing of the second subset processed according to the second model based on a scheduling criteria. The first batch processing of the second subset is initiated once the scheduling criteria is satisfied. Upon completion of the first batch processing, results of the completion are used to influence additional examples retrieved from the first subset and the second subset during a subsequent building iteration by the intent builder.

Type: Grant

Filed: October 31, 2019

Date of Patent: July 5, 2022

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Neil Rohit Mallinar, Rajendra G Ugrani, Ayush Gupta
System and method for call classification

Patent number: 11380303

Abstract: A method for voice call analysis and classification includes intercepting a voice call session between an initiating device and a recipient device. Voice call data exchanged between the initiating device and the recipient device during the voice call session is transformed into a predefined data format. The transformed voice call data is analyzed to determine one or more attributes of the intercepted voice call. One or more features associated with the intercepted voice call session are identified based on the determined one or more attributes. The intercepted voice call is classified using the identified one or more features.

Type: Grant

Filed: January 22, 2021

Date of Patent: July 5, 2022

Assignee: AO Kaspersky Lab

Inventors: Nikolay A. Churaev, Andrey I. Golubev
Wakeword detection

Patent number: 11373644

Abstract: Techniques for implementing multiple wakeword detectors on a single device are described. A digital signal processor (DSP) of the device may implement a wakeword detection component to detect when captured speech includes a wakeword. A companion application installed on the device may implement a wakeword detection component trained using speech of a user of the device. In response to determining that the user spoke the wakeword, the companion application may send audio data representing the speech and data corresponding to the user to at least one server(s) for processing. Further, the device may receive captured speech and captured image data corresponding to the captured speech and determine a representation of a user in the captured image data. If the device determines the user is represented in the image data, audio data representing the speech may be sent to at least one server(s) for processing.

Type: Grant

Filed: July 21, 2020

Date of Patent: June 28, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Deepak Yavagal, Ajith Prabhakara, John Gray
Hotword suppression

Patent number: 11373652

Abstract: A method includes obtaining, by data processing hardware, a plurality of non-watermarked speech samples. Each non-watermarked speech does not include an audio watermark sample. The method includes, from each non-watermarked speech sample of the plurality of non-watermarked speech samples, generating one or more corresponding watermarked speech samples that each include at least one audio watermark. The method includes training, using the plurality of non-watermarked speech samples and corresponding watermarked speech samples, a model to determine whether a given audio data sample includes an audio watermark, and after training the model, transmitting the trained model to a user computing device.

Type: Grant

Filed: May 14, 2020

Date of Patent: June 28, 2022

Assignee: Google LLC

Inventors: Alexander H. Gruenstein, Taral Pradeep Joglekar, Vijayaditya Peddinti, Michiel A. u. Bacchiani
Generation of audience appropriate content

Patent number: 11358063

Abstract: Multimedia content to be played on a multimedia player device can be received. Whether the multimedia content contains audience-inappropriate content can be determined. Replacement content corresponding to the audience-inappropriate content can be generated. The generated replacement content can be caused to play on the multimedia player device in lieu of the audience-inappropriate content.

Type: Grant

Filed: March 6, 2020

Date of Patent: June 14, 2022

Assignee: International Business Machines Corporation

Inventors: Maryam Ashoori, Anamika Dayaram Singh, Priti Ashvin Shah
Conditional wake word eventing based on environment

Patent number: 11361756

Abstract: In one aspect, a playback device includes at least one microphone configured to detect sound. The playback detects sound via the one or more microphones and determines whether (i) the detected sound includes a voice input, (ii) the detected sound excludes background speech, and (iii) the voice input includes a command keyword. In response to the determining, the playback device performs a playback function corresponding to the command keyword.

Type: Grant

Filed: June 12, 2019

Date of Patent: June 14, 2022

Assignee: Sonos, Inc.

Inventors: Connor Smith, John Tolomei, Kurt Soto
Speech filtering in a vehicle

Patent number: 11355136

Abstract: A computer includes a processor and a memory storing instructions executable by the processor to identify an occupant in a passenger cabin of a vehicle, detect a position of a head of the occupant relative to the passenger cabin, apply a first filter to speech from the occupant based on the position of the head, generate a second filter, apply the second filter to the speech, adjust the second filter based on a difference between the speech of the occupant filtered by the second filter and a prestored profile of the occupant, and perform an operation using the speech filtered by the first filter and the second filter.

Type: Grant

Filed: January 11, 2021

Date of Patent: June 7, 2022

Assignee: Ford Global Technologies, LLC

Inventors: Scott Andrew Amman, Cynthia M. Neubecker, Pietro Buttolo, Joshua Wheeler, Brian Bennie
Automated ordering system

Patent number: 11355120

Abstract: In some examples, a software agent executing on a server receives a communication comprising a first utterance from a customer and predicts, using an intent classifier, a first intent of the first utterance. Based on determining that the first intent is order-related, the software agent predicts, using a dish classifier, a cart delta vector based at least in part on the first utterance and modifies a cart associated with the customer based on the cart delta vector. The software agent predicts, using a dialog model, a first dialog response based at least in part on the first utterance and provides the first dialog response to the customer using a text-to-speech converter.

Type: Grant

Filed: October 1, 2021

Date of Patent: June 7, 2022

Assignee: ConverseNowAI

Inventors: Zubair Talib, Rahul Aggarwal, Vinay Kumar Shukla, Pranav Nirmal Mehra, Vrajesh Navinchandra Sejpal, Akshay Labh Kayastha, Yuganeshan A J, German Kurt Grin, Fernando Ezequiel Gonzalez, Julia Milanese, Matias Grinberg
Generating self-support metrics based on paralinguistic information

Patent number: 11354754

Abstract: Certain aspects of the present disclosure provide techniques for selecting a response to a self-support query. One example method generally includes receiving an audio stream query including spoken content from a user recorded by a mobile device and determining a set of paralinguistic features from the spoken content. The method further includes estimating an emotional state of the user based on the set of paralinguistic features and identifying subject matter of the spoken content in the audio stream query. The method further includes determining two or more query responses corresponding to the subject matter to present to the user and transmitting at least one query response to the mobile device.

Type: Grant

Filed: February 12, 2020

Date of Patent: June 7, 2022

Assignee: INTUIT, INC.

Inventors: Benjamin Indyk, Igor A. Podgorny, Raymond Chan
Query generation using natural language input

Patent number: 11347802

Abstract: Methods and systems for generation of a database schema compliant search query based on a natural language input are described herein. Natural language input may be received from a computing device. The natural language input may be associated with multiple search requests to a database. The natural language input may be parsed into a plurality of segments. The plurality of segments may be, for example, one or more words of a text string. At least one identifier for the plurality of segments may be associated with one or more confidence values. The natural language input may be converted into a single search query based on the confidence values and/or on a set of rules. The single search query may be initiated with respect to the database. The single search query may fetch content more efficiently than the multiple search requests.

Type: Grant

Filed: July 24, 2019

Date of Patent: May 31, 2022

Assignee: Citrix Systems, Inc.

Inventors: Shiv Prasad Khillar, Saifulla Shaik, Nagendra Tank
Method for controlling remote controller to avoid loss of function through a low voltage condition, remote controller device, and non-transitory storage medium

Patent number: 11348578

Abstract: A method of controlling a battery-powered remote controller to decrease a duty cycle to allow continued operations despite the quantity of the battery is bad determines a drop in voltage of the battery in standby mode as voltage of the battery is being read. When receiving a command to activate a voice function, determining whether the drop in voltage in standby mode is greater than or equal to a preset value. If yes, the method then determines whether the drop in voltage falls in a preset range. If yes, the method regulates a duty cycle of the pulse signal activating the voice function, and activates the voice function as required. A remote controller and a non-transitory storage medium are also provided.

Type: Grant

Filed: July 22, 2019

Date of Patent: May 31, 2022

Assignee: Nanning FuLian FuGui Precision Industrial Co., Ltd.

Inventors: Huang-Yu Chiang, Chung-Chih Yeh
Method and apparatus for recognizing speaker by using a resonator

Patent number: 11341973

Abstract: Provided are a method and device for recognizing a speaker by using a resonator. The method of recognizing the speaker includes receiving a plurality of electrical signals corresponding to a speech of the speaker from a plurality of resonators having different resonance bands; obtaining a difference of magnitudes of the plurality of electrical signals; and recognizing the speaker based on the difference of magnitudes of the plurality of electrical signals.

Type: Grant

Filed: December 19, 2017

Date of Patent: May 24, 2022

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Cheheung Kim, Sungchan Kang, Sangha Park, Yongseop Yoon, Choongho Rhee
Information processing apparatus and game image distributing method

Patent number: 11338211

Abstract: An application execution unit 110 generates a game image. A message generation unit 112 generates a notification message. An image processing unit 118 generates a distribution image including the game image. A distribution processing unit 126 distributes the distribution image to one or more information processing terminals through a shared server. A setting unit 114 allows a user to set whether or not the notification message is included in the distribution image so as to be visually recognizable, and registers setting contents in a storage apparatus.

Type: Grant

Filed: November 22, 2018

Date of Patent: May 24, 2022

Assignee: SONY INTERACTIVE ENTERTAINMENT INC.

Inventors: Masahiro Fujihara, Kiyobumi Matsunaga
Multiple classifications of audio data

Patent number: 11335347

Abstract: Described herein is a system for sentiment detection in audio data. The system is trained using acoustic information and lexical information to determine a sentiment corresponding to an utterance. In some cases when lexical information is not available, the system (trained on acoustic and lexical information) is configured to determine a sentiment using only acoustic information.

Type: Grant

Filed: June 3, 2019

Date of Patent: May 17, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Gustavo Alfonso Aguilar Alas, Viktor Rozgic, Chao Wang
System, method, and apparatus for virtualizing digital assistants

Patent number: 11337061

Abstract: A system and method for providing anonymous communications from a user to a called party includes obtaining a dedicated phone number and creating a user account for the user and assigning the dedicated phone number to the user account. A provider account is created for a digital assistant using the dedicated phone number and the digital assistant is preprogrammed with the user account. The digital assistant is also preprogrammed with a skill for recognizing a specific utterance (e.g. “Call”). Connectivity is provided between the digital assistant and the Internet, for example, using a wireless access point. The digital assistant listens for the specific utterance and, upon recognizing the specific utterance followed by an identification of the called party, the digital assistant initiates a voice call through the Internet to the called party.

Type: Grant

Filed: November 6, 2020

Date of Patent: May 17, 2022

Assignee: Ways Investments, LLC

Inventor: Mark Edward Gray

prev 1 2 3 4 5 6 7 … next