Word Recognition Patents (Class 704/251)
  • Patent number: 11520610
    Abstract: Embodiments described herein are generally directed towards systems and methods relating to a crowd-sourced digital assistant and system. In particular, embodiments facilitate the intuitive creation and distribution of action datasets that include computing events or tasks that can be reproduced when an associated command, stored in an action dataset, is determined received by a digital assistant device. The digital assistant device described herein can generate new action datasets, on-board new action datasets, and receive new action datasets or updates to existing action datasets. Each digital assistant device in the described system can participate in the building of action datasets, so as to crowd-source a dialect that can be understood by a digital assistant device.
    Type: Grant
    Filed: May 18, 2018
    Date of Patent: December 6, 2022
    Assignee: PELOTON INTERACTIVE INC.
    Inventors: Rajat Mukherjee, Kiran Bindhu Hemaraj, Matan Levi
  • Patent number: 11514916
    Abstract: A server for supporting speech recognition of a device and an operation method of the server. The server and method identify a plurality of estimated character strings from the first character string and obtain a second character string, based on the plurality of estimated character strings, and transmit the second character string to the device. The first character string is output from a speech signal input to the device, via speech recognition.
    Type: Grant
    Filed: August 13, 2020
    Date of Patent: November 29, 2022
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Chanwoo Kim, Sichen Jin, Kyungmin Lee, Dhananjaya N. Gowda, Kwangyoun Kim
  • Patent number: 11507907
    Abstract: Systems for optimized forecasting are provided. In some examples, data associated with strategy of one or more business units may be received. The strategy data may include identification of projects or goals. In some examples, industry trend data may be received and may include data associated with in-demand job skills and the like. An instruction to capture user data may be transmitted to one or more user devices of an employee user. The instruction may cause activation of one or more sensors or data capture devices. The captured user data may be received and analyzed to determine a competency of the user. Based on the strategy data, industry data and determined competency, one or more deficiencies between the resources needed to meet the business unit strategy data and the available resources may be identified. Based on the identified deficiency, one or more actions for execution may be identified and executed.
    Type: Grant
    Filed: December 9, 2020
    Date of Patent: November 22, 2022
    Assignee: Bank of America Corporation
    Inventors: Sandeep Kumar Chauhan, Madhuri Aniruddha Deshpande, Moses Salagala, Jagadish Reddy
  • Patent number: 11503383
    Abstract: Aspects of the subject disclosure may include, for example, applying first data associated with a first content item to a model to generate first classification characteristics, analyzing the first classification characteristics to generate a first marker, wherein the first marker delineates a first location of inventory within the first content item, selecting a first creative to populate a portion of the inventory, and populating, based on the selecting, the portion of the inventory with the first creative. Other embodiments are disclosed.
    Type: Grant
    Filed: May 13, 2021
    Date of Patent: November 15, 2022
    Assignee: AT&T Intellectual Property I, L.P.
    Inventors: Binny Asarikuniyil, Megha Venugopal
  • Patent number: 11495218
    Abstract: Systems and processes for providing a virtual assistant service are provided. In accordance with one or more examples, a method includes receiving, from an accessory device communicatively coupled to the first electronic device, a representation of a speech input representing a user request. The method further includes detecting a second electronic device and transmitting, from the first electronic device, a representation of the user request and data associated with the detected second electronic device to a third electronic device. The method further includes receiving, from the third electronic device, a determination of whether a task is to be performed by the second electronic device in accordance with the user request; and in accordance with a determination that a task is to be performed by the second electronic device, requesting the second electronic device to performed the task in accordance with the user request.
    Type: Grant
    Filed: August 31, 2018
    Date of Patent: November 8, 2022
    Assignee: Apple Inc.
    Inventors: Brandon J. Newendorp, Anumita Biswas, Gagan A. Gupta, Benjamin S. Phipps, Kisun You
  • Patent number: 11495208
    Abstract: In some embodiments, recognition results produced by a speech processing system (which may include two or more recognition results, including a top recognition result and one or more alternative recognition results) based on an analysis of a speech input, are evaluated for indications of potential errors. In some embodiments, the indications of potential errors may include discrepancies between recognition results that are meaningful for a domain, such as medically-meaningful discrepancies. The evaluation of the recognition results may be carried out using any suitable criteria, including one or more criteria that differ from criteria used by an ASR system in determining the top recognition result and the alternative recognition results from the speech input. In some embodiments, a recognition result may additionally or alternatively be processed to determine whether the recognition result includes a word or phrase that is unlikely to appear in a domain to which speech input relates.
    Type: Grant
    Filed: October 23, 2017
    Date of Patent: November 8, 2022
    Assignee: Nuance Communications, Inc.
    Inventors: William F. Ganong, III, Raghu Vemula, Robert Fleming
  • Patent number: 11488587
    Abstract: Disclosed is a regional-features-based speech recognition method, including learning speech features by region using speech data classified by region category, and recognizing input speech using an acoustic model and a language model generated through classification of a region category for the input speech and the learning. A user may use a dialect recognition service that is improved using learning based on artificial intelligence (AI) and enhanced mobile broadband (eMBB), ultra-reliable and low latency communications (URLLC), and massive machine-type communications (mMTC) techniques of 5G mobile communication.
    Type: Grant
    Filed: March 18, 2020
    Date of Patent: November 1, 2022
    Assignee: LG ELECTRONICS INC.
    Inventor: Seonyeong Park
  • Patent number: 11482222
    Abstract: A method and apparatus for determining a unique wake word for devices within an incident. One system includes an electronic computing device comprising a transceiver and an electronic processor communicatively coupled to the transceiver. The electronic processor is configured to receive a notification indicative of an occurrence of an incident and one or more communication devices present at the incident, determine contextual information associated with the incident and the one or more communication devices, and identify one or more wake words based on the contextual information. The electronic processor is further configured to determine a phonetic distance for each pair of wake words included in the one or more wake words, and select a unique wake word from the one or more wake words for each communication device of the one or more communication devices based on the determined phonetic distance.
    Type: Grant
    Filed: March 12, 2020
    Date of Patent: October 25, 2022
    Assignee: MOTOROLA SOLUTIONS, INC.
    Inventors: Sean Regan, Maryam Eneim, Melanie King, Manoj Prasad Nagendra Prasad
  • Patent number: 11475894
    Abstract: This application discloses a method and apparatus for processing audio information, a storage medium, and an electronic apparatus. The method includes: detecting that a segment of audio information is being received on a client, a first portion of audio information in the segment of audio information having been currently received on the client; obtaining first information, second information, and third information based on the first portion of audio information that has been currently received, the first information including text information corresponding to the first portion of audio information, the second information including information that meets a target condition and that corresponds to the first information, and the third information including information to be pushed to the client, which is obtained based on a keyword in the first information; and displaying the first information, the second information, and the third information on the client.
    Type: Grant
    Filed: June 19, 2020
    Date of Patent: October 18, 2022
    Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    Inventor: Longbin Li
  • Patent number: 11475875
    Abstract: In one aspect, a computerized method useful for implementing a language neutral virtual assistant including the step of providing a language detector. The language detector comprises one or more trained language classifiers. With language detector identifying a language of an incoming message from a user to an artificially intelligent (AI) personal assistant. The method includes the step of receiving an incoming message to the AI personal assistant. The method includes the step of normalizing the incoming message, wherein the normalizing the incoming message comprises a set of spelling corrections and a set of grammar corrections. The method includes the step of translating the incoming message to a specified language with a specified encoding process and a specified decoding process. The method includes the step of providing an AI personal assistant engine that comprise an artificial intelligence which conducts a conversation via auditory or textual methods.
    Type: Grant
    Filed: October 27, 2019
    Date of Patent: October 18, 2022
    Inventors: Sriram Chakravarthy, Madhav Vodnala, Balakota Srinivas Vinnakota, Ram Menon
  • Patent number: 11465290
    Abstract: A robot capable of conversation with another robot and a method of controlling the same are disclosed. The robot includes a main body having a first region corresponding to a human face and rotatable in left-right direction directions, a signal generator generating a first data signal to be transmitted to a listener robot and a first robot voice signal corresponding to the first data signal, a communication unit transmitting the first data signal to an external server, a speaker outputting the first robot voice signal, and a controller controlling a rotation direction of the main body such that the first region is directed toward the listener robot at a time point adjacent to a transmission time of the first data signal and controlling the speaker to output the first robot voice signal after the rotation direction of the robot is controlled, wherein the listener robot receives the first data signal transmitted from the external server and is controlled to operate based on the first data signal.
    Type: Grant
    Filed: August 29, 2019
    Date of Patent: October 11, 2022
    Assignee: LG ELECTRONICS INC.
    Inventors: Ji Yoon Park, Jungkwan Son
  • Patent number: 11455990
    Abstract: An electronic device is disclosed. The electronic device comprises: a voice input unit; a storage unit for storing a first text according to a first transcript format and at least one second text obtained by transcribing the first text in a second transcript format; and a processor for, when a voice text converted from a user voice input through the voice input unit corresponds to a preset instruction, executing a function according to the preset instruction. The processor executes a function according to a preset instruction when the preset instruction includes a first text and a voice text is a text in which the first text of the preset instruction has been transcribed into a second text of a second transcript format.
    Type: Grant
    Filed: November 23, 2018
    Date of Patent: September 27, 2022
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventor: Jaesung Kwon
  • Patent number: 11455984
    Abstract: A method and system of reducing noise associated with telephony-based activities occurring in shared workspaces is provided. An end-user may lower their own voice to a whisper or other less audible or intelligible utterances and submit such low-quality audio signals to an automated speech recognition system via a microphone. The words identified by the automated speech recognition system are provided to a speech synthesizer, and a synthesized audio signal is created artificially that carries the content of the original human-produced utterances. The synthesized audio signal is significantly more audible and intelligible than the original audio signal. The method allows customer support agents to speak at barely audible levels yet be heard clearly by their customers.
    Type: Grant
    Filed: October 28, 2020
    Date of Patent: September 27, 2022
    Assignee: United Services Automobile Association (USAA)
    Inventors: Justin Dax Haslam, Donnette L. Moncrief Brown, Eric David Schroeder, Ravi Durairaj, Deborah Janette Schulz
  • Patent number: 11457214
    Abstract: Quantization matrix can be used to adjust quantization of transform coefficients at different frequencies. In one embodiment, a single fixed parametric model, such as a polynomial is used to represent a quantization matrix. Modulation of bit cost and complexity is achieved by specifying only the n first polynomial coefficients, the remaining ones being implicitly set to zero or other default values. One form of the single fixed polynomial is a fully developed polynomial in (x, y), where x, y indicate the coordinates of a given coefficient in a quantization matrix, with terms ordered by increasing exponent. Since higher exponents are the last ones, reducing the number of polynomial coefficients reduces the degree of the polynomial, hence its complexity. The polynomial coefficients can be symmetrical in x and y, and thus reducing the number of polynomial coefficients that need to be signaled in the bitstream.
    Type: Grant
    Filed: August 8, 2019
    Date of Patent: September 27, 2022
    Assignee: InterDigital VC Holdings France, SAS
    Inventors: Philippe De Lagrange, Ya Chen, Edouard Francois
  • Patent number: 11443736
    Abstract: [Problem] Provided is a presentation support system that makes it possible to give effective presentations, for both presentations by machines and normal presenters. [Solution] The presentation support system included: a display unit 3; a material storage unit 5 that stores a presentation material and a plurality of keywords; an audio storage unit 7; an audio analysis unit 9 that analyzes a term contained in a presentation; a keyword order adjustment unit 11 that analyzes an order of appearance of a plurality of keywords contained in the audio analyzed by the audio analysis unit and changes the order of the plurality of keywords on the basis of the order of appearance; and a display control unit 13 that controls content displayed in the display unit 3.
    Type: Grant
    Filed: September 9, 2020
    Date of Patent: September 13, 2022
    Assignee: Interactive Solutions Corp.
    Inventor: Kiyoshi Sekine
  • Patent number: 11431642
    Abstract: Systems and processes for operating an intelligent automated assistant are provided. In one example process, an event associated with an audio input is detected with a first process. In accordance with a detection of the event, a delay value associated with an electronic device is determined. The delay value corresponds to a time required to determine, with a second process, whether the audio input includes a spoken trigger. In accordance with a determination that the delay value exceeds a threshold, the delay value is broadcast during a first advertising session, and determination is made, during a second advertising session, whether the electronic device is to respond to the audio input. In accordance with a determination that the threshold is not exceeded, a determination is made, during the first advertising session, whether the electronic device is to respond to the audio input or wait for the second advertising session.
    Type: Grant
    Filed: October 13, 2020
    Date of Patent: August 30, 2022
    Assignee: Apple Inc.
    Inventor: Kurt Piersol
  • Patent number: 11430442
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for contextual hotwords are disclosed. In one aspect, a method, during a boot process of a computing device, includes the actions of determining, by a computing device, a context associated with the computing device. The actions further include, based on the context associated with the computing device, determining a hotword. The actions further include, after determining the hotword, receiving audio data that corresponds to an utterance. The actions further include determining that the audio data includes the hotword. The actions further include, in response to determining that the audio data includes the hotword, performing an operation associated with the hotword.
    Type: Grant
    Filed: October 12, 2020
    Date of Patent: August 30, 2022
    Assignee: Google LLC
    Inventors: Christopher Thaddeus Hughes, Ignacio Lopez Moreno, Aleksandar Kracun
  • Patent number: 11427156
    Abstract: A method for generating an output for controlling a vehicular function of a vehicle includes providing (i) a vehicular sensing device having at least one illumination source operable to backlight a plurality of icons, each icon representative of a respective vehicle function, and (ii) a plurality of sensors, each sensor having a respective field of sensing associated with a respective icon of the plurality of icons. With the vehicular sensing device disposed at a vehicle, and with the at least one illumination source activated to backlight the plurality of icons, the backlit icons are viewable at an exterior portion of the vehicle, and the sensors sense movement of a person's hand or foot in a field of sensing of one of the sensors, and a controller generates an output to control the vehicular function that is represented by the respective backlit icon associated with that sensor.
    Type: Grant
    Filed: January 11, 2021
    Date of Patent: August 30, 2022
    Assignee: MAGNA MIRRORS OF AMERICA, INC.
    Inventors: Justin E. Sobecki, David P. O'Connell, Kenneth C. Peterson
  • Patent number: 11423892
    Abstract: An electronic device is disclosed. The electronic device comprises: a voice input unit; a storage unit for storing a first text according to a first transcript format and at least one second text obtained by transcribing the first text in a second transcript format; and a processor for, when a voice text converted from a user voice input through the voice input unit corresponds to a preset instruction, executing a function according to the preset instruction. The processor executes a function according to a preset instruction when the preset instruction includes a first text and a voice text is a text in which the first text of the preset instruction has been transcribed into a second text of a second transcript format.
    Type: Grant
    Filed: November 23, 2018
    Date of Patent: August 23, 2022
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventor: Jaesung Kwon
  • Patent number: 11417331
    Abstract: The present disclosure provides a method for controlling a terminal, including the following operations: obtaining recognition results corresponding to control signals after receiving the control signals, and determining whether control instructions corresponding to the recognition results conflict, each control signal comprising at least one of a voice signal or a gesture signal; determining a credibility of each control instruction in response to a determination that there exists conflict among control instructions; and sending the control instruction with highest credibility to a control terminal. The present disclosure further provides a device for controlling a terminal and a computer readable storage medium. When control instructions are received and there exists conflict among control instructions, the control instruction with the highest credibility is sent to the control terminal after the credibility of each control instructions is determined, thereby avoiding settings from conflict.
    Type: Grant
    Filed: March 6, 2020
    Date of Patent: August 16, 2022
    Assignees: GD MIDEA AIR-CONDITIONING EQUIPMENT CO., LTD., MIDEA GROUP CO., LTD.
    Inventors: Zhicai Ou, Weiying Li
  • Patent number: 11417321
    Abstract: A device for changing a speech recognition sensitivity for speech recognition can include a memory and a processor configured to obtain a first plurality of speech data input at different times, apply a pre-trained speech recognition model to the first plurality of speech data at a plurality of different speech recognition sensitivities, obtain a first speech recognition sensitivity from among the plurality of different speech recognition sensitivities based on the pre-trained speech recognition model and the plurality of different speech recognition sensitivities, the first speech recognition sensitivity corresponding to an optimal speech recognition sensitivity at which a speech recognition success rate of the speech recognition model satisfies a set first recognition success rate criterion, and change a setting of the speech recognition sensitivity based on the first speech recognition sensitivity obtained from among the plurality of different speech recognition sensitivities.
    Type: Grant
    Filed: April 24, 2020
    Date of Patent: August 16, 2022
    Assignee: LG ELECTRONICS INC.
    Inventors: Sang Won Kim, Joonbeom Lee
  • Patent number: 11417343
    Abstract: A speaker identification system (“system”) automatically assigns a speaker to voiced segments in a call. The system identifies one or more speakers in a call using one or more speaker-identification parameters. The system processes the call to determine one or more speaker-identification parameters, such as a transcript of the call, a facial image of the speaker, a scene image, which is an image of a scene in which the speaker is located during the call, or textual data associated with the call such as names of the speaker or an organization that are retrieved from the scene images or video data of the call. The system analyzes one or more of the speaker-identification parameters and determines the identity of the speaker. The system then identifies the voice segments associated with the identified speaker and marks the voice segments with the identity of the speaker.
    Type: Grant
    Filed: July 2, 2018
    Date of Patent: August 16, 2022
    Assignee: ZOOMINFO CONVERSE LLC
    Inventors: Raphael Cohen, Erez Volk, Russell Levy, Micha Yochanan Breakstone, Orgad Keller, Ilana Tuil, Amit Ashkenazi
  • Patent number: 11410660
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for voice recognition. In one aspect, a method includes the actions of receiving a voice input; determining a transcription for the voice input, wherein determining the transcription for the voice input includes, for a plurality of segments of the voice input: obtaining a first candidate transcription for a first segment of the voice input; determining one or more contexts associated with the first candidate transcription; adjusting a respective weight for each of the one or more contexts; and determining a second candidate transcription for a second segment of the voice input based in part on the adjusted weights; and providing the transcription of the plurality of segments of the voice input for output.
    Type: Grant
    Filed: April 1, 2020
    Date of Patent: August 9, 2022
    Assignee: Google LLC
    Inventors: Petar Aleksic, Pedro J. Moreno Mengibar
  • Patent number: 11409961
    Abstract: This disclosure describes techniques and architectures for evaluating conversations. In some instances, conversations with users, virtual assistants, and others may be analyzed to identify potential risks within a language model that is employed by the virtual assistants and other entities. The potential risks may be evaluated by administrators, users, systems, and others to identify potential issues with the language model that need to be addressed. This may allow the language model to be improved and enhance user experience with the virtual assistants and others that employ the language model.
    Type: Grant
    Filed: October 10, 2019
    Date of Patent: August 9, 2022
    Inventors: Cynthia Freeman, Ian Beaver
  • Patent number: 11410648
    Abstract: The present disclosure is generally related to a data processing system to selectively invoke applications for execution. A data processing system can receive an input audio signal and can parse the input audio signal to identify a command. The data processing system can identify a first functionality of a first digital assistant application hosted on the data processing system in the vehicle and a second functionality of a second digital assistant application accessible via a client device. The data processing system can determine that one of the first functionality or the second functionality supports the command. The data processing system can select one of the first digital assistant application or the second digital assistant application based on the determination. The data processing system invoke one of the first digital assistant application or the second digital assistant application based on the selection.
    Type: Grant
    Filed: October 3, 2017
    Date of Patent: August 9, 2022
    Assignee: GOOGLE LLC
    Inventors: Haris Ramic, Vikram Aggarwal, Moises Morgenstern Gali, Brandon Stuut
  • Patent number: 11405522
    Abstract: The present technology relates to an information processing device, and an information processing method, each of which enables to reduce a confirmation load put on a user before a task is executed. The information processing device according to one embodiment of the present technology has the feature of, on the basis of relationship between a first cost required in a case where execution of a predetermined task has been a mistake and a second cost that is allowed by a user for the predetermined task that has been executed by mistake, calculating a confirmation degree of confirming the user as to whether or not to execute the predetermined task, and performing the confirmation by contents corresponding to the calculated degree. The present technology can be applied to an agent apparatus that operates using a voice UI.
    Type: Grant
    Filed: April 13, 2018
    Date of Patent: August 2, 2022
    Assignee: SONY CORPORATION
    Inventor: Katsuyoshi Kanemoto
  • Patent number: 11404049
    Abstract: In non-limiting examples of the present disclosure, systems, methods and devices for integrating speech-to-text transcription in a productivity application are presented. A request to access a real-time speech-to-text transcription of an audio signal that is being received by a second device is sent by a first device. The real-time speech-to-text transcription may be surfaced in a transcription pane of a productivity application on the first device. A request to translate the transcription to a different language may be received. The transcription may be translated in real-time and surfaced in the transcription pane. A selection of a word in the surfaced transcription may be received. A request to drag the word from the transcription pane and drop it in a window in the productivity application outside of the transcription pane may be received. The word may be surfaced in the window in the productivity application outside of the transcription pane.
    Type: Grant
    Filed: December 9, 2019
    Date of Patent: August 2, 2022
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Dana Minh Nguyen, Rohail Mustafa Syed, Alisa Marilyn Bacon, William Duncan Lewis, Michael Tholfsen, Carly Larsson
  • Patent number: 11403469
    Abstract: The present invention makes it possible to generate a paraphrastic sentence that has a similar meaning to the original sentence despite a local word/phrase difference, or a non-paraphrastic sentence that is not a paraphrase despite having a similar meaning to the original sentence in terms of the entire sentence. An estimation unit 22 estimates a word deletion probability for each of words constituting an input sentence, by using a positive example model that has been trained based on a positive example constituted by a sentence and a paraphrastic sentence of the sentence, and is used to generate a paraphrastic sentence by deleting a word, or by using a negative example model that has been trained based on a negative example constituted by the sentence and a non-paraphrastic sentence of the sentence, and is used to generate a non-paraphrastic sentence by deleting a word.
    Type: Grant
    Filed: July 23, 2019
    Date of Patent: August 2, 2022
    Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Itsumi Saito, Kyosuke Nishida, Hisako Asano, Junji Tomita
  • Patent number: 11397856
    Abstract: A token is extracted from a Natural Language input. A phonetic pattern is computed corresponding to the token, the phonetic pattern including a sound pattern that represents a part of the token when the token is spoken. New data is created from data of the phonetic pattern, the new data including a syllable sequence corresponding to the phonetic pattern. A state of a data storage device is changed by storing the new data in a matrix of syllable sequences corresponding to the token. An option is selected that corresponds to the token by executing a fuzzy matching algorithm using a processor and a memory, the selecting of the option is based on a syllable sequence in the matrix.
    Type: Grant
    Filed: November 26, 2019
    Date of Patent: July 26, 2022
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Sean M. Fuoco, John M. Ganci, Jr., Craig M. Trim, Jie Zeng
  • Patent number: 11393464
    Abstract: Apparatuses, methods and storage medium associated with a spoken dialogue system are disclosed herein. In embodiments, an apparatus for natural machine conversing with a user may comprise a listening component to detect a keyword that denotes start of a conversation; a dialogue engine to converse with the user during the conversation; and a controller to selectively activate or cause to be activated one of the listening component or the dialogue component, and to pass control to the activated listening component or the activated dialogue engine, based at least in part on a state of the conversation. Other embodiments may be disclosed or claimed.
    Type: Grant
    Filed: June 6, 2019
    Date of Patent: July 19, 2022
    Assignee: Intel Corporation
    Inventors: Lavinia A. Danielescu, Shawn C. Nikkila, Robert J. Firby, Beth Ann Hockey
  • Patent number: 11392665
    Abstract: A computer-implemented method, system, and computer program product for analyzing readability of a communication intended for a target audience includes: analyzing the communication to determine a first readability measure associated with the communication; determining a second readability measure associated with the target audience based on one or more historical communications previously transmitted or received by the target audience; and generating a readability feedback signal for the communication based on the first readability measure and the second readability measure.
    Type: Grant
    Filed: August 1, 2019
    Date of Patent: July 19, 2022
    Assignee: International Business Machines Corporation
    Inventors: Adam Pilkington, Graham Charters, Gordon Hutchison, Tim Mitchell
  • Patent number: 11386910
    Abstract: Various technologies described herein pertain to active noise cancellation in the interior of a vehicle. In exemplary embodiments, a microphone mounted on the vehicle outputs an audio signal indicative of noise emitted by a noise source. A computing system of the vehicle determines a position of the noise source based upon sensor signals output by sensors mounted on the vehicle. The computing system further determines a position of a passenger in the vehicle based upon a sensor mounted inside the vehicle. The computing system generates a complementary signal that is configured to attenuate the noise based upon the audio signal, the position of the noise source, and the position of the passenger. The complementary signal is then output by way of a speaker in the interior of the vehicle.
    Type: Grant
    Filed: May 11, 2020
    Date of Patent: July 12, 2022
    Assignee: GM CRUISE HOLDINGS LLC
    Inventors: Marko Tintor, Matt Fornero
  • Patent number: 11386919
    Abstract: The present disclosure provides methods and systems that may be used for providing quality control for audio samples. The audio samples may be speech samples of a user. The user may be participating in an audio interview.
    Type: Grant
    Filed: December 31, 2020
    Date of Patent: July 12, 2022
    Assignee: AC Global Risk, Inc.
    Inventor: James A. Kane
  • Patent number: 11386890
    Abstract: A system is provided for reducing friction during user interactions with a natural language processing system, such as voice assistant systems. The system determines a pre-trained model using dialog session data corresponding to multiple user profiles. The system determines a fine-tuned model using the pre-trained model and a fine-tuning dataset that corresponds to a particular task, such as query rewriting. The system uses the fine-tuned model to process a user input and determine an alternative representation of the input that can result in a desired response from the natural language processing system.
    Type: Grant
    Filed: February 11, 2020
    Date of Patent: July 12, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Xing Fan, Zheng Chen, Yuan Ling, Lambert Leo Mathias, Chenlei Guo
  • Patent number: 11386233
    Abstract: The present disclosure provides a method, system, and device for distributing a software release. To illustrate, based on one or more files for distribution as a software release, a release bundle is generated that includes release bundle information, such as, for each file of the one or more files, a checksum, meta data, or both. One or more other aspects of the present disclosure further provide sending the release bundle to a node device. After receiving the release bundle at the node device, the node device receives and stores at least one file at a transaction directory. After verification that each of the one or more files is present/available at the node device, the one or more files may be provided to a memory of a node device and meta data included in the release bundle information may be applied to the one or more files transferred to the memory.
    Type: Grant
    Filed: April 30, 2019
    Date of Patent: July 12, 2022
    Assignee: JFrog, Ltd.
    Inventor: Yoav Landman
  • Patent number: 11380306
    Abstract: Expansion of intent classification data utilizing batch utterance scheduling, by a processor in a computing environment. A set of unlabeled examples for intent processing is received by an intent builder iteratively defining an intent. The set of examples are separated into a first subset processed according to a first model and a second subset processed according to a second model. The first subset is incorporated into the intent builder during a building iteration and scheduling a first batch processing of the second subset processed according to the second model based on a scheduling criteria. The first batch processing of the second subset is initiated once the scheduling criteria is satisfied. Upon completion of the first batch processing, results of the completion are used to influence additional examples retrieved from the first subset and the second subset during a subsequent building iteration by the intent builder.
    Type: Grant
    Filed: October 31, 2019
    Date of Patent: July 5, 2022
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Neil Rohit Mallinar, Rajendra G Ugrani, Ayush Gupta
  • Patent number: 11380303
    Abstract: A method for voice call analysis and classification includes intercepting a voice call session between an initiating device and a recipient device. Voice call data exchanged between the initiating device and the recipient device during the voice call session is transformed into a predefined data format. The transformed voice call data is analyzed to determine one or more attributes of the intercepted voice call. One or more features associated with the intercepted voice call session are identified based on the determined one or more attributes. The intercepted voice call is classified using the identified one or more features.
    Type: Grant
    Filed: January 22, 2021
    Date of Patent: July 5, 2022
    Assignee: AO Kaspersky Lab
    Inventors: Nikolay A. Churaev, Andrey I. Golubev
  • Patent number: 11373644
    Abstract: Techniques for implementing multiple wakeword detectors on a single device are described. A digital signal processor (DSP) of the device may implement a wakeword detection component to detect when captured speech includes a wakeword. A companion application installed on the device may implement a wakeword detection component trained using speech of a user of the device. In response to determining that the user spoke the wakeword, the companion application may send audio data representing the speech and data corresponding to the user to at least one server(s) for processing. Further, the device may receive captured speech and captured image data corresponding to the captured speech and determine a representation of a user in the captured image data. If the device determines the user is represented in the image data, audio data representing the speech may be sent to at least one server(s) for processing.
    Type: Grant
    Filed: July 21, 2020
    Date of Patent: June 28, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Deepak Yavagal, Ajith Prabhakara, John Gray
  • Patent number: 11373652
    Abstract: A method includes obtaining, by data processing hardware, a plurality of non-watermarked speech samples. Each non-watermarked speech does not include an audio watermark sample. The method includes, from each non-watermarked speech sample of the plurality of non-watermarked speech samples, generating one or more corresponding watermarked speech samples that each include at least one audio watermark. The method includes training, using the plurality of non-watermarked speech samples and corresponding watermarked speech samples, a model to determine whether a given audio data sample includes an audio watermark, and after training the model, transmitting the trained model to a user computing device.
    Type: Grant
    Filed: May 14, 2020
    Date of Patent: June 28, 2022
    Assignee: Google LLC
    Inventors: Alexander H. Gruenstein, Taral Pradeep Joglekar, Vijayaditya Peddinti, Michiel A. u. Bacchiani
  • Patent number: 11358063
    Abstract: Multimedia content to be played on a multimedia player device can be received. Whether the multimedia content contains audience-inappropriate content can be determined. Replacement content corresponding to the audience-inappropriate content can be generated. The generated replacement content can be caused to play on the multimedia player device in lieu of the audience-inappropriate content.
    Type: Grant
    Filed: March 6, 2020
    Date of Patent: June 14, 2022
    Assignee: International Business Machines Corporation
    Inventors: Maryam Ashoori, Anamika Dayaram Singh, Priti Ashvin Shah
  • Patent number: 11361756
    Abstract: In one aspect, a playback device includes at least one microphone configured to detect sound. The playback detects sound via the one or more microphones and determines whether (i) the detected sound includes a voice input, (ii) the detected sound excludes background speech, and (iii) the voice input includes a command keyword. In response to the determining, the playback device performs a playback function corresponding to the command keyword.
    Type: Grant
    Filed: June 12, 2019
    Date of Patent: June 14, 2022
    Assignee: Sonos, Inc.
    Inventors: Connor Smith, John Tolomei, Kurt Soto
  • Patent number: 11355136
    Abstract: A computer includes a processor and a memory storing instructions executable by the processor to identify an occupant in a passenger cabin of a vehicle, detect a position of a head of the occupant relative to the passenger cabin, apply a first filter to speech from the occupant based on the position of the head, generate a second filter, apply the second filter to the speech, adjust the second filter based on a difference between the speech of the occupant filtered by the second filter and a prestored profile of the occupant, and perform an operation using the speech filtered by the first filter and the second filter.
    Type: Grant
    Filed: January 11, 2021
    Date of Patent: June 7, 2022
    Assignee: Ford Global Technologies, LLC
    Inventors: Scott Andrew Amman, Cynthia M. Neubecker, Pietro Buttolo, Joshua Wheeler, Brian Bennie
  • Patent number: 11355120
    Abstract: In some examples, a software agent executing on a server receives a communication comprising a first utterance from a customer and predicts, using an intent classifier, a first intent of the first utterance. Based on determining that the first intent is order-related, the software agent predicts, using a dish classifier, a cart delta vector based at least in part on the first utterance and modifies a cart associated with the customer based on the cart delta vector. The software agent predicts, using a dialog model, a first dialog response based at least in part on the first utterance and provides the first dialog response to the customer using a text-to-speech converter.
    Type: Grant
    Filed: October 1, 2021
    Date of Patent: June 7, 2022
    Assignee: ConverseNowAI
    Inventors: Zubair Talib, Rahul Aggarwal, Vinay Kumar Shukla, Pranav Nirmal Mehra, Vrajesh Navinchandra Sejpal, Akshay Labh Kayastha, Yuganeshan A J, German Kurt Grin, Fernando Ezequiel Gonzalez, Julia Milanese, Matias Grinberg
  • Patent number: 11354754
    Abstract: Certain aspects of the present disclosure provide techniques for selecting a response to a self-support query. One example method generally includes receiving an audio stream query including spoken content from a user recorded by a mobile device and determining a set of paralinguistic features from the spoken content. The method further includes estimating an emotional state of the user based on the set of paralinguistic features and identifying subject matter of the spoken content in the audio stream query. The method further includes determining two or more query responses corresponding to the subject matter to present to the user and transmitting at least one query response to the mobile device.
    Type: Grant
    Filed: February 12, 2020
    Date of Patent: June 7, 2022
    Assignee: INTUIT, INC.
    Inventors: Benjamin Indyk, Igor A. Podgorny, Raymond Chan
  • Patent number: 11347802
    Abstract: Methods and systems for generation of a database schema compliant search query based on a natural language input are described herein. Natural language input may be received from a computing device. The natural language input may be associated with multiple search requests to a database. The natural language input may be parsed into a plurality of segments. The plurality of segments may be, for example, one or more words of a text string. At least one identifier for the plurality of segments may be associated with one or more confidence values. The natural language input may be converted into a single search query based on the confidence values and/or on a set of rules. The single search query may be initiated with respect to the database. The single search query may fetch content more efficiently than the multiple search requests.
    Type: Grant
    Filed: July 24, 2019
    Date of Patent: May 31, 2022
    Assignee: Citrix Systems, Inc.
    Inventors: Shiv Prasad Khillar, Saifulla Shaik, Nagendra Tank
  • Patent number: 11348578
    Abstract: A method of controlling a battery-powered remote controller to decrease a duty cycle to allow continued operations despite the quantity of the battery is bad determines a drop in voltage of the battery in standby mode as voltage of the battery is being read. When receiving a command to activate a voice function, determining whether the drop in voltage in standby mode is greater than or equal to a preset value. If yes, the method then determines whether the drop in voltage falls in a preset range. If yes, the method regulates a duty cycle of the pulse signal activating the voice function, and activates the voice function as required. A remote controller and a non-transitory storage medium are also provided.
    Type: Grant
    Filed: July 22, 2019
    Date of Patent: May 31, 2022
    Assignee: Nanning FuLian FuGui Precision Industrial Co., Ltd.
    Inventors: Huang-Yu Chiang, Chung-Chih Yeh
  • Patent number: 11341973
    Abstract: Provided are a method and device for recognizing a speaker by using a resonator. The method of recognizing the speaker includes receiving a plurality of electrical signals corresponding to a speech of the speaker from a plurality of resonators having different resonance bands; obtaining a difference of magnitudes of the plurality of electrical signals; and recognizing the speaker based on the difference of magnitudes of the plurality of electrical signals.
    Type: Grant
    Filed: December 19, 2017
    Date of Patent: May 24, 2022
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Cheheung Kim, Sungchan Kang, Sangha Park, Yongseop Yoon, Choongho Rhee
  • Patent number: 11338211
    Abstract: An application execution unit 110 generates a game image. A message generation unit 112 generates a notification message. An image processing unit 118 generates a distribution image including the game image. A distribution processing unit 126 distributes the distribution image to one or more information processing terminals through a shared server. A setting unit 114 allows a user to set whether or not the notification message is included in the distribution image so as to be visually recognizable, and registers setting contents in a storage apparatus.
    Type: Grant
    Filed: November 22, 2018
    Date of Patent: May 24, 2022
    Assignee: SONY INTERACTIVE ENTERTAINMENT INC.
    Inventors: Masahiro Fujihara, Kiyobumi Matsunaga
  • Patent number: 11335347
    Abstract: Described herein is a system for sentiment detection in audio data. The system is trained using acoustic information and lexical information to determine a sentiment corresponding to an utterance. In some cases when lexical information is not available, the system (trained on acoustic and lexical information) is configured to determine a sentiment using only acoustic information.
    Type: Grant
    Filed: June 3, 2019
    Date of Patent: May 17, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Gustavo Alfonso Aguilar Alas, Viktor Rozgic, Chao Wang
  • Patent number: 11337061
    Abstract: A system and method for providing anonymous communications from a user to a called party includes obtaining a dedicated phone number and creating a user account for the user and assigning the dedicated phone number to the user account. A provider account is created for a digital assistant using the dedicated phone number and the digital assistant is preprogrammed with the user account. The digital assistant is also preprogrammed with a skill for recognizing a specific utterance (e.g. “Call”). Connectivity is provided between the digital assistant and the Internet, for example, using a wireless access point. The digital assistant listens for the specific utterance and, upon recognizing the specific utterance followed by an identification of the called party, the digital assistant initiates a voice call through the Internet to the called party.
    Type: Grant
    Filed: November 6, 2020
    Date of Patent: May 17, 2022
    Assignee: Ways Investments, LLC
    Inventor: Mark Edward Gray