Voice Recognition Patents (Class 704/246)
  • Patent number: 10997976
    Abstract: An authentication system prevents leakage of a key-reading speech during user authentication based on the key-reading speech of a user reading an authentication key. For each user ID, a storage stores a voiceprint of a user in association with a recorded sound including speech spoken previously by the user. A specifier specifies the user ID of a user attempting to receive authorization. An outputter outputs a masking sound that includes the recorded sound recorded in association with the specified user ID. An acquirer acquires a key-reading speech of the user reading the authentication key and the output masking sound. A remover acquires a second sound by removing the masking sound from the acquired first sound. A determiner determines whether the user has authority pertaining to the specified user ID based on the acquired second sound.
    Type: Grant
    Filed: April 16, 2019
    Date of Patent: May 4, 2021
    Assignee: Passlogy Co., Ltd.
    Inventors: Motohiko Mitsuno, Hideharu Ogawa
  • Patent number: 10992666
    Abstract: An identity verification method performed at a terminal includes playing in an audio form action guide information including mouth shape guide information selected from a preset action guide information library at a speed corresponding to the action guide information, and collecting a corresponding set of action images within a preset time window; performing matching detection on the collected set of action images and the action guide information, to obtain a living body detection result indicating whether a living body exists in the collected set of action images; according to the living body detection result that indicates that a living body exists in the collected set of action images: collecting user identity information and performing verification according to the collected user identity information, to obtain a user identity information verification result; and determining the identity verification result according to the user identity information verification result.
    Type: Grant
    Filed: August 15, 2019
    Date of Patent: April 27, 2021
    Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    Inventors: Feiyue Huang, Jilin Li, Guofu Tan, Xiaoli Jiang, Dan Wu, Junwu Chen, Jianguo Xie, Wei Guo, Yihui Liu, Jiandong Xie
  • Patent number: 10984083
    Abstract: The present invention relates to methods, apparatus and systems for authentication of a user based on ear biometric data, and voice biometric data or other authentication data. The ear biometric data may be combined with voice biometric data or with a security question and response.
    Type: Grant
    Filed: July 6, 2018
    Date of Patent: April 20, 2021
    Assignee: Cirrus Logic, Inc.
    Inventors: John Paul Lesso, Thomas Lorenz
  • Patent number: 10984268
    Abstract: Detecting a replay attack on a voice biometrics system comprises: receiving a speech signal; generating an ultrasound signal; detecting a reflection of the generated ultrasound signal; detecting Doppler shifts in the reflection of the generated ultrasound signal; and identifying whether the received speech signal is indicative of the liveness of a speaker based on the detected Doppler shifts. Identifying whether the received speech signal is indicative of liveness based on the detected Doppler shifts comprises determining whether the detected Doppler shifts correspond to a speech articulation rate.
    Type: Grant
    Filed: October 11, 2018
    Date of Patent: April 20, 2021
    Assignee: Cirrus Logic, Inc.
    Inventor: John Paul Lesso
  • Patent number: 10984269
    Abstract: Detecting liveness of a speaker comprises: generating an ultrasound signal; receiving an audio signal comprising a reflection of the ultrasound signal; using the received audio signal comprising the reflection of the ultrasound signal to detect the liveness of a speaker; monitoring ambient ultrasound noise; and adjusting the operation of a system receiving the audio signal, based on a level of the reflected ultrasound and the monitored ambient ultrasound noise. The method can be used in a voice biometrics system, in which case detecting the liveness of a speaker comprises determining whether a received speech signal may be a product of a replay attack. The operation of the voice biometrics system may be adjusted based on a level of the reflected ultrasound and the monitored ambient ultrasound noise.
    Type: Grant
    Filed: October 11, 2018
    Date of Patent: April 20, 2021
    Assignee: Cirrus Logic, Inc.
    Inventor: John Paul Lesso
  • Patent number: 10978061
    Abstract: A method, a computer system, and a computer program product for detecting voice commands. Audio is recorded by the computer system to form a recorded audio. The computer system then determines whether a voice command spoken by a first person is present in the recorded audio. If the voice command is present in the recorded audio, the computer system determines whether the voice command is directed to a second person by the first person. If the voice command is not being directed to the second person, the computer system processes the voice command, wherein processing of the voice command occurs without a wake word.
    Type: Grant
    Filed: March 9, 2018
    Date of Patent: April 13, 2021
    Assignee: International Business Machines Corporation
    Inventors: Gregory J. Boss, Jeremy R. Fox, Andrew R. Jones, John E. Moore, Jr.
  • Patent number: 10972606
    Abstract: A process generates, at a computer-implemented service provider platform, a simulated user request for a service. Further, the process sends, from the computer-implemented service provider platform to a computing device associated with an agent, the simulated user request for a service. Additionally, the process performs, with a processor at the computer-implemented service provider platform, an assessment of agent responsiveness to the simulated user request for the service. Finally, the process automatically generates, with the processor at the computer-implemented service provider platform, one or more actions based on the assessment.
    Type: Grant
    Filed: December 4, 2019
    Date of Patent: April 6, 2021
    Assignee: Language Line Services, Inc.
    Inventors: Adam Caldwell, James Boutcher, Jeffrey Cordell, Jordy Boom
  • Patent number: 10970573
    Abstract: A method for user authentication based on keystroke dynamics is provided. The user authentication method includes receiving a keystroke input implemented by a user; separating a sequence of pressed keys into a sequence of bigrams having bigram names simultaneously with the user typing free text; collecting a timing information for each bigram of the sequence of bigrams; extracting a feature vector for each bigram based on the timing information; separating feature vectors into subsets according to the bigram names; estimating a GMM user model using subsets of feature vectors for each bigram; providing real time user authentication using the estimated GMM user model for each bigram and bigram features from current real time user keystroke input. The corresponding system is also provided. The GMM based analysis of the keystroke data separated by bigrams provides strong authentication using free text input, while user additional actions (to be verified) are kept at a minimum.
    Type: Grant
    Filed: April 26, 2019
    Date of Patent: April 6, 2021
    Assignee: ID R&D, INC.
    Inventors: Alexey Khitrov, Konstantin Simonchik
  • Patent number: 10963498
    Abstract: Methods and systems are provided for generating automatic program recommendations based on user interactions. In some embodiments, control circuitry processes verbal data received during an interaction between a user of a user device and a person with whom the user is interacting. The control circuitry analyzes the verbal data to automatically identify a media asset referred to during the interaction by at least one of the user and the person with whom the user is interacting. The control circuitry adds the identified media asset to a list of media assets associated with the user of the user device. The list of media assets is transmitted to a second user device of the user.
    Type: Grant
    Filed: November 6, 2018
    Date of Patent: March 30, 2021
    Assignee: Rovi Guides, Inc.
    Inventors: Brian Fife, Jason Braness, Michael Papish, Thomas Steven Woods
  • Patent number: 10960540
    Abstract: Implementations directed to providing a computer-implemented system for performing an action with a robot comprising receiving command information indicating a command related to performance of an action with a robot, identifying state information for a plurality of active routines that are actively running for the robot, the state information indicating a state for each of the active routines, determining contextual information for the command based on the accessed state information for the plurality of active routines, selecting one of the active routines as a handling routine to service the command based on the contextual information, determining an output module of the robot to perform the action based on the state of the handling routine and the contextual information, and executing one or more instructions to perform the action with the output module.
    Type: Grant
    Filed: April 11, 2018
    Date of Patent: March 30, 2021
    Assignee: Accenture Global Solutions Limited
    Inventors: Carl Matthew Dukatz, Nicholas Akiona
  • Patent number: 10957316
    Abstract: An electronic apparatus is provided. The electronic apparatus includes a memory, a microphone and a processor configured to compare a volume of a voice input through the microphone and a standard voice volume stored in the memory, corresponding to a space in which the electronic apparatus is located, and identify whether to perform a voice recognition on the voice based on the comparison.
    Type: Grant
    Filed: September 28, 2018
    Date of Patent: March 23, 2021
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventor: Gwi-rang Park
  • Patent number: 10958747
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for adjusting an eligibility value for transmitting a digital component. In one aspect, a computing system includes a server for identifying opportunities to transmit digital components to client devices. The server determines a first probability of a given outcome occurring following user interaction with the digital component when the digital component is transmitted to the client device. The server determines a second probability of the given outcome occurring if the digital component is not transmitted to the client device. The server generates an outcome incrementality factor for the digital component, including determining a ratio of the first probability relative to the second probability, and triggers adjustment of an eligibility value based on the outcome incrementality factor. The server then controls transmission of the digital component to the client device using the adjusted eligibility value.
    Type: Grant
    Filed: August 24, 2017
    Date of Patent: March 23, 2021
    Assignee: Google LLC
    Inventors: Justin Lewis, Thomas Graham Price
  • Patent number: 10957313
    Abstract: Techniques for performing command processing are described. A system receives, from a device, input data corresponding to a command. The input data may originate as audio data, as text data, or as other data. The system determines NLU processing results corresponding to the input data. The NLU processing results may be associated with multiple speechlets. The system also determines NLU confidences for the NLU processing results for each speechlet. The system sends NLU processing results and an indication to provide potential results to a portion of the multiple speechlets, and receives potential results from the portion of the speechlets. The system also receives indications whether the speechlets need to be re-called if the speechlets are selected to execute with respect to the command. The system ranks the portion of the speechlets based at least in part on the NLU processing results as well as the potential results provided by the portion of the speechlets.
    Type: Grant
    Filed: November 22, 2017
    Date of Patent: March 23, 2021
    Assignee: Amazon Technologies, Inc.
    Inventors: Ruhi Sarikaya, Zheng Ma, Simon Peter Reavely, Kerry Hammil, Huinan Ren, Bradford Jason Snow, Jerrin Thomas Elanjikal
  • Patent number: 10956117
    Abstract: A method, system and computer program product includes detecting a volume level for audio input of a first user in a multi-user conference call, and automatically adjusting a volume level for a second user receiving audio output of the first user based on at least one of preferences of the second user, historic data between the first and the second user, and geographic characteristics of the audio input of the first user.
    Type: Grant
    Filed: December 4, 2018
    Date of Patent: March 23, 2021
    Assignee: International Business Machines Corporation
    Inventors: Gregory J. Boss, Tamer E. Abuelsaad, John E. Moore, Jr., Randy A. Rendahl
  • Patent number: 10950244
    Abstract: A system and method for enrolling a speaker in a speaker authentication and identification system (AIS), the method comprising: generating a user account, the user account comprising: a user identifier based on one or more metadata elements associated with an audio input received from an end device; generating a first i-vector from an audio frame of the audio input, a trained T-matrix, and a Universal Background Model (UBM), wherein the first i-vector generation comprises an optimized computation; and associating the user account with the first i-vector.
    Type: Grant
    Filed: April 16, 2019
    Date of Patent: March 16, 2021
    Assignee: ILLUMA Labs LLC.
    Inventor: Milind Borkar
  • Patent number: 10950221
    Abstract: A keyword confirmation method and apparatus are provided. A keyword confirmation method includes: obtaining first audio data, the first audio data being recognized as a keyword; obtaining a pronunciation similarity probability of a similar pronunciation unit corresponding to at least one fragment of the first audio data and second audio data; determining that multiple contiguous silence fragments exist in second audio data contiguous in time with the first audio data; utilizing the silence probability, as well as a pronunciation similarity probability corresponding to fragment(s) of the first audio data and/or a pronunciation similarity probability corresponding to fragment(s) of the second audio data, evaluating whether the second audio data is silence; and confirming the first audio data as an effective keyword.
    Type: Grant
    Filed: December 7, 2018
    Date of Patent: March 16, 2021
    Assignee: Alibaba Group Holding Limited
    Inventors: Yong Liu, Haitao Yao
  • Patent number: 10942703
    Abstract: Systems and processes for proactive assistance based on dialog communication between devices are provided. In one example process, while voice communication between an electronic device and a second electronic device is established, a stream of audio data associated with the second electronic device can be received. In response to detecting a user input, a text representation of speech contained in a portion of the stream of audio data can be generated. The process can determine whether the text representation contains information corresponding to one of a plurality of types of information. In response to determining that the text representation contains information corresponding to one of a plurality of types of information, one or more tasks based on the information can be performed.
    Type: Grant
    Filed: January 16, 2019
    Date of Patent: March 9, 2021
    Assignee: Apple Inc.
    Inventors: Mathieu Jean Martel, Thomas Deniau
  • Patent number: 10943099
    Abstract: A computer-implemented method for classifying an input data set within a data category using multiple data representation modes. The method includes identifying at least a first data representation source mode and a second data representation source mode; classifying the at least first data representation source mode via at least a first data recognition tool and the at least second data representation source mode via at least a second data recognition tool, the classifying including allocating a confidence factor for each data representation source mode in the data category; and combining outputs of the classifying into a single output confidence score by using a weighted fusion of the allocated confidence factors.
    Type: Grant
    Filed: February 14, 2020
    Date of Patent: March 9, 2021
    Assignee: BOOZ ALLEN HAMILTON INC.
    Inventors: Nathaniel Jackson Short, Srinivasan Rajaraman, Jonathan M. Levitt
  • Patent number: 10929596
    Abstract: A method and system for using vocal patterns of a user for modifying an electronic dictionary is provided. The method includes continuously retrieving vocal communications of a user and converting the vocal communications into text data. Common terms communicated by the user are selected from the text data and resulting linguistic patterns are determined. In response, a weighted prioritization list of the common terms is generated and electronic dictionary software is modified accordingly. A specified electronic communication currently being entered into the electronic device is monitored and each term of the specified electronic communication is analyzed. In response to the analysis, suggested terms for entering within the specific electronic communication are presented via a graphical user interface of the electronic device.
    Type: Grant
    Filed: May 15, 2019
    Date of Patent: February 23, 2021
    Assignee: International Business Machines Corporation
    Inventors: Jill Dhillon, Michael Bender, Jeremy R. Fox, Kulvir Singh Bhogal
  • Patent number: 10922433
    Abstract: Systems and methods for interrupting disclosure of sensitive information are described. Sensitive information data associated with a user is maintained. A primary device detects commencement of a voice input to a secondary device. As the voice input is detected by the primary device, the voice input is analyzed to determine the content of the voice input. The content is compared to the sensitive information data to determine whether the voice input contains sensitive information. When the primary device determines the voice input contains sensitive information, a speaker of the primary device is controlled to generate a noise canceling signal which interrupts receipt of further sensitive information by the secondary device.
    Type: Grant
    Filed: November 26, 2018
    Date of Patent: February 16, 2021
    Assignee: Wells Fargo Bank, N.A.
    Inventors: Richard Barge, Lila Fakhraie, Tammy C. Fleming, Chris Kalaboukis, Kristine Ing Kushner, Lane Mortensen, Karen L. Shahoian
  • Patent number: 10915614
    Abstract: A method for authenticating a user of an electronic device is disclosed. The method comprises: responsive to detection of a trigger event indicative of a user interaction with the electronic device, generating an audio probe signal to play through an audio transducer of the electronic device; receiving a first audio signal comprising a response of the user's ear to the audio probe signal; receiving a second audio signal comprising speech of the user; and applying an ear biometric algorithm to the first audio signal and a voice biometric algorithm to the second audio signal to authenticate the user as an authorised user.
    Type: Grant
    Filed: August 31, 2018
    Date of Patent: February 9, 2021
    Assignee: Cirrus Logic, Inc.
    Inventor: John Paul Lesso
  • Patent number: 10909240
    Abstract: A building management system (BMS) includes a user access point configured to receive a user input corresponding to the BMS. The system includes at least one building subsystem in communication with the user access point and configured to control subsystem equipment in response to the user input. Additionally, the system includes a controller configured to: receive the user input, and receive access point data. The controller is further configured to compare the user input and access point data to a user profile and/or an equipment profile. Additionally, the controller is configured to determine a safety value using the comparison, and determine if the safety value is outside of a predetermined safety range. If the safety value is outside of the predetermined safety range, the controller is configured to initiate a verification process.
    Type: Grant
    Filed: December 18, 2017
    Date of Patent: February 2, 2021
    Assignee: Johnson Controls Technology Company
    Inventor: Justin D. Eltoft
  • Patent number: 10909644
    Abstract: Disclosed herein are methods, systems, and apparatus, including computer programs encoded on computer storage media. One method includes: receiving a request associated with an account of a blockchain-based application for collecting a monetary award issued in an order of a court; determining a creditor, a debtor, and an amount of the monetary award; determining that the account is associated with the creditor based on data recorded on the blockchain; identifying, based on the data, a payment account of the creditor and one or more payment accounts of the debtor with an aggregated balance greater than or equal to the amount of the monetary award; transferring the amount of the monetary award from the one or more payment accounts of the debtor to the payment account of the creditor; and recording a verified time stamp representing a time the amount of the monetary award is transferred.
    Type: Grant
    Filed: July 6, 2020
    Date of Patent: February 2, 2021
    Assignee: Advanced New Technologies Co., Ltd.
    Inventor: Zhiguo Li
  • Patent number: 10902853
    Abstract: A voice command identification method for an electronic device having a microphone matrix is provided. The method includes: obtaining a plurality of sound signals from the microphone matrix; executing a voice purify operation on the sound signals to obtain a purified sound signal and identifying a target voice signal from the purified sound signal; calculating a compound speech feature data corresponding to the target voice signal through a compound speech recognition model; comparing the compound speech feature data with a plurality of reference speech feature data in the speech feature database, so as to determine a target command mapped to the target voice signal; and executing the target command.
    Type: Grant
    Filed: April 17, 2019
    Date of Patent: January 26, 2021
    Assignee: Wistron Corporation
    Inventors: Yuan-Han Liu, Yi-Wen Chen, Yong-Jie Hong, Ru-Feng Liu, Rong-Huei Wang
  • Patent number: 10893047
    Abstract: Methods and systems for providing security and verifying a human user and/or an authorized user are described. A system may include a processor and a non-transitory, processor-readable storage medium. The non-transitory, processor-readable storage medium may include one or more programming instructions that, when executed, cause the processor to receive a request to access a secured resource, provide a verification challenge to a user via a user interface, receive at least one input from the user in response to the verification challenge, and determine that the at least one input corresponds to at least one parameter indicative of a human user. The verification challenge may include a game.
    Type: Grant
    Filed: September 12, 2018
    Date of Patent: January 12, 2021
    Assignee: GANALILA, LLC
    Inventors: Shreedhar Natarajan, Jaisree Moorthy
  • Patent number: 10878840
    Abstract: A method for recognising at least one of a non-verbal sound event and a scene in an audio signal comprising a sequence of frames of audio data, the method comprising: for each frame of the sequence: receiving at least one sound class score, wherein each sound class score is representative of a degree of affiliation of the frame with a sound class of a plurality of sound classes; for a sound class score of the at least one sound class scores: determining a confidence that the sound class score is representative of a degree of affiliation of the frame with the sound class by processing a value for a property associated with the frame, wherein the value is processed using a learned model for the property; adjusting the sound class score for the frame based at least on the determined confidence.
    Type: Grant
    Filed: October 15, 2019
    Date of Patent: December 29, 2020
    Inventors: Christopher James Mitchell, Sacha Krstulovic, Cagdas Bilen, Juan Azcarreta Ortiz, Giacomo Ferroni, Arnoldas Jasonas, Francesco Tuveri
  • Patent number: 10880643
    Abstract: A sound-source-direction determining apparatus includes a processor that updates a reference threshold such that the reference threshold increases as a sound pressure difference increases, the sound pressure difference being a difference between sound pressure of a certain frequency component of sound acquired by the first microphone and sound pressure of the certain frequency component of the sound acquired by the second microphone when the synthesized sound is output from the speaker and determines a direction in which a sound source of sound is located, based on comparison between the reference threshold and a sound pressure difference between sound pressure of a certain frequency component of the sound acquired by the first microphone and sound pressure of the certain frequency component of the sound acquired by the second microphone when the synthesized sound is not output from the speaker.
    Type: Grant
    Filed: September 3, 2019
    Date of Patent: December 29, 2020
    Assignee: FUJITSU LIMITED
    Inventors: Chisato Shioda, Nobuyuki Washio, Masanao Suzuki
  • Patent number: 10873461
    Abstract: Disclosed herein are embodiments of systems and methods for zero-knowledge multiparty secure sharing of voiceprints. In an embodiment, an illustrative computer may receive, through a remote server, a plurality of encrypted voiceprints. When the computer receives an incoming call, the computer may generate a plaintext i-vector of the incoming call. Using the plaintext i-vector and the encrypted voiceprints, the computer may generate one or more encrypted comparison models. The remote server may decrypt the encrypted comparison model to generate similarity scores between the plaintext i-vector and the plurality of encrypted voiceprints.
    Type: Grant
    Filed: July 13, 2018
    Date of Patent: December 22, 2020
    Assignee: Pindrop Security, Inc.
    Inventors: Payas Gupta, Terry Nelms
  • Patent number: 10863971
    Abstract: A method and apparatus are disclosed herein for controlling an ultrasound machine using one or more touchless inputs. In one embodiment, the method for controlling operation of the ultrasound machine comprises obtaining one or more touchless inputs; determining one or more operations to control the ultrasound machine based on the one or more touchless inputs and machine state of the ultrasound machine; and controlling the ultrasound machine using the one or more operations.
    Type: Grant
    Filed: November 30, 2018
    Date of Patent: December 15, 2020
    Assignee: FUJIFILM SONOSITE, INC.
    Inventor: Davinder S. Dhatt
  • Patent number: 10847150
    Abstract: A dialogue system is provided to assist a user while minimizing distraction and achieve safe driving by adjusting a level of a dialogue service based on a dialogue with the user in a vehicle driving environment and multiple kinds of information including vehicle state information, driving environment information, and user information, and a vehicle having the dialogue system and a dialogue service processing method is provided.
    Type: Grant
    Filed: December 6, 2017
    Date of Patent: November 24, 2020
    Assignees: Hyundai Motor Company, Kia Motors Corporation
    Inventors: Donghee Seok, Dongsoo Shin, Jeong-Eom Lee, Ga Hee Kim, Seona Kim, Jung Mi Park, HeeJin Ro, Kye Yoon Kim
  • Patent number: 10839806
    Abstract: An electronic device and method are disclosed herein. The electronic device includes a network interface and processor. The processor implements the method, including receiving a voice input through a network interface as transmitted from a first external device, including a request to execute a function using at least one application which is not indicated in the voice input, extracting a first text from the voice input by executing automatic speech recognition (ASR), when the at least one application is identified based on the first text, transmitting, through the network interface to the first external device, second data associated with the identified at least one application for display by the first external device, and when the at least one application is not identified based at least in part on the first text, reattempting identification of the at least one application by executing natural language understanding (NLU) on the first text.
    Type: Grant
    Filed: July 9, 2018
    Date of Patent: November 17, 2020
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Joo Hyuk Jeon, Woo Up Kwon, Jin Woo Park, Kyoung Gu Woo, Eun Taek Lim, Kyung Hak Hyun, Dong Ho Jang
  • Patent number: 10839392
    Abstract: Systems and methods are provided for use in enhancing security associated with services related to payment or banking accounts, in connection with communication between consumers and consumer service call centers associated with the accounts. One exemplary method includes receiving, at a computing device, a request to authenticate a consumer to a payment or banking account from a call center associated with consumer services for the account, and soliciting a biometric from the consumer. The method also includes receiving biometric data from a communication device associated with the consumer relating to the solicited biometric, and confirming the received biometric data based on reference biometric data for the consumer. The method then further includes transmitting an authentication confirmation to the call center, when the received biometric data is confirmed, whereby the call center is able to proceed in providing the consumer services to the consumer with or without security questions.
    Type: Grant
    Filed: February 21, 2017
    Date of Patent: November 17, 2020
    Assignee: MASTERCARD INTERNATIONAL INCORPORATED
    Inventors: Laurie Ann Nicoletti, Elisabeth Lea Rode, Janet Marie Smith, Brandon Craig Bryson, Sameer Tare, Brian Piel, Steve Hubbard
  • Patent number: 10832678
    Abstract: A computer-implemented method, according to one embodiment, includes: receiving a complex audio signal which includes an intended audio signal and at least one interfering audio signal. Moreover, the intended audio signal is a voice-based command originating from a user. Information which corresponds to the at least one interfering audio signal is also received. The received information is used to identify portions of the complex audio signal as being the at least one interfering audio signal. Furthermore, the identified portion of the complex audio signal is removed from the complex audio signal, and a remaining portion of the complex audio signal is output.
    Type: Grant
    Filed: June 8, 2018
    Date of Patent: November 10, 2020
    Assignee: International Business Machines Corporation
    Inventors: Su Liu, Eric J. Rozner, Inseok Hwang, Chungkuk Yoo
  • Patent number: 10832686
    Abstract: The present disclosure discloses a method and apparatus for pushing information.
    Type: Grant
    Filed: August 20, 2018
    Date of Patent: November 10, 2020
    Assignee: Baidu Online Network Technology (Beijing) Co., Ltd.
    Inventor: Wenyu Wang
  • Patent number: 10825457
    Abstract: An information processing apparatus that detects a voice command via a microphone in order to activate the device and execute certain applications. The apparatus comprises a digital signal processor (DSP) and a host controller which are responsible for processing the voice commands. The DSP recognizes and processes voice commands intermittently while the host processor is in a sleep state, thereby reducing the overall power consumption of the apparatus. Further, when the DSP is configured to recognize voice commands intended only to activate the device, a memory having a sufficiently lower storage capacity suffices.
    Type: Grant
    Filed: July 9, 2019
    Date of Patent: November 3, 2020
    Assignee: Sony Corporation
    Inventor: Kenji Tokutake
  • Patent number: 10825275
    Abstract: Blockchain-controlled and location-validated locking systems and methods are described. A method includes maintaining state information for a lock, where the first state of the lock corresponds to an open state and the second to a locked state. The method further includes receiving a current location of a device associated with a person, authorized to change a state of the lock, attempting to change a state of the lock and a current location of the lock. The method further includes receiving a digital signature from the device. The method further includes automatically transmitting a control signal to the lock to change the state of the lock only when the current location of the person is determined to be the same as the current location of the lock and a valid proof of work is performed by a miner associated with a blockchain configured to manage transactions corresponding to the lock.
    Type: Grant
    Filed: December 19, 2018
    Date of Patent: November 3, 2020
    Inventor: Ranjeev K. Singh
  • Patent number: 10818297
    Abstract: A dialogue system, a vehicle and a method for controlling the vehicle is disclosed. The method for controlling the vehicle includes: acquiring an utterance and a speech pattern by recognizing a speech when a speech of a plurality of speakers is input through a speech input device; classifying dialogue contents for each speaker based on the acquired utterance and speech pattern; acquiring a relationship between the speakers based on the acquired utterance; understanding an intention and a context for each speaker based on the acquired relationship between the speakers and the acquired dialogue content for each speaker determining an action corresponding to the acquired relationship and the acquired intention and context for each speaker, and outputting an utterance corresponding to the determined action; generating a control command corresponding to the determined action; and controlling a load based on the generated control command.
    Type: Grant
    Filed: November 15, 2018
    Date of Patent: October 27, 2020
    Assignees: HYUNDAI MOTOR COMPANY, KIA MOTORS CORPORATION
    Inventors: Kye Yoon Kim, Donghee Seok, Dongsoo Shin, Jeong-Eom Lee, Ga Hee Kim, Seona Kim, Jung Mi Park, HeeJin Ro
  • Patent number: 10803873
    Abstract: Hardware and/or software systems, devices, networks, and methods of the present invention enable increased levels of security and increase resistance to unauthorized access to secure systems by performing identity recognition and verification based on vocal spectrum analysis. Enrollment and verification processes enable a score to be ascribed to access attempts by person, provide spoof identification, and associate potential relatives of enrolled speakers. The present invention may be employed across a wide range of applications including voice login for mobile phone, tablets, laptops, etc., smartcards for various systems and devices, and software applications running on the devices.
    Type: Grant
    Filed: September 19, 2018
    Date of Patent: October 13, 2020
    Assignee: Lingual Information System Technologies, Inc.
    Inventor: Paul J. Warner
  • Patent number: 10803865
    Abstract: Among other things, requests are received from voice assistant devices expressed in accordance with different corresponding protocols of one or more voice assistant frameworks. Each of the requests represents a voiced input by a user to the corresponding voice assistant device. The received requests are re-expressed in accordance with a common request protocol. Based on the received requests, responses to the requests are expressed in accordance with a common response protocol. Each of the responses is re-expressed according to a protocol of the framework with respect to which the corresponding request was expressed. The responses are sent to the voice assistant devices for presentation to the users.
    Type: Grant
    Filed: June 5, 2018
    Date of Patent: October 13, 2020
    Assignee: Voicify, LLC
    Inventors: Robert T. Naughton, Nicholas G. Laidlaw, Alexander M. Dunn, Jeffrey K. McMahon
  • Patent number: 10796688
    Abstract: An electronic apparatus is provided. The electronic apparatus according to an embodiment includes an audio input unit configured to receive sound sources from different positions and generate a plurality of voice signals, a pre-processor configured to perform pre-processing of the plurality of voice signals, and a voice recognition unit configured to perform voice recognition using the plurality of voice signals pre-processed by the pre-processor, and in response to a predetermined trigger being detected as a result of the voice recognition, generate trigger information, wherein the pre-processor is further configured to receive feedback on the trigger information generated by the voice recognition unit, change a pre-processing method according to the trigger information, process the plurality of voice signals using the changed pre-processing method, and generate enhanced voice signals.
    Type: Grant
    Filed: October 21, 2016
    Date of Patent: October 6, 2020
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventor: Myung-suk Song
  • Patent number: 10789343
    Abstract: An audio/video stream generated by a target object to be authenticated is obtained. The target object is associated with a user. A determination is made whether a lip reading component and voice component in the audio/video stream are consistent. In response to determining that the lip reading component and voice component are consistent, voice recognition is performed on an audio stream in the audio/video stream to obtain voice content. The voice content is used as an object identifier of the target object. A model physiological feature corresponding to the object identifier is obtained from object registration information. Physiological recognition is performed on the audio/video stream to obtain a physiological feature of the target object. The physiological feature of the target object is compared with the model physiological feature to obtain a comparison result. If the comparison result satisfies an authentication condition, the target object is authenticated.
    Type: Grant
    Filed: November 15, 2018
    Date of Patent: September 29, 2020
    Assignee: Alibaba Group Holding Limited
    Inventors: Peng Li, Yipeng Sun, Yongxiang Xie, Liang Li
  • Patent number: 10789283
    Abstract: Systems and methods are described to notify an author that suggested content is available. An author-assistance tool is instantiated with a document processor to perform research to suggest content for a document being edited at the document processor. A user interaction relating to a document is received via the document processor, and the author-assistance tool generates suggested content for the document when the author has intent for content suggestion or the document has a document type that is on a list of document types for which a content suggestion should be made. The author-assistance tool then determines that the suggested content meets a pre-determined quality threshold, and generates, via the user interface of the document processor, a notification to the author that the suggested content is available.
    Type: Grant
    Filed: June 14, 2017
    Date of Patent: September 29, 2020
    Assignee: Google LLC
    Inventors: Jayakumar Hoskere Gireesha, Shyam Parikkal Krishnamurthy, Shruti Gupta, Anmol Gulati, Luiz Do Amaral De Franca Pereira Filho, Andrea Zvinakis, Kishore Papineni
  • Patent number: 10791188
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for adjusting an eligibility value for transmitting a digital component. In one aspect, a computing system includes a server for identifying opportunities to transmit digital components to client devices. The server determines a first probability of a given outcome occurring following user interaction with the digital component when the digital component is transmitted to the client device. The server determines a second probability of the given outcome occurring if the digital component is not transmitted to the client device. The server generates an outcome incrementality factor for the digital component, including determining a ratio of the first probability relative to the second probability, and triggers adjustment of an eligibility value based on the outcome incrementality factor. The server then controls transmission of the digital component to the client device using the adjusted eligibility value.
    Type: Grant
    Filed: August 24, 2017
    Date of Patent: September 29, 2020
    Assignee: Google LLC
    Inventors: Justin Lewis, Thomas Graham Price
  • Patent number: 10777192
    Abstract: A method and apparatus of recognizing a field of semantic parsing information, a device and a readable medium. The method includes: obtaining at least one preset keyword extracting pattern which is in a preset field and used to parse user-input speech data to generate semantic parsing information, each of the at least one preset keyword extracting pattern; obtaining subject weights of keywords according to importance degree identifiers of the keywords in the preset keyword extracting patterns; calculating a subject score of the speech parsing information according to the subject weights of the keywords; recognizing whether the speech parsing information belongs to the preset field according to the subject score of the speech parsing information. The method recognizes the field to which the speech parsing information belongs to ensure correctness of the recognized field, and thereby ensure correctness of operations performed by the App according to the semantic parsing information.
    Type: Grant
    Filed: May 15, 2018
    Date of Patent: September 15, 2020
    Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
    Inventors: Qin Qu, Zejin Hu
  • Patent number: 10755700
    Abstract: A system for voice-based process management is provided. The system includes a microphone, a speaker, and a computer device in communication with the microphone and the speaker. The computer device includes at least one processor in communication with at least one memory device. The computer device is programmed to (i) receive, via the microphone, one or more audible statements from an individual; (ii) parse the one or more audible statements into discrete data elements to allow normalized semantic definition of the meaning of conveyed information; (iii) compare the conveyed information with stored data; (iv) determine whether there is a discrepancy based on the comparison; and (v) if the determination is that there is a discrepancy, request, via the speaker, a clarification.
    Type: Grant
    Filed: June 22, 2018
    Date of Patent: August 25, 2020
    Assignee: Ascension Health Alliance
    Inventors: Gerry X. Lewis, Juan Sanchez, Christine K. McCoy, John Pirolo, Fahad Tahir
  • Patent number: 10754425
    Abstract: An information processing apparatus is disclosed which includes a hardware processor configured to, analyze an attention degree of a gaze of a user, on the basis of gaze data in which the gaze of the user is detected, the gaze data being input externally, and assign an important degree according to the attention degree that is analyzed with respect to speech data of the user, the speech data being input externally, and is associated with a time axis that is a same as that of the gaze data to be recorded in a memory.
    Type: Grant
    Filed: May 15, 2019
    Date of Patent: August 25, 2020
    Assignee: OLYMPUS CORPORATION
    Inventors: Kazuhito Horiuchi, Nobuyuki Watanabe, Yoshioki Kaneko, Hidetoshi Nishimura
  • Patent number: 10755727
    Abstract: A system configured to perform directional speech separation. The system may dynamically associate direction-of-arrivals with one or more audio sources in order to generate output audio data that separates each of the audio sources. The system identifies a target direction for each audio source, dynamically determines directions that are correlated with the target direction, and generates output signals for each audio source. The system may associate individual frequency bands with specific directions based on a time delay detected by two or more microphones. The system may determine a cross-correlation between each direction and the target direction and select directions with strong correlation. The system may generate time-frequency mask data indicating frequency bands corresponding to the directions associated with a particular audio source. Using the mask data, the system generates output audio data specific to the audio source, resulting in directional speech separation between different audio sources.
    Type: Grant
    Filed: September 25, 2018
    Date of Patent: August 25, 2020
    Assignee: Amazon Technologies, Inc.
    Inventor: Wai Chung Chu
  • Patent number: 10749864
    Abstract: A speaker recognition system for authenticating a mobile device user includes an enrollment and learning software module, a voice biometric authentication software module, and a secure software application. Upon request by a user of the mobile device, the enrollment and learning software module displays text prompts to the user, receives speech utterances from the user, and produces a voice biometric print. The enrollment and training software module determines when a voice biometric print has met at least a quality threshold before storing it on the mobile device. The secure software application prompts a user requiring authentication to repeat an utterance based at least on an attribute of a selected voice biometric print, receives a corresponding utterance, requests the voice biometric authentication software module to verify the identity of the second user using the utterance, and, if the user is authenticated, imports the voice biometric print.
    Type: Grant
    Filed: January 25, 2018
    Date of Patent: August 18, 2020
    Assignee: Cirrus Logic, Inc.
    Inventor: Marta Garcia Gomar
  • Patent number: 10747498
    Abstract: An electronic device can implement a zero-latency digital assistant by capturing audio input from a microphone and using a first processor to write audio data representing the captured audio input to a memory buffer. In response to detecting a user input while capturing the audio input, the device can determine whether the user input meets a predetermined criteria. If the user input meets the criteria, the device can use a second processor to identify and execute a task based on at least a portion of the contents of the memory buffer.
    Type: Grant
    Filed: May 5, 2016
    Date of Patent: August 18, 2020
    Assignee: Apple Inc.
    Inventors: William F. Stasior, David A. Carson, Rohit Dasari, Yoon Kim
  • Patent number: 10748554
    Abstract: Embodiments facilitating audio source identification are provided. A computer-implemented method comprises: receiving, by a device operatively coupled to one or more processors, an audio signal under inspection; generating, by the device, an image of time-frequency spectrum of low frequency component and high frequency component of the audio signal; and identifying, by the device, a source of the audio signal based on the generated image and one or more patterns of time-frequency spectrum, wherein each of the one or more patterns is corresponding to low frequency feature and high frequency feature of a specific audio source.
    Type: Grant
    Filed: January 16, 2019
    Date of Patent: August 18, 2020
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Jing Chang Huang, Guo Qiang Hu, Peng Ji, Jun Zhu