Voice Recognition Patents (Class 704/246)
  • Patent number: 11450320
    Abstract: A dialogue system, a dialogue system control method, and an electronic apparatus are configured to process a user speech to generate a system response before the user's speech ends by recognizing the user's intention When the user's speech is finished, the system response is output to continue a natural dialogue flow in real time. The dialogue system includes: a Speech to Text engine to convert a user speech into text; an intermediate dialogue engine configured to process an intermediate speech before user speech is terminated; a final dialogue engine configured to process a final speech after the user speech is terminated; and a controller. The controller is configured to input the converted text to the intermediate dialogue engine when user speech is not terminated, and to input the converted text to the final dialogue engine when user speech is terminated The dialogue system also includes a Text to Speech engine configured to convert the system response into a speech signal.
    Type: Grant
    Filed: July 1, 2020
    Date of Patent: September 20, 2022
    Assignees: HYUNDAI MOTOR COMPANY, KIA MOTORS CORPORATION
    Inventors: Seona Kim, Youngmin Park, Jeong-Eom Lee
  • Patent number: 11437036
    Abstract: The present disclosure discloses a smart speaker wake-up method, a smart speaker wake-up device, a smart speaker and a storage medium, relates to the technical field of speech recognition. The method of the present disclosure is applied to a wireless network including two or more smart speakers, and a specific implementation thereof is: receiving, speech information including a wake-up word; performing a recognition processing to the speech information to obtain identification information corresponding to the wake-up word; and waking up one smart speaker in the wireless network to enter listening state according to the identification information. The present disclosure may be applied to a scenario where multiple smart speakers coexist, so as to quickly select one smart speaker that is most likely to be wakened, avoiding a chaotic speech interaction caused by multiple smart speakers being wakened simultaneously, improving efficiency and quality of speech interaction and achieving better user experience.
    Type: Grant
    Filed: May 29, 2020
    Date of Patent: September 6, 2022
    Assignees: Baidu Online Network Technology (Beijing) Co., Ltd., Shanghai Xiaodu Technology Co., Ltd.
    Inventors: Xiangdang Zhang, Xing Luo, Xiangdong Xue, Guohui Zhou, Wenjie Liao
  • Patent number: 11430448
    Abstract: A method and apparatus for processing voice data of a speech received from a speaker are provided. The method includes extracting a speaker feature vector from the voice data of the speech received from a speaker, generating a speaker feature map by positioning the extracted speaker feature vector at a specific position on a multi-dimensional vector space, forming a plurality of clusters indicating features of voices of a plurality of speakers by grouping at least one speaker feature vector positioned on the speaker feature map, and classifying the plurality of speakers according to the plurality of clusters.
    Type: Grant
    Filed: November 22, 2019
    Date of Patent: August 30, 2022
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Jaeyoung Roh, Keunseok Cho, Jiwon Hyung, Donghan Jang, Jaewon Lee
  • Patent number: 11431703
    Abstract: A biometric authentication system is disclosed that provides authentication capability using biometric data in connection with a challenge for parties engaging in digital communications such as digital text-oriented, interactive digital communications. End-user systems may be coupled to devices that include biometric data capture devices such as retina scanners, fingerprint recorders, cameras, microphones, ear scanners, DNA profilers, etc., so that biometric data of a communicating party may be captured and used for authentication purposes.
    Type: Grant
    Filed: January 4, 2019
    Date of Patent: August 30, 2022
    Assignee: AT&T Intellectual Property II, L.P.
    Inventors: Pradeep K. Bansal, Lee Begeja, Carroll W. Creswell, Jeffrey Farah, Benjamin J. Stern, Jay Wilpon
  • Patent number: 11430436
    Abstract: A voice interaction method and a vehicle using the same are disclosed. A voice interaction method according to an embodiment of the present invention activates a personal terminal through which a voice signal of an occupant is received as a voice interaction assisting device between the vehicle and the occupant and changes presence or absence of a voice interaction between the vehicle and the occupant and voice interaction settings according to states of the vehicle and the occupant.
    Type: Grant
    Filed: March 29, 2019
    Date of Patent: August 30, 2022
    Assignee: LG Electronics Inc.
    Inventor: Soryoung Kim
  • Patent number: 11404056
    Abstract: A drone system is configured to capture an audio stream that includes voice commands from an operator, to process the audio stream for identification of the voice commands, and to perform operations based on the identified voice commands. The drone system can identify a particular voice stream in the audio stream as an operator voice, and perform the command recognition with respect to the operator voice to the exclusion of other voice streams present in the audio stream. The drone can include a directional camera that is automatically and continuously focused on the operator to capture a video stream usable in disambiguation of different voice streams captured by the drone.
    Type: Grant
    Filed: June 30, 2017
    Date of Patent: August 2, 2022
    Assignee: Snap Inc.
    Inventors: David Meisenholder, Steven Horowitz
  • Patent number: 11393463
    Abstract: A system and method are disclosed for setting up a communication link between a device or application and a system with a controller. The controller can collect and send information to the application. A user interfaces with the controller to access the functionality of the application through providing commands to the controller. The system allows the user to interface with multiple applications.
    Type: Grant
    Filed: April 19, 2019
    Date of Patent: July 19, 2022
    Assignee: SoundHound, Inc.
    Inventors: Timothy P. Stonehocker, Kathleen Worthington McMahon
  • Patent number: 11367451
    Abstract: A speaker authentication method and apparatus may extract input speaker features corresponding to a plurality of frames of an input speech of an object, estimate discriminable speaker sections corresponding to the plurality of frames, and dynamically match the input speaker features to pre-enrolled enrolled speaker features based on the discriminable speaker section.
    Type: Grant
    Filed: July 23, 2019
    Date of Patent: June 21, 2022
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Kyuhong Kim, Insoo Kim, Dohwan Lee, Hana Lee
  • Patent number: 11366890
    Abstract: Various examples described herein are directed to systems and methods for managing an interface between a user and a user computing device. The user computing device may determine that an audio sensor in communication with the user computing device indicates a first command in a user voice of the user, where the first command instructs the user computing device to perform a first task. The user computing device may determine that the audio sensor also indicates a first ambient voice different than the user voice and match the first ambient voice to a first known voice. The user computing device may determine that a second computing device associated with the first known voice is within a threshold distance of the user computing device and select a first privacy level for the first task based at least in part on the first known voice.
    Type: Grant
    Filed: July 15, 2020
    Date of Patent: June 21, 2022
    Assignee: Wells Fargo Bank, N.A.
    Inventors: Tambra Nichols, Teresa Lynn Rench, Jonathan Austin Hartsell, John C. Brenner, Christopher James Williams
  • Patent number: 11355110
    Abstract: According to various embodiments of the disclosure, An electronic device according to various embodiments of the disclosure may include: a communication module; a display; a memory; and a processor electrically connected to the communication module, the display, and the memory, wherein the memory stores instructions that cause, when executed, the processor to: receive a voice recognition trigger command during a call while a call connection with an external electronic device is maintained; execute a voice recognition function, based on a voice received from the external electronic device; determine a function execution command corresponding to a recognized voice; and execute a function of the electronic device according to the determined function execution command.
    Type: Grant
    Filed: November 6, 2018
    Date of Patent: June 7, 2022
    Inventors: Kyung Tae Kim, Chang Ho Lee
  • Patent number: 11356714
    Abstract: Systems and methods of emulating a conversation about a thematic content event are disclosed. An exemplary embodiment receives a member dialogue video from a community member who is a member of a plurality of community members, wherein the member dialogue video includes video and audio portions, and wherein the member dialogue video expresses at least one of a personal opinion and a personal viewpoint about the thematic content event; generates dialogue text from the audio portion of each received member dialogue video, wherein the dialogue text comprises words and phrases spoken by the community member in the member dialogue video; receives a modified thematic content event; compares the words and phrases of the dialogue text with the plurality of keywords; and associates at least one portion of the member dialogue video having the words and phrases of the dialogue text that match with the matching keyword of the anchor point.
    Type: Grant
    Filed: August 20, 2020
    Date of Patent: June 7, 2022
    Assignee: DISH Broadcasting Corporation
    Inventors: Nicholas Brandon Newell, Omar Khan
  • Patent number: 11355127
    Abstract: An electronic apparatus and a controlling method thereof are provided. The electronic apparatus includes a communication interface comprising communication circuitry, a memory, and a processor. The processor is configured to control the electronic apparatus to: receive a user voice for controlling an external device connected to the electronic apparatus from a user terminal through the communication interface, perform user authentication by comparing feature information obtained from the user voice with feature information pre-stored in the memory, obtain a control command for controlling the external device by analyzing the user voice based on the user being authenticated, and control the communication interface to transmit the control command to the external device.
    Type: Grant
    Filed: December 13, 2019
    Date of Patent: June 7, 2022
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Sungjun Lee, Seongwook Chung
  • Patent number: 11341839
    Abstract: A system for facilitating automated response to an event notifying signal, the system including a network monitoring module, an assessment module, a resource monitoring module, and a resource response module. The network monitoring module is configured for monitoring signals received over a data network for a presence of one or more event notifying signals indicative of a relevant incident. The assessment module is configured for assessing a response of the one or more event notifying signals and a resource tasked to the response. The resource monitoring module is configured for monitoring a status of a resource tasked to the response of the event notifying signal. The resource response module is configured for communicating the response to the resource tasked to the response.
    Type: Grant
    Filed: May 4, 2020
    Date of Patent: May 24, 2022
    Assignee: ALERT MEDIA, INC.
    Inventors: Brian Cruver, Matthew Miller
  • Patent number: 11341961
    Abstract: A multi-lingual speech recognition and theme-semanteme analysis method comprises steps executed by a speech recognizer: obtaining an alphabet string corresponding to a voice input signal according to a pronunciation-alphabet table, determining that the alphabet string corresponds to original words according to a multi-lingual vocabulary, and forming a sentence according to the multi-lingual vocabulary and the original words, and comprises steps executed by a sematic analyzer: according to the sentence and a theme vocabulary-semantic relationship data set, selectively executing a correction procedure to generate a corrected sentence, an analysis state determining procedure or a procedure of outputting the sentence, outputting the corrected sentence when the correction procedure successes, and executing the analysis state determining procedure to selectively output a determined result when the correction procedure fails.
    Type: Grant
    Filed: December 2, 2019
    Date of Patent: May 24, 2022
    Assignee: NATIONAL CHENG KUNG UNIVERSITY
    Inventors: Wen-Hsiang Lu, Chun-Yu Chien, Shao-Chuan Shen, Wei-Cheng Yeh
  • Patent number: 11341971
    Abstract: A computing device includes a processor and a memory. The processor is configured to acquire a voice instruction through at least two voice receiving devices, analyze the voice instruction to determine at least one display device controlled by the voice instruction, generate a control instruction according to the voice instruction, and send the control instruction to the at least one display device to cause the at least one display device to display corresponding contents according to the voice instruction.
    Type: Grant
    Filed: July 2, 2020
    Date of Patent: May 24, 2022
    Assignee: HON HAI PRECISION INDUSTRY CO., LTD.
    Inventors: Jung-Yi Lin, Chin-Pin Kuo
  • Patent number: 11342003
    Abstract: Disclosed are various embodiments for segmenting and classifying video content using sounds. In one embodiment, a plurality of segments of a video content item are generated by analyzing audio accompanying the video content item. A subset of the plurality of segments that correspond to music segments is selected based at least in part on an audio characteristic of the subset of the plurality of segments. Individual segments of the subset of the plurality of segments are processed to determine whether a classification applies to the individual segments. A list of segments of the video content item to which the classification applies is generated.
    Type: Grant
    Filed: December 12, 2019
    Date of Patent: May 24, 2022
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Christian Garcia Siagian, Christian Ciabattoni, David Niu, Lawrence Kyuil Chang, Gordon Zheng, Ritesh Pase, Shiva Krishnamurthy, Ramakanth Mudumba
  • Patent number: 11334590
    Abstract: A system may support multiple tier serverless data foundation creation to support large data set processing. At a data ingestion tier, data ingestion serverless tasks may receive source data for processing. The data integration serverless tasks may filter and group the source data into file-object stored items. Further, data integration serverless tasks may capture metadata that, when paired with the file-object stored items, establishes the data foundation. The data foundation facilitates database-like performance in data operations in a database-less system. At the processing tier, the processing serverless tasks access the data foundation by iterating across the file-object stored items to generate output-object stored items. At the directed storage tier, directed storage serverless tasks capture metadata for the output-object stored items to establish an output data foundation or prepare the output data for storage in a data warehouse.
    Type: Grant
    Filed: December 28, 2018
    Date of Patent: May 17, 2022
    Assignee: Accenture Global Solutions Limited
    Inventors: Madhan Kumar Srinivasan, Arun Purushothaman, Vijaya Tapaswi Achanta
  • Patent number: 11322137
    Abstract: A video camera, a computer-implemented method, and a computer-readable storage medium. The video camera including one or more microphones and a processor. The processor is configured to: acquire an output from the or each microphone; apply one or more pre-analysis filters to the or each acquired output, wherein the or each pre-analysis filter determines if the or each acquired output contains a corresponding predetermined feature of interest; and analyse the or each output, when it is determined by the or each pre-analysis filter that the corresponding output contains at least one predetermined feature of interest.
    Type: Grant
    Filed: January 31, 2020
    Date of Patent: May 3, 2022
    Assignee: Ava Video Security Limited
    Inventor: Haohai Sun
  • Patent number: 11323263
    Abstract: A solution is proposed for sharing secret information for accessing a wireless computing network. A corresponding method for distributing the secret information by a source (computing) device comprises receiving a public key of the a target (computing) device, transmitting a verification token to the target device, receiving an utterance of the verification token and transmitting the secret information encrypted with the public key in response to the utterance of the verification token. A corresponding method for obtaining the secret information by a target (computing) device comprises transmitting a public key of the target device, receiving a verification token, outputting the verification token and receiving the secret information encrypted with the public key in response to an utterance of the verification token. Corresponding computer programs and computer program products are also proposed. Moreover, a source computing device and a target computing device for implementing the methods are proposed.
    Type: Grant
    Filed: May 7, 2020
    Date of Patent: May 3, 2022
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Gianluca Gargaro, Matteo Rogante, Paolo Ottaviano, Roberto Ragusa
  • Patent number: 11308199
    Abstract: A user authentication method using ultrasonic waves is disclosed. The user authentication method using ultrasonic waves, according to an embodiment of the present invention, comprises the steps of: receiving a sound wave signal which includes analog data; sampling the sound wave signal at a preset sampling rate; generating a block by selecting a preset number of pieces of sampling data; converting sampled data included in the block into frequency components; and determining, as digital data in the block, a letter or number corresponding to the frequency component having the largest magnitude from among the frequency components.
    Type: Grant
    Filed: May 29, 2020
    Date of Patent: April 19, 2022
    Assignee: MUZLIVE INC.
    Inventor: Jong Sung Park
  • Patent number: 11302334
    Abstract: The present disclosure proposes a solution to associate a device with a user by capturing a voice of a speaker by a microphone connected to the network device (e.g. a residential or home gateway), monitoring the IP traffic of the network device and detecting the device contributing to this IP traffic in order to establish a link between the speaker and his device(s) and associate the device with the speaker.
    Type: Grant
    Filed: December 21, 2018
    Date of Patent: April 12, 2022
    Assignee: INTERDIGITAL CE PATENT HOLDINGS
    Inventors: Christopher Howson, Philippe Gilberton, Patrick Fontaine, Christoph Neumann
  • Patent number: 11295744
    Abstract: A voice assistance device includes a microphone picking up and transmitting a first signal to a detection unit; the detection unit routing, in case of detection of the wakeup word in the first signal, the first signal to an analysis unit; the analysis unit processing the first signal and generating an output signal. The detection unit includes a first module detecting the wakeup word in the first signal, a second module detecting the wakeup word in a second signal received from at least one external audio source and a control module routing the first signal to the analysis unit when the wakeup word is detected solely by the first module of the detection unit.
    Type: Grant
    Filed: December 4, 2018
    Date of Patent: April 5, 2022
    Assignee: SAGEMCOM BROADBAND SAS
    Inventor: Gilles Bourgoin
  • Patent number: 11282514
    Abstract: Embodiments of the present disclosure disclose a method and apparatus for recognizing a voice. A specific implementation of the method includes: acquiring an audio signal; determining a signal-to-noise ratio of the audio signal; and selecting a voice recognition model from a pre-trained voice recognition model group to perform voice recognition on the audio signal according to the determined signal-to-noise ratio. This embodiment improves the robustness of a voice recognition product for recognizing voices in different application scenarios.
    Type: Grant
    Filed: September 11, 2019
    Date of Patent: March 22, 2022
    Assignee: Baidu Online Network Technology (Beijing) Co., Ltd.
    Inventor: Jianwei Sun
  • Patent number: 11276397
    Abstract: A system and method for improving the performance of a hands-free voice user interface system while minimizing the computational complexity without sacrificing performance. Specifically, when estimating the location of the talker for the purpose of steering a directional beam in the direction of the active talker. A hands-free voice user interface system requires a clean signal to be streamed to the cloud for recognition. One way to improve the speech signal is to estimate where the talker is and steer a beam in the direction of the active talker. To locate the talker to a localized position, a direction of arrival estimator (DOA) algorithm is used. DoA generally requires noise and echo free signal for optimal estimation, but it is computationally expensive to run audio pre-processing such as an acoustic echo cancellation for each microphone in microphone array.
    Type: Grant
    Filed: March 1, 2019
    Date of Patent: March 15, 2022
    Assignee: DSP Concepts, Inc.
    Inventors: Ke Li, Paul Beckmann
  • Patent number: 11272328
    Abstract: A method and apparatus for changing a talkgroup icon is provided herein. During operation a current public-safety incident is determined. Based on the current public safety incident, a talkgroup icon will be determined and pushed to the various radios that are members of the talkgroup. When a radio displays a list of talkgroups (or a single talkgroup), each talkgroup will be accompanied by the unique icon that identifies a public-safety incident related to the talkgroup. This allows a user of the radio to identify a current conversation on a particular talkgroup without having to monitor the particular talkgroup.
    Type: Grant
    Filed: January 6, 2021
    Date of Patent: March 8, 2022
    Assignee: MOTOROLA SOLUTIONS, INC.
    Inventors: Anoop Sehgal Paras Ram, Chong Keat Chua, Chun Meng Tan, Kim Koon Neoh
  • Patent number: 11259127
    Abstract: A hearing device adapted to be worn by a user and for picking up sound containing the user's own voice is provided.
    Type: Grant
    Filed: March 20, 2020
    Date of Patent: February 22, 2022
    Assignee: OTICON A/S
    Inventors: Jan M. De Haan, Mirjana Adnadjevic, Svend Feldt
  • Patent number: 11244675
    Abstract: An output-content control device includes a voice classifying unit configured to analyze a voice spoken by a user and acquired by a voice acquiring unit to determine whether the voice is a predetermined voice; an intention analyzing unit configured to analyze the voice acquired by the voice acquiring unit to detect intention information indicating what kind of information is wished to be acquired by the user; a notification-information acquiring unit configured to acquire notification information to be notified to the user based on the intention information; and an output-content generating unit configured to generate an output sentence as sentence data to be output to the user based on the notification information and also configured to generate the output sentence in which at least one word selected among words included in the notification information is replaced with another word when the voice is determined to be the predetermined voice.
    Type: Grant
    Filed: March 7, 2019
    Date of Patent: February 8, 2022
    Assignee: JVCKENWOOD Corporation
    Inventor: Tatsumi Naganuma
  • Patent number: 11244688
    Abstract: Hardware and/or software systems, devices, networks, and methods for identity recognition and verification based on vocal spectrum analysis. The system including one or more processors coupled to a memory/storage to collect audio samples sufficient to generate a speaker identification reference pattern and a speaker identification verification pattern, generate a speaker identification reference pattern from the audio samples and a speaker identification verification pattern from other audio samples, compare the speaker identification verification pattern with the speaker identification reference pattern; and provide a response indicating whether the speaker identification verification pattern and the speaker identification reference pattern were generated based on audio samples from the same person. The system may be employed on a mobile phone in near field communication with a control system and may include a management platform.
    Type: Grant
    Filed: September 4, 2020
    Date of Patent: February 8, 2022
    Assignee: Lingual Information System Technologies, Inc.
    Inventor: Paul J. Warner
  • Patent number: 11238242
    Abstract: Some implementations are directed to translating chatspeak to a normalized form, where the chatspeak is included in natural language input formulated by a user via a user interface input device of a computing device—such as input provided by the user to an automated assistant. The normalized form of the chatspeak may be utilized by the automated assistant in determining reply content that is responsive to the natural language input, and that reply content may be presented to the user via one or more user interface output devices of the computing device of the user. Some implementations are additionally and/or alternatively directed to providing, for presentation to a user, natural language output that includes chatspeak in lieu of a normalized form of the chatspeak, based at least in part on a “chatspeak measure” that is determined based on past usage of chatspeak by the user and/or by additional users.
    Type: Grant
    Filed: March 21, 2019
    Date of Patent: February 1, 2022
    Assignee: Google LLC
    Inventors: Wan Fen Nicole Quah, Bryan Horling, Maryam Garrett, Brian Roark, Richard Sproat
  • Patent number: 11238848
    Abstract: In some implementations, authentication tokens corresponding to known users of a device are stored on the device. An utterance from a speaker is received. The speaker of the utterance is classified as not a known user of the device. A query that includes the authentication tokens that correspond to known users of the device, a representation of the utterance, and an indication that the speaker was classified as not a known user of the device is provided to the server. A response to the query is received at the device and from the server based on the query.
    Type: Grant
    Filed: December 10, 2019
    Date of Patent: February 1, 2022
    Assignee: Google LLC
    Inventors: Meltem Oktem, Taral Pradeep Joglekar, Fnu Heryandi, Pu-sen Chao, Ignacio Lopez Moreno, Salil Rajadhyaksha, Alexander H. Gruenstein, Diego Melendo Casado
  • Patent number: 11233756
    Abstract: The present disclosure provides method and apparatus for voice forwarding in automated chatting. A first request for transmitting a voice segment may be received from a first entity in a service group. The voice segment may be received from the first entity. A voice message may be generated based on the voice segment. The voice message may be transmitted based on the first request.
    Type: Grant
    Filed: April 7, 2017
    Date of Patent: January 25, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Xianchao Wu, Kazushige Ito
  • Patent number: 11227605
    Abstract: A management of user profiles comprises calculating, for each speaker model of at least one speaker model, a confidence measure representing a probability that the speaker model represents a speaker of a cluster of audio segments. A user profile associated with the speaker model is updated based on a user preference assigned to the cluster of audio segments if the confidence measure calculated for the speaker model represents a probability that is higher than a target probability. The embodiments achieve an efficient user profile management in a voice-controlled context but without the need for any dedicated enrollment sessions to train speaker models.
    Type: Grant
    Filed: September 11, 2017
    Date of Patent: January 18, 2022
    Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)
    Inventors: Volodya Grancharov, Tomer Amiaz, Hadar Gecht, Harald Pobloth
  • Patent number: 11227591
    Abstract: Described are techniques for tracking where user sensitive data has been sent (and optionally stored). Also described are techniques for ensuring user sensitive data is deleted, from all applicable locations, in response to a user command to delete its sensitive data. In at least some embodiments, a natural language processing system may cause a skill, in communication with but not implemented by the natural language processing system, to delete sensitive data.
    Type: Grant
    Filed: June 4, 2019
    Date of Patent: January 18, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Lawrence Ockene, Gregory Chappell, Fausto Rafael Betances, Marissa Mierow
  • Patent number: 11227592
    Abstract: The present disclosure describes techniques for dynamically determining when information is to be output to a user, as well as what information is to be output to a user. A natural language processing system may receive, from a first device, first data representing information to be output at a first point during a skill session. The natural language processing system may also receive, from a second device, second data representing a natural language input. The natural language processing system may determine a skill component is to execute with respect to the natural language input. The natural language processing system may send, to the skill component, second data representing the natural language input. The natural language processing system may receive, from the skill component, an indication that an ongoing first skill session with the second device has reached the first point.
    Type: Grant
    Filed: June 27, 2019
    Date of Patent: January 18, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Mark Conrad Kockerbeck, Muhammad Yahia, Jordan Michael Hughes, Kevin Boehm, Rohit Sauhta
  • Patent number: 11222325
    Abstract: The present disclosure generally relates to user interfaces for managing peer-to-peer transfers. In some examples, a device provides user interfaces for initiating and managing transfers. In some examples, a device provides user interfaces corresponding to completed transfers. In some examples, a device provides user interfaces for providing visually distinguishable message object appearances based on message designation. In some examples, a device provides user interfaces for activating accounts for accepting and sending transfers. In some examples, a device provides user interfaces for exchanging accounts for use in a transfer. In some examples, a device provides user interfaces for splitting transfers between two or more accounts. In some examples, a device provides user interfaces for generating and displaying a transfers history list. In some examples, a device provides user interfaces for voice-activation of transfers.
    Type: Grant
    Filed: September 29, 2020
    Date of Patent: January 11, 2022
    Assignee: Apple Inc.
    Inventors: Marcel Van Os, Peter D. Anton, Allison Dryer, Cas Lemmens, Glen W. Steele
  • Patent number: 11216545
    Abstract: Briefly, a portable intelligent device is provided that has an audio input for receiving a voice input from a user and an event manager for detecting that an event has occurred. The intelligent device also stores a passcode and a voice-code indicative of the passcode that is unique to a particular user. The intelligent device presents the passcode to a user, for example, from a display on the device, or from smart phone or tablet wirelessly connected to the intelligent device. The user speaks the passcode into an input transducer (microphone) on the intelligent device, and a processor generates a voiceprint that reflects the spoken passcode. The processor then can use the stored voice-code and the generated voiceprint to determine if a specific user was speaking, and if the user spoke the correct passcode. In this way the intelligent device is able to authenticate or authorize a remote user simply by having the user anonymously speak a passcode into the intelligent device.
    Type: Grant
    Filed: September 18, 2019
    Date of Patent: January 4, 2022
    Inventors: Paul Atkinson, Jack Donner
  • Patent number: 11205418
    Abstract: Examples of the present disclosure describe systems and methods for detecting monotone speech. In aspects, audio data provided by a user may be received a device. Pitch values may be calculated and/or extracted from the audio data. The non-zero pitch values may be divided into clusters. For each cluster, a Pitch Variation Quotient (PVQ) value may be calculated. The weighted average of PVQ values across the clusters may be calculated and compared to a threshold for determining monotone speech. Based on the comparison, the audio data may be classified as monotone or non-monotone and an indication of the classification may be provided to the user in real-time via a user interface. Upon the completion of the audio session in which the audio data is received, feedback for the audio data may be provided to the user via the user interface.
    Type: Grant
    Filed: May 13, 2020
    Date of Patent: December 21, 2021
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: John Christian Leone, Amit Srivastava
  • Patent number: 11200904
    Abstract: An electronic apparatus is provided. The electronic apparatus includes an inputter comprising input circuitry, a voice receiver comprising voice receiving circuitry, a storage, and a processor configured to: provide a guide prompting a user utterance based on user authentication being performed according to user information input through the inputter, generate a speaker recognition model corresponding to the user information based on a voice corresponding to the guide being received through the voice receiver, store the speaker recognition model in the storage, and identify a user corresponding to a voice received through the voice receiver based on the speaker recognition model updated by comparing a voice received through the voice receiver with the speaker recognition model.
    Type: Grant
    Filed: May 10, 2019
    Date of Patent: December 14, 2021
    Assignee: Samsung Electronics Co., Ltd.
    Inventor: Chanhee Choi
  • Patent number: 11200894
    Abstract: In one aspect, a playback device includes a voice assistant service (VAS) wake-word engine and a command keyword engine. The playback device detects, via the command keyword engine, a first command keyword of in voice input of sound detected by one or more microphones of the playback device. The playback device determines an intent based on at least one keyword in the voice input via a local natural language unit (NLU). After detecting the first command keyword event and determining the intent, the playback device performs a first playback command corresponding to the first command keyword and according to the determined intent. When the playback device detects, via the wake-word engine, a wake-word in voice input, the playback device streams sound data corresponding to at least a portion of the voice input to one or more remote servers associated with the VAS.
    Type: Grant
    Filed: June 12, 2019
    Date of Patent: December 14, 2021
    Assignee: Sonos, Inc.
    Inventors: Connor Smith, John Tolomei, Kurt Soto
  • Patent number: 11195524
    Abstract: Systems and methods for contextual search query revision are disclosed. A user utterance including at least one semantic component is received and a plurality of candidate n-grams including the at least one semantic component and at least one additional semantic component selected from a set of prior semantic components is generated. A probability that each of the plurality of candidate n-grams is an intended n-gram is calculated and a selected one of the plurality of candidate n-grams is output based on the probability.
    Type: Grant
    Filed: January 31, 2020
    Date of Patent: December 7, 2021
    Assignee: Walmart Apollo, LLC
    Inventors: Snehasish Mukherjee, Phani Ram Sayapaneni
  • Patent number: 11194893
    Abstract: The present disclosure is generally directed a data processing system for authenticating packetized audio signals in a voice activated computer network environment. The data processing system can improve the efficiency and effectiveness of auditory data packet transmission over one or more computer networks by, for example, disabling malicious transmissions prior to their transmission across the network. The present solution can also improve computational efficiency by disabling remote computer processes possibly affected by or caused by the malicious audio signal transmissions. By disabling the transmission of malicious audio signals, the system can reduce bandwidth utilization by not transmitting the data packets carrying the malicious audio signal across the networks.
    Type: Grant
    Filed: January 5, 2018
    Date of Patent: December 7, 2021
    Assignee: Google LLC
    Inventors: Ken Krieger, Andrew Joseph Alexander Gildfind, Nicholas Salvatore Arini, Simon Michael Rowe, Raimundo Mirisola, Gaurav Bhaya, Robert Stets
  • Patent number: 11190155
    Abstract: A system for audio control in a vehicle includes a speaker designed to output vehicle audio data in a cabin of the vehicle at a volume. The system further includes a microphone designed to detect microphone data in the cabin of the vehicle. The system further includes a memory designed to store an audio profile corresponding to desirable operation of the volume of the speaker. The system also includes an electronic control unit (ECU) coupled to the speaker, the microphone, and the memory and designed to control the volume of the speaker based on the detected microphone data and the audio profile.
    Type: Grant
    Filed: September 3, 2019
    Date of Patent: November 30, 2021
    Assignee: TOYOTA MOTOR NORTH AMERICA, INC.
    Inventors: Sai Prithvi Gadde, Harjot Singh, Ethan Pomish
  • Patent number: 11189294
    Abstract: An electronic device and method are disclosed herein. The electronic device includes a speaker, microphone, processor and memory storing instructions, which implement the method, including: determining whether registration of a first user is required based on a first voice signal obtained through a microphone of the electronic device, when registration of the first user is required, requesting authentication of the first user by a second user preregistered at the electronic device, and when information authenticating the first user by the second user is received, registering the first user based on the received information.
    Type: Grant
    Filed: August 7, 2019
    Date of Patent: November 30, 2021
    Assignee: Samsung Electronics Co., Ltd.
    Inventor: Minjung Sohn
  • Patent number: 11189263
    Abstract: A voice data processing method includes acquiring historical voice data, acquiring historical voice feature vectors corresponding to the historical voice data, and performing clustering on the historical voice feature vectors to obtain a voice feature cluster, the voice feature cluster comprising at least one historical voice feature vector with a similar feature. The method also includes, when the voice feature cluster matches a high-frequency user condition, training a corresponding user voice model according to the historical voice feature vectors contained in the voice feature cluster; after a current voice feature vector of the current voice data matches the user voice model, initiating a user identity association request associated with the current voice data; and, after a response message corresponding to the user identity association request is received, binding user identity information in the response message to the user voice model.
    Type: Grant
    Filed: October 11, 2019
    Date of Patent: November 30, 2021
    Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    Inventors: Long Ma, Jun Li, Li Zhang
  • Patent number: 11182594
    Abstract: A face image retrieval method includes: obtaining to-be-retrieved face information corresponding to a to-be-retrieved image by a convolutional neural network, the convolutional neural network being configured with corresponding convolution calculation configuration information by a processor, the convolutional neural network including at least one convolutional layer, the convolution calculation configuration information including a data bit width value corresponding to each convolutional layer in the convolutional neural network, and the to-be-retrieved image including at least one face region; searching a database for matched preset face image information that matches the to-be-retrieved face information, the database storing at least one piece of preset face image information; and outputting the preset face image information that matches the to-be-retrieved face information.
    Type: Grant
    Filed: December 31, 2019
    Date of Patent: November 23, 2021
    Assignee: SHENZHEN SENSETIME TECHNOLOGY CO., LTD.
    Inventors: Haibin Lai, Ningyuan Mao, Qingzheng Li, Wenzhi Liu
  • Patent number: 11182828
    Abstract: A fixed-wing aircraft advertisement method, system, and non-transitory computer readable medium for a fixed-wing aircraft, include advertising from samples of speech heard by the fixed-wing aircraft at a given location.
    Type: Grant
    Filed: August 22, 2019
    Date of Patent: November 23, 2021
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Kuntal Dey, Seema Nagar, Roman Vaculin
  • Patent number: 11176543
    Abstract: The invention provides systems, methods and computer program products for secure electronic payment transactions based on voice generated currency tokens. The invention comprises implementing at a voice currency platform server, the steps of (i) receiving from a payor terminal device a request for generation of a voice currency token, (ii) performing voice based biometric authentication by matching the voice data received from the payor terminal device against one or more voice based biometric templates associated with the payor voice currency platform account, (iii) performing speech analysis to extract at least the currency amount identified within the voice data received from the payor terminal device and (iv) generating an encrypted voice currency token.
    Type: Grant
    Filed: September 5, 2019
    Date of Patent: November 16, 2021
    Assignee: Mastercard International Incorporated
    Inventors: Harsh Piparsaniya, Sudhir Gupta, Rahul Agrawal
  • Patent number: 11170775
    Abstract: Disclosed are a display apparatus and a method for operating the display apparatus, the display apparatus being operated by executing an artificial intelligence (AI) algorithm and/or a machine learning algorithm in a 5G environment connected for Internet of Things. The method for operating the display apparatus includes the acts of receiving utterance information of a user who is watching the display apparatus, selecting an utterance intention corresponding to the user's utterance information according to a predefined rule, switching operation of the display apparatus on the basis of the selected utterance intention, collecting reaction information of the user corresponding to the switched operation of the display apparatus, and reconstructing the predefined rule by using the user's utterance information, the selected utterance intention, and the user's reaction information.
    Type: Grant
    Filed: August 15, 2019
    Date of Patent: November 9, 2021
    Assignee: LG ELECTRONICS INC.
    Inventor: Yi Reun Kim
  • Patent number: 11170789
    Abstract: To generate substantially domain-invariant and speaker-discriminative features, embodiments are associated with a feature extractor to receive speech frames and extract features from the speech frames based on a first set of parameters of the feature extractor, a senone classifier to identify a senone based on the received features and on a second set of parameters of the senone classifier, an attention network capable of determining a relative importance of features extracted by the feature extractor to domain classification, based on a third set of parameters of the attention network, a domain classifier capable of classifying a domain based on the features and the relative importances, and on a fourth set of parameters of the domain classifier; and a training platform to train the first set of parameters of the feature extractor and the second set of parameters of the senone classifier to minimize the senone classification loss, train the first set of parameters of the feature extractor to maximize the dom
    Type: Grant
    Filed: July 26, 2019
    Date of Patent: November 9, 2021
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Zhong Meng, Jinyu Li, Yifan Gong
  • Patent number: 11170786
    Abstract: The present disclosure proposes a federated speaker verification method based on differential privacy, including: 1. performing, by a server, UBM pre-training to obtain an initial UBM; 2. receiving, by the client, the pre-trained initial UBM, and performing initial UBM learning based on local private speech data; 3. performing, by the client, differential privacy protection based on learned statistics; 4. aggregating, by the server, statistics uploaded by multiple clients, and updating the initial UBM; and 5. receiving, by the client, the updated UBM, performing adjustment based on the local private speech data to obtain a GMM for a user of the client, and determining, based on the updated UBM and the GMM, whether a to-be-verified speech is generated by the user of the client.
    Type: Grant
    Filed: May 30, 2021
    Date of Patent: November 9, 2021
    Assignee: Harbin Institute of Technology (Shenzhen) (Shenzhen Institute of Science and Technology Innovation, Harbin Institute of Technology)
    Inventors: Qing Liao, Yangqian Wang, Yang Liu, Lin Jiang, Xuan Wang, Ye Wang