Systems Using Speech Recognizers (epo) Patents (Class 704/E15.045)

Method, apparatus, and computer program product for providing a search feedback system

Patent number: 12260421

Abstract: Provided herein are systems, methods and computer readable media for receiving consumer search data, aggregating by consumer and location, and utilizing the aggregated consumer search data in demand forecasting and relevance determination. An example method may include receiving consumer search data, the consumer search data indicative of search performed by a consumer, the consumer search data comprising one or more search terms and at least one of a consumer location or consumer identification information, storing the consumer search data for a predetermined time interval, and providing at least one of consumer aggregated search data to a relevance module for determining which of a plurality of promotions to present to a consumer at a second time or providing location aggregated search data to a demand forecasting module for utilization in forecasting promotion demand in a particular location.

Type: Grant

Filed: April 1, 2022

Date of Patent: March 25, 2025

Assignee: BYTEDANCE INC.

Inventors: Greyson Gregory, Vincenzo Mannino, Alex Lester
Supporting multiple roles in voice-enabled navigation

Patent number: 12246676

Abstract: To determine whether a user is authorized to make a particular audio request during navigation, a client device receives a request for navigation directions from a starting location to a destination location. The client device provides a set of navigation directions for traversing to the destination location along a route. During a navigation session, an audio request related to the route is received from a user. The client device determines an authorization level of the user based on the audio request, and provides a response to the request based on the authorization level of the user.

Type: Grant

Filed: June 23, 2021

Date of Patent: March 11, 2025

Assignee: GOOGLE LLC

Inventor: Matthew Sharifi
Policy layers for machine control

Patent number: 12240112

Abstract: Apparatuses, systems, and techniques provide a policy that can be executed to cause a machine to move. In at least one embodiment, a first policy layer is provided to cause the machine to execute a first motion that causes the machine to accelerate to reach an unbiased state. A second policy layer is provided to cause the machine to execute a second motion without influencing the unbiased state to be reached by machine. The policy can comprise the first and second policy layers.

Type: Grant

Filed: April 26, 2022

Date of Patent: March 4, 2025

Assignee: NVIDIA Corporation

Inventors: Nathan Donald Ratliff, Karl Van Wyk, Man Xie, Anqi Li, Muhammad Asif Rana
Privacy mode based on speaker identifier

Patent number: 12243532

Abstract: Techniques for configuring a speech processing system with a privacy mode that is associated with the identity of a user that activated the privacy mode are described. A user may speak an indication to have the speech processing system activate a privacy mode. When such an indication is detected by the speech processing system, the speech processing system determines an identity of the user, determines a unique system identifier associated with the user, and generates a privacy mode flag. The speech processing system then associates the privacy mode flag with the user's unique system identifier. The privacy mode flag indicates to components of the speech processing system that any data related to processing of the user's utterances should not be sent to long term storage, thus causing various components of the system to delete data once the respective component is finished processing with respect to an utterance of the user.

Type: Grant

Filed: November 22, 2023

Date of Patent: March 4, 2025

Assignee: Amazon Technologies, Inc.

Inventor: Zhenhua Wang
Method of editing speech recognition result

Patent number: 12243537

Abstract: Disclosed is a method of editing a speech recognition result, the method being performed by a computing device. The method may include: displaying a word list satisfying a predetermined condition based on text information generated by speech recognition; determining a target word within the word list; and displaying a region corresponding to the target word within the text information, in which the predetermined condition includes at least one of predetermined word information for each user account and predetermined threshold information associated with a frequency of occurrence of a word.

Type: Grant

Filed: August 22, 2023

Date of Patent: March 4, 2025

Assignee: ActionPower Corp.

Inventors: Jihwa Lee, Hwanseok Choi, Jinsuk Park, Yunseop Kim, Woochan Jeong
Semantic search based on a graph database

Patent number: 12242477

Abstract: In order to perform a semantic search based on a graph database, sets of nodes are selected from a plurality of nodes in a graph database. A set of nodes semantically matches a keyword in a natural language query. At least one target node is identified in the sets of nodes. A path is selected from candidate paths based on similarities between the candidate paths and a plurality of paths in the graph database. A graph query for retrieving information from the graph database is generated based on the selected path and the query target.

Type: Grant

Filed: September 7, 2021

Date of Patent: March 4, 2025

Assignee: International Business Machines Corporation

Inventors: Teng Sun, Tong Liu, Si Tong Zhao, XueLiang Zhao, Frank Feng, Yu Zui Wy You, Zhong Fang Yuan
Videochat

Patent number: 12235898

Abstract: The present disclosure provides a technical solution of multi-modal chatting, which may provide response to user query by using multi-modal response in the interaction between chatbot and human beings, so that the expressing ways and the expressed content by the chatbot could be richer by using such response in a multi-modal way.

Type: Grant

Filed: February 20, 2024

Date of Patent: February 25, 2025

Assignee: Microsoft Technology Licensing, LLC

Inventors: Nan Duan, Lei Ji, Ming Zhou
Camera control systems and methods for a computer-assisted surgical system

Patent number: 12220181

Abstract: A camera control system may access surgical session data for a surgical session, the surgical session including performance of one or more operations by a computer-assisted surgical system. The camera control system may identify, based on the surgical session data, an event associated with the surgical session, and may determine, based on the surgical session data, a location associated with the event. In response to the determination of the location of the event, the camera control system may direct an automatic adjustment of a view of a camera to capture a specific view of the location associated with the event.

Type: Grant

Filed: January 28, 2020

Date of Patent: February 11, 2025

Assignee: Intuitive Surgical Operations, Inc.

Inventors: Govinda Payyavula, Anthony M. Jarc
Information processing device and information processing method

Patent number: 12220805

Abstract: An information processing device including: an output control unit that controls an output from an interaction device to a user; an action evaluation unit that determines an action of the user performed in correspondence with an output of the interaction device; an emotion estimation unit that estimates an emotion of the user corresponding to the action of the user; and an information accumulation unit that accumulates the output of the interaction device, the action of the user, and the emotion of the user in association with each other as interaction information, in which the output control unit controls the output from the interaction device to the user based on the interaction information accumulated.

Type: Grant

Filed: February 14, 2020

Date of Patent: February 11, 2025

Assignee: SONY GROUP CORPORATION

Inventors: Fumihiko Iida, Ryuichi Suzuki, Kuniaki Torii, Emika Kaneko
Learning to extract entities from conversations with neural networks

Patent number: 12216999

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for extracting entities from conversation transcript data. One of the methods includes obtaining a conversation transcript sequence, processing the conversation transcript sequence using a span detection neural network configured to generate a set of text token spans; and for each text token span: processing a span representation using an entity name neural network to generate an entity name probability distribution over a set of entity names, each probability in the entity name probability distribution representing a likelihood that a corresponding entity name is a name of the entity referenced by the text token span; and processing the span representation using an entity status neural network to generate an entity status probability distribution over a set of entity statuses.

Type: Grant

Filed: February 19, 2020

Date of Patent: February 4, 2025

Assignee: Google LLC

Inventors: Nan Du, Linh Mai Tran, Yu-Hui Chen, Izhak Shafran
Hyperparameter optimization in file compression using sequence alignment

Patent number: 12216621

Abstract: Compressing files is disclosed. An input file to be compressed is first aligned. During or prior to aligning the input file, hyperparameters are set, determined, or configured. The hyperparameters may be set, determined, or configured to achieve a particular performance characteristic. Aligning the file includes splitting the file into sequences that can be aligned. The result is a compression matrix, where each row of the matrix corresponds to part of the file. A consensus sequence id determined from the compression matrix. Using the consensus sequence, pointer pairs are generated. Each pointer pair identifies a subsequence of the consensus matrix. The compressed file includes the pointer pairs and the consensus sequence.

Type: Grant

Filed: April 12, 2022

Date of Patent: February 4, 2025

Assignee: Dell Products L.P.

Inventors: Ofir Ezrielev, Ilan Buyum, Jehuda Shemer
Controller for a mobile drive, and method for controlling a mobile drive

Patent number: 12217746

Abstract: A controller for a furniture drive includes an operating device which includes a speech controller. The speech controller includes a speech control subunit operatively connected to an adjustment drive, and a microphone interacting with the speech control subunit. The speech controller includes three speech control subunits arranged in the operating unit, with two of the speech control subunits forming actuators of adjustment functions and one of the speech control units forming an actuator of stopping the adjustment drive.

Type: Grant

Filed: April 9, 2019

Date of Patent: February 4, 2025

Assignee: Dewertokin Technology Group Co., Ltd

Inventor: Armin Hille
Device targeting for content

Patent number: 12217749

Abstract: Devices and techniques are generally described for targeting of devices. In various examples, a first natural language input comprising a first request to output a response may be received by an input device. A first component may determine first data associated with the input device. A plurality of devices associated with the first data may be determined. First state data describing a state of each device of the plurality of devices may be determined. A first device of the plurality of devices may be determined as a target device for the first request based at least in part on the first state data. The first device may be different from the input device. First instructions may be sent to the first device effective to cause the first device to display the first visual content.

Type: Grant

Filed: December 10, 2021

Date of Patent: February 4, 2025

Assignee: AMAZON TECHNOLOGIES, INC.

Inventors: Ratika Anand, Zhen Hua, Trisha Hajela, Evan Victor Chang, Tom Vasella
Multi-modal sensor fusion for content identification in applications of human-machine interfaces

Patent number: 12211308

Abstract: Interactions with virtual systems may be difficult when users inadvertently fail to provide sufficient information to proceed with their requests. Certain types of inputs, such as auditory inputs, may lack sufficient information to properly provide a response to the user. Additional information, such as image data, may enable user gestures or poses to supplement the auditory inputs to enable response generation without requesting additional information from users.

Type: Grant

Filed: August 31, 2021

Date of Patent: January 28, 2025

Assignee: Nvidia Corporation

Inventors: Sakthivel Sivaraman, Nishant Puri, Yuzhuo Ren, Atousa Torabi, Shubhadeep Das, Niranjan Avadhanam, Sumit Kumar Bhattacharya, Jason Roche
Customer service system, server, control method, and storage medium

Patent number: 12205122

Abstract: A customer service system for providing customer service using a robot for customer service is provided. A first acquisition unit acquires first information regarding a topic provided by a person in charge of response in a negotiation between a visitor and the person in charge of response. A second acquisition unit acquires a detection result regarding speech and behavior of the visitor by the robot. An estimation unit estimates a reaction of the visitor on a basis of the detection result acquired by the second acquisition unit. An output unit outputs second information regarding the reaction estimated by the estimation unit.

Type: Grant

Filed: February 16, 2022

Date of Patent: January 21, 2025

Assignee: HONDA MOTOR CO., LTD.

Inventors: Leo Ito, Jingwen Han, Tomohiro Tsukamoto, Eriko Okabe, Yuichi Kawasaki, Misato Fukushima
Conferencing system and method for controlling the conferencing system

Patent number: 12205613

Abstract: A communication system and a method can be configured to facilitate the performance of a conference. The system can include a conference organizer terminal and at least two participants' terminals each assigned to respective conference participants who each log in to start a conference on the communication system. The communication system can be configured to calculate a decision situation at a particular point in time of the ongoing conference by analyzing the views expressed by the conference participants during the conference and send data relating to the decision situation for that point in time to the conference organizer's terminal and/or other conference participant terminals for use in facilitating the conference. IN some embodiments, such data can be used to assist the conference participants' in recognizing when there is a consensus made on at least one decision to be made during the conference.

Type: Grant

Filed: June 11, 2020

Date of Patent: January 21, 2025

Assignee: RINGCENTRAL, INC.

Inventors: Jurgen Totzke, Karl Klug
Structured video documents

Patent number: 12169522

Abstract: A method includes receiving a content feed that includes audio data corresponding to speech utterances and processing the content feed to generate a semantically-rich, structured document. The structured document includes a transcription of the speech utterances and includes a plurality of words each aligned with a corresponding audio segment of the audio data that indicates a time when the word was recognized in the audio data. During playback of the content feed, the method also includes receiving a query from a user requesting information contained in the content feed and processing, by a large language model, the query and the structured document to generate a response to the query. The response conveys the requested information contained in the content feed. The method also includes providing, for output from a user device associated with the user, the response to the query.

Type: Grant

Filed: March 2, 2023

Date of Patent: December 17, 2024

Assignee: Google LLC

Inventors: Johan Schalkwyk, Francoise Beaufays
Natural language interactions with interactive visual content

Patent number: 12159628

Abstract: Techniques for facilitating natural language interactions with visual interactive content are described. During a build time, a system analyzes various websites and applications relating to a particular user goal to understand website and application navigation and information relating to the user goal. The learned information is used to store configuration data. During runtime, when a user request performance of an action, the system engages in a dialog with the user to complete the user's goal. The system uses the stored configuration data to determine actions to be performed at a website or application to complete the user's goal, and determines system responses to present to the user to facilitate completion of the goal. Such system responses may request information from the user, may inform the user of information displayed at the website or application, etc.

Type: Grant

Filed: December 10, 2021

Date of Patent: December 3, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Amitabh Saikia, Devesh Mohan Pandey, Tagyoung Chung, Shanchan Wu, Chien-Wei Lin, Govindarajan Sundaram Thattai, Aishwarya Naresh Reganti, Arindam Mandal, Prakash Krishnan, Raefer Christopher Gabriel, Meyyappan Sundaram
Man-machine dialogue method and apparatus, electronic device and storage medium

Patent number: 12154554

Abstract: A man-machine dialogue method, includes: for each round of a plurality of rounds of dialogue wherein each round includes dialogue information input by a user, determining semantic information corresponding to the dialogue information; determining a target slot position corresponding to an item indicated by the semantic information, establishing a new pre-order data structure including the target slot position when there is no established pre-order data structure including the target slot position; outputting reply information responsive to the dialogue information, wherein the reply information is configured to guide the user to input new dialogue information in a subsequent round of dialogue; and in a case that the dialogue information input by the user in the subsequent round includes a keyword for indicating ordering, performing an ordering operation according to a finally-established pre-order data structure.

Type: Grant

Filed: February 25, 2022

Date of Patent: November 26, 2024

Assignees: Beijing Xiaomi Mobile Software Co., Ltd., Beijing Xiaomi Pinecone Electronics Co., Ltd.

Inventors: Zhennan Ming, Junjie Jiang
Assistance method and assistance system and assistance device using assistance method that execute processing relating to a behavior model

Patent number: 12145603

Abstract: A driving assistance device executes processing relating to a behavior model of a vehicle. Detected information from the vehicle is input to a detected information inputter. An acquirer derives at least one of a travel difficulty level of a vehicle, a wakefulness level of a driver, and a driving proficiency level of the driver on the basis of the detected information that is input to the detected information inputter. A determiner determines whether or not to execute processing on the basis of at least one information item derived by the acquirer. If the determiner has made a determination to execute the processing, a processor executes the processing relating to the behavior model. It is assumed that the processor does not execute the processing relating to the behavior model if the determiner has made a determination to not execute the processing.

Type: Grant

Filed: May 11, 2023

Date of Patent: November 19, 2024

Assignee: PANASONIC AUTOMOTIVE SYSTEMS CO., LTD.

Inventor: Koichi Emura
Determining unknown concepts from surrounding context

Patent number: 12141530

Abstract: A computer-implemented method for learning unknown concepts during natural language processing is disclosed, including identifying a sentence associated with an unknown concept, selecting a first sequential set of sentences from a first document, including the sentence associated with the unknown concept, one sentence prior, and subsequent to the sentence associated with the unknown concept, selecting a second sequential set of sentences from a second document, including a sentence associated with a known concept, and one sentence prior and subsequent to the sentence associated with the known concept, comparing concepts associated with the first sequential set of sentences and second sequential set of sentences, determining whether an inference can be made between the unknown concept associated with the sentence from the first document and the sentence associated with the known concept associated with the sentence from the second document, and tagging the unknown concept associated with the known concept.

Type: Grant

Filed: June 9, 2021

Date of Patent: November 12, 2024

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Maulana Bachtiar, Thi Thanh Thao Lai, Wen Rui Siow, Yida Lee, Ronny Syarif, Cheranellore Vasudevan
Methods, systems, and media for rights management of embedded sound recordings using composition clustering

Patent number: 12141882

Abstract: Methods, systems, and media for determining and presenting information related to embedded sound recordings are provided.

Type: Grant

Filed: December 11, 2019

Date of Patent: November 12, 2024

Assignee: Google LLC

Inventors: Kevin Song Zhu, Thomas Bugnon, Keith Wedelich, George Huang, Jacob Levine, Sha Chang, Julian Bill, Arthur Gaudriot, Nicholas Bryan Johnson, Vishaal Prasad
Augmented display from conversational monitoring

Patent number: 12143673

Abstract: Systems and methods are provided for generating for display an indication of a segment of media content relevant to a voice communication. This may be accomplished by a media guidance application that monitors a voice communication between users. The media up guidance application determines that a first user is describing media content. In response to determining that the first user is describing the media content, the media guidance application retrieves media asset viewing history of the first user. The media guidance application determines, based on metadata of each media asset in the media asset viewing history of the first user and the voice communication, a media asset that the first user is describing. The media guidance application determines, based on metadata of the media asset, a segment of the media asset that the first user is describing. The media guidance application generates, for display, an indication of the segment.

Type: Grant

Filed: July 25, 2023

Date of Patent: November 12, 2024

Assignee: Adeia Guides Inc.

Inventors: Michael K. McCarty, Glen E. Roe
Repository for quick retrieval of object(s) of a communication platform

Patent number: 12141100

Abstract: A repository for quick retrieval of object(s) of a communication platform is described. Server(s) of the communication platform can receive, in association with a user interface, a request to associate an object with a repository. The server(s) can store an object identifier of the object in the repository and cause display of an object user interface element representative of the object to be presented in association with a repository user interface element representative of the repository. In response to receiving a selection of the object user interface element, the server(s) can retrieve the object using the object identifier and cause the object to be presented, in the user interface with contextual data, wherein the contextual data comprises other object(s) associated with the object.

Type: Grant

Filed: April 9, 2021

Date of Patent: November 12, 2024

Assignee: Salesforce, Inc.

Inventors: Jason Hon-Son Wong, Julie Punturo, Elizabeth Anne Millikin, Zachery Floyd
Ring enabling its wearer to enter control commands

Patent number: 12130965

Abstract: Control systems and methods are provided that utilize a device, which can be worn by a user, to enable the user to enter control commands for causing a controller to control one or more electronic devices in a local network, such as a Wi-Fi system. A local control system, according to one implementation, includes a smart ring configured to obtain movement information related to one or more movements of the smart ring while a user is wearing the smart ring. The local control system also includes a controller device configured to communicate with the smart ring using Bluetooth or Wi-Fi signals. Characteristics of the movement information can be translated in order to obtain one or more control commands. The controller device is configured to control one or more aspects of one or more electronic devices based on the one or more control commands.

Type: Grant

Filed: July 7, 2022

Date of Patent: October 29, 2024

Assignee: PLUME DESIGN, INC.

Inventors: Zhicheng Qiu, William J. McFarland
Systems and methods for assisting an operator in operating vehicle controls

Patent number: 12128908

Abstract: Systems and methods for assisting an operator in operating vehicle controls are disclosed herein. One embodiment detects that an operator is touching a control in a vehicle and automatically takes, in response to the operator touching the control, one or more actions to assist the operator with regard to the vehicle being a left-hand-drive vehicle or a right-hand-drive vehicle.

Type: Grant

Filed: October 24, 2022

Date of Patent: October 29, 2024

Assignee: Woven by Toyota, Inc.

Inventors: Manuel Ludwig Kuehner, Hiroshi Yasuda
Audio frequency signal processing method and apparatus, terminal and storage medium

Patent number: 12121812

Abstract: An audio signal processing method, an audio signal processing apparatus, a terminal, and a storage medium are provided. In the method, audio signal in a first period of time is collected. A reference sound volume is determined based on the collected audio signal. Audio signal sampling is performed on the audio signal at multiple audio sampling points in a second period of time to obtain multiple audio sampling signals. In a case of determining that the multiple audio sampling signals meet a predetermined condition based on the reference sound volume, a basis sound volume is determined based on the multiple audio sampling signals. The basis sound volume is used for controlling a display effect of a target object in a virtual scene, and different basis sound volumes correspond to different display effects of the target object.

Type: Grant

Filed: March 11, 2020

Date of Patent: October 22, 2024

Assignee: BEIJING BYTEDANCE NETWORK TECHNOLOGY CO., LTD.

Inventor: Wei Zheng
Enhanced personalized gesture inputs at an electronic gaming machine

Patent number: 12118846

Abstract: Devices, systems and methods are provided. A device may include a gesture input device to detect gesture inputs performed by a user, a processor circuit, and a memory coupled to the processor circuit. The memory includes machine-readable instructions that, when executed by the processor circuit, cause the processor circuit to receive a first gesture input value from the first gesture input device and that corresponds to a user-specific gesture that the user performs, associate the first gesture input value with a first gaming device operation to be performed by the gaming device, receive the first gesture input value that is associated with the first gaming device operation, and responsive to receiving the first gesture input value that is associated with the first gaming device operation, cause the gaming device to perform the first gaming device operation.

Type: Grant

Filed: April 16, 2021

Date of Patent: October 15, 2024

Assignee: IGT

Inventors: David Small, David Froy, Jr., Michael Russ
Reducing the need for manual start/end-pointing and trigger phrases

Patent number: 12118999

Abstract: Systems and processes for selectively processing and responding to a spoken user input are provided. In one example, audio input containing a spoken user input can be received at a user device. The spoken user input can be identified from the audio input by identifying start and end-points of the spoken user input. It can be determined whether or not the spoken user input was intended for a virtual assistant based on contextual information. The determination can be made using a rule-based system or a probabilistic system. If it is determined that the spoken user input was intended for the virtual assistant, the spoken user input can be processed and an appropriate response can be generated. If it is instead determined that the spoken user input was not intended for the virtual assistant, the spoken user input can be ignored and/or no response can be generated.

Type: Grant

Filed: August 7, 2023

Date of Patent: October 15, 2024

Assignee: Apple Inc.

Inventors: Philippe P. Piernot, Justin G. Binder
Method for executing application and apparatus therefor

Patent number: 12112095

Abstract: Provided is a device for executing an application including a graphics user interface (GUI) for receiving an input value of an input field, the device including an audio output unit, a user input unit receiving a user input to request execution of the application, and a control unit configured to output, through the audio output unit, an audio signal indicating an induced inquiry corresponding to the input field, based on whether the user input is a voice input, to receive a voice input indicating a response to the induced inquiry, and to execute the application by setting an input value for the input field based on the voice input indicating the response to the induced inquiry.

Type: Grant

Filed: February 28, 2018

Date of Patent: October 8, 2024

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Dong-hyeon Lee, Se-chun Kang, Yu-bin Seo, He-jung Yang
Curriculum architecture tool

Patent number: 12112651

Abstract: Delivering multimedia content for academic curriculum, over a network, to a user. The method includes a multimedia content delivery system identifying user attributes for the user, where the user is connected to the multimedia content delivery system over a network. The method further includes the multimedia content delivery system identifying attributes of a plurality of multimedia assets. Based on the identified user attributes and the identified attributes of the plurality of multimedia assets, the method includes creating a multimedia offering for delivery to the user. The multimedia offering satisfies a curriculum requirement specific to the user based on the user attributes. The multimedia offering is delivered over the network to the user.

Type: Grant

Filed: April 14, 2022

Date of Patent: October 8, 2024

Assignee: Western Governors University

Inventors: Jon Morley, Mike Hassett, Adel Lelo, Jerry Damon Jasperson, Timothy Andrus, Brandon Karratti
Information processing apparatus, information processing method, and non-transitory computer readable media storing a program

Patent number: 12093459

Abstract: An information processing apparatus comprises a processor configured to execute a program so as to: acquire viewpoint information, motion information and object information; process an object image wherein: the object image is an image representing a virtual substance and is updated based on the viewpoint information and the motion information; calculate a position when superimposing a background image and the object image, wherein: the background image is an image according to the field of view of the user; generate visual information superimposing the background image and the object image based on the position calculated, and output the generated visual information to a display device; and generate tactile information for an ultrasound generator irradiating the user with ultrasound corresponding to the object based on the position calculated, the motion information and the object information, and output the generated tactile information to the ultrasound generator.

Type: Grant

Filed: December 8, 2022

Date of Patent: September 17, 2024

Assignee: THE UNIVERSITY OF TOKYO

Inventors: Mamoru Miyawaki, Hiroyuki Shinoda, Takaaki Kamigaki, Mitsuru Ito, Tao Morisaki, Shun Suzuki, Atsushi Matsubayashi, Ryoya Onishi, Yasutoshi Makino, Seki Inoue
Voice recognition method and device

Patent number: 12094469

Abstract: A method for recognizing speech comprises: respectively setting initial values of a Chinese character coefficient and a Pinyin coefficient, generating a Chinese character mapping function according to the initial value of the Chinese character coefficient, and generating a Pinyin mapping function according to the initial value of the Pinyin coefficient (S101); training the Chinese character mapping function and the Pinyin mapping function using a plurality of preset training samples, calculating training results as parameters of a joint loss function, and generating a target mapping function according to calculation results (S102); and recognizing, according to the target mapping function, speech to be recognized, so as to obtain a Chinese character recognition result and a Pinyin recognition result of the speech to be recognized (S103). The method reduces the cost of speech recognition while ensuring the accuracy of speech recognition.

Type: Grant

Filed: March 3, 2020

Date of Patent: September 17, 2024

Assignee: JINGDONG TECHNOLOGY HOLDING CO., LTD.

Inventors: Li Fu, Xiaoxiao Li
Systems and methods for selectively powering TV remote microphones

Patent number: 12089014

Abstract: A remote control with a microphone subsystem comprising a pair of internal microphones is shown and described. When connected to a remote-control base station that is itself connected to an external power source, the microphone subsystem is continuously energized by the external power source, and the pair of internal microphones operate as far field microphones that receive oral commands uttered by a user from a distance. When the remote control is removed from the base, the microphone subsystem is configured for selective connection to an internal power source by actuating a user control on the remote control. In the external power source mode, signals from both microphones are digitally processed to provide a far-field microphone array with beam forming. In the direct current mode, only one microphone's signals are digitally processed as a simple monaural signal (or they are not digitally processed).

Type: Grant

Filed: April 7, 2022

Date of Patent: September 10, 2024

Assignee: Vizio, Inc.

Inventors: W. Leo Hoarty, Glen Gihong Kim
Automatic learning of entities, words, pronunciations, and parts of speech

Patent number: 12080275

Abstract: Systems for automatic speech recognition and/or natural language understanding automatically learn new words by finding subsequences of phonemes that, if they were a new word, would enable a successful tokenization of a phoneme sequence. Systems can learn alternate pronunciations of words by finding phoneme sequences with a small edit distance to existing pronunciations. Systems can learn the part of speech of words by finding part-of-speech variations that would enable parses by syntactic grammars. Systems can learn what types of entities a word describes by finding sentences that could be parsed by a semantic grammar but for the words not being on an entity list.

Type: Grant

Filed: January 11, 2021

Date of Patent: September 3, 2024

Assignee: SoundHound AI IP, LLC.

Inventor: Anton V. Relin
Speech recognition for providing assistance during customer interaction

Patent number: 12079261

Abstract: A computer-implemented method for presenting relevant information to a customer service representative of a business may include receiving a digitized data stream corresponding to a spoken conversation between a customer and a representative; converting the data stream to a text stream; determining one or more keywords from the text stream; comparing the one or more keywords with a history of keywords that have previously been searched; and/or searching a database for information related to the one or more keywords that have not been previously searched. As a result of the keyword search, information about topics that the customer is interested in, may be located and displayed on a customer service representative display to facilitate the customer service representative timely relaying the information found by the keyword search to enhance the customer experience. Exemplary keywords may relate to insurance and financial services, such as “auto,” “home,” “life,” “insurance,” or “vehicle loan.

Type: Grant

Filed: June 28, 2022

Date of Patent: September 3, 2024

Assignee: State Farm Mutual Automobile Insurance Company

Inventor: Sylvia Hernandez
Systems and methods for voice assistant for electronic health records

Patent number: 12080292

Abstract: An electronic record voice assistant system can include one or more processors that receive audio data, apply a machine learning model to the audio data to generate speech data including at least one value, determine a state of an electronic record, and update one or more fields of the electronic record using the state and the at least one value.

Type: Grant

Filed: June 6, 2022

Date of Patent: September 3, 2024

Assignee: Bola Technologies, Inc.

Inventors: Rushi M. Ganmukhi, Daniel Brownwood, Sidharth Malhotra, Augusto Monteiro Nobre Amanco
Audio-visual hearing aid

Patent number: 12073844

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: receiving, by a user device, a first indication of one or more first speakers visible in a current view recorded by a camera of the user device, in response, generating a respective isolated speech signal for each of the one or more first speakers that isolates speech of the first speaker in the current view and sending the isolated speech signals for each of the one or more first speakers to a listening device operatively coupled to the user device, receiving, by the user device, a second indication of one or more second speakers visible in the current view recorded by the camera of the user device, and in response generating and sending a respective isolated speech signal for each of the one or more second speakers to the listening device.

Type: Grant

Filed: October 1, 2020

Date of Patent: August 27, 2024

Assignee: Google LLC

Inventors: Anatoly Efros, Noam Etzion-Rosenberg, Tal Remez, Oran Lang, Inbar Mosseri, Israel Or Weinstein, Benjamin Schlesinger, Michael Rubinstein, Ariel Ephrat, Yukun Zhu, Stella Laurenzo, Amit Pitaru, Yossi Matias
Real-time data communication for elevator system

Patent number: 12065329

Abstract: A method for controlling an elevator system comprises receiving media data, receiving at least one of control data and sensor data, transmitting the media data and at least one of the control data and sensor data via a real-time data communication network between a central controller adapted for controlling a drive of the elevator system and a local controller, the local controller being provided in one of an elevator car control panel and a door control panel, wherein the real-time data communication network is adapted for transmitting data packets while ensuring that a data packet is transmitted during a maximal transmission time.

Type: Grant

Filed: January 16, 2018

Date of Patent: August 20, 2024

Assignee: Inventio AG

Inventor: Stefano Carriero
Using a natural language model to interface with a closed domain system

Patent number: 12057113

Abstract: In various examples, systems and methods of the present disclosure combine open and closed dialog systems into an intelligent dialog management system. A text query may be processed by a natural language understanding model trained to associate the text query with a domain tag, intent classification, and/or input slots. Using the domain tag, the natural language understanding model may identify information in the text query corresponding to input slots needed for answering the text query. The text query and related information may then be passed to a dialog manager to direct the text query to the proper domain dialog system. Responses retrieved from the domain dialog system may be provided to the user via text output and/or via a text to speech component of the dialog management system.

Type: Grant

Filed: June 6, 2023

Date of Patent: August 6, 2024

Assignee: NVIDIA Corporation

Inventors: Shubhadeep Das, Sumit Bhattacharya, Ratin Kumar
System and method for enhancing vehicle occupant voice data privacy

Patent number: 12045353

Abstract: A microphone controller includes a processor programmed to receive voice input from one or more microphones to be utilized in a voice recognition session initiated by the microphone controller. Further the microphone controller includes a key store including one or more keys configured to encrypt the received voice input to an encrypted voice data.

Type: Grant

Filed: May 29, 2019

Date of Patent: July 23, 2024

Assignee: Denso Corporation

Inventors: Ameer Kashani, Gopalakrishnan Iyer
Surgical system with voice control

Patent number: 12042237

Abstract: A surgical system includes a plurality of voice sensors located in a surgical environment and configured to detect sound and generate a first plurality of signals. The surgical system also includes a position indicator, in proximity to a designated user, configured to indicate a first position of the designated user and generate a second signal representative of the first position. The surgical system further includes a processor configured to receive the first plurality of signals and the second signal and determine, based on the first plurality of signals, a second position. The processor is also configured to compare the detected sound with registered voice command of the designated user stored in a memory to verify the designated user's credentials, and send a command signal to a surgical instrument to carry out an operation related to the voice command based on at least one of the verification of the designated user's credentials, the first position and the second position.

Type: Grant

Filed: November 4, 2022

Date of Patent: July 23, 2024

Assignee: Cilag GmbH International

Inventors: David J. Cagle, Eric Smith, Jeffrey L. Aldridge, Mary E. Mootoo, Ryan Asher
Robot control device and robot control method

Patent number: 12023811

Abstract: The present disclosure provides a robot control device including a detection section that detects an external force applied to a movable part of a robot, on the basis of a parameter obtained from a joint driving the movable part, and a driving control section that controls an interaction of the robot, according to the detected external force. With this configuration, in a case where a user touches the robot, the robot can perform an interaction according to the touch.

Type: Grant

Filed: January 27, 2020

Date of Patent: July 2, 2024

Assignee: SONY GROUP CORPORATION

Inventors: Yusuke Kawabe, Kensuke Kitamura, Katsuhisa Ito
Method and system for a parametric speech synthesis

Patent number: 12020687

Abstract: Embodiments of the present systems and methods may provide techniques for synthesizing speech in any voice in any language in any accent. For example, in an embodiment, a text-to-speech conversion system may comprise a text converter adapted to convert input text to at least one phoneme selected from a plurality of phonemes stored in memory, a machine-learning model storing voice patterns for a plurality of individuals and adapted to receive the at least one phoneme and an identity of a speaker and to generate acoustic features for each phoneme, and a decoder adapted to receive the generated acoustic features and to generate a speech signal simulating a voice of the identified speaker in a language.

Type: Grant

Filed: February 6, 2023

Date of Patent: June 25, 2024

Assignee: Georgetown University

Inventors: Joe Garman, Ophir Frieder
Multimodal intent entity resolver

Patent number: 12020695

Abstract: A method comprises receiving from an input device, a capture of user action as an initial command; interpreting the initial command into an interpreted command; generating a first set of modified commands that are based on the interpreted command, including: a first modified command that has a phonetic similarity to the interpreted command within a certain threshold, and a second modified command that is semantically related to an earlier command; transmitting, to an output device, the first set of modified commands; receiving a response to a group of commands including the first set of modified commands; recording an identifier of an input device from which the response was received and a type of the response in a log; when the response includes acknowledging a specific command of the group of commands as an accepted command, executing, the accepted command; otherwise, generating a second set of modified commands.

Type: Grant

Filed: February 28, 2023

Date of Patent: June 25, 2024

Assignee: Merlyn Mind, Inc.

Inventors: Aditya Vempaty, Ravindranath Kokku, Tamer Abuelsaad, Sharad C. Sundararajan, Satyanarayana Nitta
Method for controlling a plurality of robot effectors

Patent number: 12011828

Abstract: A method for controlling effectors of a robot by means of primitives made up of parameterizable coded functions, the primitives being activated conditionally by actions selected by an action selection system, the method based on associating coded objects with a sequence of characters corresponding to their semantic description, and comprising: i. a semantic description of the coded objects stored in the memory, made up of a string of characters representing a perception function of the robot and of another string of characters representing a perceived object, ii. a semantic description of the primitives, made up of a string of characters representing a possible action of the robot and of another, optional string of characters representing the optional parameters of this action, iii. a semantic description of rules made up of the combination of a string of characters representing the associated context and another string of characters representing the associated action.

Type: Grant

Filed: April 26, 2019

Date of Patent: June 18, 2024

Assignee: SPOON

Inventors: Jérôme Monceaux, Thibault Hervier, Aymeric Masurelle
Intercepting and challenging unwanted phone calls

Patent number: 12003667

Abstract: A call challenger can receive a user input from a called party identity to opt-in to a call challenge service, and a second user input of a keyword. When the call challenger receives a call directed to a user equipment of the called party identity, the call challenger can prompt the calling party to provide an audible response. In response to a receipt of the audible response, the call challenger can convert the audible response to a text. The call challenger can compare the text with the keyword to determine if there is a sufficient match. In response to the determining the output of the comparing does not satisfy a threshold match score, the call challenger can prevent the call from connecting with the user equipment.

Type: Grant

Filed: January 10, 2023

Date of Patent: June 4, 2024

Assignees: AT&T Intellectual Property I, L.P., AT&T Mobility II LLC

Inventors: Sheldon Meredith, Brandon Hilliard, Zachary Meredith
System and method for voice actuated configuration of a controlling device

Patent number: 11996100

Abstract: A speech recognition engine is provided voice data indicative of at least a brand of a target appliance. The speech recognition engine uses the voice data indicative of at least a brand of the target appliance to identify within a library of codesets at least one codeset that is cross-referenced to the brand of the target appliance. The at least one codeset so identified is then caused to be provisioned to the controlling device for use in commanding functional operations of the target appliance.

Type: Grant

Filed: January 13, 2022

Date of Patent: May 28, 2024

Assignee: Universal Electronics Inc.

Inventor: Jonathan Lim
Speech recognition device, search device, speech recognition method, search method, and program

Patent number: 11990136

Abstract: It is intended to acquire a highly accurate speech recognition result for a subject of a conversation, while inhibiting an increase in the amount of calculation. A speech recognition device (10) according to the present invention includes a first speech recognition unit (11) that performs speech recognition processing using a first method on speech data of a conversation made by a plurality of speakers and outputs a speech recognition result for each of respective uttered speech segments of the plurality of speakers, a determination unit (13) that determines a subject segment based on a result of the speech recognition processing by the first speech recognition unit 11, and a second speech recognition unit (14) that performs speech recognition processing using a second method higher in accuracy than the first method on the speech data in the segment determined to be the subject segment and outputs a speech recognition result as a subject text.

Type: Grant

Filed: January 24, 2020

Date of Patent: May 21, 2024

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Tetsuo Amakasu, Kaname Kasahara, Takafumi Hikichi, Masayuki Sugizaki
Information providing method

Patent number: 11978477

Abstract: An information providing method includes: generating first information indicating that a friendly gathering is occurring in a home when (i) a threshold amount of time or longer has elapsed from a start time of food preparation by a user and (ii) the volume of sound in a dining space is a first threshold volume or greater; obtaining, from a second information processing apparatus connected to a first information processing apparatus, information indicating first request content over a network; and when content of the first information is included in the first request content, outputting, to the second information processing apparatus, second information including information for identifying the user or the home, using the first information generated.

Type: Grant

Filed: July 13, 2023

Date of Patent: May 7, 2024

Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA

Inventors: Masaki Yamauchi, Nanami Fujiwara

1 2 3 4 5 next