Systems Using Speech Recognizers (epo) Patents (Class 704/E15.045)
  • Patent number: 12260421
    Abstract: Provided herein are systems, methods and computer readable media for receiving consumer search data, aggregating by consumer and location, and utilizing the aggregated consumer search data in demand forecasting and relevance determination. An example method may include receiving consumer search data, the consumer search data indicative of search performed by a consumer, the consumer search data comprising one or more search terms and at least one of a consumer location or consumer identification information, storing the consumer search data for a predetermined time interval, and providing at least one of consumer aggregated search data to a relevance module for determining which of a plurality of promotions to present to a consumer at a second time or providing location aggregated search data to a demand forecasting module for utilization in forecasting promotion demand in a particular location.
    Type: Grant
    Filed: April 1, 2022
    Date of Patent: March 25, 2025
    Assignee: BYTEDANCE INC.
    Inventors: Greyson Gregory, Vincenzo Mannino, Alex Lester
  • Patent number: 12246676
    Abstract: To determine whether a user is authorized to make a particular audio request during navigation, a client device receives a request for navigation directions from a starting location to a destination location. The client device provides a set of navigation directions for traversing to the destination location along a route. During a navigation session, an audio request related to the route is received from a user. The client device determines an authorization level of the user based on the audio request, and provides a response to the request based on the authorization level of the user.
    Type: Grant
    Filed: June 23, 2021
    Date of Patent: March 11, 2025
    Assignee: GOOGLE LLC
    Inventor: Matthew Sharifi
  • Patent number: 12240112
    Abstract: Apparatuses, systems, and techniques provide a policy that can be executed to cause a machine to move. In at least one embodiment, a first policy layer is provided to cause the machine to execute a first motion that causes the machine to accelerate to reach an unbiased state. A second policy layer is provided to cause the machine to execute a second motion without influencing the unbiased state to be reached by machine. The policy can comprise the first and second policy layers.
    Type: Grant
    Filed: April 26, 2022
    Date of Patent: March 4, 2025
    Assignee: NVIDIA Corporation
    Inventors: Nathan Donald Ratliff, Karl Van Wyk, Man Xie, Anqi Li, Muhammad Asif Rana
  • Patent number: 12243532
    Abstract: Techniques for configuring a speech processing system with a privacy mode that is associated with the identity of a user that activated the privacy mode are described. A user may speak an indication to have the speech processing system activate a privacy mode. When such an indication is detected by the speech processing system, the speech processing system determines an identity of the user, determines a unique system identifier associated with the user, and generates a privacy mode flag. The speech processing system then associates the privacy mode flag with the user's unique system identifier. The privacy mode flag indicates to components of the speech processing system that any data related to processing of the user's utterances should not be sent to long term storage, thus causing various components of the system to delete data once the respective component is finished processing with respect to an utterance of the user.
    Type: Grant
    Filed: November 22, 2023
    Date of Patent: March 4, 2025
    Assignee: Amazon Technologies, Inc.
    Inventor: Zhenhua Wang
  • Patent number: 12243537
    Abstract: Disclosed is a method of editing a speech recognition result, the method being performed by a computing device. The method may include: displaying a word list satisfying a predetermined condition based on text information generated by speech recognition; determining a target word within the word list; and displaying a region corresponding to the target word within the text information, in which the predetermined condition includes at least one of predetermined word information for each user account and predetermined threshold information associated with a frequency of occurrence of a word.
    Type: Grant
    Filed: August 22, 2023
    Date of Patent: March 4, 2025
    Assignee: ActionPower Corp.
    Inventors: Jihwa Lee, Hwanseok Choi, Jinsuk Park, Yunseop Kim, Woochan Jeong
  • Patent number: 12242477
    Abstract: In order to perform a semantic search based on a graph database, sets of nodes are selected from a plurality of nodes in a graph database. A set of nodes semantically matches a keyword in a natural language query. At least one target node is identified in the sets of nodes. A path is selected from candidate paths based on similarities between the candidate paths and a plurality of paths in the graph database. A graph query for retrieving information from the graph database is generated based on the selected path and the query target.
    Type: Grant
    Filed: September 7, 2021
    Date of Patent: March 4, 2025
    Assignee: International Business Machines Corporation
    Inventors: Teng Sun, Tong Liu, Si Tong Zhao, XueLiang Zhao, Frank Feng, Yu Zui Wy You, Zhong Fang Yuan
  • Patent number: 12235898
    Abstract: The present disclosure provides a technical solution of multi-modal chatting, which may provide response to user query by using multi-modal response in the interaction between chatbot and human beings, so that the expressing ways and the expressed content by the chatbot could be richer by using such response in a multi-modal way.
    Type: Grant
    Filed: February 20, 2024
    Date of Patent: February 25, 2025
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Nan Duan, Lei Ji, Ming Zhou
  • Patent number: 12220181
    Abstract: A camera control system may access surgical session data for a surgical session, the surgical session including performance of one or more operations by a computer-assisted surgical system. The camera control system may identify, based on the surgical session data, an event associated with the surgical session, and may determine, based on the surgical session data, a location associated with the event. In response to the determination of the location of the event, the camera control system may direct an automatic adjustment of a view of a camera to capture a specific view of the location associated with the event.
    Type: Grant
    Filed: January 28, 2020
    Date of Patent: February 11, 2025
    Assignee: Intuitive Surgical Operations, Inc.
    Inventors: Govinda Payyavula, Anthony M. Jarc
  • Patent number: 12220805
    Abstract: An information processing device including: an output control unit that controls an output from an interaction device to a user; an action evaluation unit that determines an action of the user performed in correspondence with an output of the interaction device; an emotion estimation unit that estimates an emotion of the user corresponding to the action of the user; and an information accumulation unit that accumulates the output of the interaction device, the action of the user, and the emotion of the user in association with each other as interaction information, in which the output control unit controls the output from the interaction device to the user based on the interaction information accumulated.
    Type: Grant
    Filed: February 14, 2020
    Date of Patent: February 11, 2025
    Assignee: SONY GROUP CORPORATION
    Inventors: Fumihiko Iida, Ryuichi Suzuki, Kuniaki Torii, Emika Kaneko
  • Patent number: 12216999
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for extracting entities from conversation transcript data. One of the methods includes obtaining a conversation transcript sequence, processing the conversation transcript sequence using a span detection neural network configured to generate a set of text token spans; and for each text token span: processing a span representation using an entity name neural network to generate an entity name probability distribution over a set of entity names, each probability in the entity name probability distribution representing a likelihood that a corresponding entity name is a name of the entity referenced by the text token span; and processing the span representation using an entity status neural network to generate an entity status probability distribution over a set of entity statuses.
    Type: Grant
    Filed: February 19, 2020
    Date of Patent: February 4, 2025
    Assignee: Google LLC
    Inventors: Nan Du, Linh Mai Tran, Yu-Hui Chen, Izhak Shafran
  • Patent number: 12216621
    Abstract: Compressing files is disclosed. An input file to be compressed is first aligned. During or prior to aligning the input file, hyperparameters are set, determined, or configured. The hyperparameters may be set, determined, or configured to achieve a particular performance characteristic. Aligning the file includes splitting the file into sequences that can be aligned. The result is a compression matrix, where each row of the matrix corresponds to part of the file. A consensus sequence id determined from the compression matrix. Using the consensus sequence, pointer pairs are generated. Each pointer pair identifies a subsequence of the consensus matrix. The compressed file includes the pointer pairs and the consensus sequence.
    Type: Grant
    Filed: April 12, 2022
    Date of Patent: February 4, 2025
    Assignee: Dell Products L.P.
    Inventors: Ofir Ezrielev, Ilan Buyum, Jehuda Shemer
  • Patent number: 12217746
    Abstract: A controller for a furniture drive includes an operating device which includes a speech controller. The speech controller includes a speech control subunit operatively connected to an adjustment drive, and a microphone interacting with the speech control subunit. The speech controller includes three speech control subunits arranged in the operating unit, with two of the speech control subunits forming actuators of adjustment functions and one of the speech control units forming an actuator of stopping the adjustment drive.
    Type: Grant
    Filed: April 9, 2019
    Date of Patent: February 4, 2025
    Assignee: Dewertokin Technology Group Co., Ltd
    Inventor: Armin Hille
  • Patent number: 12217749
    Abstract: Devices and techniques are generally described for targeting of devices. In various examples, a first natural language input comprising a first request to output a response may be received by an input device. A first component may determine first data associated with the input device. A plurality of devices associated with the first data may be determined. First state data describing a state of each device of the plurality of devices may be determined. A first device of the plurality of devices may be determined as a target device for the first request based at least in part on the first state data. The first device may be different from the input device. First instructions may be sent to the first device effective to cause the first device to display the first visual content.
    Type: Grant
    Filed: December 10, 2021
    Date of Patent: February 4, 2025
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Ratika Anand, Zhen Hua, Trisha Hajela, Evan Victor Chang, Tom Vasella
  • Patent number: 12211308
    Abstract: Interactions with virtual systems may be difficult when users inadvertently fail to provide sufficient information to proceed with their requests. Certain types of inputs, such as auditory inputs, may lack sufficient information to properly provide a response to the user. Additional information, such as image data, may enable user gestures or poses to supplement the auditory inputs to enable response generation without requesting additional information from users.
    Type: Grant
    Filed: August 31, 2021
    Date of Patent: January 28, 2025
    Assignee: Nvidia Corporation
    Inventors: Sakthivel Sivaraman, Nishant Puri, Yuzhuo Ren, Atousa Torabi, Shubhadeep Das, Niranjan Avadhanam, Sumit Kumar Bhattacharya, Jason Roche
  • Patent number: 12205122
    Abstract: A customer service system for providing customer service using a robot for customer service is provided. A first acquisition unit acquires first information regarding a topic provided by a person in charge of response in a negotiation between a visitor and the person in charge of response. A second acquisition unit acquires a detection result regarding speech and behavior of the visitor by the robot. An estimation unit estimates a reaction of the visitor on a basis of the detection result acquired by the second acquisition unit. An output unit outputs second information regarding the reaction estimated by the estimation unit.
    Type: Grant
    Filed: February 16, 2022
    Date of Patent: January 21, 2025
    Assignee: HONDA MOTOR CO., LTD.
    Inventors: Leo Ito, Jingwen Han, Tomohiro Tsukamoto, Eriko Okabe, Yuichi Kawasaki, Misato Fukushima
  • Patent number: 12205613
    Abstract: A communication system and a method can be configured to facilitate the performance of a conference. The system can include a conference organizer terminal and at least two participants' terminals each assigned to respective conference participants who each log in to start a conference on the communication system. The communication system can be configured to calculate a decision situation at a particular point in time of the ongoing conference by analyzing the views expressed by the conference participants during the conference and send data relating to the decision situation for that point in time to the conference organizer's terminal and/or other conference participant terminals for use in facilitating the conference. IN some embodiments, such data can be used to assist the conference participants' in recognizing when there is a consensus made on at least one decision to be made during the conference.
    Type: Grant
    Filed: June 11, 2020
    Date of Patent: January 21, 2025
    Assignee: RINGCENTRAL, INC.
    Inventors: Jurgen Totzke, Karl Klug
  • Patent number: 12169522
    Abstract: A method includes receiving a content feed that includes audio data corresponding to speech utterances and processing the content feed to generate a semantically-rich, structured document. The structured document includes a transcription of the speech utterances and includes a plurality of words each aligned with a corresponding audio segment of the audio data that indicates a time when the word was recognized in the audio data. During playback of the content feed, the method also includes receiving a query from a user requesting information contained in the content feed and processing, by a large language model, the query and the structured document to generate a response to the query. The response conveys the requested information contained in the content feed. The method also includes providing, for output from a user device associated with the user, the response to the query.
    Type: Grant
    Filed: March 2, 2023
    Date of Patent: December 17, 2024
    Assignee: Google LLC
    Inventors: Johan Schalkwyk, Francoise Beaufays
  • Patent number: 12159628
    Abstract: Techniques for facilitating natural language interactions with visual interactive content are described. During a build time, a system analyzes various websites and applications relating to a particular user goal to understand website and application navigation and information relating to the user goal. The learned information is used to store configuration data. During runtime, when a user request performance of an action, the system engages in a dialog with the user to complete the user's goal. The system uses the stored configuration data to determine actions to be performed at a website or application to complete the user's goal, and determines system responses to present to the user to facilitate completion of the goal. Such system responses may request information from the user, may inform the user of information displayed at the website or application, etc.
    Type: Grant
    Filed: December 10, 2021
    Date of Patent: December 3, 2024
    Assignee: Amazon Technologies, Inc.
    Inventors: Amitabh Saikia, Devesh Mohan Pandey, Tagyoung Chung, Shanchan Wu, Chien-Wei Lin, Govindarajan Sundaram Thattai, Aishwarya Naresh Reganti, Arindam Mandal, Prakash Krishnan, Raefer Christopher Gabriel, Meyyappan Sundaram
  • Patent number: 12154554
    Abstract: A man-machine dialogue method, includes: for each round of a plurality of rounds of dialogue wherein each round includes dialogue information input by a user, determining semantic information corresponding to the dialogue information; determining a target slot position corresponding to an item indicated by the semantic information, establishing a new pre-order data structure including the target slot position when there is no established pre-order data structure including the target slot position; outputting reply information responsive to the dialogue information, wherein the reply information is configured to guide the user to input new dialogue information in a subsequent round of dialogue; and in a case that the dialogue information input by the user in the subsequent round includes a keyword for indicating ordering, performing an ordering operation according to a finally-established pre-order data structure.
    Type: Grant
    Filed: February 25, 2022
    Date of Patent: November 26, 2024
    Assignees: Beijing Xiaomi Mobile Software Co., Ltd., Beijing Xiaomi Pinecone Electronics Co., Ltd.
    Inventors: Zhennan Ming, Junjie Jiang
  • Patent number: 12145603
    Abstract: A driving assistance device executes processing relating to a behavior model of a vehicle. Detected information from the vehicle is input to a detected information inputter. An acquirer derives at least one of a travel difficulty level of a vehicle, a wakefulness level of a driver, and a driving proficiency level of the driver on the basis of the detected information that is input to the detected information inputter. A determiner determines whether or not to execute processing on the basis of at least one information item derived by the acquirer. If the determiner has made a determination to execute the processing, a processor executes the processing relating to the behavior model. It is assumed that the processor does not execute the processing relating to the behavior model if the determiner has made a determination to not execute the processing.
    Type: Grant
    Filed: May 11, 2023
    Date of Patent: November 19, 2024
    Assignee: PANASONIC AUTOMOTIVE SYSTEMS CO., LTD.
    Inventor: Koichi Emura
  • Patent number: 12141530
    Abstract: A computer-implemented method for learning unknown concepts during natural language processing is disclosed, including identifying a sentence associated with an unknown concept, selecting a first sequential set of sentences from a first document, including the sentence associated with the unknown concept, one sentence prior, and subsequent to the sentence associated with the unknown concept, selecting a second sequential set of sentences from a second document, including a sentence associated with a known concept, and one sentence prior and subsequent to the sentence associated with the known concept, comparing concepts associated with the first sequential set of sentences and second sequential set of sentences, determining whether an inference can be made between the unknown concept associated with the sentence from the first document and the sentence associated with the known concept associated with the sentence from the second document, and tagging the unknown concept associated with the known concept.
    Type: Grant
    Filed: June 9, 2021
    Date of Patent: November 12, 2024
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Maulana Bachtiar, Thi Thanh Thao Lai, Wen Rui Siow, Yida Lee, Ronny Syarif, Cheranellore Vasudevan
  • Patent number: 12141882
    Abstract: Methods, systems, and media for determining and presenting information related to embedded sound recordings are provided.
    Type: Grant
    Filed: December 11, 2019
    Date of Patent: November 12, 2024
    Assignee: Google LLC
    Inventors: Kevin Song Zhu, Thomas Bugnon, Keith Wedelich, George Huang, Jacob Levine, Sha Chang, Julian Bill, Arthur Gaudriot, Nicholas Bryan Johnson, Vishaal Prasad
  • Patent number: 12143673
    Abstract: Systems and methods are provided for generating for display an indication of a segment of media content relevant to a voice communication. This may be accomplished by a media guidance application that monitors a voice communication between users. The media up guidance application determines that a first user is describing media content. In response to determining that the first user is describing the media content, the media guidance application retrieves media asset viewing history of the first user. The media guidance application determines, based on metadata of each media asset in the media asset viewing history of the first user and the voice communication, a media asset that the first user is describing. The media guidance application determines, based on metadata of the media asset, a segment of the media asset that the first user is describing. The media guidance application generates, for display, an indication of the segment.
    Type: Grant
    Filed: July 25, 2023
    Date of Patent: November 12, 2024
    Assignee: Adeia Guides Inc.
    Inventors: Michael K. McCarty, Glen E. Roe
  • Patent number: 12141100
    Abstract: A repository for quick retrieval of object(s) of a communication platform is described. Server(s) of the communication platform can receive, in association with a user interface, a request to associate an object with a repository. The server(s) can store an object identifier of the object in the repository and cause display of an object user interface element representative of the object to be presented in association with a repository user interface element representative of the repository. In response to receiving a selection of the object user interface element, the server(s) can retrieve the object using the object identifier and cause the object to be presented, in the user interface with contextual data, wherein the contextual data comprises other object(s) associated with the object.
    Type: Grant
    Filed: April 9, 2021
    Date of Patent: November 12, 2024
    Assignee: Salesforce, Inc.
    Inventors: Jason Hon-Son Wong, Julie Punturo, Elizabeth Anne Millikin, Zachery Floyd
  • Patent number: 12130965
    Abstract: Control systems and methods are provided that utilize a device, which can be worn by a user, to enable the user to enter control commands for causing a controller to control one or more electronic devices in a local network, such as a Wi-Fi system. A local control system, according to one implementation, includes a smart ring configured to obtain movement information related to one or more movements of the smart ring while a user is wearing the smart ring. The local control system also includes a controller device configured to communicate with the smart ring using Bluetooth or Wi-Fi signals. Characteristics of the movement information can be translated in order to obtain one or more control commands. The controller device is configured to control one or more aspects of one or more electronic devices based on the one or more control commands.
    Type: Grant
    Filed: July 7, 2022
    Date of Patent: October 29, 2024
    Assignee: PLUME DESIGN, INC.
    Inventors: Zhicheng Qiu, William J. McFarland
  • Patent number: 12128908
    Abstract: Systems and methods for assisting an operator in operating vehicle controls are disclosed herein. One embodiment detects that an operator is touching a control in a vehicle and automatically takes, in response to the operator touching the control, one or more actions to assist the operator with regard to the vehicle being a left-hand-drive vehicle or a right-hand-drive vehicle.
    Type: Grant
    Filed: October 24, 2022
    Date of Patent: October 29, 2024
    Assignee: Woven by Toyota, Inc.
    Inventors: Manuel Ludwig Kuehner, Hiroshi Yasuda
  • Patent number: 12121812
    Abstract: An audio signal processing method, an audio signal processing apparatus, a terminal, and a storage medium are provided. In the method, audio signal in a first period of time is collected. A reference sound volume is determined based on the collected audio signal. Audio signal sampling is performed on the audio signal at multiple audio sampling points in a second period of time to obtain multiple audio sampling signals. In a case of determining that the multiple audio sampling signals meet a predetermined condition based on the reference sound volume, a basis sound volume is determined based on the multiple audio sampling signals. The basis sound volume is used for controlling a display effect of a target object in a virtual scene, and different basis sound volumes correspond to different display effects of the target object.
    Type: Grant
    Filed: March 11, 2020
    Date of Patent: October 22, 2024
    Assignee: BEIJING BYTEDANCE NETWORK TECHNOLOGY CO., LTD.
    Inventor: Wei Zheng
  • Patent number: 12118846
    Abstract: Devices, systems and methods are provided. A device may include a gesture input device to detect gesture inputs performed by a user, a processor circuit, and a memory coupled to the processor circuit. The memory includes machine-readable instructions that, when executed by the processor circuit, cause the processor circuit to receive a first gesture input value from the first gesture input device and that corresponds to a user-specific gesture that the user performs, associate the first gesture input value with a first gaming device operation to be performed by the gaming device, receive the first gesture input value that is associated with the first gaming device operation, and responsive to receiving the first gesture input value that is associated with the first gaming device operation, cause the gaming device to perform the first gaming device operation.
    Type: Grant
    Filed: April 16, 2021
    Date of Patent: October 15, 2024
    Assignee: IGT
    Inventors: David Small, David Froy, Jr., Michael Russ
  • Patent number: 12118999
    Abstract: Systems and processes for selectively processing and responding to a spoken user input are provided. In one example, audio input containing a spoken user input can be received at a user device. The spoken user input can be identified from the audio input by identifying start and end-points of the spoken user input. It can be determined whether or not the spoken user input was intended for a virtual assistant based on contextual information. The determination can be made using a rule-based system or a probabilistic system. If it is determined that the spoken user input was intended for the virtual assistant, the spoken user input can be processed and an appropriate response can be generated. If it is instead determined that the spoken user input was not intended for the virtual assistant, the spoken user input can be ignored and/or no response can be generated.
    Type: Grant
    Filed: August 7, 2023
    Date of Patent: October 15, 2024
    Assignee: Apple Inc.
    Inventors: Philippe P. Piernot, Justin G. Binder
  • Patent number: 12112095
    Abstract: Provided is a device for executing an application including a graphics user interface (GUI) for receiving an input value of an input field, the device including an audio output unit, a user input unit receiving a user input to request execution of the application, and a control unit configured to output, through the audio output unit, an audio signal indicating an induced inquiry corresponding to the input field, based on whether the user input is a voice input, to receive a voice input indicating a response to the induced inquiry, and to execute the application by setting an input value for the input field based on the voice input indicating the response to the induced inquiry.
    Type: Grant
    Filed: February 28, 2018
    Date of Patent: October 8, 2024
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Dong-hyeon Lee, Se-chun Kang, Yu-bin Seo, He-jung Yang
  • Patent number: 12112651
    Abstract: Delivering multimedia content for academic curriculum, over a network, to a user. The method includes a multimedia content delivery system identifying user attributes for the user, where the user is connected to the multimedia content delivery system over a network. The method further includes the multimedia content delivery system identifying attributes of a plurality of multimedia assets. Based on the identified user attributes and the identified attributes of the plurality of multimedia assets, the method includes creating a multimedia offering for delivery to the user. The multimedia offering satisfies a curriculum requirement specific to the user based on the user attributes. The multimedia offering is delivered over the network to the user.
    Type: Grant
    Filed: April 14, 2022
    Date of Patent: October 8, 2024
    Assignee: Western Governors University
    Inventors: Jon Morley, Mike Hassett, Adel Lelo, Jerry Damon Jasperson, Timothy Andrus, Brandon Karratti
  • Patent number: 12093459
    Abstract: An information processing apparatus comprises a processor configured to execute a program so as to: acquire viewpoint information, motion information and object information; process an object image wherein: the object image is an image representing a virtual substance and is updated based on the viewpoint information and the motion information; calculate a position when superimposing a background image and the object image, wherein: the background image is an image according to the field of view of the user; generate visual information superimposing the background image and the object image based on the position calculated, and output the generated visual information to a display device; and generate tactile information for an ultrasound generator irradiating the user with ultrasound corresponding to the object based on the position calculated, the motion information and the object information, and output the generated tactile information to the ultrasound generator.
    Type: Grant
    Filed: December 8, 2022
    Date of Patent: September 17, 2024
    Assignee: THE UNIVERSITY OF TOKYO
    Inventors: Mamoru Miyawaki, Hiroyuki Shinoda, Takaaki Kamigaki, Mitsuru Ito, Tao Morisaki, Shun Suzuki, Atsushi Matsubayashi, Ryoya Onishi, Yasutoshi Makino, Seki Inoue
  • Patent number: 12094469
    Abstract: A method for recognizing speech comprises: respectively setting initial values of a Chinese character coefficient and a Pinyin coefficient, generating a Chinese character mapping function according to the initial value of the Chinese character coefficient, and generating a Pinyin mapping function according to the initial value of the Pinyin coefficient (S101); training the Chinese character mapping function and the Pinyin mapping function using a plurality of preset training samples, calculating training results as parameters of a joint loss function, and generating a target mapping function according to calculation results (S102); and recognizing, according to the target mapping function, speech to be recognized, so as to obtain a Chinese character recognition result and a Pinyin recognition result of the speech to be recognized (S103). The method reduces the cost of speech recognition while ensuring the accuracy of speech recognition.
    Type: Grant
    Filed: March 3, 2020
    Date of Patent: September 17, 2024
    Assignee: JINGDONG TECHNOLOGY HOLDING CO., LTD.
    Inventors: Li Fu, Xiaoxiao Li
  • Patent number: 12089014
    Abstract: A remote control with a microphone subsystem comprising a pair of internal microphones is shown and described. When connected to a remote-control base station that is itself connected to an external power source, the microphone subsystem is continuously energized by the external power source, and the pair of internal microphones operate as far field microphones that receive oral commands uttered by a user from a distance. When the remote control is removed from the base, the microphone subsystem is configured for selective connection to an internal power source by actuating a user control on the remote control. In the external power source mode, signals from both microphones are digitally processed to provide a far-field microphone array with beam forming. In the direct current mode, only one microphone's signals are digitally processed as a simple monaural signal (or they are not digitally processed).
    Type: Grant
    Filed: April 7, 2022
    Date of Patent: September 10, 2024
    Assignee: Vizio, Inc.
    Inventors: W. Leo Hoarty, Glen Gihong Kim
  • Patent number: 12080275
    Abstract: Systems for automatic speech recognition and/or natural language understanding automatically learn new words by finding subsequences of phonemes that, if they were a new word, would enable a successful tokenization of a phoneme sequence. Systems can learn alternate pronunciations of words by finding phoneme sequences with a small edit distance to existing pronunciations. Systems can learn the part of speech of words by finding part-of-speech variations that would enable parses by syntactic grammars. Systems can learn what types of entities a word describes by finding sentences that could be parsed by a semantic grammar but for the words not being on an entity list.
    Type: Grant
    Filed: January 11, 2021
    Date of Patent: September 3, 2024
    Assignee: SoundHound AI IP, LLC.
    Inventor: Anton V. Relin
  • Patent number: 12079261
    Abstract: A computer-implemented method for presenting relevant information to a customer service representative of a business may include receiving a digitized data stream corresponding to a spoken conversation between a customer and a representative; converting the data stream to a text stream; determining one or more keywords from the text stream; comparing the one or more keywords with a history of keywords that have previously been searched; and/or searching a database for information related to the one or more keywords that have not been previously searched. As a result of the keyword search, information about topics that the customer is interested in, may be located and displayed on a customer service representative display to facilitate the customer service representative timely relaying the information found by the keyword search to enhance the customer experience. Exemplary keywords may relate to insurance and financial services, such as “auto,” “home,” “life,” “insurance,” or “vehicle loan.
    Type: Grant
    Filed: June 28, 2022
    Date of Patent: September 3, 2024
    Assignee: State Farm Mutual Automobile Insurance Company
    Inventor: Sylvia Hernandez
  • Patent number: 12080292
    Abstract: An electronic record voice assistant system can include one or more processors that receive audio data, apply a machine learning model to the audio data to generate speech data including at least one value, determine a state of an electronic record, and update one or more fields of the electronic record using the state and the at least one value.
    Type: Grant
    Filed: June 6, 2022
    Date of Patent: September 3, 2024
    Assignee: Bola Technologies, Inc.
    Inventors: Rushi M. Ganmukhi, Daniel Brownwood, Sidharth Malhotra, Augusto Monteiro Nobre Amanco
  • Patent number: 12073844
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: receiving, by a user device, a first indication of one or more first speakers visible in a current view recorded by a camera of the user device, in response, generating a respective isolated speech signal for each of the one or more first speakers that isolates speech of the first speaker in the current view and sending the isolated speech signals for each of the one or more first speakers to a listening device operatively coupled to the user device, receiving, by the user device, a second indication of one or more second speakers visible in the current view recorded by the camera of the user device, and in response generating and sending a respective isolated speech signal for each of the one or more second speakers to the listening device.
    Type: Grant
    Filed: October 1, 2020
    Date of Patent: August 27, 2024
    Assignee: Google LLC
    Inventors: Anatoly Efros, Noam Etzion-Rosenberg, Tal Remez, Oran Lang, Inbar Mosseri, Israel Or Weinstein, Benjamin Schlesinger, Michael Rubinstein, Ariel Ephrat, Yukun Zhu, Stella Laurenzo, Amit Pitaru, Yossi Matias
  • Patent number: 12065329
    Abstract: A method for controlling an elevator system comprises receiving media data, receiving at least one of control data and sensor data, transmitting the media data and at least one of the control data and sensor data via a real-time data communication network between a central controller adapted for controlling a drive of the elevator system and a local controller, the local controller being provided in one of an elevator car control panel and a door control panel, wherein the real-time data communication network is adapted for transmitting data packets while ensuring that a data packet is transmitted during a maximal transmission time.
    Type: Grant
    Filed: January 16, 2018
    Date of Patent: August 20, 2024
    Assignee: Inventio AG
    Inventor: Stefano Carriero
  • Patent number: 12057113
    Abstract: In various examples, systems and methods of the present disclosure combine open and closed dialog systems into an intelligent dialog management system. A text query may be processed by a natural language understanding model trained to associate the text query with a domain tag, intent classification, and/or input slots. Using the domain tag, the natural language understanding model may identify information in the text query corresponding to input slots needed for answering the text query. The text query and related information may then be passed to a dialog manager to direct the text query to the proper domain dialog system. Responses retrieved from the domain dialog system may be provided to the user via text output and/or via a text to speech component of the dialog management system.
    Type: Grant
    Filed: June 6, 2023
    Date of Patent: August 6, 2024
    Assignee: NVIDIA Corporation
    Inventors: Shubhadeep Das, Sumit Bhattacharya, Ratin Kumar
  • Patent number: 12045353
    Abstract: A microphone controller includes a processor programmed to receive voice input from one or more microphones to be utilized in a voice recognition session initiated by the microphone controller. Further the microphone controller includes a key store including one or more keys configured to encrypt the received voice input to an encrypted voice data.
    Type: Grant
    Filed: May 29, 2019
    Date of Patent: July 23, 2024
    Assignee: Denso Corporation
    Inventors: Ameer Kashani, Gopalakrishnan Iyer
  • Patent number: 12042237
    Abstract: A surgical system includes a plurality of voice sensors located in a surgical environment and configured to detect sound and generate a first plurality of signals. The surgical system also includes a position indicator, in proximity to a designated user, configured to indicate a first position of the designated user and generate a second signal representative of the first position. The surgical system further includes a processor configured to receive the first plurality of signals and the second signal and determine, based on the first plurality of signals, a second position. The processor is also configured to compare the detected sound with registered voice command of the designated user stored in a memory to verify the designated user's credentials, and send a command signal to a surgical instrument to carry out an operation related to the voice command based on at least one of the verification of the designated user's credentials, the first position and the second position.
    Type: Grant
    Filed: November 4, 2022
    Date of Patent: July 23, 2024
    Assignee: Cilag GmbH International
    Inventors: David J. Cagle, Eric Smith, Jeffrey L. Aldridge, Mary E. Mootoo, Ryan Asher
  • Patent number: 12023811
    Abstract: The present disclosure provides a robot control device including a detection section that detects an external force applied to a movable part of a robot, on the basis of a parameter obtained from a joint driving the movable part, and a driving control section that controls an interaction of the robot, according to the detected external force. With this configuration, in a case where a user touches the robot, the robot can perform an interaction according to the touch.
    Type: Grant
    Filed: January 27, 2020
    Date of Patent: July 2, 2024
    Assignee: SONY GROUP CORPORATION
    Inventors: Yusuke Kawabe, Kensuke Kitamura, Katsuhisa Ito
  • Patent number: 12020687
    Abstract: Embodiments of the present systems and methods may provide techniques for synthesizing speech in any voice in any language in any accent. For example, in an embodiment, a text-to-speech conversion system may comprise a text converter adapted to convert input text to at least one phoneme selected from a plurality of phonemes stored in memory, a machine-learning model storing voice patterns for a plurality of individuals and adapted to receive the at least one phoneme and an identity of a speaker and to generate acoustic features for each phoneme, and a decoder adapted to receive the generated acoustic features and to generate a speech signal simulating a voice of the identified speaker in a language.
    Type: Grant
    Filed: February 6, 2023
    Date of Patent: June 25, 2024
    Assignee: Georgetown University
    Inventors: Joe Garman, Ophir Frieder
  • Patent number: 12020695
    Abstract: A method comprises receiving from an input device, a capture of user action as an initial command; interpreting the initial command into an interpreted command; generating a first set of modified commands that are based on the interpreted command, including: a first modified command that has a phonetic similarity to the interpreted command within a certain threshold, and a second modified command that is semantically related to an earlier command; transmitting, to an output device, the first set of modified commands; receiving a response to a group of commands including the first set of modified commands; recording an identifier of an input device from which the response was received and a type of the response in a log; when the response includes acknowledging a specific command of the group of commands as an accepted command, executing, the accepted command; otherwise, generating a second set of modified commands.
    Type: Grant
    Filed: February 28, 2023
    Date of Patent: June 25, 2024
    Assignee: Merlyn Mind, Inc.
    Inventors: Aditya Vempaty, Ravindranath Kokku, Tamer Abuelsaad, Sharad C. Sundararajan, Satyanarayana Nitta
  • Patent number: 12011828
    Abstract: A method for controlling effectors of a robot by means of primitives made up of parameterizable coded functions, the primitives being activated conditionally by actions selected by an action selection system, the method based on associating coded objects with a sequence of characters corresponding to their semantic description, and comprising: i. a semantic description of the coded objects stored in the memory, made up of a string of characters representing a perception function of the robot and of another string of characters representing a perceived object, ii. a semantic description of the primitives, made up of a string of characters representing a possible action of the robot and of another, optional string of characters representing the optional parameters of this action, iii. a semantic description of rules made up of the combination of a string of characters representing the associated context and another string of characters representing the associated action.
    Type: Grant
    Filed: April 26, 2019
    Date of Patent: June 18, 2024
    Assignee: SPOON
    Inventors: Jérôme Monceaux, Thibault Hervier, Aymeric Masurelle
  • Patent number: 12003667
    Abstract: A call challenger can receive a user input from a called party identity to opt-in to a call challenge service, and a second user input of a keyword. When the call challenger receives a call directed to a user equipment of the called party identity, the call challenger can prompt the calling party to provide an audible response. In response to a receipt of the audible response, the call challenger can convert the audible response to a text. The call challenger can compare the text with the keyword to determine if there is a sufficient match. In response to the determining the output of the comparing does not satisfy a threshold match score, the call challenger can prevent the call from connecting with the user equipment.
    Type: Grant
    Filed: January 10, 2023
    Date of Patent: June 4, 2024
    Assignees: AT&T Intellectual Property I, L.P., AT&T Mobility II LLC
    Inventors: Sheldon Meredith, Brandon Hilliard, Zachary Meredith
  • Patent number: 11996100
    Abstract: A speech recognition engine is provided voice data indicative of at least a brand of a target appliance. The speech recognition engine uses the voice data indicative of at least a brand of the target appliance to identify within a library of codesets at least one codeset that is cross-referenced to the brand of the target appliance. The at least one codeset so identified is then caused to be provisioned to the controlling device for use in commanding functional operations of the target appliance.
    Type: Grant
    Filed: January 13, 2022
    Date of Patent: May 28, 2024
    Assignee: Universal Electronics Inc.
    Inventor: Jonathan Lim
  • Patent number: 11990136
    Abstract: It is intended to acquire a highly accurate speech recognition result for a subject of a conversation, while inhibiting an increase in the amount of calculation. A speech recognition device (10) according to the present invention includes a first speech recognition unit (11) that performs speech recognition processing using a first method on speech data of a conversation made by a plurality of speakers and outputs a speech recognition result for each of respective uttered speech segments of the plurality of speakers, a determination unit (13) that determines a subject segment based on a result of the speech recognition processing by the first speech recognition unit 11, and a second speech recognition unit (14) that performs speech recognition processing using a second method higher in accuracy than the first method on the speech data in the segment determined to be the subject segment and outputs a speech recognition result as a subject text.
    Type: Grant
    Filed: January 24, 2020
    Date of Patent: May 21, 2024
    Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Tetsuo Amakasu, Kaname Kasahara, Takafumi Hikichi, Masayuki Sugizaki
  • Patent number: 11978477
    Abstract: An information providing method includes: generating first information indicating that a friendly gathering is occurring in a home when (i) a threshold amount of time or longer has elapsed from a start time of food preparation by a user and (ii) the volume of sound in a dining space is a first threshold volume or greater; obtaining, from a second information processing apparatus connected to a first information processing apparatus, information indicating first request content over a network; and when content of the first information is included in the first request content, outputting, to the second information processing apparatus, second information including information for identifying the user or the home, using the first information generated.
    Type: Grant
    Filed: July 13, 2023
    Date of Patent: May 7, 2024
    Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventors: Masaki Yamauchi, Nanami Fujiwara