Systems Using Speech Recognizers (epo) Patents (Class 704/E15.045)
-
Patent number: 12260421Abstract: Provided herein are systems, methods and computer readable media for receiving consumer search data, aggregating by consumer and location, and utilizing the aggregated consumer search data in demand forecasting and relevance determination. An example method may include receiving consumer search data, the consumer search data indicative of search performed by a consumer, the consumer search data comprising one or more search terms and at least one of a consumer location or consumer identification information, storing the consumer search data for a predetermined time interval, and providing at least one of consumer aggregated search data to a relevance module for determining which of a plurality of promotions to present to a consumer at a second time or providing location aggregated search data to a demand forecasting module for utilization in forecasting promotion demand in a particular location.Type: GrantFiled: April 1, 2022Date of Patent: March 25, 2025Assignee: BYTEDANCE INC.Inventors: Greyson Gregory, Vincenzo Mannino, Alex Lester
-
Patent number: 12246676Abstract: To determine whether a user is authorized to make a particular audio request during navigation, a client device receives a request for navigation directions from a starting location to a destination location. The client device provides a set of navigation directions for traversing to the destination location along a route. During a navigation session, an audio request related to the route is received from a user. The client device determines an authorization level of the user based on the audio request, and provides a response to the request based on the authorization level of the user.Type: GrantFiled: June 23, 2021Date of Patent: March 11, 2025Assignee: GOOGLE LLCInventor: Matthew Sharifi
-
Patent number: 12240112Abstract: Apparatuses, systems, and techniques provide a policy that can be executed to cause a machine to move. In at least one embodiment, a first policy layer is provided to cause the machine to execute a first motion that causes the machine to accelerate to reach an unbiased state. A second policy layer is provided to cause the machine to execute a second motion without influencing the unbiased state to be reached by machine. The policy can comprise the first and second policy layers.Type: GrantFiled: April 26, 2022Date of Patent: March 4, 2025Assignee: NVIDIA CorporationInventors: Nathan Donald Ratliff, Karl Van Wyk, Man Xie, Anqi Li, Muhammad Asif Rana
-
Patent number: 12243532Abstract: Techniques for configuring a speech processing system with a privacy mode that is associated with the identity of a user that activated the privacy mode are described. A user may speak an indication to have the speech processing system activate a privacy mode. When such an indication is detected by the speech processing system, the speech processing system determines an identity of the user, determines a unique system identifier associated with the user, and generates a privacy mode flag. The speech processing system then associates the privacy mode flag with the user's unique system identifier. The privacy mode flag indicates to components of the speech processing system that any data related to processing of the user's utterances should not be sent to long term storage, thus causing various components of the system to delete data once the respective component is finished processing with respect to an utterance of the user.Type: GrantFiled: November 22, 2023Date of Patent: March 4, 2025Assignee: Amazon Technologies, Inc.Inventor: Zhenhua Wang
-
Patent number: 12243537Abstract: Disclosed is a method of editing a speech recognition result, the method being performed by a computing device. The method may include: displaying a word list satisfying a predetermined condition based on text information generated by speech recognition; determining a target word within the word list; and displaying a region corresponding to the target word within the text information, in which the predetermined condition includes at least one of predetermined word information for each user account and predetermined threshold information associated with a frequency of occurrence of a word.Type: GrantFiled: August 22, 2023Date of Patent: March 4, 2025Assignee: ActionPower Corp.Inventors: Jihwa Lee, Hwanseok Choi, Jinsuk Park, Yunseop Kim, Woochan Jeong
-
Patent number: 12242477Abstract: In order to perform a semantic search based on a graph database, sets of nodes are selected from a plurality of nodes in a graph database. A set of nodes semantically matches a keyword in a natural language query. At least one target node is identified in the sets of nodes. A path is selected from candidate paths based on similarities between the candidate paths and a plurality of paths in the graph database. A graph query for retrieving information from the graph database is generated based on the selected path and the query target.Type: GrantFiled: September 7, 2021Date of Patent: March 4, 2025Assignee: International Business Machines CorporationInventors: Teng Sun, Tong Liu, Si Tong Zhao, XueLiang Zhao, Frank Feng, Yu Zui Wy You, Zhong Fang Yuan
-
Patent number: 12235898Abstract: The present disclosure provides a technical solution of multi-modal chatting, which may provide response to user query by using multi-modal response in the interaction between chatbot and human beings, so that the expressing ways and the expressed content by the chatbot could be richer by using such response in a multi-modal way.Type: GrantFiled: February 20, 2024Date of Patent: February 25, 2025Assignee: Microsoft Technology Licensing, LLCInventors: Nan Duan, Lei Ji, Ming Zhou
-
Patent number: 12220181Abstract: A camera control system may access surgical session data for a surgical session, the surgical session including performance of one or more operations by a computer-assisted surgical system. The camera control system may identify, based on the surgical session data, an event associated with the surgical session, and may determine, based on the surgical session data, a location associated with the event. In response to the determination of the location of the event, the camera control system may direct an automatic adjustment of a view of a camera to capture a specific view of the location associated with the event.Type: GrantFiled: January 28, 2020Date of Patent: February 11, 2025Assignee: Intuitive Surgical Operations, Inc.Inventors: Govinda Payyavula, Anthony M. Jarc
-
Patent number: 12220805Abstract: An information processing device including: an output control unit that controls an output from an interaction device to a user; an action evaluation unit that determines an action of the user performed in correspondence with an output of the interaction device; an emotion estimation unit that estimates an emotion of the user corresponding to the action of the user; and an information accumulation unit that accumulates the output of the interaction device, the action of the user, and the emotion of the user in association with each other as interaction information, in which the output control unit controls the output from the interaction device to the user based on the interaction information accumulated.Type: GrantFiled: February 14, 2020Date of Patent: February 11, 2025Assignee: SONY GROUP CORPORATIONInventors: Fumihiko Iida, Ryuichi Suzuki, Kuniaki Torii, Emika Kaneko
-
Patent number: 12216999Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for extracting entities from conversation transcript data. One of the methods includes obtaining a conversation transcript sequence, processing the conversation transcript sequence using a span detection neural network configured to generate a set of text token spans; and for each text token span: processing a span representation using an entity name neural network to generate an entity name probability distribution over a set of entity names, each probability in the entity name probability distribution representing a likelihood that a corresponding entity name is a name of the entity referenced by the text token span; and processing the span representation using an entity status neural network to generate an entity status probability distribution over a set of entity statuses.Type: GrantFiled: February 19, 2020Date of Patent: February 4, 2025Assignee: Google LLCInventors: Nan Du, Linh Mai Tran, Yu-Hui Chen, Izhak Shafran
-
Patent number: 12216621Abstract: Compressing files is disclosed. An input file to be compressed is first aligned. During or prior to aligning the input file, hyperparameters are set, determined, or configured. The hyperparameters may be set, determined, or configured to achieve a particular performance characteristic. Aligning the file includes splitting the file into sequences that can be aligned. The result is a compression matrix, where each row of the matrix corresponds to part of the file. A consensus sequence id determined from the compression matrix. Using the consensus sequence, pointer pairs are generated. Each pointer pair identifies a subsequence of the consensus matrix. The compressed file includes the pointer pairs and the consensus sequence.Type: GrantFiled: April 12, 2022Date of Patent: February 4, 2025Assignee: Dell Products L.P.Inventors: Ofir Ezrielev, Ilan Buyum, Jehuda Shemer
-
Patent number: 12217746Abstract: A controller for a furniture drive includes an operating device which includes a speech controller. The speech controller includes a speech control subunit operatively connected to an adjustment drive, and a microphone interacting with the speech control subunit. The speech controller includes three speech control subunits arranged in the operating unit, with two of the speech control subunits forming actuators of adjustment functions and one of the speech control units forming an actuator of stopping the adjustment drive.Type: GrantFiled: April 9, 2019Date of Patent: February 4, 2025Assignee: Dewertokin Technology Group Co., LtdInventor: Armin Hille
-
Patent number: 12217749Abstract: Devices and techniques are generally described for targeting of devices. In various examples, a first natural language input comprising a first request to output a response may be received by an input device. A first component may determine first data associated with the input device. A plurality of devices associated with the first data may be determined. First state data describing a state of each device of the plurality of devices may be determined. A first device of the plurality of devices may be determined as a target device for the first request based at least in part on the first state data. The first device may be different from the input device. First instructions may be sent to the first device effective to cause the first device to display the first visual content.Type: GrantFiled: December 10, 2021Date of Patent: February 4, 2025Assignee: AMAZON TECHNOLOGIES, INC.Inventors: Ratika Anand, Zhen Hua, Trisha Hajela, Evan Victor Chang, Tom Vasella
-
Patent number: 12211308Abstract: Interactions with virtual systems may be difficult when users inadvertently fail to provide sufficient information to proceed with their requests. Certain types of inputs, such as auditory inputs, may lack sufficient information to properly provide a response to the user. Additional information, such as image data, may enable user gestures or poses to supplement the auditory inputs to enable response generation without requesting additional information from users.Type: GrantFiled: August 31, 2021Date of Patent: January 28, 2025Assignee: Nvidia CorporationInventors: Sakthivel Sivaraman, Nishant Puri, Yuzhuo Ren, Atousa Torabi, Shubhadeep Das, Niranjan Avadhanam, Sumit Kumar Bhattacharya, Jason Roche
-
Patent number: 12205122Abstract: A customer service system for providing customer service using a robot for customer service is provided. A first acquisition unit acquires first information regarding a topic provided by a person in charge of response in a negotiation between a visitor and the person in charge of response. A second acquisition unit acquires a detection result regarding speech and behavior of the visitor by the robot. An estimation unit estimates a reaction of the visitor on a basis of the detection result acquired by the second acquisition unit. An output unit outputs second information regarding the reaction estimated by the estimation unit.Type: GrantFiled: February 16, 2022Date of Patent: January 21, 2025Assignee: HONDA MOTOR CO., LTD.Inventors: Leo Ito, Jingwen Han, Tomohiro Tsukamoto, Eriko Okabe, Yuichi Kawasaki, Misato Fukushima
-
Patent number: 12205613Abstract: A communication system and a method can be configured to facilitate the performance of a conference. The system can include a conference organizer terminal and at least two participants' terminals each assigned to respective conference participants who each log in to start a conference on the communication system. The communication system can be configured to calculate a decision situation at a particular point in time of the ongoing conference by analyzing the views expressed by the conference participants during the conference and send data relating to the decision situation for that point in time to the conference organizer's terminal and/or other conference participant terminals for use in facilitating the conference. IN some embodiments, such data can be used to assist the conference participants' in recognizing when there is a consensus made on at least one decision to be made during the conference.Type: GrantFiled: June 11, 2020Date of Patent: January 21, 2025Assignee: RINGCENTRAL, INC.Inventors: Jurgen Totzke, Karl Klug
-
Patent number: 12169522Abstract: A method includes receiving a content feed that includes audio data corresponding to speech utterances and processing the content feed to generate a semantically-rich, structured document. The structured document includes a transcription of the speech utterances and includes a plurality of words each aligned with a corresponding audio segment of the audio data that indicates a time when the word was recognized in the audio data. During playback of the content feed, the method also includes receiving a query from a user requesting information contained in the content feed and processing, by a large language model, the query and the structured document to generate a response to the query. The response conveys the requested information contained in the content feed. The method also includes providing, for output from a user device associated with the user, the response to the query.Type: GrantFiled: March 2, 2023Date of Patent: December 17, 2024Assignee: Google LLCInventors: Johan Schalkwyk, Francoise Beaufays
-
Patent number: 12159628Abstract: Techniques for facilitating natural language interactions with visual interactive content are described. During a build time, a system analyzes various websites and applications relating to a particular user goal to understand website and application navigation and information relating to the user goal. The learned information is used to store configuration data. During runtime, when a user request performance of an action, the system engages in a dialog with the user to complete the user's goal. The system uses the stored configuration data to determine actions to be performed at a website or application to complete the user's goal, and determines system responses to present to the user to facilitate completion of the goal. Such system responses may request information from the user, may inform the user of information displayed at the website or application, etc.Type: GrantFiled: December 10, 2021Date of Patent: December 3, 2024Assignee: Amazon Technologies, Inc.Inventors: Amitabh Saikia, Devesh Mohan Pandey, Tagyoung Chung, Shanchan Wu, Chien-Wei Lin, Govindarajan Sundaram Thattai, Aishwarya Naresh Reganti, Arindam Mandal, Prakash Krishnan, Raefer Christopher Gabriel, Meyyappan Sundaram
-
Patent number: 12154554Abstract: A man-machine dialogue method, includes: for each round of a plurality of rounds of dialogue wherein each round includes dialogue information input by a user, determining semantic information corresponding to the dialogue information; determining a target slot position corresponding to an item indicated by the semantic information, establishing a new pre-order data structure including the target slot position when there is no established pre-order data structure including the target slot position; outputting reply information responsive to the dialogue information, wherein the reply information is configured to guide the user to input new dialogue information in a subsequent round of dialogue; and in a case that the dialogue information input by the user in the subsequent round includes a keyword for indicating ordering, performing an ordering operation according to a finally-established pre-order data structure.Type: GrantFiled: February 25, 2022Date of Patent: November 26, 2024Assignees: Beijing Xiaomi Mobile Software Co., Ltd., Beijing Xiaomi Pinecone Electronics Co., Ltd.Inventors: Zhennan Ming, Junjie Jiang
-
Patent number: 12145603Abstract: A driving assistance device executes processing relating to a behavior model of a vehicle. Detected information from the vehicle is input to a detected information inputter. An acquirer derives at least one of a travel difficulty level of a vehicle, a wakefulness level of a driver, and a driving proficiency level of the driver on the basis of the detected information that is input to the detected information inputter. A determiner determines whether or not to execute processing on the basis of at least one information item derived by the acquirer. If the determiner has made a determination to execute the processing, a processor executes the processing relating to the behavior model. It is assumed that the processor does not execute the processing relating to the behavior model if the determiner has made a determination to not execute the processing.Type: GrantFiled: May 11, 2023Date of Patent: November 19, 2024Assignee: PANASONIC AUTOMOTIVE SYSTEMS CO., LTD.Inventor: Koichi Emura
-
Patent number: 12141530Abstract: A computer-implemented method for learning unknown concepts during natural language processing is disclosed, including identifying a sentence associated with an unknown concept, selecting a first sequential set of sentences from a first document, including the sentence associated with the unknown concept, one sentence prior, and subsequent to the sentence associated with the unknown concept, selecting a second sequential set of sentences from a second document, including a sentence associated with a known concept, and one sentence prior and subsequent to the sentence associated with the known concept, comparing concepts associated with the first sequential set of sentences and second sequential set of sentences, determining whether an inference can be made between the unknown concept associated with the sentence from the first document and the sentence associated with the known concept associated with the sentence from the second document, and tagging the unknown concept associated with the known concept.Type: GrantFiled: June 9, 2021Date of Patent: November 12, 2024Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Maulana Bachtiar, Thi Thanh Thao Lai, Wen Rui Siow, Yida Lee, Ronny Syarif, Cheranellore Vasudevan
-
Patent number: 12141882Abstract: Methods, systems, and media for determining and presenting information related to embedded sound recordings are provided.Type: GrantFiled: December 11, 2019Date of Patent: November 12, 2024Assignee: Google LLCInventors: Kevin Song Zhu, Thomas Bugnon, Keith Wedelich, George Huang, Jacob Levine, Sha Chang, Julian Bill, Arthur Gaudriot, Nicholas Bryan Johnson, Vishaal Prasad
-
Patent number: 12143673Abstract: Systems and methods are provided for generating for display an indication of a segment of media content relevant to a voice communication. This may be accomplished by a media guidance application that monitors a voice communication between users. The media up guidance application determines that a first user is describing media content. In response to determining that the first user is describing the media content, the media guidance application retrieves media asset viewing history of the first user. The media guidance application determines, based on metadata of each media asset in the media asset viewing history of the first user and the voice communication, a media asset that the first user is describing. The media guidance application determines, based on metadata of the media asset, a segment of the media asset that the first user is describing. The media guidance application generates, for display, an indication of the segment.Type: GrantFiled: July 25, 2023Date of Patent: November 12, 2024Assignee: Adeia Guides Inc.Inventors: Michael K. McCarty, Glen E. Roe
-
Patent number: 12141100Abstract: A repository for quick retrieval of object(s) of a communication platform is described. Server(s) of the communication platform can receive, in association with a user interface, a request to associate an object with a repository. The server(s) can store an object identifier of the object in the repository and cause display of an object user interface element representative of the object to be presented in association with a repository user interface element representative of the repository. In response to receiving a selection of the object user interface element, the server(s) can retrieve the object using the object identifier and cause the object to be presented, in the user interface with contextual data, wherein the contextual data comprises other object(s) associated with the object.Type: GrantFiled: April 9, 2021Date of Patent: November 12, 2024Assignee: Salesforce, Inc.Inventors: Jason Hon-Son Wong, Julie Punturo, Elizabeth Anne Millikin, Zachery Floyd
-
Patent number: 12130965Abstract: Control systems and methods are provided that utilize a device, which can be worn by a user, to enable the user to enter control commands for causing a controller to control one or more electronic devices in a local network, such as a Wi-Fi system. A local control system, according to one implementation, includes a smart ring configured to obtain movement information related to one or more movements of the smart ring while a user is wearing the smart ring. The local control system also includes a controller device configured to communicate with the smart ring using Bluetooth or Wi-Fi signals. Characteristics of the movement information can be translated in order to obtain one or more control commands. The controller device is configured to control one or more aspects of one or more electronic devices based on the one or more control commands.Type: GrantFiled: July 7, 2022Date of Patent: October 29, 2024Assignee: PLUME DESIGN, INC.Inventors: Zhicheng Qiu, William J. McFarland
-
Patent number: 12128908Abstract: Systems and methods for assisting an operator in operating vehicle controls are disclosed herein. One embodiment detects that an operator is touching a control in a vehicle and automatically takes, in response to the operator touching the control, one or more actions to assist the operator with regard to the vehicle being a left-hand-drive vehicle or a right-hand-drive vehicle.Type: GrantFiled: October 24, 2022Date of Patent: October 29, 2024Assignee: Woven by Toyota, Inc.Inventors: Manuel Ludwig Kuehner, Hiroshi Yasuda
-
Patent number: 12121812Abstract: An audio signal processing method, an audio signal processing apparatus, a terminal, and a storage medium are provided. In the method, audio signal in a first period of time is collected. A reference sound volume is determined based on the collected audio signal. Audio signal sampling is performed on the audio signal at multiple audio sampling points in a second period of time to obtain multiple audio sampling signals. In a case of determining that the multiple audio sampling signals meet a predetermined condition based on the reference sound volume, a basis sound volume is determined based on the multiple audio sampling signals. The basis sound volume is used for controlling a display effect of a target object in a virtual scene, and different basis sound volumes correspond to different display effects of the target object.Type: GrantFiled: March 11, 2020Date of Patent: October 22, 2024Assignee: BEIJING BYTEDANCE NETWORK TECHNOLOGY CO., LTD.Inventor: Wei Zheng
-
Patent number: 12118846Abstract: Devices, systems and methods are provided. A device may include a gesture input device to detect gesture inputs performed by a user, a processor circuit, and a memory coupled to the processor circuit. The memory includes machine-readable instructions that, when executed by the processor circuit, cause the processor circuit to receive a first gesture input value from the first gesture input device and that corresponds to a user-specific gesture that the user performs, associate the first gesture input value with a first gaming device operation to be performed by the gaming device, receive the first gesture input value that is associated with the first gaming device operation, and responsive to receiving the first gesture input value that is associated with the first gaming device operation, cause the gaming device to perform the first gaming device operation.Type: GrantFiled: April 16, 2021Date of Patent: October 15, 2024Assignee: IGTInventors: David Small, David Froy, Jr., Michael Russ
-
Patent number: 12118999Abstract: Systems and processes for selectively processing and responding to a spoken user input are provided. In one example, audio input containing a spoken user input can be received at a user device. The spoken user input can be identified from the audio input by identifying start and end-points of the spoken user input. It can be determined whether or not the spoken user input was intended for a virtual assistant based on contextual information. The determination can be made using a rule-based system or a probabilistic system. If it is determined that the spoken user input was intended for the virtual assistant, the spoken user input can be processed and an appropriate response can be generated. If it is instead determined that the spoken user input was not intended for the virtual assistant, the spoken user input can be ignored and/or no response can be generated.Type: GrantFiled: August 7, 2023Date of Patent: October 15, 2024Assignee: Apple Inc.Inventors: Philippe P. Piernot, Justin G. Binder
-
Patent number: 12112095Abstract: Provided is a device for executing an application including a graphics user interface (GUI) for receiving an input value of an input field, the device including an audio output unit, a user input unit receiving a user input to request execution of the application, and a control unit configured to output, through the audio output unit, an audio signal indicating an induced inquiry corresponding to the input field, based on whether the user input is a voice input, to receive a voice input indicating a response to the induced inquiry, and to execute the application by setting an input value for the input field based on the voice input indicating the response to the induced inquiry.Type: GrantFiled: February 28, 2018Date of Patent: October 8, 2024Assignee: SAMSUNG ELECTRONICS CO., LTD.Inventors: Dong-hyeon Lee, Se-chun Kang, Yu-bin Seo, He-jung Yang
-
Patent number: 12112651Abstract: Delivering multimedia content for academic curriculum, over a network, to a user. The method includes a multimedia content delivery system identifying user attributes for the user, where the user is connected to the multimedia content delivery system over a network. The method further includes the multimedia content delivery system identifying attributes of a plurality of multimedia assets. Based on the identified user attributes and the identified attributes of the plurality of multimedia assets, the method includes creating a multimedia offering for delivery to the user. The multimedia offering satisfies a curriculum requirement specific to the user based on the user attributes. The multimedia offering is delivered over the network to the user.Type: GrantFiled: April 14, 2022Date of Patent: October 8, 2024Assignee: Western Governors UniversityInventors: Jon Morley, Mike Hassett, Adel Lelo, Jerry Damon Jasperson, Timothy Andrus, Brandon Karratti
-
Patent number: 12093459Abstract: An information processing apparatus comprises a processor configured to execute a program so as to: acquire viewpoint information, motion information and object information; process an object image wherein: the object image is an image representing a virtual substance and is updated based on the viewpoint information and the motion information; calculate a position when superimposing a background image and the object image, wherein: the background image is an image according to the field of view of the user; generate visual information superimposing the background image and the object image based on the position calculated, and output the generated visual information to a display device; and generate tactile information for an ultrasound generator irradiating the user with ultrasound corresponding to the object based on the position calculated, the motion information and the object information, and output the generated tactile information to the ultrasound generator.Type: GrantFiled: December 8, 2022Date of Patent: September 17, 2024Assignee: THE UNIVERSITY OF TOKYOInventors: Mamoru Miyawaki, Hiroyuki Shinoda, Takaaki Kamigaki, Mitsuru Ito, Tao Morisaki, Shun Suzuki, Atsushi Matsubayashi, Ryoya Onishi, Yasutoshi Makino, Seki Inoue
-
Patent number: 12094469Abstract: A method for recognizing speech comprises: respectively setting initial values of a Chinese character coefficient and a Pinyin coefficient, generating a Chinese character mapping function according to the initial value of the Chinese character coefficient, and generating a Pinyin mapping function according to the initial value of the Pinyin coefficient (S101); training the Chinese character mapping function and the Pinyin mapping function using a plurality of preset training samples, calculating training results as parameters of a joint loss function, and generating a target mapping function according to calculation results (S102); and recognizing, according to the target mapping function, speech to be recognized, so as to obtain a Chinese character recognition result and a Pinyin recognition result of the speech to be recognized (S103). The method reduces the cost of speech recognition while ensuring the accuracy of speech recognition.Type: GrantFiled: March 3, 2020Date of Patent: September 17, 2024Assignee: JINGDONG TECHNOLOGY HOLDING CO., LTD.Inventors: Li Fu, Xiaoxiao Li
-
Patent number: 12089014Abstract: A remote control with a microphone subsystem comprising a pair of internal microphones is shown and described. When connected to a remote-control base station that is itself connected to an external power source, the microphone subsystem is continuously energized by the external power source, and the pair of internal microphones operate as far field microphones that receive oral commands uttered by a user from a distance. When the remote control is removed from the base, the microphone subsystem is configured for selective connection to an internal power source by actuating a user control on the remote control. In the external power source mode, signals from both microphones are digitally processed to provide a far-field microphone array with beam forming. In the direct current mode, only one microphone's signals are digitally processed as a simple monaural signal (or they are not digitally processed).Type: GrantFiled: April 7, 2022Date of Patent: September 10, 2024Assignee: Vizio, Inc.Inventors: W. Leo Hoarty, Glen Gihong Kim
-
Patent number: 12080275Abstract: Systems for automatic speech recognition and/or natural language understanding automatically learn new words by finding subsequences of phonemes that, if they were a new word, would enable a successful tokenization of a phoneme sequence. Systems can learn alternate pronunciations of words by finding phoneme sequences with a small edit distance to existing pronunciations. Systems can learn the part of speech of words by finding part-of-speech variations that would enable parses by syntactic grammars. Systems can learn what types of entities a word describes by finding sentences that could be parsed by a semantic grammar but for the words not being on an entity list.Type: GrantFiled: January 11, 2021Date of Patent: September 3, 2024Assignee: SoundHound AI IP, LLC.Inventor: Anton V. Relin
-
Patent number: 12079261Abstract: A computer-implemented method for presenting relevant information to a customer service representative of a business may include receiving a digitized data stream corresponding to a spoken conversation between a customer and a representative; converting the data stream to a text stream; determining one or more keywords from the text stream; comparing the one or more keywords with a history of keywords that have previously been searched; and/or searching a database for information related to the one or more keywords that have not been previously searched. As a result of the keyword search, information about topics that the customer is interested in, may be located and displayed on a customer service representative display to facilitate the customer service representative timely relaying the information found by the keyword search to enhance the customer experience. Exemplary keywords may relate to insurance and financial services, such as “auto,” “home,” “life,” “insurance,” or “vehicle loan.Type: GrantFiled: June 28, 2022Date of Patent: September 3, 2024Assignee: State Farm Mutual Automobile Insurance CompanyInventor: Sylvia Hernandez
-
Patent number: 12080292Abstract: An electronic record voice assistant system can include one or more processors that receive audio data, apply a machine learning model to the audio data to generate speech data including at least one value, determine a state of an electronic record, and update one or more fields of the electronic record using the state and the at least one value.Type: GrantFiled: June 6, 2022Date of Patent: September 3, 2024Assignee: Bola Technologies, Inc.Inventors: Rushi M. Ganmukhi, Daniel Brownwood, Sidharth Malhotra, Augusto Monteiro Nobre Amanco
-
Patent number: 12073844Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: receiving, by a user device, a first indication of one or more first speakers visible in a current view recorded by a camera of the user device, in response, generating a respective isolated speech signal for each of the one or more first speakers that isolates speech of the first speaker in the current view and sending the isolated speech signals for each of the one or more first speakers to a listening device operatively coupled to the user device, receiving, by the user device, a second indication of one or more second speakers visible in the current view recorded by the camera of the user device, and in response generating and sending a respective isolated speech signal for each of the one or more second speakers to the listening device.Type: GrantFiled: October 1, 2020Date of Patent: August 27, 2024Assignee: Google LLCInventors: Anatoly Efros, Noam Etzion-Rosenberg, Tal Remez, Oran Lang, Inbar Mosseri, Israel Or Weinstein, Benjamin Schlesinger, Michael Rubinstein, Ariel Ephrat, Yukun Zhu, Stella Laurenzo, Amit Pitaru, Yossi Matias
-
Patent number: 12065329Abstract: A method for controlling an elevator system comprises receiving media data, receiving at least one of control data and sensor data, transmitting the media data and at least one of the control data and sensor data via a real-time data communication network between a central controller adapted for controlling a drive of the elevator system and a local controller, the local controller being provided in one of an elevator car control panel and a door control panel, wherein the real-time data communication network is adapted for transmitting data packets while ensuring that a data packet is transmitted during a maximal transmission time.Type: GrantFiled: January 16, 2018Date of Patent: August 20, 2024Assignee: Inventio AGInventor: Stefano Carriero
-
Patent number: 12057113Abstract: In various examples, systems and methods of the present disclosure combine open and closed dialog systems into an intelligent dialog management system. A text query may be processed by a natural language understanding model trained to associate the text query with a domain tag, intent classification, and/or input slots. Using the domain tag, the natural language understanding model may identify information in the text query corresponding to input slots needed for answering the text query. The text query and related information may then be passed to a dialog manager to direct the text query to the proper domain dialog system. Responses retrieved from the domain dialog system may be provided to the user via text output and/or via a text to speech component of the dialog management system.Type: GrantFiled: June 6, 2023Date of Patent: August 6, 2024Assignee: NVIDIA CorporationInventors: Shubhadeep Das, Sumit Bhattacharya, Ratin Kumar
-
Patent number: 12045353Abstract: A microphone controller includes a processor programmed to receive voice input from one or more microphones to be utilized in a voice recognition session initiated by the microphone controller. Further the microphone controller includes a key store including one or more keys configured to encrypt the received voice input to an encrypted voice data.Type: GrantFiled: May 29, 2019Date of Patent: July 23, 2024Assignee: Denso CorporationInventors: Ameer Kashani, Gopalakrishnan Iyer
-
Patent number: 12042237Abstract: A surgical system includes a plurality of voice sensors located in a surgical environment and configured to detect sound and generate a first plurality of signals. The surgical system also includes a position indicator, in proximity to a designated user, configured to indicate a first position of the designated user and generate a second signal representative of the first position. The surgical system further includes a processor configured to receive the first plurality of signals and the second signal and determine, based on the first plurality of signals, a second position. The processor is also configured to compare the detected sound with registered voice command of the designated user stored in a memory to verify the designated user's credentials, and send a command signal to a surgical instrument to carry out an operation related to the voice command based on at least one of the verification of the designated user's credentials, the first position and the second position.Type: GrantFiled: November 4, 2022Date of Patent: July 23, 2024Assignee: Cilag GmbH InternationalInventors: David J. Cagle, Eric Smith, Jeffrey L. Aldridge, Mary E. Mootoo, Ryan Asher
-
Patent number: 12023811Abstract: The present disclosure provides a robot control device including a detection section that detects an external force applied to a movable part of a robot, on the basis of a parameter obtained from a joint driving the movable part, and a driving control section that controls an interaction of the robot, according to the detected external force. With this configuration, in a case where a user touches the robot, the robot can perform an interaction according to the touch.Type: GrantFiled: January 27, 2020Date of Patent: July 2, 2024Assignee: SONY GROUP CORPORATIONInventors: Yusuke Kawabe, Kensuke Kitamura, Katsuhisa Ito
-
Patent number: 12020687Abstract: Embodiments of the present systems and methods may provide techniques for synthesizing speech in any voice in any language in any accent. For example, in an embodiment, a text-to-speech conversion system may comprise a text converter adapted to convert input text to at least one phoneme selected from a plurality of phonemes stored in memory, a machine-learning model storing voice patterns for a plurality of individuals and adapted to receive the at least one phoneme and an identity of a speaker and to generate acoustic features for each phoneme, and a decoder adapted to receive the generated acoustic features and to generate a speech signal simulating a voice of the identified speaker in a language.Type: GrantFiled: February 6, 2023Date of Patent: June 25, 2024Assignee: Georgetown UniversityInventors: Joe Garman, Ophir Frieder
-
Patent number: 12020695Abstract: A method comprises receiving from an input device, a capture of user action as an initial command; interpreting the initial command into an interpreted command; generating a first set of modified commands that are based on the interpreted command, including: a first modified command that has a phonetic similarity to the interpreted command within a certain threshold, and a second modified command that is semantically related to an earlier command; transmitting, to an output device, the first set of modified commands; receiving a response to a group of commands including the first set of modified commands; recording an identifier of an input device from which the response was received and a type of the response in a log; when the response includes acknowledging a specific command of the group of commands as an accepted command, executing, the accepted command; otherwise, generating a second set of modified commands.Type: GrantFiled: February 28, 2023Date of Patent: June 25, 2024Assignee: Merlyn Mind, Inc.Inventors: Aditya Vempaty, Ravindranath Kokku, Tamer Abuelsaad, Sharad C. Sundararajan, Satyanarayana Nitta
-
Patent number: 12011828Abstract: A method for controlling effectors of a robot by means of primitives made up of parameterizable coded functions, the primitives being activated conditionally by actions selected by an action selection system, the method based on associating coded objects with a sequence of characters corresponding to their semantic description, and comprising: i. a semantic description of the coded objects stored in the memory, made up of a string of characters representing a perception function of the robot and of another string of characters representing a perceived object, ii. a semantic description of the primitives, made up of a string of characters representing a possible action of the robot and of another, optional string of characters representing the optional parameters of this action, iii. a semantic description of rules made up of the combination of a string of characters representing the associated context and another string of characters representing the associated action.Type: GrantFiled: April 26, 2019Date of Patent: June 18, 2024Assignee: SPOONInventors: Jérôme Monceaux, Thibault Hervier, Aymeric Masurelle
-
Patent number: 12003667Abstract: A call challenger can receive a user input from a called party identity to opt-in to a call challenge service, and a second user input of a keyword. When the call challenger receives a call directed to a user equipment of the called party identity, the call challenger can prompt the calling party to provide an audible response. In response to a receipt of the audible response, the call challenger can convert the audible response to a text. The call challenger can compare the text with the keyword to determine if there is a sufficient match. In response to the determining the output of the comparing does not satisfy a threshold match score, the call challenger can prevent the call from connecting with the user equipment.Type: GrantFiled: January 10, 2023Date of Patent: June 4, 2024Assignees: AT&T Intellectual Property I, L.P., AT&T Mobility II LLCInventors: Sheldon Meredith, Brandon Hilliard, Zachary Meredith
-
Patent number: 11996100Abstract: A speech recognition engine is provided voice data indicative of at least a brand of a target appliance. The speech recognition engine uses the voice data indicative of at least a brand of the target appliance to identify within a library of codesets at least one codeset that is cross-referenced to the brand of the target appliance. The at least one codeset so identified is then caused to be provisioned to the controlling device for use in commanding functional operations of the target appliance.Type: GrantFiled: January 13, 2022Date of Patent: May 28, 2024Assignee: Universal Electronics Inc.Inventor: Jonathan Lim
-
Patent number: 11990136Abstract: It is intended to acquire a highly accurate speech recognition result for a subject of a conversation, while inhibiting an increase in the amount of calculation. A speech recognition device (10) according to the present invention includes a first speech recognition unit (11) that performs speech recognition processing using a first method on speech data of a conversation made by a plurality of speakers and outputs a speech recognition result for each of respective uttered speech segments of the plurality of speakers, a determination unit (13) that determines a subject segment based on a result of the speech recognition processing by the first speech recognition unit 11, and a second speech recognition unit (14) that performs speech recognition processing using a second method higher in accuracy than the first method on the speech data in the segment determined to be the subject segment and outputs a speech recognition result as a subject text.Type: GrantFiled: January 24, 2020Date of Patent: May 21, 2024Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Tetsuo Amakasu, Kaname Kasahara, Takafumi Hikichi, Masayuki Sugizaki
-
Patent number: 11978477Abstract: An information providing method includes: generating first information indicating that a friendly gathering is occurring in a home when (i) a threshold amount of time or longer has elapsed from a start time of food preparation by a user and (ii) the volume of sound in a dining space is a first threshold volume or greater; obtaining, from a second information processing apparatus connected to a first information processing apparatus, information indicating first request content over a network; and when content of the first information is included in the first request content, outputting, to the second information processing apparatus, second information including information for identifying the user or the home, using the first information generated.Type: GrantFiled: July 13, 2023Date of Patent: May 7, 2024Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICAInventors: Masaki Yamauchi, Nanami Fujiwara