Distributed Recognition, E.g., In Client-server Systems For Mobile Phones Or Network Applications, Etc. (epo) Patents (Class 704/E15.047)
  • Patent number: 11967318
    Abstract: The present subject matter at least describes a method and a system (300, 1200) of performing speech-recognition in an electronic device having an embedded speech recognizer. The method comprises receiving an input-audio comprising speech at a device. In real-time, at-least one speech-recognition module is selected within at least one of the device and a server for recognition of at least a portion of the received speech based on a criteria defined in terms of a) past-performance of speech-recognition modules within the device and server; b) an orator of speech; and c) a quality of service associated with at least one of the device and a networking environment thereof. Based upon the selection of the server, output of the selected speech-recognition modules within the device are selected for processing by corresponding speech-recognition modules of the server. An uttered-speech is determined within the input-audio based on output of the selected speech-recognition modules of the device or the server.
    Type: Grant
    Filed: December 19, 2019
    Date of Patent: April 23, 2024
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Jithendra Vepa, Periyasamy Paramasivam, Ramya Viswanathan, Rajesh Krishna Selvaraj Krishnan
  • Patent number: 11948564
    Abstract: Provided is an information processing device including a response control unit that controls a response to a user's utterance based on a first utterance interpretation result and a second utterance interpretation result. The first utterance interpretation result is a result of natural language understanding processing for an utterance text generated by automatic speech recognition processing based on the user's utterance and the second utterance interpretation result is an interpretation result acquired based on learning data in which the first utterance interpretation result and the utterance text used to acquire the first utterance interpretation result are associated with each other. The response control unit further controls the response to the user's utterance based on the second utterance interpretation result in a case where the second utterance interpretation result is acquired based on the user's utterance before acquisition of the first utterance interpretation result.
    Type: Grant
    Filed: March 13, 2019
    Date of Patent: April 2, 2024
    Assignee: SONY CORPORATION
    Inventors: Hiro Iwase, Yuhei Taki, Kunihito Sawai
  • Patent number: 11935517
    Abstract: A speech decoding method is performed by a computer device, the speech including a current audio frame and a previous audio frame. The method includes: obtaining a target token corresponding to a smallest decoding score from a first token list including first tokens obtained by decoding the previous audio frame, each first token including a state pair and a decoding score, the state pair being used for characterizing a correspondence between a first state of the first token in a first decoding network corresponding to a low-order language model and a second state of the first token in a second decoding network corresponding to a differential language model; determining pruning parameters according to the target token and an acoustic vector of the current audio frame when the current audio frame is decoded; and decoding the current audio frame according to the first token list, the pruning parameters, and the acoustic vector.
    Type: Grant
    Filed: March 3, 2021
    Date of Patent: March 19, 2024
    Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    Inventors: Yiheng Huang, Xiaozheng Jian, Liqiang He
  • Patent number: 11935539
    Abstract: A voice support server is used to provide voice control functionality to a third party application that does not natively support voice control functions. The voice support server implements a domain specific to the third party application that maintains a domain-specific language model (DLM) reflecting the functionality of the third party application. The DLM comprises a plurality of intent patterns corresponding to different commands and their possible variations that may be issued by the user, and maps each intent pattern to a corresponding action to be performed by the third party application. Received audio data is analyzed to determine one or more user utterances, which are transcribed and compared to the intent patterns of the DLM to identify an intent corresponding to the user utterance. The voice control module may then transmit instructions to the third party application to perform the action corresponding to the identified intent.
    Type: Grant
    Filed: January 24, 2020
    Date of Patent: March 19, 2024
    Assignee: Alan AI, Inc.
    Inventors: Andrey Ryabov, Anna Miroshnichenko, Evgeny Yusov, Alex Sotnikov
  • Patent number: 11915065
    Abstract: Examples described herein include systems and methods for brokerless reliable totally ordered many-to-many inter-process communication on a single node. A messaging protocol is provided that utilizes shared memory for one of the control plane and data plane, and multicast for the other plane. Readers and writers can store either control messages or message data in the shared memory, including in a ring buffer. Write access to portions of the shared memory can be controlled by a robust futex, which includes a locking mechanism that is crash recoverable. In general, the writers and readers can control the pace of communications and the crash of any process does not crash the overall messaging on the node.
    Type: Grant
    Filed: January 20, 2022
    Date of Patent: February 27, 2024
    Assignee: VMware, Inc.
    Inventors: Rusko Atanasov, Kalin Tsvetkov
  • Patent number: 11906613
    Abstract: An electronic device includes memory circuitry, interface circuitry, and processor circuitry. The processor circuitry is configured to transmit, to a plurality of electronic reference devices, a first signal, the first signal having a pulse width below a threshold. The processor circuitry is configured to determine, based on the received second signals and at least one predetermined time period, a time of flight of each of the second signals. The processor circuitry is configured to obtain, from the memory circuitry, reference positions of the plurality of electronic reference devices. The processor circuitry is configured to determine, based on the associations, one or more candidate positions of the electronic device. The processor circuitry is configured to determine, based on the distances, the one or more candidate positions, and the obtained reference positions, a position of the electronic device.
    Type: Grant
    Filed: June 3, 2021
    Date of Patent: February 20, 2024
    Assignee: Sony Group Corporation
    Inventor: Peter Ljung
  • Patent number: 11900921
    Abstract: Techniques for partially processing an input on a device and completing processing at a remote system are provided. The device may process an input using an on-device machine learning (ML) model, and determine to cease processing at an intermediary node of the (ML) model based on the output of the intermediary node. Based on the output of the intermediary node satisfying a condition, the device may use the output of the intermediary node to generate an output responsive to the input. Conversely, if the output of the intermediary node does not satisfy a condition, the device may send the output of the intermediary node to the remote system, so the remote system can use another machine learning model to complete processing with respect to the input.
    Type: Grant
    Filed: October 26, 2020
    Date of Patent: February 13, 2024
    Assignee: Amazon Technologies, Inc.
    Inventors: Rahul Gupta, Christophe Dupuy, Jacob Ryan Stolee, Clement Chung
  • Patent number: 11893308
    Abstract: Example techniques involve invoking voice assistance for a media playback system. In some embodiments, a NMD stores in memory a set of command information comprising a listing of playback commands and associated command criteria. The NMD captures a voice input and detects inclusion, within the voice input, of one or more particular playback commands from among the playback commands in the listing. In response, the NMD selects a local voice assistant that supports (a) one or more additional playback commands relative to a cloud-based VAS and (b) fewer non-playback commands relative to the cloud-based VAS, determines, via the local voice assistant, an intent in the captured voice input, and performs a response to the determined intent. The NMD foregoes selection of the cloud-based VAS when the local voice assistant is selected.
    Type: Grant
    Filed: March 28, 2022
    Date of Patent: February 6, 2024
    Assignee: Sonos, Inc.
    Inventors: Dayn Wilberding, John Tolomei
  • Patent number: 11869503
    Abstract: As noted above, example techniques relate to offline voice control. A local voice input engine may process voice inputs locally when processing voice inputs via a cloud-based voice assistant service is not possible. Some techniques involve local (on-device) voice-assisted set-up of a cloud-based voice assistant service. Further example techniques involve local voice-assisted troubleshooting the cloud-based voice assistant service. Other techniques relate to interactions between local and cloud-based processing of voice inputs on a device that supports both local and cloud-based processing.
    Type: Grant
    Filed: December 13, 2021
    Date of Patent: January 9, 2024
    Assignee: Sonos, Inc.
    Inventor: Connor Smith
  • Patent number: 11869487
    Abstract: Speech processing tasks may be allocated at least partly to a local device (e.g., user computing device that receives spoken words) and at least partly to a remote device to determine one or more user commands or tasks to be performed by the local device. The remote device may be used to process speech that the local device could not process or understand, or for other reasons, such as for error checking. The local device may then execute or begin to execute locally determined tasks to reduce user-perceived latency. Meanwhile, the entire media input, or a portion thereof, may be sent to the remote device to process speech, verify the tasks and/or identify other user commands in the media input (or portion thereof).
    Type: Grant
    Filed: August 16, 2019
    Date of Patent: January 9, 2024
    Assignee: Amazon Technologies, Inc.
    Inventors: Sanjoy Ghosh, Pieter Sierd van der Meulen
  • Patent number: 11863646
    Abstract: Disclosed is the technology for computer-based “Daily Brief” service, which includes methods and corresponding systems for proactively providing push notifications for users of chat information systems. The push notifications are dynamically generated and presented to the user based on identification of one or more triggering events, which may include predetermined time/date, current geographical location, activity of peers and friends in social media associated with the user, scheduled events, appointments, meetings, emails, instant messages, and many more. The described technology improves the interaction interface between the user and chat information system.
    Type: Grant
    Filed: February 3, 2023
    Date of Patent: January 2, 2024
    Assignee: GOOGLE LLC
    Inventors: Ilya Gennadyevich Gelfenbeyn, Artem Goncharuk, Ilya Andreevich Platonov, Pavel Aleksandrovich Sirotin, Olga Aleksandrovna Gelfenbeyn
  • Patent number: 11830490
    Abstract: Disambiguating question answering responses by receiving voice command data associated with a first user, determining a first user identity according to the first user voice command data, determining a first user activity context according to the first user voice command data, determining a first response for the first user, receiving voice command data associated with a second user, determining a second user identity according to the second user voice command data, determining a second user activity context according to the second user voice command data, determining a second response for the second user, determining a predicted ambiguity between the first response and the second response, altering the first response according to the predicted ambiguity, and providing the first response and the second response.
    Type: Grant
    Filed: August 11, 2021
    Date of Patent: November 28, 2023
    Assignee: International Business Machines Corporation
    Inventors: Venkata Vara Prasad Karri, Sarbajit K. Rakshit, Sri Harsha Varada, Sampath Kumar Pulupula Venkata
  • Patent number: 11824894
    Abstract: Embodiments of the invention are directed to techniques that include receiving a query intended for a targeted database and determining that the query is from an unauthorized user. A response is returned to the unauthorized user generated by a model, the response being dynamically generated to fulfill the query. The model is configured to generate responses consistent with any previous responses returned to the unauthorized user.
    Type: Grant
    Filed: November 25, 2020
    Date of Patent: November 21, 2023
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Marco Simioni, Stefano Braghin, Killian Levacher
  • Patent number: 11803431
    Abstract: Examples described herein include systems and methods for brokerless reliable totally ordered many-to-many inter-process communication on a single node. A messaging protocol is provided that utilizes shared memory for one of the control plane and data plane, and multicast for the other plane. Readers and writers can store either control messages or message data in the shared memory, including in a ring buffer. Write access to portions of the shared memory can be controlled by a robust futex, which includes a locking mechanism that is crash recoverable. In general, the writers and readers can control the pace of communications and the crash of any process does not crash the overall messaging on the node.
    Type: Grant
    Filed: March 14, 2022
    Date of Patent: October 31, 2023
    Assignee: VMware, Inc.
    Inventors: Rusko Atanasov, Kalin Tsvetkov, Viktoriya Bambaldokova
  • Patent number: 11790893
    Abstract: A voice processing method is disclosed. The voice processing method applies first and second sentence vectors extracted from first and second utterances, that are included in one dialog group and are separated from each other, to a learning model and generates an output from which at least one word having an overlapping meaning is removed. The voice processing method can be associated with an artificial intelligence module, an unmanned aerial vehicle (UAV), a robot, an augmented reality (AR) device, a virtual reality (VR) device, devices related to 5G services, and the like.
    Type: Grant
    Filed: September 30, 2020
    Date of Patent: October 17, 2023
    Assignee: LG ELECTRONICS INC.
    Inventors: Kwangyong Lee, Hyun Yu, Byeongha Kim, Yejin Kim
  • Patent number: 11790902
    Abstract: A system may include first and second speech-processing systems. The first speech-processing system may process received audio data and determine that a command represented therein is associated with a second speech-processing system. The first speech-processing system may send command data to the second speech-processing system and receive response data in return. The first speech-processing system may then process the response data to determine second response data that includes an indication of the second speech-processing system and cause output of audio corresponding to the second response data.
    Type: Grant
    Filed: February 4, 2020
    Date of Patent: October 17, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Timothy Whalin, Catherine Michelle Loo, Calvin Phuong Nguyen
  • Patent number: 11769012
    Abstract: A system and method for updating computerized language models is provided that automatically adds or deletes terms from the language model to capture trending events or products, while maximizing computer efficiencies by deleting terms that are no longer trending and use of knowledge bases, machine learning model training and evaluation corpora, analysis tools and databases.
    Type: Grant
    Filed: March 25, 2020
    Date of Patent: September 26, 2023
    Assignee: Verint Americas Inc.
    Inventors: Ian Roy Beaver, Christopher James Jeffs
  • Patent number: 11763404
    Abstract: Systems, methods, and apparatuses for implementing a geo-demographic zoning optimization engine are disclosed.
    Type: Grant
    Filed: June 15, 2021
    Date of Patent: September 19, 2023
    Assignee: Arizona Board of Regents on behalf of Arizona State University
    Inventors: Jon J. Miller, Vikash Bajaj, Srinivasa Srivatsav Kandala, Fangwu Wei, Michael Kuby, Wangshu Mu, Daoqin Tong
  • Patent number: 11741385
    Abstract: To simplify assisting a user in their day-to-day activities, a communication for performing an action may be sent to a user in the form of a query, where the query includes the most likely set of choices for the action arranged in a group of dichotomous (e.g., yes/no) or multiple choice answers. In this manner, a user may respond to the query by simply selecting one of the dichotomous or multiple choice answers. Historical logs of past actions, responses, queries, and so forth, may be used to predict future user actions or needs, and to formulate future queries for sending to the user. These techniques may be implemented, for example, through a remote coordination server or directly through a user's personal electronics device.
    Type: Grant
    Filed: July 28, 2022
    Date of Patent: August 29, 2023
    Assignee: Telepathy Labs, Inc
    Inventors: Damien Phelan Stolarz, David Joseph Diaz, James Rossfeld, Scott Raven, Christopher O'Malley, Christopher Kurpinski
  • Patent number: 11735185
    Abstract: The present invention provides a caption service system for remote speech recognition, which provides caption service for the hearing impaired. This system includes a speaker and a live broadcast equipment at A, a listener-typist and a computer at B, a hearing impaired and a live screen at C, and an automatic speech recognition (ASR) caption server at D. Connect the live broadcast equipment, the computer, the live screen and the ASR caption server with a network. The speaker's audio is sent to the automatic speech recognition (ASR) caption server to be converted into text, which is corrected by the listener-typist, and then the text caption is sent to the live screen of the hearing impaired together with the speaker's video and audio, so that the hearing impaired can see the text caption spoken by the speaker.
    Type: Grant
    Filed: August 19, 2021
    Date of Patent: August 22, 2023
    Assignee: NATIONAL YANG MING CHIAO TUNG UNIVERSITY
    Inventors: Sin Horng Chen, Yuan Fu Liao, Yih Ru Wang, Shaw Hwa Hwang, Bing Chih Yao, Cheng Yu Yeh, You Shuo Chen, Yao Hsing Chung, Yen Chun Huang, Chi Jung Huang, Li Te Shen, Ning Yun Ku
  • Patent number: 11729596
    Abstract: A communication device and method can include one or more processors operatively coupled to memory, a sensor and an output device, where the one or more processors to perform operations of identifying target person locations using internet searching and short range communication enabled devices such as Bluetooth LE devices.
    Type: Grant
    Filed: December 2, 2022
    Date of Patent: August 15, 2023
    Assignee: Staton Techiya LLC
    Inventor: Steven Wayne Goldstein
  • Patent number: 11721347
    Abstract: Some speech processing systems may handle some commands on-device rather than sending the audio data to a second device or system for processing. The first device may have limited speech processing capabilities sufficient for handling common language and/or commands, while the second device (e.g., an edge device and/or a remote system) may call on additional language models, entity libraries, skill components, etc. to perform additional tasks. An intermediate data generator may facilitate dividing speech processing operations between devices by generating a stream of data that includes a first-pass ASR output (e.g., a word or sub-word lattice) and other characteristics of the audio data such as whisper detection, speaker identification, media signatures, etc. The second device can perform the additional processing using the data stream; e.g., without using the audio data. Thus, privacy may be enhanced by processing the audio data locally without sending it to other devices/systems.
    Type: Grant
    Filed: June 29, 2021
    Date of Patent: August 8, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Stanislaw Ignacy Pasko, Pawel Zelazko, Cagdas Bak, Eli Joshua Fidler, Michal Kowalczuk, Andrew Oberlin, Ariya Rastrow
  • Patent number: 11710488
    Abstract: A method may include obtaining audio data originating at a first device during a communication session between the first device and a second device and providing the audio data to a first speech recognition system to generate a first transcript based on the audio data and directing the first transcript to the second device. The method may also include in response to obtaining a quality indication regarding a quality of the first transcript, multiplexing the audio data to provide the audio data to a second speech recognition system to generate a second transcript based on the audio data while continuing to provide the audio data to the first speech recognition system and direct the first transcript to the second device, and in response to obtaining a transfer indication that occurs after multiplexing of the audio data, directing the second transcript to the second device instead of the first transcript.
    Type: Grant
    Filed: December 19, 2018
    Date of Patent: July 25, 2023
    Assignee: Sorenson IP Holdings, LLC
    Inventors: Kenneth Boehme, Michael Holm, Shane Roylance
  • Patent number: 11700484
    Abstract: A device to process speech includes a speech processing network that includes an input configured to receive audio data corresponding to audio captured by one or more microphones. The speech processing network also includes one or more network layers configured to process the audio data to generate a network output. The speech processing network includes an output configured to be coupled to multiple speech application modules to enable the network output to be provided as a common input to each of the multiple speech application modules. A first speech application module corresponds to a speaker verifier, and a second speech application module corresponds to a speech recognition network.
    Type: Grant
    Filed: February 10, 2022
    Date of Patent: July 11, 2023
    Assignee: QUALCOMM Incorporated
    Inventors: Lae-Hoon Kim, Sunkuk Moon, Erik Visser, Prajakt Kulkarni
  • Patent number: 11695836
    Abstract: A computer program and the like are provided that are capable of causing an information processing device connected to a private network, to automatically execute operation processing of a browser. The computer program is a computer program for causing the information processing device connected to the private network, to automatically execute the operation of the browser that accesses a web server on the private network, based on an instruction from a server connected to a global network, and causes the information processing device to execute the processing of: requesting the server to establish a connection; obtaining an operation instruction related to the operation processing which is push-transmitted from the server, by using the connection; executing the operation processing of the browser based on the obtained operation instruction; obtaining an execution result of the operation processing; and outputting the obtained execution result to the server.
    Type: Grant
    Filed: June 18, 2020
    Date of Patent: July 4, 2023
    Assignee: C-RISE Ltd.
    Inventors: Masanori Murai, Yutaka Mitsubayashi
  • Patent number: 11683320
    Abstract: The present disclosure is generally directed to a data processing system for customizing content in a voice activated computer network environment. With user consent, the data processing system can improve the efficiency and effectiveness of auditory data packet transmission over one or more computer networks by, for example, increasing the accuracy of the voice identification process used in the generation of customized content. The present solution can make accurate identifications while generating fewer audio identification models, which are computationally intensive to generate.
    Type: Grant
    Filed: April 22, 2021
    Date of Patent: June 20, 2023
    Assignee: GOOGLE LLC
    Inventors: Victor Carbune, Thomas Deselaers, Sandro Feuz
  • Patent number: 11669697
    Abstract: A method for providing responsive actions to user inputs in a multi-domain context includes receiving, by a speech-based user interface, a first speech input from a user and converting said first speech input into a text-based representation of the first speech input. A natural language processor processes the text-based representation to determine an intent, entity and internal state of the first speech input. The method further includes determining, by a model-based module based on the intent, entity and internal state, a first data processing policy to apply to the first speech input, wherein the first data processing policy is either a rules-based data processing policy applied by a rules-based module or a statistical model-based data processing policy applied by the model-based module. The first responsive action is generated by the determined first data processing module, and outputted via the speech-based user interface and/or a machine interface.
    Type: Grant
    Filed: October 23, 2019
    Date of Patent: June 6, 2023
    Assignee: Bayerische Motoren Werke Aktiengesellschaft
    Inventors: Wangsu Hu, Jilei Tian
  • Patent number: 11650983
    Abstract: A method is provided for generating a classification model configured to select an optimal execution combination for query processing. The method provides, to a processor, training queries and different execution combinations for executing the training queries. Each different execution combination involves a respective different query engine and a respective different runtime. The method extracts, from a set of Directed Acyclic Graphs (DAGs) using a set of Cost-Based Optimizers (CBOs), a set of feature vectors for each of the plurality of training queries. The method adds, by the processor to each of merged feature vectors a respective label indicative of the optimal execution combination based on actual respective execution times of the plurality of different execution combinations, to obtain a set of labels. The method trains, by the processor, the classification model by learning the set of merged feature vectors with the set of labels.
    Type: Grant
    Filed: December 22, 2020
    Date of Patent: May 16, 2023
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventor: Tatsuhiro Chiba
  • Patent number: 11646023
    Abstract: Systems and methods for distributed voice processing are disclosed herein. In one example, the method includes detecting sound via a microphone array of a first playback device and analyzing, via a first wake-word engine of the first playback device, the detected sound. The first playback device may transmit data associated with the detected sound to a second playback device over a local area network. A second wake-word engine of the second playback device may analyze the transmitted data associated with the detected sound. The method may further include identifying that the detected sound contains either a first wake word or a second wake word based on the analysis via the first and second wake-word engines, respectively. Based on the identification, sound data corresponding to the detected sound may be transmitted over a wide area network to a remote computing device associated with a particular voice assistant service.
    Type: Grant
    Filed: December 14, 2020
    Date of Patent: May 9, 2023
    Assignee: Sonos, Inc.
    Inventors: Connor Kristopher Smith, John Tolomei, Betty Lee
  • Patent number: 11594213
    Abstract: Systems and methods are described herein for interpreting natural language search queries that account for contextual relevance of words of the search query that would ordinarily not be processed, including, for example, processing each word of the query. Each term is associated with a respective part of speech, and a frequency of occurrence of each term in content metadata is determined. A relevance of each term is then determined based on its respective part of speech and frequency. The natural language search query is then interpreted based on the importance or relevance of each term.
    Type: Grant
    Filed: March 3, 2020
    Date of Patent: February 28, 2023
    Assignee: ROVI GUIDES, INC.
    Inventors: Jeffry Copps Robert Jose, Ajay Kumar Mishra
  • Patent number: 11588799
    Abstract: A moving object control system includes a server, a portable terminal configured to transmit authentication information issued by the server, and a controller provided in a moving object and configured to authenticate the portable terminal according to the authentication information transmitted from the portable terminal, and when the portable terminal is authenticated, control the moving object in response to an operation signal from the portable terminal. The controller is configured to perform information communication with the server and control the moving object according to control information received from the server.
    Type: Grant
    Filed: July 29, 2019
    Date of Patent: February 21, 2023
    Assignee: TOYOTA JIDOSHA KABUSHIKI KAISHA
    Inventor: Yasuhisa Fujiwara
  • Patent number: 11587571
    Abstract: An electronic apparatus includes: at least one processor configured to: receive audio of a voice input of a user; obtain, from a plurality of voice recognizers capable of recognizing the voice input, a plurality of recognition results of the received audio; and perform an operation based on a recognition result of which recognition suitability for the voice input is identified to be high, among the plurality of recognition results.
    Type: Grant
    Filed: September 2, 2020
    Date of Patent: February 21, 2023
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventor: Chanhee Choi
  • Patent number: 11550846
    Abstract: Methods, apparatus, systems, and computer-readable media are provided for transferring dialog sessions between devices using deep links. The dialog sessions can correspond to interactions, mediated by an automated assistant, between a user and a third party application. During the dialog session, a user can request that the dialog session be transferred to a different device, for example, to interact with the third party application through a different modality. In response, the automated assistant and/or the third party application can generate a link that can be transferred to the transferee device to allow the transferee device to seamlessly take over the dialog session. In this way, computational resources and electrical power can be preserved by not requiring a recipient device to re-process natural language inputs previously provided during the dialog session.
    Type: Grant
    Filed: May 17, 2021
    Date of Patent: January 10, 2023
    Assignee: GOOGLE LLC
    Inventors: Justin Lewis, Scott Davies
  • Patent number: 11538458
    Abstract: Disclosed is an electronic apparatus capable of controlling voice recognition. The electronic apparatus increases a score of a category corresponding to a word included in user's utterance in a database when the instruction included in the user's utterance is present in the database. The electronic apparatus checks whether the score of the category corresponding to the word is equal to or greater than a preset value when the instruction is not present in the database. The electronic apparatus registers the instruction in the database so that the instruction is included in the category corresponding to the word when the score is equal to or greater than the preset value as the check result.
    Type: Grant
    Filed: September 16, 2020
    Date of Patent: December 27, 2022
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventor: Heejae Kim
  • Patent number: 11538476
    Abstract: A terminal device is provided and includes a communication interface including circuitry, a display and at least one processor configured to control the communication interface to transmit a user voice including a plurality of intents to an external server, based on word use information included in the user voice and summary information regarding the user voice generated based on user-related information being received from the external server, control the display to display the received summary information, based on a user feedback regarding the summary information being input, transmit information regarding the user feedback to the external server, and based on response information regarding the user voice generated based on the user feedback being received from the external server, control the display to provide the response information.
    Type: Grant
    Filed: November 24, 2020
    Date of Patent: December 27, 2022
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Sanghyuk Yoon, Heejun Song, Heejae Yu
  • Patent number: 11501879
    Abstract: Techniques for voice control of a patient care device are described. A patient care device receives an audio request from a user. The patient care device records the audio request. The patient care device transmits the audio request over a communication network to a speech recognition service, and in response receives, from the speech recognition service, a textual representation of the audio request. The patient care device matches the textual representation, using the computer processor, to a first command in a vocabulary of available commands, and in response performs the first command.
    Type: Grant
    Filed: October 1, 2019
    Date of Patent: November 15, 2022
    Assignee: PREVENTICE TECHNOLOGIES, INC.
    Inventors: Richard M. Smith, Scott J. Burrichter, Jon P. Otterstatter
  • Patent number: 11475068
    Abstract: Disclosed are an automatic question answering method and apparatus, a storage medium, and a server. The method includes: acquiring numerical features of a sentence to be queried; querying a target sentence in a question database according to the numerical features of the sentence to be queried, the question database including a plurality of sentences and answers corresponding to the plurality of sentences; and determining a target answer according to an answer corresponding to the target sentence. In this method, the sentence is represented by the numerical features, such that it is convenient to search questions similar to a question of a user in the question database, thereby achieving an effect of improving a search speed of the question.
    Type: Grant
    Filed: July 24, 2020
    Date of Patent: October 18, 2022
    Assignee: BEIJING BOE TECHNOLOGY DEVELOPMENT CO., LTD.
    Inventors: Jianbo Han, Bingqian Wang
  • Patent number: 11475459
    Abstract: A system for classification of a customer query is disclosed. The system includes a customer interaction subsystem to receive the customer query from a customer, and a tokenizer subsystem to split the customer query into tokens. The system also includes a multitask profiler subsystem including a mapping module to map the tokens with pre-trained embedding data to assign mathematical codes to the tokens, an attention module to apply attention models hierarchically on a contextual embedding layer to obtain contextual mathematical codes corresponding to the tokens based on the mathematical codes, a classification module to classify the multiple tokens into profiles based on the contextual mathematical codes, and a profile generator to generate a human readable profile and a machine-readable profile based on the profiles. The machine-readable profile and the human readable profile includes at least one of a customer profile, a product profile, an issue profile or a combination thereof.
    Type: Grant
    Filed: March 20, 2020
    Date of Patent: October 18, 2022
    Assignee: PM Labs, Inc.
    Inventors: Arjun Maheswaran, Akhilesh Sudhakar
  • Patent number: 11443561
    Abstract: A vehicle device includes: a communication device configured to transmit or receive a signal to or from a cloud server operating in conjunction with a service device of a parking lot upon entering the parking lot; and a controller configured to output parking lot information received from the cloud server upon entering the parking lot, and to identify charge settlement information of the parking lot from the cloud server to pay a settlement charge, when a predetermined charge settlement event occurs.
    Type: Grant
    Filed: June 4, 2019
    Date of Patent: September 13, 2022
    Assignees: HYUNDAI MOTOR COMPANY, KIA MOTORS CORPORATION
    Inventors: Yun Joong Park, Jong Pil Park, Kyowoong Choo
  • Patent number: 11398238
    Abstract: Disclosed herein is a speech recognition method in a distributed network environment. A method of performing a speech recognition operation in an edge computing device includes receiving a natural language understanding (NLU) model from the cloud server, storing the received NLU model, receiving voice data spoken by a user from the client device, performing a natural language processing operation on the received voice data using the NLU model, performing speech recognition according to the natural language processing operation, and transmitting a result of the speech recognition to the client device. At least one of the edge computing device, a voice recognition device, and a server may be associated with an artificial intelligence module, a drone (an unmanned aerial vehicle (UAV)), a robot, an augmented reality (AR) device, a virtual reality (VR) device, a device related to a 5G service, and the like.
    Type: Grant
    Filed: June 7, 2019
    Date of Patent: July 26, 2022
    Assignee: LG ELECTRONICS INC.
    Inventors: Sungjin Kim, Dongho Kim, Jingyeong Kim, Taehyun Kim
  • Patent number: 9928256
    Abstract: A universal data management interface (UDMI) system includes a processing system generates a visual interface through which a user can access, manage, and manipulate data on plural different types of remote databases. The UDMI connects to multiple standard database management systems and to allow multiple users to access, manage, and manipulate data within each of the multiple standard database management systems. The UDMI also allows multiple virtual databases that reside in a single database to be available as a network service.
    Type: Grant
    Filed: March 29, 2016
    Date of Patent: March 27, 2018
    Assignee: S. AQUA SEMICONDUCTOR, LLC
    Inventor: Jasmin Cosic
  • Publication number: 20120179457
    Abstract: Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture.
    Type: Application
    Filed: January 6, 2012
    Publication date: July 12, 2012
    Applicant: Nuance Communications, Inc.
    Inventors: Michael Newman, Anthony Gillet, David Mark Krowitz, Michael D. Edgington
  • Publication number: 20100191529
    Abstract: Systems and methods are described for a speech system that manages multiple grammars from one or more speech-enabled applications. The speech system includes a speech server that supports different grammars and different types of grammars by exposing several methods to the speech-enabled applications. The speech server supports static grammars that do not change and dynamic grammars that may change after a commit. The speech server provides persistence by supporting persistent grammars that enable a user to issue a command to an application even when the application is not loaded. In such a circumstance, the application is automatically launched and the command is processed. The speech server may enable or disable a grammar in order to limit confusion between grammars. Global and yielding grammars are also supported by the speech server. Global grammars are always active (e.g., “call 9-1-1”) while yielding grammars may be deactivated when an interaction whose grammar requires priority is active.
    Type: Application
    Filed: March 31, 2010
    Publication date: July 29, 2010
    Applicant: Microsoft Corporation
    Inventors: Stephen Russell Falcon, Clement Chun Pong Yip, David Michael Miller, Dan Banay
  • Publication number: 20090138265
    Abstract: Adjusting model parameters is described for a speech recognition system that combines recognition outputs from multiple speech recognition processes. Discriminative adjustments are made to model parameters of at least one acoustic model based on a joint discriminative criterion over multiple complementary acoustic models to lower recognition word error rate in the system.
    Type: Application
    Filed: November 26, 2007
    Publication date: May 28, 2009
    Applicant: NUANCE COMMUNICATIONS, INC.
    Inventors: Daniel Willett, Chuang He
  • Publication number: 20080228480
    Abstract: A speech recognition method comprises model selection step which selects a recognition model based on characteristic information of input speech and speech recognition step which translates input speech into text data based on the selected recognition model.
    Type: Application
    Filed: October 30, 2007
    Publication date: September 18, 2008
    Inventor: Shuhei Maegawa
  • Publication number: 20080215327
    Abstract: Speech signal information is formatted, processed and transported in accordance with a format adapted for TCP/IP protocols used on the Internet and other communications networks. NULL characters are used for indicating the end of a voice segment. The method is useful for distributed speech recognition systems such as a client-server system, typically implemented on an intranet or over the Internet based on user queries at his/her computer, a PDA, or a workstation using a speech input interface.
    Type: Application
    Filed: May 19, 2008
    Publication date: September 4, 2008
    Inventor: Ian M. Bennett
  • Publication number: 20080167872
    Abstract: A speech recognition device that is capable of presenting, to a user in an easy-to-understand manner, whether or not the user's utterance is a word unregistered in a speech recognition dictionary and whether or not the utterance should be repeated due to a recognition error includes: a speech recognition vocabulary storage unit (102) which defines vocabulary for speech recognition; a speech recognition unit (101) which checks the uttered speech against the registered words; a reference similarity calculation unit (103) which calculates a similarity between the uttered speech and a combination of acoustic units, which are subwords; an unregistered word judgment unit (104) which judges, based on the result of the check by the speech recognition unit (101) and a result of the calculation performed by the reference similarity calculation unit (103), whether the uttered speech is a registered word or an unregistered word; an unregistered word storage (106) which stores unregistered words; an unregistered word cand
    Type: Application
    Filed: June 2, 2005
    Publication date: July 10, 2008
    Inventors: Yoshiyuki Okimoto, Tsuyoshi Inoue, Takashi Tsuzuki
  • Publication number: 20080154596
    Abstract: The present invention can include a speech enrollment system including an ordered stack of grammars and a recognition engine. The ordered stack of grammars can include an application grammars layer, a confusable grammar layer, a personal grammar layer, a phrase enrolled grammar layer, and an enrollment grammar layer. The recognition engine can return recognition results for speech input by processing the input using the ordered stack of grammars. The processing can occur from the topmost layer in the stack to the bottommost layer in the stack. Each layer in the stack can includes exit criteria based upon a defined condition. When the exit criteria is satisfied, a result can be returned based upon that layer and lower layers of the ordered stack can be ignored.
    Type: Application
    Filed: December 22, 2006
    Publication date: June 26, 2008
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: WILLIAM V. DA PALMA, BRIEN H. MUSCHETT
  • Publication number: 20080140419
    Abstract: A system and method for improving voice recognition processing at a server system that receives voice input from a remotely located user system. The user system includes a microphone, a processor that performs front-end voice recognition processing of the received user voice input, and a communication component configured to send the front-end processed user voice input to a destination wirelessly over a network. The server system includes a communication component configured to receive the sent front-end processed user voice input, and a processor configured to complete voice recognition processing of the sent front-end processed user voice input.
    Type: Application
    Filed: October 30, 2007
    Publication date: June 12, 2008
    Inventor: Gilad Odinak
  • Patent number: RE49284
    Abstract: Disclosed is a method for controlling a cordless telephone device for use in a system that allows remote control of a home electric appliance. The method includes a first generation step of causing a first generation unit in a handset to encode audio input via a sound receiving unit in the handset to generate a first stream, and a first transmission step of transmitting the first stream to a base unit. The first generation step includes causing the first generation unit to generate instruction bit information and a first instruction stream when a first trigger indicating a request to start the remote control is given to the first generation unit. The first transmission step includes transmitting the instruction bit information and the first instruction stream to the base unit through a multiplexing scheme that is common to transmission of a first stream generated when the first trigger is not given.
    Type: Grant
    Filed: August 21, 2020
    Date of Patent: November 8, 2022
    Assignee: Panasonic Intellectual Property Corporation of America
    Inventors: Masayuki Kozuka, Shingo Matsumoto, Hideyuki Oka, Akihiko Inoue, Hiroshi Yahata, Tomoki Ogawa, Tohru Wakabayashi, Keizo Ishiguro