Distributed Recognition, E.g., In Client-server Systems For Mobile Phones Or Network Applications, Etc. (epo) Patents (Class 704/E15.047)
-
Patent number: 12210801Abstract: Example techniques involve invoking voice assistance for a media playback system. In some embodiments, a NMD stores in memory a set of command information comprising a listing of playback commands and associated command criteria. The NMD captures a voice input and detects inclusion, within the voice input, of one or more particular playback commands from among the playback commands in the listing. In response, the NMD selects a local voice assistant that supports (a) one or more additional playback commands relative to a cloud-based VAS and (b) fewer non-playback commands relative to the cloud-based VAS, determines, via the local voice assistant, an intent in the captured voice input, and performs a response to the determined intent. The NMD foregoes selection of the cloud-based VAS when the local voice assistant is selected.Type: GrantFiled: February 5, 2024Date of Patent: January 28, 2025Assignee: Sonos, Inc.Inventors: Dayn Wilberding, John Tolomei
-
Patent number: 12190056Abstract: Systems and methods for autofilling and autocorrecting an AI supported UI form text field via smart glasses are provided. The data autofillted and autocorrected may be identification data, including account numbers and user IDs. The smart glasses may work together with a conversation tracking application and a smart glasses UI. Methods may include prompting agents based on real-time conversational analysis with context clues and texts to assist and maintain customer eye contact. Methods may further include capturing segments of data within conversational analysis and storing the segments of data in memory on the smart glasses. Methods may further include updating smart glasses UI form text fields based on AI supported autocorrection from real-time conversation, by autofilling the segment of data in the smart glasses UI form text fields.Type: GrantFiled: August 1, 2023Date of Patent: January 7, 2025Assignee: Bank of America CorporationInventor: Sandeep Verma
-
Patent number: 12165646Abstract: Provided herein are systems and methods for delta models for providing privatized speech-to-text during virtual meetings. In one embodiment, a system may include a non-transitory computer-readable medium; a communications interface; and a processor. The processor may be configured to execute processor-executable instructions to: join a virtual meeting. Each participant in the virtual meeting may exchange audio streams with other participants in the virtual meeting. The instructions may include receiving, from a video conference provider, a local model for speech recognition. The local model may be a copy of a centralized model. The instructions may include performing speech recognition using the local model on the audio streams.Type: GrantFiled: April 29, 2022Date of Patent: December 10, 2024Assignee: Zoom Video Communications, Inc.Inventors: Shane Paul Springer, Alexander Waibel
-
Patent number: 12165631Abstract: A method of generating keyword-based dialogue summaries is provided. The method includes inputting a transcript of an audio conversation and a keyword into a machine learning model trained based on encodings representing the keyword and the transcript, generating computer-generated text different from and semantically descriptive of the transcript and semantically associated with the keyword, and outputting the computer-generated text in association with a selectable item selectable for inclusion of the computer-generated text in displayed text representing the transcript, the selectable item associated with the keyword.Type: GrantFiled: May 3, 2022Date of Patent: December 10, 2024Assignee: Microsoft Technology Licensing, LLCInventors: Abedelkader Asi, Royi Ronen, Roy Eisenstadt, Dean Geckt
-
Patent number: 12159243Abstract: To simplify assisting a user in their day-to-day activities, a communication for performing an action may be sent to a user in the form of a query, where the query includes the most likely set of choices for the action arranged in a group of dichotomous (e.g., yes/no) or multiple choice answers. In this manner, a user may respond to the query by simply selecting one of the dichotomous or multiple choice answers. Historical logs of past actions, responses, queries, and so forth, may be used to predict future user actions or needs, and to formulate future queries for sending to the user. These techniques may be implemented, for example, through a remote coordination server or directly through a user's personal electronics device.Type: GrantFiled: July 11, 2023Date of Patent: December 3, 2024Assignee: Telepathy Labs, Inc.Inventors: Damien Phelan Stolarz, David Joseph Diaz, James Rossfeld, Scott Raven, Christopher O'Malley, Christopher Kurpinski
-
Patent number: 12089140Abstract: According to various embodiments, a multi-link device (MLD) operating in multiple links including a first link may transmit, through a first station (STA) and to a first AP of an AP multi-link device, a request frame including an information field for requesting at least one element related to a second link. The multi-link device may receive at least one element related to the second link on the basis of the request frame.Type: GrantFiled: July 6, 2023Date of Patent: September 10, 2024Assignee: LG ELECTRONICS INC.Inventors: Namyeong Kim, Jeongki Kim, Jinsoo Choi, Sungjin Park, Taewon Song, Insun Jang
-
Patent number: 12073453Abstract: There is provided systems and method for generating sale transaction from voice data input by a user. A user device may receive voice data including a preference for purchasing an item. The user device may convert the voice data to the preferences and perform a search for a sales transaction corresponding to the preferences. The search may include parameters about the user, such as a location. The sales transaction may include purchase prices, times, locations, or other relevant data. A user may accept or decline the sales transaction with additional user data. If the user accepts the sales transaction, the sales transaction may be completed with a payment provider and a transaction history given to the user for later redemption of the item. If the user declines the sales transaction, further sale transactions with additional items may be present to the user.Type: GrantFiled: August 17, 2021Date of Patent: August 27, 2024Assignee: PAYPAL, INC.Inventors: Hyunju Lee, Joel P. Yarbrough, Francisco Vittorio Octavio Joachin D. Barretto, Gokul G Narayana Pillai
-
Patent number: 12062381Abstract: Two-stage speech/music classification device and method classify an input sound signal and select a core encoder for encoding the sound signal. A first stage classifies the input sound signal into one of a number of final classes. A second stage extracts high-level features of the input sound signal and selects the core encoder for encoding the input sound signal in response to the extracted high-level features and the final class selected in the first stage.Type: GrantFiled: April 8, 2021Date of Patent: August 13, 2024Assignee: VOICEAGE CORPORATIONInventor: Vladimir Malenovsky
-
Patent number: 12014733Abstract: A vehicle occupant aid system is disclosed. The system may comprise a rearview assembly. Further, the rearview assembly may comprise a button. The system may further comprise one or more data capturing element. Each element may be a microphone, an imager, a location device, and/or a sensor. In some embodiments, a controller may record the data for a predetermined period of time. Further, the controller may transmit information to a remote device based upon initiation of a trigger. The information being based, at least in part, on the data. In other embodiments, the controller may operability record the data in response to a first operation of the button. Further, the controller may transmit information to a remote device based upon a second operation of the button. The information being based, at least in part, on the data recorded between the first and second operations of the button.Type: GrantFiled: May 27, 2021Date of Patent: June 18, 2024Assignee: GENTEX CORPORATIONInventors: Thomas S. Wright, Eric P. Bigoness
-
Patent number: 12008988Abstract: An electronic apparatus and a controlling method thereof are provided. The electronic apparatus includes a microphone, a camera, a memory configured to store at least one command, and at least one processor configured to, based on a first user voice being input from a user, provide a response to the first user voice, based on an audio signal including a voice being input while the response to the first user voice is provided, analyze an image captured by the camera and determine whether there is a second user voice uttered by the user in the audio signal, and based on determining that there is the second user voice uttered by the user in the audio signal, stop providing the response to the first user voice and obtain and provide a response to the second user voice.Type: GrantFiled: October 7, 2020Date of Patent: June 11, 2024Assignee: Samsung Electronics Co., Ltd.Inventors: Hyeontaek Lim, Sejin Kwak, Youngjin Kim
-
Patent number: 11990130Abstract: A method, apparatus, device and computer storage medium for processing voices, which relate to the technical field of voices, are disclosed. An implementation includes: recognizing a voice request received by a first voice assistant to obtain a text request; determining information of a second voice assistant which is able to process the text request; and calling the second voice assistant to respond to the text request.Type: GrantFiled: May 7, 2020Date of Patent: May 21, 2024Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.Inventors: Jizhou Huang, Shiqiang Ding, Changshun Hou
-
Patent number: 11990146Abstract: An apparatus for providing a processed audio signal representation on the basis of input audio signal representation configured to apply an un-windowing, in order to provide the processed audio signal representation on the basis of the input audio signal representation. The apparatus is configured to adapt the un-windowing in dependence on one or more signal characteristics and/or in dependence on one or more processing parameters used for a provision of the input audio signal representation.Type: GrantFiled: May 4, 2021Date of Patent: May 21, 2024Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.Inventors: Stefan Bayer, Pallavi Maben, Emmanuel Ravelli, Guillaume Fuchs, Eleni Fotopoulou, Markus Multrus
-
Patent number: 11990997Abstract: This disclosure provides systems, methods and apparatus, including computer programs encoded on computer storage media, for mapping coding indices associated with a rateless coding scheme to respective sets of communication resources over which a transmitting device may perform transmissions of portions of a message corresponding to each of the coding indices. In some aspects, for example, different coding indices may correspond to different cumulative portions of the message and the transmitting device may transmit a first cumulative portion of the message corresponding to a first coding index over a first set of communication resources to which the first coding index is mapped. The transmitting device may signal an indication of such a resource mapping scheme to a receiving device and the receiving device may attempt to decode transmissions of one or more cumulative portions of the message using the indicated resource mapping scheme.Type: GrantFiled: November 17, 2021Date of Patent: May 21, 2024Assignee: QUALCOMM IncorporatedInventors: Gideon Shlomo Kutz, David Yunusov, Tal Oved, Assaf Touboul, Amit Bar-Or Tillinger
-
Patent number: 11984137Abstract: A two-way communication support system supports two-way communication between a second terminal device which receives, from a first terminal device, first state data changed according to the state of a first user present on a side where the first terminal device is arranged and which outputs an image and a voice indicated by the first state data and the first terminal device. The two-way communication support system includes a state identifier and an output controller. The state identifier analyzes second state data changed according to the state of a second user present on a side where the second terminal device is arranged so as to identify the state of the second user. The output controller causes the first terminal device to output analysis information indicating a result of the identification of the state of the second user performed by the state identifier.Type: GrantFiled: December 21, 2021Date of Patent: May 14, 2024Assignee: SHARP KABUSHIKI KAISHAInventor: Mamoru Takaya
-
Patent number: 11967318Abstract: The present subject matter at least describes a method and a system (300, 1200) of performing speech-recognition in an electronic device having an embedded speech recognizer. The method comprises receiving an input-audio comprising speech at a device. In real-time, at-least one speech-recognition module is selected within at least one of the device and a server for recognition of at least a portion of the received speech based on a criteria defined in terms of a) past-performance of speech-recognition modules within the device and server; b) an orator of speech; and c) a quality of service associated with at least one of the device and a networking environment thereof. Based upon the selection of the server, output of the selected speech-recognition modules within the device are selected for processing by corresponding speech-recognition modules of the server. An uttered-speech is determined within the input-audio based on output of the selected speech-recognition modules of the device or the server.Type: GrantFiled: December 19, 2019Date of Patent: April 23, 2024Assignee: Samsung Electronics Co., Ltd.Inventors: Jithendra Vepa, Periyasamy Paramasivam, Ramya Viswanathan, Rajesh Krishna Selvaraj Krishnan
-
Patent number: 11948564Abstract: Provided is an information processing device including a response control unit that controls a response to a user's utterance based on a first utterance interpretation result and a second utterance interpretation result. The first utterance interpretation result is a result of natural language understanding processing for an utterance text generated by automatic speech recognition processing based on the user's utterance and the second utterance interpretation result is an interpretation result acquired based on learning data in which the first utterance interpretation result and the utterance text used to acquire the first utterance interpretation result are associated with each other. The response control unit further controls the response to the user's utterance based on the second utterance interpretation result in a case where the second utterance interpretation result is acquired based on the user's utterance before acquisition of the first utterance interpretation result.Type: GrantFiled: March 13, 2019Date of Patent: April 2, 2024Assignee: SONY CORPORATIONInventors: Hiro Iwase, Yuhei Taki, Kunihito Sawai
-
Patent number: 11935517Abstract: A speech decoding method is performed by a computer device, the speech including a current audio frame and a previous audio frame. The method includes: obtaining a target token corresponding to a smallest decoding score from a first token list including first tokens obtained by decoding the previous audio frame, each first token including a state pair and a decoding score, the state pair being used for characterizing a correspondence between a first state of the first token in a first decoding network corresponding to a low-order language model and a second state of the first token in a second decoding network corresponding to a differential language model; determining pruning parameters according to the target token and an acoustic vector of the current audio frame when the current audio frame is decoded; and decoding the current audio frame according to the first token list, the pruning parameters, and the acoustic vector.Type: GrantFiled: March 3, 2021Date of Patent: March 19, 2024Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITEDInventors: Yiheng Huang, Xiaozheng Jian, Liqiang He
-
Patent number: 11935539Abstract: A voice support server is used to provide voice control functionality to a third party application that does not natively support voice control functions. The voice support server implements a domain specific to the third party application that maintains a domain-specific language model (DLM) reflecting the functionality of the third party application. The DLM comprises a plurality of intent patterns corresponding to different commands and their possible variations that may be issued by the user, and maps each intent pattern to a corresponding action to be performed by the third party application. Received audio data is analyzed to determine one or more user utterances, which are transcribed and compared to the intent patterns of the DLM to identify an intent corresponding to the user utterance. The voice control module may then transmit instructions to the third party application to perform the action corresponding to the identified intent.Type: GrantFiled: January 24, 2020Date of Patent: March 19, 2024Assignee: Alan AI, Inc.Inventors: Andrey Ryabov, Anna Miroshnichenko, Evgeny Yusov, Alex Sotnikov
-
Patent number: 11915065Abstract: Examples described herein include systems and methods for brokerless reliable totally ordered many-to-many inter-process communication on a single node. A messaging protocol is provided that utilizes shared memory for one of the control plane and data plane, and multicast for the other plane. Readers and writers can store either control messages or message data in the shared memory, including in a ring buffer. Write access to portions of the shared memory can be controlled by a robust futex, which includes a locking mechanism that is crash recoverable. In general, the writers and readers can control the pace of communications and the crash of any process does not crash the overall messaging on the node.Type: GrantFiled: January 20, 2022Date of Patent: February 27, 2024Assignee: VMware, Inc.Inventors: Rusko Atanasov, Kalin Tsvetkov
-
Patent number: 11906613Abstract: An electronic device includes memory circuitry, interface circuitry, and processor circuitry. The processor circuitry is configured to transmit, to a plurality of electronic reference devices, a first signal, the first signal having a pulse width below a threshold. The processor circuitry is configured to determine, based on the received second signals and at least one predetermined time period, a time of flight of each of the second signals. The processor circuitry is configured to obtain, from the memory circuitry, reference positions of the plurality of electronic reference devices. The processor circuitry is configured to determine, based on the associations, one or more candidate positions of the electronic device. The processor circuitry is configured to determine, based on the distances, the one or more candidate positions, and the obtained reference positions, a position of the electronic device.Type: GrantFiled: June 3, 2021Date of Patent: February 20, 2024Assignee: Sony Group CorporationInventor: Peter Ljung
-
Patent number: 11900921Abstract: Techniques for partially processing an input on a device and completing processing at a remote system are provided. The device may process an input using an on-device machine learning (ML) model, and determine to cease processing at an intermediary node of the (ML) model based on the output of the intermediary node. Based on the output of the intermediary node satisfying a condition, the device may use the output of the intermediary node to generate an output responsive to the input. Conversely, if the output of the intermediary node does not satisfy a condition, the device may send the output of the intermediary node to the remote system, so the remote system can use another machine learning model to complete processing with respect to the input.Type: GrantFiled: October 26, 2020Date of Patent: February 13, 2024Assignee: Amazon Technologies, Inc.Inventors: Rahul Gupta, Christophe Dupuy, Jacob Ryan Stolee, Clement Chung
-
Patent number: 11893308Abstract: Example techniques involve invoking voice assistance for a media playback system. In some embodiments, a NMD stores in memory a set of command information comprising a listing of playback commands and associated command criteria. The NMD captures a voice input and detects inclusion, within the voice input, of one or more particular playback commands from among the playback commands in the listing. In response, the NMD selects a local voice assistant that supports (a) one or more additional playback commands relative to a cloud-based VAS and (b) fewer non-playback commands relative to the cloud-based VAS, determines, via the local voice assistant, an intent in the captured voice input, and performs a response to the determined intent. The NMD foregoes selection of the cloud-based VAS when the local voice assistant is selected.Type: GrantFiled: March 28, 2022Date of Patent: February 6, 2024Assignee: Sonos, Inc.Inventors: Dayn Wilberding, John Tolomei
-
Patent number: 11869487Abstract: Speech processing tasks may be allocated at least partly to a local device (e.g., user computing device that receives spoken words) and at least partly to a remote device to determine one or more user commands or tasks to be performed by the local device. The remote device may be used to process speech that the local device could not process or understand, or for other reasons, such as for error checking. The local device may then execute or begin to execute locally determined tasks to reduce user-perceived latency. Meanwhile, the entire media input, or a portion thereof, may be sent to the remote device to process speech, verify the tasks and/or identify other user commands in the media input (or portion thereof).Type: GrantFiled: August 16, 2019Date of Patent: January 9, 2024Assignee: Amazon Technologies, Inc.Inventors: Sanjoy Ghosh, Pieter Sierd van der Meulen
-
Patent number: 11869503Abstract: As noted above, example techniques relate to offline voice control. A local voice input engine may process voice inputs locally when processing voice inputs via a cloud-based voice assistant service is not possible. Some techniques involve local (on-device) voice-assisted set-up of a cloud-based voice assistant service. Further example techniques involve local voice-assisted troubleshooting the cloud-based voice assistant service. Other techniques relate to interactions between local and cloud-based processing of voice inputs on a device that supports both local and cloud-based processing.Type: GrantFiled: December 13, 2021Date of Patent: January 9, 2024Assignee: Sonos, Inc.Inventor: Connor Smith
-
Patent number: 11863646Abstract: Disclosed is the technology for computer-based “Daily Brief” service, which includes methods and corresponding systems for proactively providing push notifications for users of chat information systems. The push notifications are dynamically generated and presented to the user based on identification of one or more triggering events, which may include predetermined time/date, current geographical location, activity of peers and friends in social media associated with the user, scheduled events, appointments, meetings, emails, instant messages, and many more. The described technology improves the interaction interface between the user and chat information system.Type: GrantFiled: February 3, 2023Date of Patent: January 2, 2024Assignee: GOOGLE LLCInventors: Ilya Gennadyevich Gelfenbeyn, Artem Goncharuk, Ilya Andreevich Platonov, Pavel Aleksandrovich Sirotin, Olga Aleksandrovna Gelfenbeyn
-
Patent number: 11830490Abstract: Disambiguating question answering responses by receiving voice command data associated with a first user, determining a first user identity according to the first user voice command data, determining a first user activity context according to the first user voice command data, determining a first response for the first user, receiving voice command data associated with a second user, determining a second user identity according to the second user voice command data, determining a second user activity context according to the second user voice command data, determining a second response for the second user, determining a predicted ambiguity between the first response and the second response, altering the first response according to the predicted ambiguity, and providing the first response and the second response.Type: GrantFiled: August 11, 2021Date of Patent: November 28, 2023Assignee: International Business Machines CorporationInventors: Venkata Vara Prasad Karri, Sarbajit K. Rakshit, Sri Harsha Varada, Sampath Kumar Pulupula Venkata
-
Patent number: 11824894Abstract: Embodiments of the invention are directed to techniques that include receiving a query intended for a targeted database and determining that the query is from an unauthorized user. A response is returned to the unauthorized user generated by a model, the response being dynamically generated to fulfill the query. The model is configured to generate responses consistent with any previous responses returned to the unauthorized user.Type: GrantFiled: November 25, 2020Date of Patent: November 21, 2023Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Marco Simioni, Stefano Braghin, Killian Levacher
-
Patent number: 11803431Abstract: Examples described herein include systems and methods for brokerless reliable totally ordered many-to-many inter-process communication on a single node. A messaging protocol is provided that utilizes shared memory for one of the control plane and data plane, and multicast for the other plane. Readers and writers can store either control messages or message data in the shared memory, including in a ring buffer. Write access to portions of the shared memory can be controlled by a robust futex, which includes a locking mechanism that is crash recoverable. In general, the writers and readers can control the pace of communications and the crash of any process does not crash the overall messaging on the node.Type: GrantFiled: March 14, 2022Date of Patent: October 31, 2023Assignee: VMware, Inc.Inventors: Rusko Atanasov, Kalin Tsvetkov, Viktoriya Bambaldokova
-
Patent number: 11790902Abstract: A system may include first and second speech-processing systems. The first speech-processing system may process received audio data and determine that a command represented therein is associated with a second speech-processing system. The first speech-processing system may send command data to the second speech-processing system and receive response data in return. The first speech-processing system may then process the response data to determine second response data that includes an indication of the second speech-processing system and cause output of audio corresponding to the second response data.Type: GrantFiled: February 4, 2020Date of Patent: October 17, 2023Assignee: Amazon Technologies, Inc.Inventors: Timothy Whalin, Catherine Michelle Loo, Calvin Phuong Nguyen
-
Patent number: 11790893Abstract: A voice processing method is disclosed. The voice processing method applies first and second sentence vectors extracted from first and second utterances, that are included in one dialog group and are separated from each other, to a learning model and generates an output from which at least one word having an overlapping meaning is removed. The voice processing method can be associated with an artificial intelligence module, an unmanned aerial vehicle (UAV), a robot, an augmented reality (AR) device, a virtual reality (VR) device, devices related to 5G services, and the like.Type: GrantFiled: September 30, 2020Date of Patent: October 17, 2023Assignee: LG ELECTRONICS INC.Inventors: Kwangyong Lee, Hyun Yu, Byeongha Kim, Yejin Kim
-
Patent number: 11769012Abstract: A system and method for updating computerized language models is provided that automatically adds or deletes terms from the language model to capture trending events or products, while maximizing computer efficiencies by deleting terms that are no longer trending and use of knowledge bases, machine learning model training and evaluation corpora, analysis tools and databases.Type: GrantFiled: March 25, 2020Date of Patent: September 26, 2023Assignee: Verint Americas Inc.Inventors: Ian Roy Beaver, Christopher James Jeffs
-
Patent number: 11763404Abstract: Systems, methods, and apparatuses for implementing a geo-demographic zoning optimization engine are disclosed.Type: GrantFiled: June 15, 2021Date of Patent: September 19, 2023Assignee: Arizona Board of Regents on behalf of Arizona State UniversityInventors: Jon J. Miller, Vikash Bajaj, Srinivasa Srivatsav Kandala, Fangwu Wei, Michael Kuby, Wangshu Mu, Daoqin Tong
-
Patent number: 11741385Abstract: To simplify assisting a user in their day-to-day activities, a communication for performing an action may be sent to a user in the form of a query, where the query includes the most likely set of choices for the action arranged in a group of dichotomous (e.g., yes/no) or multiple choice answers. In this manner, a user may respond to the query by simply selecting one of the dichotomous or multiple choice answers. Historical logs of past actions, responses, queries, and so forth, may be used to predict future user actions or needs, and to formulate future queries for sending to the user. These techniques may be implemented, for example, through a remote coordination server or directly through a user's personal electronics device.Type: GrantFiled: July 28, 2022Date of Patent: August 29, 2023Assignee: Telepathy Labs, IncInventors: Damien Phelan Stolarz, David Joseph Diaz, James Rossfeld, Scott Raven, Christopher O'Malley, Christopher Kurpinski
-
Patent number: 11735185Abstract: The present invention provides a caption service system for remote speech recognition, which provides caption service for the hearing impaired. This system includes a speaker and a live broadcast equipment at A, a listener-typist and a computer at B, a hearing impaired and a live screen at C, and an automatic speech recognition (ASR) caption server at D. Connect the live broadcast equipment, the computer, the live screen and the ASR caption server with a network. The speaker's audio is sent to the automatic speech recognition (ASR) caption server to be converted into text, which is corrected by the listener-typist, and then the text caption is sent to the live screen of the hearing impaired together with the speaker's video and audio, so that the hearing impaired can see the text caption spoken by the speaker.Type: GrantFiled: August 19, 2021Date of Patent: August 22, 2023Assignee: NATIONAL YANG MING CHIAO TUNG UNIVERSITYInventors: Sin Horng Chen, Yuan Fu Liao, Yih Ru Wang, Shaw Hwa Hwang, Bing Chih Yao, Cheng Yu Yeh, You Shuo Chen, Yao Hsing Chung, Yen Chun Huang, Chi Jung Huang, Li Te Shen, Ning Yun Ku
-
Patent number: 11729596Abstract: A communication device and method can include one or more processors operatively coupled to memory, a sensor and an output device, where the one or more processors to perform operations of identifying target person locations using internet searching and short range communication enabled devices such as Bluetooth LE devices.Type: GrantFiled: December 2, 2022Date of Patent: August 15, 2023Assignee: Staton Techiya LLCInventor: Steven Wayne Goldstein
-
Patent number: 11721347Abstract: Some speech processing systems may handle some commands on-device rather than sending the audio data to a second device or system for processing. The first device may have limited speech processing capabilities sufficient for handling common language and/or commands, while the second device (e.g., an edge device and/or a remote system) may call on additional language models, entity libraries, skill components, etc. to perform additional tasks. An intermediate data generator may facilitate dividing speech processing operations between devices by generating a stream of data that includes a first-pass ASR output (e.g., a word or sub-word lattice) and other characteristics of the audio data such as whisper detection, speaker identification, media signatures, etc. The second device can perform the additional processing using the data stream; e.g., without using the audio data. Thus, privacy may be enhanced by processing the audio data locally without sending it to other devices/systems.Type: GrantFiled: June 29, 2021Date of Patent: August 8, 2023Assignee: Amazon Technologies, Inc.Inventors: Stanislaw Ignacy Pasko, Pawel Zelazko, Cagdas Bak, Eli Joshua Fidler, Michal Kowalczuk, Andrew Oberlin, Ariya Rastrow
-
Patent number: 11710488Abstract: A method may include obtaining audio data originating at a first device during a communication session between the first device and a second device and providing the audio data to a first speech recognition system to generate a first transcript based on the audio data and directing the first transcript to the second device. The method may also include in response to obtaining a quality indication regarding a quality of the first transcript, multiplexing the audio data to provide the audio data to a second speech recognition system to generate a second transcript based on the audio data while continuing to provide the audio data to the first speech recognition system and direct the first transcript to the second device, and in response to obtaining a transfer indication that occurs after multiplexing of the audio data, directing the second transcript to the second device instead of the first transcript.Type: GrantFiled: December 19, 2018Date of Patent: July 25, 2023Assignee: Sorenson IP Holdings, LLCInventors: Kenneth Boehme, Michael Holm, Shane Roylance
-
Patent number: 11700484Abstract: A device to process speech includes a speech processing network that includes an input configured to receive audio data corresponding to audio captured by one or more microphones. The speech processing network also includes one or more network layers configured to process the audio data to generate a network output. The speech processing network includes an output configured to be coupled to multiple speech application modules to enable the network output to be provided as a common input to each of the multiple speech application modules. A first speech application module corresponds to a speaker verifier, and a second speech application module corresponds to a speech recognition network.Type: GrantFiled: February 10, 2022Date of Patent: July 11, 2023Assignee: QUALCOMM IncorporatedInventors: Lae-Hoon Kim, Sunkuk Moon, Erik Visser, Prajakt Kulkarni
-
Patent number: 11695836Abstract: A computer program and the like are provided that are capable of causing an information processing device connected to a private network, to automatically execute operation processing of a browser. The computer program is a computer program for causing the information processing device connected to the private network, to automatically execute the operation of the browser that accesses a web server on the private network, based on an instruction from a server connected to a global network, and causes the information processing device to execute the processing of: requesting the server to establish a connection; obtaining an operation instruction related to the operation processing which is push-transmitted from the server, by using the connection; executing the operation processing of the browser based on the obtained operation instruction; obtaining an execution result of the operation processing; and outputting the obtained execution result to the server.Type: GrantFiled: June 18, 2020Date of Patent: July 4, 2023Assignee: C-RISE Ltd.Inventors: Masanori Murai, Yutaka Mitsubayashi
-
Patent number: 11683320Abstract: The present disclosure is generally directed to a data processing system for customizing content in a voice activated computer network environment. With user consent, the data processing system can improve the efficiency and effectiveness of auditory data packet transmission over one or more computer networks by, for example, increasing the accuracy of the voice identification process used in the generation of customized content. The present solution can make accurate identifications while generating fewer audio identification models, which are computationally intensive to generate.Type: GrantFiled: April 22, 2021Date of Patent: June 20, 2023Assignee: GOOGLE LLCInventors: Victor Carbune, Thomas Deselaers, Sandro Feuz
-
Patent number: 11669697Abstract: A method for providing responsive actions to user inputs in a multi-domain context includes receiving, by a speech-based user interface, a first speech input from a user and converting said first speech input into a text-based representation of the first speech input. A natural language processor processes the text-based representation to determine an intent, entity and internal state of the first speech input. The method further includes determining, by a model-based module based on the intent, entity and internal state, a first data processing policy to apply to the first speech input, wherein the first data processing policy is either a rules-based data processing policy applied by a rules-based module or a statistical model-based data processing policy applied by the model-based module. The first responsive action is generated by the determined first data processing module, and outputted via the speech-based user interface and/or a machine interface.Type: GrantFiled: October 23, 2019Date of Patent: June 6, 2023Assignee: Bayerische Motoren Werke AktiengesellschaftInventors: Wangsu Hu, Jilei Tian
-
Patent number: 11650983Abstract: A method is provided for generating a classification model configured to select an optimal execution combination for query processing. The method provides, to a processor, training queries and different execution combinations for executing the training queries. Each different execution combination involves a respective different query engine and a respective different runtime. The method extracts, from a set of Directed Acyclic Graphs (DAGs) using a set of Cost-Based Optimizers (CBOs), a set of feature vectors for each of the plurality of training queries. The method adds, by the processor to each of merged feature vectors a respective label indicative of the optimal execution combination based on actual respective execution times of the plurality of different execution combinations, to obtain a set of labels. The method trains, by the processor, the classification model by learning the set of merged feature vectors with the set of labels.Type: GrantFiled: December 22, 2020Date of Patent: May 16, 2023Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventor: Tatsuhiro Chiba
-
Patent number: 11646023Abstract: Systems and methods for distributed voice processing are disclosed herein. In one example, the method includes detecting sound via a microphone array of a first playback device and analyzing, via a first wake-word engine of the first playback device, the detected sound. The first playback device may transmit data associated with the detected sound to a second playback device over a local area network. A second wake-word engine of the second playback device may analyze the transmitted data associated with the detected sound. The method may further include identifying that the detected sound contains either a first wake word or a second wake word based on the analysis via the first and second wake-word engines, respectively. Based on the identification, sound data corresponding to the detected sound may be transmitted over a wide area network to a remote computing device associated with a particular voice assistant service.Type: GrantFiled: December 14, 2020Date of Patent: May 9, 2023Assignee: Sonos, Inc.Inventors: Connor Kristopher Smith, John Tolomei, Betty Lee
-
Patent number: 11594213Abstract: Systems and methods are described herein for interpreting natural language search queries that account for contextual relevance of words of the search query that would ordinarily not be processed, including, for example, processing each word of the query. Each term is associated with a respective part of speech, and a frequency of occurrence of each term in content metadata is determined. A relevance of each term is then determined based on its respective part of speech and frequency. The natural language search query is then interpreted based on the importance or relevance of each term.Type: GrantFiled: March 3, 2020Date of Patent: February 28, 2023Assignee: ROVI GUIDES, INC.Inventors: Jeffry Copps Robert Jose, Ajay Kumar Mishra
-
Patent number: 11588799Abstract: A moving object control system includes a server, a portable terminal configured to transmit authentication information issued by the server, and a controller provided in a moving object and configured to authenticate the portable terminal according to the authentication information transmitted from the portable terminal, and when the portable terminal is authenticated, control the moving object in response to an operation signal from the portable terminal. The controller is configured to perform information communication with the server and control the moving object according to control information received from the server.Type: GrantFiled: July 29, 2019Date of Patent: February 21, 2023Assignee: TOYOTA JIDOSHA KABUSHIKI KAISHAInventor: Yasuhisa Fujiwara
-
Patent number: 11587571Abstract: An electronic apparatus includes: at least one processor configured to: receive audio of a voice input of a user; obtain, from a plurality of voice recognizers capable of recognizing the voice input, a plurality of recognition results of the received audio; and perform an operation based on a recognition result of which recognition suitability for the voice input is identified to be high, among the plurality of recognition results.Type: GrantFiled: September 2, 2020Date of Patent: February 21, 2023Assignee: SAMSUNG ELECTRONICS CO., LTD.Inventor: Chanhee Choi
-
Patent number: 11550846Abstract: Methods, apparatus, systems, and computer-readable media are provided for transferring dialog sessions between devices using deep links. The dialog sessions can correspond to interactions, mediated by an automated assistant, between a user and a third party application. During the dialog session, a user can request that the dialog session be transferred to a different device, for example, to interact with the third party application through a different modality. In response, the automated assistant and/or the third party application can generate a link that can be transferred to the transferee device to allow the transferee device to seamlessly take over the dialog session. In this way, computational resources and electrical power can be preserved by not requiring a recipient device to re-process natural language inputs previously provided during the dialog session.Type: GrantFiled: May 17, 2021Date of Patent: January 10, 2023Assignee: GOOGLE LLCInventors: Justin Lewis, Scott Davies
-
Patent number: 11538476Abstract: A terminal device is provided and includes a communication interface including circuitry, a display and at least one processor configured to control the communication interface to transmit a user voice including a plurality of intents to an external server, based on word use information included in the user voice and summary information regarding the user voice generated based on user-related information being received from the external server, control the display to display the received summary information, based on a user feedback regarding the summary information being input, transmit information regarding the user feedback to the external server, and based on response information regarding the user voice generated based on the user feedback being received from the external server, control the display to provide the response information.Type: GrantFiled: November 24, 2020Date of Patent: December 27, 2022Assignee: Samsung Electronics Co., Ltd.Inventors: Sanghyuk Yoon, Heejun Song, Heejae Yu
-
Patent number: 11538458Abstract: Disclosed is an electronic apparatus capable of controlling voice recognition. The electronic apparatus increases a score of a category corresponding to a word included in user's utterance in a database when the instruction included in the user's utterance is present in the database. The electronic apparatus checks whether the score of the category corresponding to the word is equal to or greater than a preset value when the instruction is not present in the database. The electronic apparatus registers the instruction in the database so that the instruction is included in the category corresponding to the word when the score is equal to or greater than the preset value as the check result.Type: GrantFiled: September 16, 2020Date of Patent: December 27, 2022Assignee: SAMSUNG ELECTRONICS CO., LTD.Inventor: Heejae Kim
-
Patent number: 11501879Abstract: Techniques for voice control of a patient care device are described. A patient care device receives an audio request from a user. The patient care device records the audio request. The patient care device transmits the audio request over a communication network to a speech recognition service, and in response receives, from the speech recognition service, a textual representation of the audio request. The patient care device matches the textual representation, using the computer processor, to a first command in a vocabulary of available commands, and in response performs the first command.Type: GrantFiled: October 1, 2019Date of Patent: November 15, 2022Assignee: PREVENTICE TECHNOLOGIES, INC.Inventors: Richard M. Smith, Scott J. Burrichter, Jon P. Otterstatter