Application Patents (Class 704/270)
-
Patent number: 11556696Abstract: Systems and methods include receiving, with a processor, two or more messages from a first user device participating in a communication session, processing, with the processor, the two or more messages, generating, with the processor, a processed message, and displaying, with the processor, the processed message on a second user device participating in the communication session.Type: GrantFiled: March 15, 2021Date of Patent: January 17, 2023Assignee: Avaya Management L.P.Inventors: Sandesh Chopdekar, Pushkar Deole, Navin Daga
-
Patent number: 11558663Abstract: Methods, apparatus, systems and articles of manufacture are disclosed. An example apparatus includes a controller to cause a people meter to emit a prompt for input of audience identification information at a first time and determine a first audience count based on the input, an audio detector to determine a second audience count based on signatures generated from audio data captured in the media environment, and a comparator to cause the people meter to not emit the prompt for at least a first time period after the first time when the first audience count is equal to the second audience count.Type: GrantFiled: August 20, 2020Date of Patent: January 17, 2023Assignee: THE NIELSEN COMPANY (US), LLCInventors: John T. LiVoti, Stanley Wellington Woodruff, Rajakumar Madhanganesh, Khushboo Agarwal
-
Patent number: 11551663Abstract: A natural language processing system may use system response configuration data to determine customized output data forms when outputting data for a user. The system response configuration data may represent various output attributes the system may use when creating output data. The system may have a certain number of existing profiles where a profile is associated with certain settings for the system response configuration data/attributes. The system may also use various data such as context data, sentiment data, or the like to customize system response configuration data during a dialog. Other components, such as natural language generation (NLG), text-to-speech (TTS), or the like, may use the customized system response configuration data to determine the form, timing, etc. of output data to be presented to a user.Type: GrantFiled: December 10, 2020Date of Patent: January 10, 2023Assignee: Amazon Technologies, Inc.Inventors: Anthony Bissell, Janet Slifka
-
Patent number: 11544685Abstract: A multimedia keepsake is created containing multimedia content created by a customer and stored online as content information. After the customer selects the type of keepsake, the content information is converted to keepsake information having a format appropriate for storage in the selected type of keepsake. The keepsake information is stored online so as to be accessible via an access code, and it is downloaded to a vendor providing the access code.Type: GrantFiled: August 12, 2014Date of Patent: January 3, 2023Inventor: Geoffrey S. Stern
-
Patent number: 11532007Abstract: A system and method are provided for employing voice-activated user interfaces to determine user attention to particularly-presented advertising content by collecting user contact/consumer information, presenting content to the user/consumer, and proposing at least one question, inquiry or query to the user regarding the presented content, the at least one inquiry or query calling for a user/consumer response to be collected, at least one of (a) the user/consumer contact information and (b) the user/consumer response to the question, inquiry or query being collected by the system via a voice-activated user interface and evaluated to assess a level of engagement of the user/consumer with the advertising content. The disclosed systems and methods uniquely provide voice-activated user interface coupled with display of certain advertising content in a manner that promotes user/consumer attention to the advertising content and ease of interaction with the presentation system.Type: GrantFiled: August 16, 2019Date of Patent: December 20, 2022Inventor: Frank S. Maggio
-
Patent number: 11521114Abstract: This document relates to creating and/or updating a chatbot using a graphical user interface. For example, training dialogs for a chatbot can be displayed in a tree form on a graphical user interface. Based at least on interactions between a developer and the graphical user interface, the training dialogs can be modified in the tree form, and training dialogs can be updated based on the modifications provided on the tree form via the graphical user interface.Type: GrantFiled: April 18, 2019Date of Patent: December 6, 2022Assignee: Microsoft Technology Licensing, LLCInventors: Lars H. Liden, Swadheen K. Shukla, Shahin Shayandeh, Matthew D. Mazzola
-
Patent number: 11521618Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for collaboration between multiple voice controlled devices are disclosed. In one aspect, a method includes the actions of identifying, by a first computing device, a second computing device that is configured to respond to a particular, predefined hotword; receiving audio data that corresponds to an utterance; receiving a transcription of additional audio data outputted by the second computing device in response to the utterance; based on the transcription of the additional audio data and based on the utterance, generating a transcription that corresponds to a response to the additional audio data; and providing, for output, the transcription that corresponds to the response.Type: GrantFiled: December 17, 2019Date of Patent: December 6, 2022Assignee: GOOGLE LLCInventors: Victor Carbune, Pedro Gonnet Anders, Thomas Deselaers, Sandro Feuz
-
Patent number: 11516346Abstract: A three-way calling terminal for a mobile human-machine coordination calling robot. Technical solutions include: a first speech interface, configured to transfer call audio between a call object and a back-end processing module; a CODEC1 module, configured to encode and decode the call audio between the call object and the back-end processing module; a second speech interface, configured to transfer call audio between the human agent and the call object; a CODEC2 module, configured to encode and decode the call audio between the human agent and the call object; a call control module, configured to process a control signal, and automatically make, answer, and hang up a call; a data processing submodule, configured to process speech data and perform data transfer between the data processing submodule and the back-end processing module; and a networking submodule, configured to be connected to the back-end processing module.Type: GrantFiled: July 8, 2021Date of Patent: November 29, 2022Assignee: NANJING SILICON INTELLIGENCE TECHNOLOGY CO., LTD.Inventor: Huapeng Sima
-
Patent number: 11514663Abstract: Provided are a reception apparatus, a reception system, a reception method, and a storage medium that can naturally provide a personal conversation in accordance with a user without requiring the user to register the personal information thereof in advance. A disclosure includes a face information acquisition unit that acquires face information of a user; a face matching unit that matches, against face information of one user, the face information registered in a user information database in which user information including the face information of the user and the reception information is registered; and a user information management unit that, when a result of matching of the face information performed by the face matching unit is unmatched, registers the user information of the one user to the user information database.Type: GrantFiled: February 13, 2019Date of Patent: November 29, 2022Assignee: NEC CORPORATIONInventors: Nobuaki Kawase, Makoto Igarashi
-
Patent number: 11508366Abstract: A method, an apparatus and a device for converting a whispered speech, and a readable storage medium are provided. The method is implemented based on the whispered speech converting model. The whispered speech converting model is trained in advance by using recognition results and whispered speech training acoustic features of whispered speech training data as samples and using normal speech acoustic features of normal speech data parallel to the whispered speech training data as sample labels. A whispered speech acoustic feature and a preliminary recognition result of whispered speech data are acquired, then the whispered speech acoustic feature and the preliminary recognition result are inputted into a preset whispered speech converting model to acquire a normal speech acoustic feature outputted by the model. In this way, the whispered speech can be converted to a normal speech.Type: GrantFiled: June 15, 2018Date of Patent: November 22, 2022Assignee: IFLYTEK CO., LTD.Inventors: Jia Pan, Cong Liu, Haikun Wang, Zhiguo Wang, Guoping Hu
-
Patent number: 11508357Abstract: An extended role play-based utterance set generation apparatus includes a first data store storing role play-based utterance sets and a second data store storing non-role-played utterance sets. The role play-based utterance sets include a first query and a role play-based response to the query. The non-role-played utterance sets include a second query and a non-role-played response to the query. The disclosed technology determines similarity between the role play-based response and the non-role-played response. Upon determining that the role play-based response is the same or similar to the non-role-played response, the disclosed technology generates an association between the role play-based response and the second query and extends the role play-based utterance sets in the first data store with the second query.Type: GrantFiled: April 5, 2019Date of Patent: November 22, 2022Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Masahiro Mizukami, Ryuichiro Higashinaka
-
Patent number: 11501755Abstract: Provided are an electronic device and method for providing a voice assistant service. The method, performed by the electronic device, of providing the voice assistant service includes: obtaining a voice of a user; obtaining voice analysis information of the voice of the user by inputting the voice of the user to a natural language understanding model; determining whether a response operation with respect to the voice of the user is performable, according to a preset criterion, based on the obtained voice analysis information; and based on the determining that the response operation is not performable, outputting a series of guide messages for learning the response operation related to the voice of the user.Type: GrantFiled: September 1, 2020Date of Patent: November 15, 2022Assignee: SAMSUNG ELECTRONICS CO., LTD.Inventor: Inchul Hwang
-
Patent number: 11483425Abstract: A communication system, method and communication terminal are configured to facilitate private outputting of content of a message or communication session. A communication terminal can be configured via data included in a message or via a privacy setting to output content of data from a communication session or message in accordance with a pre-selected privacy setting or one or more privacy rules. For instance, a communication terminal may be configured to suppress a text to speech function for certain text messages, email messages, instant messages, or social networking messages that it receives having the privacy parameter set therein. As another example, a user may set the privacy parameter in his or her terminal so that any such message received by that terminal is output in accordance with the privacy setting or rules. A detection of nearby people can affect how certain content may be output via a terminal.Type: GrantFiled: October 23, 2018Date of Patent: October 25, 2022Assignee: RINGCENTRAL, INC.Inventors: Christian Garbin, Johannes Ruetschi
-
Patent number: 11475880Abstract: A method includes receiving audio data of an utterance and processing the audio data to obtain, as output from a speech recognition model configured to jointly perform speech decoding and endpointing of utterances: partial speech recognition results for the utterance; and an endpoint indication indicating when the utterance has ended. While processing the audio data, the method also includes detecting, based on the endpoint indication, the end of the utterance. In response to detecting the end of the utterance, the method also includes terminating the processing of any subsequent audio data received after the end of the utterance was detected.Type: GrantFiled: March 4, 2020Date of Patent: October 18, 2022Assignee: Google LLCInventors: Shuo-yiin Chang, Rohit Prakash Prabhavalkar, Gabor Simko, Tara N. Sainath, Bo Li, Yangzhang He
-
Patent number: 11468884Abstract: An information processing apparatus includes a voice acquisition section, a reliability generation section, and a processing execution section. The voice acquisition section acquires an ambient voice. The reliability generation section generates reliability indicating a degree in which the acquired voice is uttered from the particular position on the basis of a predetermined transfer characteristic. As the predetermined transfer characteristic, a phase difference or acoustic characteristic of the voice can be assumed. The processing execution section executes a process according to the generated reliability. As the process according to the reliability, a notification according to the reliability or a predetermined command can be assumed to be executed.Type: GrantFiled: March 13, 2018Date of Patent: October 11, 2022Assignee: Sony CorporationInventors: Ryosuke Sawata, Yuichiro Koyama
-
Patent number: 11470027Abstract: A method, an apparatus, an electronic device and a storage medium for broadcasting a voice are provided. The method may include: sending a voice broadcast request to a server, where the voice broadcast request includes at least one of scenario information, user information or voice packet setting information; receiving a voice broadcast instruction corresponding to the voice broadcast request returned by the server; and acquiring a personalized voice packet corresponding to the voice broadcast instruction in a local database and broadcasting the personalized voice packet.Type: GrantFiled: March 25, 2021Date of Patent: October 11, 2022Assignee: Beijing Baidu Netcom Science and Technology Co., Ltd.Inventors: Shiqiang Ding, Jinyi Lei
-
Patent number: 11470419Abstract: A method for auralizing a multi-microphone device. Path information for one or more sound paths using dimensions and room reflection coefficients of a simulated room for one of a plurality of microphones included in a multi-microphone device is determined. An array-related transfer functions (ARTFs) for the one of the plurality of microphones is retrieved. The auralized impulse response for the one of the plurality of microphones is generated based at least on the retrieved ARTFs and the determined path information.Type: GrantFiled: August 29, 2019Date of Patent: October 11, 2022Assignee: Google LLCInventors: Rajeev Conrad Nongpiur, Ananya Misra, Chanwoo Kim
-
Patent number: 11455907Abstract: A computer-implemented method includes recognizing, by a computer device, a word as a new learned word for a user; registering, by the computer device, the new leaned word in a user's new learned word list as a registered new learned word; associating, by the computer device, the registered new learned word with related known words in a user's known word library, the known word library including words known to the user; tracking, by the computer device, uses of the related known words by the user; identifying, by the computer device, a used sentence used by the user that contains one of the related known words; and suggesting, by the computer device, to the user a new sentence that replaces the one of the related known words in the used sentence with the new learned word.Type: GrantFiled: November 27, 2018Date of Patent: September 27, 2022Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Su Liu, Manjunath Ravi, Zhichao Li, Kai Liu
-
Patent number: 11456005Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: obtaining, for each frame in a stream of frames from a video in which faces of one or more speakers have been detected, a respective per-frame face embedding of the face of each speaker; processing, for each speaker, the per-frame face embeddings of the face of the speaker to generate visual features for the face of the speaker; obtaining a spectrogram of an audio soundtrack for the video; processing the spectrogram to generate an audio embedding for the audio soundtrack; combining the visual features for the one or more speakers and the audio embedding for the audio soundtrack to generate an audio-visual embedding for the video; determining a respective spectrogram mask for each of the one or more speakers; and determining a respective isolated speech spectrogram for each speaker.Type: GrantFiled: November 21, 2018Date of Patent: September 27, 2022Assignee: Google LLCInventors: Inbar Mosseri, Michael Rubinstein, Ariel Ephrat, William Freeman, Oran Lang, Kevin William Wilson, Tali Dekel, Avinatan Hassidim
-
Patent number: 11443745Abstract: Included are: an apparatus function information acquiring unit for acquiring apparatus function information in which a target apparatus and one or more target functions to be executed by the target apparatus, which are determined on the basis of uttered speech, are associated with each other; a procedure determining unit for determining one or more manual operations for executing the one or more target functions and an order of the one or more manual operations on the basis of the apparatus function information acquired by the apparatus function information acquiring unit; and an operation command transmission controlling unit for sequentially transmitting, to the target apparatus, operation commands for outputting operation response output control information corresponding to each of the one or more manual operations in accordance with the order of the one or more manual operations determined by the procedure determining unit.Type: GrantFiled: October 21, 2020Date of Patent: September 13, 2022Assignee: MITSUBISHI ELECTRIC CORPORATIONInventors: Masato Hirai, Kenshiro Kitamura, Miho Ishikawa, Daisuke Iizawa
-
Patent number: 11443759Abstract: An information processing apparatus includes a memory storing instructions. The instructions cause the apparatus to extract a plurality of local features from data indicating a speech, the characteristics of feature extraction being formed through learning; and to encode a series of chronological features of the data based on the plurality of local features, characteristics of encoding the series of chronological features being formed through learning. The instructions also cause the apparatus to generate information obtained by weighting features at a specific point in time associated with emotion classification, of the series of chronological features encoded, characteristics of weighting the features at the specific point in time being formed through learning; and to classify emotion corresponding to the data using the information obtained by weighting the features at the specific point in time, characteristics of classification being formed through learning.Type: GrantFiled: July 27, 2020Date of Patent: September 13, 2022Assignee: HONDA MOTOR CO., LTD.Inventor: Yuanchao Li
-
Patent number: 11443730Abstract: A system, a computer program product, and method for controlling synthesized speech output on a voice-controlled device. A sensor is used to capture an image of a face of a person. A database of previously stored images of facial features is accessed. In response to i) not recognizing the at least one person the voice-controlled device selects a first set of conversational starters; ii) recognizing the person and recognizing previous communications with the person, the voice-controlled device selects a second set of conversational starters; iii) recognizing the person and not recognizing previous communications with the person, the voice-controlled device selects a third set of conversational starters; or iv) recognizing the at least one person and recognizing previous communications with the person selecting but do not know the person's name selecting a fourth set of conversational starters. The voice controlled device outputs the selected set of conversational starters.Type: GrantFiled: January 28, 2020Date of Patent: September 13, 2022Assignee: International Business Machines CorporationInventors: Shang Qing Guo, Jonathan Lenchner
-
Patent number: 11430014Abstract: Systems and methods of facilitating transactions related to targeted or customized commercial offerings based on derived sentiment states are provided. The sentiment states are derived from digital representations such as images, videos and sound recordings.Type: GrantFiled: October 1, 2020Date of Patent: August 30, 2022Assignee: Nant Holdings IP, LLCInventor: Patrick Soon-Shiong
-
Patent number: 11430207Abstract: Provided are a reception apparatus, a reception system, a reception method, and a storage medium that can naturally provide a personal conversation in accordance with a user without requiring the user to register the personal information thereof in advance. A disclosure includes a face information acquisition unit that acquires face information of a user; a conversation processing unit that acquires reception information including a content of conversation with the user; a face matching unit that matches, against the face information of one user, the face information registered in a user information database in which user information including the face information of the user and the reception information is registered; and a user information management unit that, when a result of matching of the face information performed by the face matching unit is unmatched, registers the user information of the one user to the user information database.Type: GrantFiled: June 8, 2017Date of Patent: August 30, 2022Assignee: NEC CORPORATIONInventors: Nobuaki Kawase, Makoto Igarashi
-
Patent number: 11423879Abstract: A technique for controlling a voice-enabled device using voice commands includes receiving an audio signal that is generated in response to a verbal utterance, generating a verbal utterance indicator for the verbal utterance based on the audio signal, selecting a first command for a voice-controlled application residing within the voice-enabled device based on the verbal utterance indicator, and transmitting the first command to the voice-controlled application as an input.Type: GrantFiled: July 18, 2017Date of Patent: August 23, 2022Assignee: Disney Enterprises, Inc.Inventor: William Valentine Zajac, III
-
Patent number: 11423881Abstract: According to an embodiment of the present disclosure, a method of updating a speech recognition model using a mobile agent in real-time comprises obtaining, in real-time, space type information for a particular space where the mobile agent is located, varying, in real-time, parameters of a speech recognition model used in the particular space based on the space type information, and performing a speech recognition service based on the speech recognition model including the varied parameters. Embodiments of the present disclosure may be related to artificial intelligence (AI) devices, unmanned aerial vehicles (UAVs), robots, augmented reality (AR) devices, virtual reality (VR) devices, and 5G service-related devices.Type: GrantFiled: September 17, 2019Date of Patent: August 23, 2022Assignee: LG ELECTRONICS INC.Inventor: Jonghoon Chae
-
Patent number: 11422764Abstract: Augmented reality display systems, apparatuses, and methods are disclosed for enabling a wearer of an augmented reality optical display to continue wearing the same optical display while moving between different platforms or vehicles. Example embodiments include optical displays that use a wired connection to connect with each platform to minimize the electromagnetic signature of the system. Embodiments include changing the information displayed to the user depending on the type of vehicle to which the optical display is connected. Additional embodiment display information about weapon systems associated with the platform to which the optical display is connected.Type: GrantFiled: May 31, 2019Date of Patent: August 23, 2022Assignee: EPIC OPTIX, INC.Inventor: Ray Kwong
-
Patent number: 11417302Abstract: Apparatus, methods, and systems that operate to provide interactive streaming content identification and processing are disclosed. An example apparatus includes a classifier to determine an audio characteristic value representative of an audio characteristic in audio; a transition detector to detect a transition between a first category and a second category by comparing the audio characteristic value to a threshold value among a set of threshold values, the set of threshold values corresponding to the first category and the second category; and a context manager to control a device to switch from a first fingerprinting algorithm to a second fingerprinting algorithm different than the first fingerprinting algorithm, responsive to the detected transition between the first category and the second category.Type: GrantFiled: September 10, 2020Date of Patent: August 16, 2022Assignee: Gracenote, Inc.Inventors: Michael Jeffrey, Markus K. Cremer, Dong-In Lee
-
Patent number: 11417329Abstract: A system for performing magnetic resonance tomography is disclosed. A control system creates a speech data stream from an acquired linguistic expression and generates a command library, which contains a selection of speech commands, to each of which one or more linguistic expressions are assigned. The selection of speech commands is loaded from a command database depending on a current system status of a magnetic resonance (MR) scanner. The control system applies a speech recognition algorithm to the speech data stream to determine whether a linguistic expression contained in the command library can be assigned to the speech data stream. If so, the acquired linguistic expression is recognized, a speech command from the command library assigned to the recognized linguistic expression is established, and a control command for controlling the MR scanner in accordance with the speech command is created.Type: GrantFiled: January 30, 2020Date of Patent: August 16, 2022Assignee: Siemens Healthcare GmbHInventors: Rainer Schneider, Dirk Franger
-
Patent number: 11416741Abstract: A technique for constructing a model supporting a plurality of domains is disclosed. In the technique, a plurality of teacher models, each of which is specialized for different one of the plurality of the domains, is prepared. A plurality of training data collections, each of which is collected for different one of the plurality of the domains, is obtained. A plurality of soft label sets is generated by inputting each training data in the plurality of the training data collections into corresponding one of the plurality of the teacher models. A student model is trained using the plurality of the soft label sets.Type: GrantFiled: June 8, 2018Date of Patent: August 16, 2022Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Takashi Fukuda, Osamu Ichikawa, Samuel Thomas, Bhuvana Ramabhadran
-
Patent number: 11410401Abstract: The subject technology receives a selection of a selectable graphical item from a plurality of selectable graphical items, the selectable graphical item comprising an augmented reality content generator for applying a 3D effect, the 3D effect including at least one beautification operation. The subject technology captures image data and depth data using a camera. The subject technology applies, to the image data and the depth data, the 3D effect including the at least one beautification operation based at least in part on the augmented reality content generator, the beautification operation being performed as part of applying the 3D effect. The subject technology generates a 3D message based at least in part on the applied 3D effect including the at least one beautification operation. The subject technology renders a view of the 3D message based at least in part on the applied 3D effect including the at least one beautification operation.Type: GrantFiled: August 28, 2020Date of Patent: August 9, 2022Assignee: Snap Inc.Inventors: Kyle Goodrich, Samuel Edward Hare, Maxim Maximov Lazarov, Tony Mathew, Andrew James McPhee, Daniel Moreno, Dhritiman Sagar, Wentao Shang
-
Patent number: 11410646Abstract: A system capable of performing natural language understanding (NLU) on utterances including complex command structures such as sequential commands (e.g., multiple commands in a single utterance), conditional commands (e.g., commands that are only executed if a condition is satisfied), and/or repetitive commands (e.g., commands that are executed until a condition is satisfied). Audio data may be processed using automatic speech recognition (ASR) techniques to obtain text. The text may then be processed using machine learning models that are trained to parse text of incoming utterances. The models may identify complex utterance structures and may identify what command portions of an utterance go with what conditional statements. Machine learning models may also identify what data is needed to determine when the conditionals are true so the system may cause the commands to be executed (and stopped) at the appropriate times.Type: GrantFiled: March 28, 2019Date of Patent: August 9, 2022Assignee: Amazon Technologies, Inc.Inventors: Cengiz Erbas, Thomas Kollar, Avnish Sikka, Spyridon Matsoukas, Simon Peter Reavely
-
Patent number: 11410144Abstract: Embodiments disclosed herein describe intelligent e-book readers which provide a significant improvement over the conventional e-books that simply render static content. The intelligent e-book readers may customize a rendered e-book based on, for example, the reading level and preferences of the user, the user's social media profile and activity, and current events. Furthermore, the intelligent e-book reader may provide additional augmented reality (AR)/virtual reality (VR) content associated with one or more portions of the rendered e-book. The intelligent e-book reader may also facilitate virtual, real time communication between multiple users and experts. The intelligent e-book reader may also facilitate one or more users to provide feedback and suggestions to authors and future movie-makers. The intelligent e-book reader may automatically determine difficult portions of an e-book based on the virtual communications and/or real time eye-tracking of a user.Type: GrantFiled: January 25, 2021Date of Patent: August 9, 2022Assignee: MASSACHUSETTS MUTUAL LIFE INSURANCE COMPANYInventors: Michal Knas, Payton A. Shubrick, Damon Ryan Depaolo, Jiby John
-
Patent number: 11403598Abstract: The present disclosure generally relates to interview training and providing interview feedback. An exemplary method comprises: at an electronic device that is in communication with a display and one or more input devices: receiving, via the one or more input devices, media data corresponding to a user's responses to a plurality of prompts; analyzing the media data; and while displaying, on the display, a media representation of the media data, displaying a plurality of analysis representations overlaid on the media representation, wherein each of the plurality of analysis representations is associated with an analysis of content located at a given time in the media representation and is displayed in coordination with the given time in the media representation.Type: GrantFiled: October 18, 2021Date of Patent: August 2, 2022Assignee: Korn FerryInventors: Thom Steinhoff, Panos S. Stamus, Bryan Ackermann, John Deyto
-
Patent number: 11398225Abstract: A method and apparatus for controlling a device are disclosed. The method includes: performing voice recognition on a received sound signal to obtain a voice recognition result; determining keywords using the voice recognition result; determining a target intelligent device having attribute information matched with the keywords from intelligent devices, where relationships between the intelligent devices and attribute information of the intelligent devices are constructed in advance, and the attribute information characterizes a device operation provided by the intelligent device corresponding to the attribute information; and controlling the target intelligent device to perform an operation indicated by the voice recognition result.Type: GrantFiled: September 25, 2019Date of Patent: July 26, 2022Assignee: Beijing Xiaomi Intelligent Technology Co., Ltd.Inventor: Fuxin Li
-
Patent number: 11398222Abstract: Provided is an artificial intelligence (AI) device for recognizing speech of user. The AI apparatus includes: a microphone; and a processor configured to: receive, via the microphone, a sound signal corresponding to speech of the user, recognize the speech from the sound signal using a language model, determine an intention of the user based on the recognition result, determine whether the determination of the intention is successful, obtain a user's application usage log if the determination of the intention is not successful, and update the language model using the obtained user's application usage log.Type: GrantFiled: August 13, 2019Date of Patent: July 26, 2022Assignee: LG ELECTRONICS INC.Inventors: Jaehong Kim, Boseop Kim
-
Patent number: 11393462Abstract: A device with a microphone acquires audio data of a user's speech. That speech comprises utterances, that together comprise a session. The audio data is processed to determine sentiment data indicative of perceived emotional content of the speech as conveyed by individual utterances of the user. That information is then used to determine the emotional content of the session. For example, the information may include several words describing the overall and outlying emotions of the session. Numeric metrics may also be determined, such as activation and valence. A user interface may present the words and metrics to the user. The user may use this information to assess their state of mind, facilitate interactions with others, and so forth.Type: GrantFiled: May 13, 2020Date of Patent: July 19, 2022Assignee: AMAZON TECHNOLOGIES, INC.Inventors: Narendra Gyanchandani, Bilyana Slavova, Daniel Kenneth Bone, Hanhan Wang, Njenga Kariuki
-
Patent number: 11393481Abstract: A method is described which decodes a downmix matrix for mapping a plurality of input channels of audio content to a plurality of output channels, the input and output channels being associated with respective speakers at predetermined positions relative to a listener position, wherein the downmix matrix is encoded by exploiting the symmetry of speaker pairs of the plurality of input channels and the symmetry of speaker pairs of the plurality of output channels. Encoded information representing the encoded downmix matrix is received and decoded for obtaining the decoded downmix matrix.Type: GrantFiled: September 23, 2019Date of Patent: July 19, 2022Inventors: Florin Ghido, Achim Kuntz, Bernhard Grill
-
Patent number: 11386901Abstract: An audio confirmation system includes a voice acquiring section configured to acquire a voice contained in a motion picture; a voice text producing section configured to produce a voice text based on the acquired voice; a determining section configured to determine whether or not the produced voice text and a caption text that is embedded in an image contained in the motion picture correspond to each other; and an outputting section configured to output a result of the determination of the determining section.Type: GrantFiled: March 18, 2020Date of Patent: July 12, 2022Assignee: Sony Interactive Entertainment Inc.Inventors: Masaomi Nishidate, Isamu Terasaka, Norihiro Nagai
-
Patent number: 11386889Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for implementing contextual grammar selection are disclosed. In one aspect, a method includes the actions of receiving audio data of an utterance. The actions include generating a word lattice that includes multiple candidate transcriptions of the utterance and that includes transcription confidence scores. The actions include determining a context of the computing device. The actions include based on the context of the computing device, identifying grammars that correspond to the multiple candidate transcriptions. The actions include determining, for each of the multiple candidate transcriptions, grammar confidence scores that reflect a likelihood that a respective grammar is a match for a respective candidate transcription. The actions include selecting, from among the candidate transcriptions, a candidate transcription.Type: GrantFiled: November 27, 2019Date of Patent: July 12, 2022Assignee: Google LLCInventors: Petar Aleksic, Pedro J. Moreno Mengibar, Leonid Velikovich
-
Patent number: 11381670Abstract: An electronic device including circuitry configured to perform control in a manner that a Physical Layer Convergence Protocol (PLCP) header format is selected from a plurality of PLCP header formats; and append the selected PLCP header to a physical layer packet for transmission.Type: GrantFiled: August 17, 2020Date of Patent: July 5, 2022Assignee: SONY CORPORATIONInventors: Takeshi Itagaki, Tomoya Yamaura, Kazuyuki Sakoda, Masanori Sato
-
Patent number: 11373641Abstract: Embodiments of the present invention provide an intelligent interactive method and apparatus, a computer device and a computer readable storage medium, which solves problems that a deep intention of a user message cannot be analyzed in an intelligent interactive manner in the prior art and humanized interactive experiences cannot be provided. The intelligent interactive method includes: obtaining an emotion recognition result according to a user message, where the user message includes at least a user voice message; performing an intention analysis according to a text content of the user voice message to obtain corresponding basic intention information; and determining a corresponding interactive instruction according to the emotion recognition result and the basic intention information.Type: GrantFiled: May 16, 2019Date of Patent: June 28, 2022Assignee: Shanghai Xiaoi Robot Technology Co., Ltd.Inventors: Hui Wang, Shijing Yu, Pinpin Zhu
-
Patent number: 11373651Abstract: A method for performing voice analysis includes storing, in a database, a simulation file for conducting a training session with a user, the simulation file including at least a script, storing desired attributes associated with the simulation file, retrieving the simulation file from the database and providing a user interface to conduct the voice analysis using the simulation file from the database, receiving one or more voice impressions from a user and analyzing, at an audio analysis tool, at least one of the voice impressions of the user determining, at the audio analysis tool, attributes of the at least one voice impression in response to analyzing the at least one voice impression and comparing, at the audio analysis tool, the determined attributes to the desired attributes associated with the simulation file. The method provides, by the client application, feedback to the user based on the comparison.Type: GrantFiled: February 21, 2020Date of Patent: June 28, 2022Assignee: SALESBOOST, LLCInventor: Margaret L Brooks
-
Patent number: 11373643Abstract: An output method includes obtaining voice information, determining whether the voice information is a voice request, in response to the voice information being the voice request, obtaining reply information for replying to the voice request, and supplemental information, and transmitting the reply information and the supplementary information to an output device for outputting. The supplemental information is information that needs to be outputted in association with the reply information.Type: GrantFiled: March 28, 2019Date of Patent: June 28, 2022Assignee: LENOVO (BEIJING) CO., LTD.Inventors: Wenlin Yan, Shifeng Peng
-
Patent number: 11366851Abstract: Computer systems and methods are provided for processing audio queries. An electronic device receives an audio clip and performs a matching process on the audio clip. The matching process includes comparing at least a portion of the audio clip to a plurality of reference audio tracks and identifying, based on the comparing, a first portion of a particular reference track that corresponds to the audio sample. Upon identifying the matching portion, the electronic device provides a backing track for playback which corresponds to the particular reference track, and an initial playback position of the backing track.Type: GrantFiled: December 18, 2019Date of Patent: June 21, 2022Assignee: Spotify ABInventors: Marco Marchini, Nicola Montecchio
-
Patent number: 11361764Abstract: Systems and methods for device naming-indicator generation are disclosed. Friendly names for accessory devices, such as smart-home devices, may be utilized to generate formatted text data that includes capitalization and/or punctuation for the friendly names. The formatted text data may be utilized to generate tag data indicating attributes of the friendly name. The tag data and/or contextual data indicating historical usage of the accessory device may be utilized to generate naming indicator(s) for the accessory device. The naming indicator(s) may be utilized, for example, during target inference and/or for communicating with a user about the accessory device.Type: GrantFiled: January 3, 2019Date of Patent: June 14, 2022Assignee: Amazon Technologies, Inc.Inventors: David Y Zhao, Akshay Kumar, William Evan Welbourne
-
Patent number: 11363083Abstract: Methods and apparatus are disclosed for managing streamed audio communication sessions between user devices (50) configured to send streamed data indicative of received audio contributions from respective participants in a multiple-participant audio communication session via a communications network to one or more other user devices (50) for conversion to audio representations of said received audio contributions for other participants.Type: GrantFiled: December 21, 2018Date of Patent: June 14, 2022Assignee: BRITISH TELECOMMUNICATIONS public limited companyInventors: Ian Kegel, Karis Bailey, Martin Reed, Peter Hughes
-
Patent number: 11354089Abstract: A method for generating a user interface with a user interface device in a distributed automation system includes receiving a service message from a home automation device in the distributed automation system, identifying a state of a dialog manager of the user interface device in response to receiving the service message, and generating a natural language output message based at least in part on a device identifier parameter in the service message and a plurality of natural language templates stored in the memory in response to the dialog manager being in an idle state. The method further includes storing the service message in a priority queue in the memory based on a priority level parameter corresponding to the service message in response to the dialog manager being in an active state.Type: GrantFiled: December 9, 2016Date of Patent: June 7, 2022Assignee: Robert Bosch GmbHInventors: Leah Nicolich-Henkin, Cory Henson, Joao P. Sousa
-
Patent number: 11355126Abstract: Provided are methods and systems to verify user identity for voice enabled devices. A voice input can instruct a voice enabled device to perform a plurality of functions/services that, depending on the function/service, may require additional user verification. Primary user verification can be performed by associating voice characteristics of the voice input to a profile associated with a user/user device. A signal (e.g., a BLE beacon) can be sent to the user device that causes the user device to perform secondary user verification. The secondary user verification can be based on a biometric input, passcode verification, authenticated message reply, for example. Based on the secondary user verification, an operational command associated with the voice input can be executed.Type: GrantFiled: January 24, 2018Date of Patent: June 7, 2022Assignee: Comcast Cable Communications, LLCInventor: Franklyn Athias
-
Patent number: 11341970Abstract: A method of providing navigation directions includes receiving, at a user terminal, a query spoken by a user, wherein the query spoken by the user includes a speech utterance indicating (i) a category of business, (ii) a name of the business, and (iii) a location at which or near which the business is disposed; identifying, by processing hardware, the business based on the speech utterance; and providing navigation directions to the business via the user terminal.Type: GrantFiled: June 8, 2020Date of Patent: May 24, 2022Assignee: GOOGLE LLCInventors: Brian Strope, Francoise Beaufays, William J. Byrne