Application Patents (Class 704/270)
-
Patent number: 11626112Abstract: Systems and methods for detecting demographic bias in automatic speech recognition (ASR) systems. Corpuses of transcriptions from different demographic groups are analyzed, where one of the groups is known to be susceptible to bias and another group is known not to be susceptible to bias. ASR accuracy for each group is measured and compared to each other using both statistics-based and practicality-based methodologies to determine whether a given ASR system or model exhibits a meaningful level of bias.Type: GrantFiled: February 5, 2021Date of Patent: April 11, 2023Assignee: Wells Fargo Bank, N.A.Inventors: Yong Yi Bay, Menglin Cao, Yang Yang
-
Patent number: 11626127Abstract: System and methods for processing audio signals are disclosed. In one implementation, a system may comprise a wearable camera configured to capture images from an environment of a user; a microphone; and a processor. The processor may be configured to receive an audio signal representative of sounds captured by the microphone during a time period; and receive the images captured by the wearable camera. The processor may process the audio signal in a first mode based on audio data accumulated in a buffer prior to the time period; detect a change in the active speaker from the first individual to a second individual; and cease processing in the first mode and process the audio signal in a second mode that differs from the first mode.Type: GrantFiled: January 19, 2021Date of Patent: April 11, 2023Assignee: OrCam Technologies Ltd.Inventors: Yonatan Wexler, Amnon Shashua
-
Patent number: 11627203Abstract: Systems and methods are described herein to automate managing of service layer operations comprised of multiple elementary operations and offloading the burden of performing such multi-step operations from a requesting entity to the service layer. A Request Abstraction Service (RAS) is described herein for the autonomous execution of such multi-step operations. Methods and apparatuses are also described herein for a service layer framework for integrating generic and functional user interfaces as services managed by the SL on behalf of requesting entities.Type: GrantFiled: May 7, 2019Date of Patent: April 11, 2023Assignee: Convida Wireless, LLCInventors: Catalina Mihaela Mladin, Dale N. Seed, Quang Ly, William Robert Flynn, IV, Zhuo Chen, Hongkun Li, Lu Liu, Chonggang Wang, Jiwan L. Ninglekhu
-
Patent number: 11620333Abstract: A conversation topic providing method includes: converting voice data, of a conversation of a user who is on a phone, into text; selecting a keyword, indicating an intention of the user, from the text; obtaining information of interest with respect to the keyword; and determining topics relating to the keyword based on user information.Type: GrantFiled: June 29, 2021Date of Patent: April 4, 2023Assignee: SAMSUNG ELECTRONICS CO., LTD.Inventors: Hue-yin Kim, Sang-Il Lee, Sung-kyu Lee, Seong-seol Hong, Jung-hoon Shin, Yeon-woo Lee
-
Patent number: 11615791Abstract: Among other things, requests are received from voice assistant devices expressed in accordance with different corresponding protocols of one or more voice assistant frameworks. Each of the requests represents a voiced input by a user to the corresponding voice assistant device. The received requests are re-expressed in accordance with a common request protocol. Based on the received requests, responses to the requests are expressed in accordance with a common response protocol. Each of the responses is re-expressed according to a protocol of the framework with respect to which the corresponding request was expressed. The responses are sent to the voice assistant devices for presentation to the users.Type: GrantFiled: October 1, 2019Date of Patent: March 28, 2023Assignee: Voicify, LLCInventors: Robert T. Naughton, Nicholas G. Laidlaw, Alexander M. Dunn, Jeffrey K. McMahon
-
Patent number: 11607138Abstract: A method and system for determining a respiratory rate of a user using an electrocardiogram (ECG) segment of the user are disclosed. The method comprises decomposing the ECG segment into a plurality of functions and evaluating the plurality of functions to choose one of the plurality of functions based on a respiratory band power. The method includes determining the respiratory rate using the one of the plurality of functions and a domain detection.Type: GrantFiled: July 19, 2019Date of Patent: March 21, 2023Assignee: Vital Connect, Inc.Inventors: Nandakumar Selvaraj, Ravi Narasimhan
-
Patent number: 11600264Abstract: A prosodic speech recognition engine configured to identify prosodic features and patterns in a speech continuum for the extraction of linguistic content including para-syntactic content, discourse function, information structure, meaning, and speaker sentiment.Type: GrantFiled: November 26, 2018Date of Patent: March 7, 2023Assignee: YEDA RESEARCH AND DEVELOPMENT CO. LTD.Inventors: Elisha Moses, Tirza Biron, Dominik Freche, Daniel Baum, Nadav Matalon, Netanel Ehrmann, Eyal Weinreb
-
Patent number: 11600276Abstract: One embodiment provides a method for predicting a next action in a conversation system that includes obtaining, by a processor, information from conversation logs and a conversation design. The processor further creates a dialog graph based on the conversation design. Weights and attributes for edges in the dialog graph are determined based on the information from the conversation logs and adding user input and external context information to an edge attributes set. An unrecognized user input is analyzed and a next action is predicted based on dialog nodes in the dialog graph and historical paths. A guiding conversation response is generated based on the predicted next action.Type: GrantFiled: January 11, 2021Date of Patent: March 7, 2023Assignee: International Business Machines CorporationInventors: Lei Huang, Robert J. Moore, Guangjie Ren, Shun Jiang
-
Patent number: 11595723Abstract: Methods, apparatus, systems and articles of manufacture are disclosed. An example apparatus includes a controller to cause a people meter to emit a prompt for input of audience identification information at a first time and determine a first audience count based on the input, an audio detector to determine a second audience count based on signatures generated from audio data captured in the media environment, and a comparator to cause the people meter to not emit the prompt for at least a first time period after the first time when the first audience count is equal to the second audience count.Type: GrantFiled: August 20, 2020Date of Patent: February 28, 2023Assignee: THE NIELSEN COMPANY (US), LLCInventors: John T. LiVoti, Stanley Wellington Woodruff, Rajakumar Madhanganesh, Khushboo Agarwal
-
Patent number: 11594147Abstract: An interactive system and method for development of the voice, preferably for singing. The system and methods provide and utilize an animated, interactive, preferably 3D, visual character for illustrating the various human physiological components involved in producing vocals, and how best to strengthen and train such components to prevent injury. The system and methods are designed to visually replicate how the human body, and more specifically the internal organs for voice, interact and synchronize muscular movements that are involved in abdominal support, release of air control, and neural stimulation, in unison with Larynx mobility and gravity.Type: GrantFiled: February 27, 2019Date of Patent: February 28, 2023Assignee: VOIXTEK VR, LLCInventors: Juan Felipe Perez, Ronald Warren Anderson
-
Patent number: 11594242Abstract: A sound pickup transducer array, deployed within an enclosed area, is coupled to a sound recorder. A processor, coupled to the sound recorder, provides a button or speech recognizer through which a person in the enclosed area issues a command signifying the occurrence of a sound for which categorizing is requested. The processor is programmed to respond to the issued command by extracting and storing an audio snippet copied from the audio recorder, in a digital memory, where the snippet corresponds to sound captured before, during and after the issued command. The processor communicates the stored audio snippet to an artificial intelligence system trained to categorize sounds as to what produced them. The artificial intelligence system may employ trained model feature extraction, a neural network categorization system, and/or direction of sound arrival analysis.Type: GrantFiled: May 3, 2021Date of Patent: February 28, 2023Assignee: Gulfstream Aerospace CorporationInventors: Tongan Wang, Scott Bohanan, Jim Jordan
-
Patent number: 11589153Abstract: Methods, systems, and devices for signal processing are described. Generally, as provided for by the described techniques, a wearable device may receive an input audio signal (e.g., including both an external signal and a self-voice signal). The wearable device may detect the self-voice signal in the input audio signal based on a self-voice activity detection (SVAD) procedure, and may implement the described techniques based thereon. The wearable device may perform beamforming operations or other separation procedures to isolate the external signal and the self-voice signal from the input audio signal. The wearable device may apply a first filter to the external signal, and a second filter to the self-voice signal. The wearable device may then mix the filtered signals, and generate an output signal that sounds natural to the user.Type: GrantFiled: March 15, 2021Date of Patent: February 21, 2023Assignee: Qualcomm IncorporatedInventors: Lae-Hoon Kim, Dongmei Wang, Fatemeh Saki, Taher Shahbazi Mirzahasanloo, Erik Visser, Rogerio Guedes Alves
-
Patent number: 11587559Abstract: Systems and processes for intelligent device identification are provided. In one example process, audio input may be sampled with a microphone at each of two or more of the plurality of electronic devices. A first electronic device of the plurality of electronic devices for determining a task associated with sampled audio input may be identified. The process may determine the task based on the sampled audio input with the first electronic device and identify identifying a second electronic device of the plurality of electronic devices for performing the task. The task be performed with the second electronic device. The second electronic device is not the first electronic device in some examples.Type: GrantFiled: May 2, 2016Date of Patent: February 21, 2023Assignee: Apple Inc.Inventors: Brandon J. Newendorp, Lia T. Napolitano
-
Patent number: 11583998Abstract: Disclosed herein is a robot including an output interface including at least one of a display or a speaker, and a processor configured to acquire output data of a predetermined playback time point of content output via the robot or an external device, recognize a first emotion corresponding to the acquired output data, and control the output interface to output an expression based on the recognized first emotion.Type: GrantFiled: March 17, 2020Date of Patent: February 21, 2023Assignee: LG ELECTRONICS INC.Inventor: Yoonji Moon
-
Patent number: 11580997Abstract: A jitter buffer control for controlling a provision of a decoded audio content on the basis of an input audio content is configured to select a frame-based time scaling or a sample-based time scaling in a signal-adaptive manner. An audio decoder uses such a jitter buffer control.Type: GrantFiled: June 11, 2020Date of Patent: February 14, 2023Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.Inventors: Stefan Reuschl, Stefan Doehla, Jérémie Lecomte, Manuel Jander
-
Patent number: 11582532Abstract: Methods and systems are described herein for improving audio for hearing impaired content consumers. An example method may comprise determining a content asset. Closed caption data associated with the content asset may be determined. At least a portion of the closed caption data may be determined based on a user setting associated with a hearing impairment. Compensating audio comprising a frequency translation associated with at least the portion of the closed caption data may be generated. The content asset may be caused to be output with audio content comprising the compensating audio and the original audio.Type: GrantFiled: March 12, 2021Date of Patent: February 14, 2023Assignee: Comcast Cable Communications, LLCInventor: Jeff Calkins
-
Patent number: 11580981Abstract: An in-vehicle apparatus is connectable to a device that includes a voice assistant function. The in-vehicle apparatus includes: a voice detector that performs voice recognition of an audio signal input from a microphone and that controls functions of the in-vehicle apparatus based on a result of the voice recognition; and an interface that communicates with the device. When being informed of a detection of a predetermined word in the audio signal as the result of the voice recognition of the audio signal performed by the voice detector, the interface sends to the device, not via the voice detector, the audio signal input from the microphone. The predetermined word is for activating the voice assistant function of the device.Type: GrantFiled: March 3, 2021Date of Patent: February 14, 2023Assignee: DENSO TEN LimitedInventors: Katsuaki Hikima, Daisuke Yamasaki, Futoshi Kosuga
-
Patent number: 11568231Abstract: A contact center analysis system can receive various types of communications from customers, such as audio from telephone calls, voicemails, or video conferences; text from speech-to-text translations, emails, live chat transcripts, text messages, and the like; and other media or multimedia. The system can segment the communication data using temporal, lexical, semantic, syntactic, prosodic, user, and/or other features of the segments. The system can cluster the segments according to one or more similarity measures of the segments. The system can use the clusters to train a machine learning classifier to identify one or more of the clusters as waypoints (e.g., portions of the communications of particular relevance to a user training the classifier). The system can automatically classify new communications using the classifier and facilitate various analyses of the communications using the waypoints.Type: GrantFiled: December 8, 2017Date of Patent: January 31, 2023Assignee: Raytheon BBN Technologies Corp.Inventors: Marie Wenzel Meteer, Patrick Mangan Peterson
-
Patent number: 11558663Abstract: Methods, apparatus, systems and articles of manufacture are disclosed. An example apparatus includes a controller to cause a people meter to emit a prompt for input of audience identification information at a first time and determine a first audience count based on the input, an audio detector to determine a second audience count based on signatures generated from audio data captured in the media environment, and a comparator to cause the people meter to not emit the prompt for at least a first time period after the first time when the first audience count is equal to the second audience count.Type: GrantFiled: August 20, 2020Date of Patent: January 17, 2023Assignee: THE NIELSEN COMPANY (US), LLCInventors: John T. LiVoti, Stanley Wellington Woodruff, Rajakumar Madhanganesh, Khushboo Agarwal
-
Patent number: 11556696Abstract: Systems and methods include receiving, with a processor, two or more messages from a first user device participating in a communication session, processing, with the processor, the two or more messages, generating, with the processor, a processed message, and displaying, with the processor, the processed message on a second user device participating in the communication session.Type: GrantFiled: March 15, 2021Date of Patent: January 17, 2023Assignee: Avaya Management L.P.Inventors: Sandesh Chopdekar, Pushkar Deole, Navin Daga
-
Patent number: 11551663Abstract: A natural language processing system may use system response configuration data to determine customized output data forms when outputting data for a user. The system response configuration data may represent various output attributes the system may use when creating output data. The system may have a certain number of existing profiles where a profile is associated with certain settings for the system response configuration data/attributes. The system may also use various data such as context data, sentiment data, or the like to customize system response configuration data during a dialog. Other components, such as natural language generation (NLG), text-to-speech (TTS), or the like, may use the customized system response configuration data to determine the form, timing, etc. of output data to be presented to a user.Type: GrantFiled: December 10, 2020Date of Patent: January 10, 2023Assignee: Amazon Technologies, Inc.Inventors: Anthony Bissell, Janet Slifka
-
Patent number: 11544685Abstract: A multimedia keepsake is created containing multimedia content created by a customer and stored online as content information. After the customer selects the type of keepsake, the content information is converted to keepsake information having a format appropriate for storage in the selected type of keepsake. The keepsake information is stored online so as to be accessible via an access code, and it is downloaded to a vendor providing the access code.Type: GrantFiled: August 12, 2014Date of Patent: January 3, 2023Inventor: Geoffrey S. Stern
-
Patent number: 11532007Abstract: A system and method are provided for employing voice-activated user interfaces to determine user attention to particularly-presented advertising content by collecting user contact/consumer information, presenting content to the user/consumer, and proposing at least one question, inquiry or query to the user regarding the presented content, the at least one inquiry or query calling for a user/consumer response to be collected, at least one of (a) the user/consumer contact information and (b) the user/consumer response to the question, inquiry or query being collected by the system via a voice-activated user interface and evaluated to assess a level of engagement of the user/consumer with the advertising content. The disclosed systems and methods uniquely provide voice-activated user interface coupled with display of certain advertising content in a manner that promotes user/consumer attention to the advertising content and ease of interaction with the presentation system.Type: GrantFiled: August 16, 2019Date of Patent: December 20, 2022Inventor: Frank S. Maggio
-
Patent number: 11521618Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for collaboration between multiple voice controlled devices are disclosed. In one aspect, a method includes the actions of identifying, by a first computing device, a second computing device that is configured to respond to a particular, predefined hotword; receiving audio data that corresponds to an utterance; receiving a transcription of additional audio data outputted by the second computing device in response to the utterance; based on the transcription of the additional audio data and based on the utterance, generating a transcription that corresponds to a response to the additional audio data; and providing, for output, the transcription that corresponds to the response.Type: GrantFiled: December 17, 2019Date of Patent: December 6, 2022Assignee: GOOGLE LLCInventors: Victor Carbune, Pedro Gonnet Anders, Thomas Deselaers, Sandro Feuz
-
Patent number: 11521114Abstract: This document relates to creating and/or updating a chatbot using a graphical user interface. For example, training dialogs for a chatbot can be displayed in a tree form on a graphical user interface. Based at least on interactions between a developer and the graphical user interface, the training dialogs can be modified in the tree form, and training dialogs can be updated based on the modifications provided on the tree form via the graphical user interface.Type: GrantFiled: April 18, 2019Date of Patent: December 6, 2022Assignee: Microsoft Technology Licensing, LLCInventors: Lars H. Liden, Swadheen K. Shukla, Shahin Shayandeh, Matthew D. Mazzola
-
Patent number: 11514663Abstract: Provided are a reception apparatus, a reception system, a reception method, and a storage medium that can naturally provide a personal conversation in accordance with a user without requiring the user to register the personal information thereof in advance. A disclosure includes a face information acquisition unit that acquires face information of a user; a face matching unit that matches, against face information of one user, the face information registered in a user information database in which user information including the face information of the user and the reception information is registered; and a user information management unit that, when a result of matching of the face information performed by the face matching unit is unmatched, registers the user information of the one user to the user information database.Type: GrantFiled: February 13, 2019Date of Patent: November 29, 2022Assignee: NEC CORPORATIONInventors: Nobuaki Kawase, Makoto Igarashi
-
Patent number: 11516346Abstract: A three-way calling terminal for a mobile human-machine coordination calling robot. Technical solutions include: a first speech interface, configured to transfer call audio between a call object and a back-end processing module; a CODEC1 module, configured to encode and decode the call audio between the call object and the back-end processing module; a second speech interface, configured to transfer call audio between the human agent and the call object; a CODEC2 module, configured to encode and decode the call audio between the human agent and the call object; a call control module, configured to process a control signal, and automatically make, answer, and hang up a call; a data processing submodule, configured to process speech data and perform data transfer between the data processing submodule and the back-end processing module; and a networking submodule, configured to be connected to the back-end processing module.Type: GrantFiled: July 8, 2021Date of Patent: November 29, 2022Assignee: NANJING SILICON INTELLIGENCE TECHNOLOGY CO., LTD.Inventor: Huapeng Sima
-
Patent number: 11508357Abstract: An extended role play-based utterance set generation apparatus includes a first data store storing role play-based utterance sets and a second data store storing non-role-played utterance sets. The role play-based utterance sets include a first query and a role play-based response to the query. The non-role-played utterance sets include a second query and a non-role-played response to the query. The disclosed technology determines similarity between the role play-based response and the non-role-played response. Upon determining that the role play-based response is the same or similar to the non-role-played response, the disclosed technology generates an association between the role play-based response and the second query and extends the role play-based utterance sets in the first data store with the second query.Type: GrantFiled: April 5, 2019Date of Patent: November 22, 2022Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Masahiro Mizukami, Ryuichiro Higashinaka
-
Patent number: 11508366Abstract: A method, an apparatus and a device for converting a whispered speech, and a readable storage medium are provided. The method is implemented based on the whispered speech converting model. The whispered speech converting model is trained in advance by using recognition results and whispered speech training acoustic features of whispered speech training data as samples and using normal speech acoustic features of normal speech data parallel to the whispered speech training data as sample labels. A whispered speech acoustic feature and a preliminary recognition result of whispered speech data are acquired, then the whispered speech acoustic feature and the preliminary recognition result are inputted into a preset whispered speech converting model to acquire a normal speech acoustic feature outputted by the model. In this way, the whispered speech can be converted to a normal speech.Type: GrantFiled: June 15, 2018Date of Patent: November 22, 2022Assignee: IFLYTEK CO., LTD.Inventors: Jia Pan, Cong Liu, Haikun Wang, Zhiguo Wang, Guoping Hu
-
Patent number: 11501755Abstract: Provided are an electronic device and method for providing a voice assistant service. The method, performed by the electronic device, of providing the voice assistant service includes: obtaining a voice of a user; obtaining voice analysis information of the voice of the user by inputting the voice of the user to a natural language understanding model; determining whether a response operation with respect to the voice of the user is performable, according to a preset criterion, based on the obtained voice analysis information; and based on the determining that the response operation is not performable, outputting a series of guide messages for learning the response operation related to the voice of the user.Type: GrantFiled: September 1, 2020Date of Patent: November 15, 2022Assignee: SAMSUNG ELECTRONICS CO., LTD.Inventor: Inchul Hwang
-
Patent number: 11483425Abstract: A communication system, method and communication terminal are configured to facilitate private outputting of content of a message or communication session. A communication terminal can be configured via data included in a message or via a privacy setting to output content of data from a communication session or message in accordance with a pre-selected privacy setting or one or more privacy rules. For instance, a communication terminal may be configured to suppress a text to speech function for certain text messages, email messages, instant messages, or social networking messages that it receives having the privacy parameter set therein. As another example, a user may set the privacy parameter in his or her terminal so that any such message received by that terminal is output in accordance with the privacy setting or rules. A detection of nearby people can affect how certain content may be output via a terminal.Type: GrantFiled: October 23, 2018Date of Patent: October 25, 2022Assignee: RINGCENTRAL, INC.Inventors: Christian Garbin, Johannes Ruetschi
-
Patent number: 11475880Abstract: A method includes receiving audio data of an utterance and processing the audio data to obtain, as output from a speech recognition model configured to jointly perform speech decoding and endpointing of utterances: partial speech recognition results for the utterance; and an endpoint indication indicating when the utterance has ended. While processing the audio data, the method also includes detecting, based on the endpoint indication, the end of the utterance. In response to detecting the end of the utterance, the method also includes terminating the processing of any subsequent audio data received after the end of the utterance was detected.Type: GrantFiled: March 4, 2020Date of Patent: October 18, 2022Assignee: Google LLCInventors: Shuo-yiin Chang, Rohit Prakash Prabhavalkar, Gabor Simko, Tara N. Sainath, Bo Li, Yangzhang He
-
Patent number: 11470027Abstract: A method, an apparatus, an electronic device and a storage medium for broadcasting a voice are provided. The method may include: sending a voice broadcast request to a server, where the voice broadcast request includes at least one of scenario information, user information or voice packet setting information; receiving a voice broadcast instruction corresponding to the voice broadcast request returned by the server; and acquiring a personalized voice packet corresponding to the voice broadcast instruction in a local database and broadcasting the personalized voice packet.Type: GrantFiled: March 25, 2021Date of Patent: October 11, 2022Assignee: Beijing Baidu Netcom Science and Technology Co., Ltd.Inventors: Shiqiang Ding, Jinyi Lei
-
Patent number: 11468884Abstract: An information processing apparatus includes a voice acquisition section, a reliability generation section, and a processing execution section. The voice acquisition section acquires an ambient voice. The reliability generation section generates reliability indicating a degree in which the acquired voice is uttered from the particular position on the basis of a predetermined transfer characteristic. As the predetermined transfer characteristic, a phase difference or acoustic characteristic of the voice can be assumed. The processing execution section executes a process according to the generated reliability. As the process according to the reliability, a notification according to the reliability or a predetermined command can be assumed to be executed.Type: GrantFiled: March 13, 2018Date of Patent: October 11, 2022Assignee: Sony CorporationInventors: Ryosuke Sawata, Yuichiro Koyama
-
Patent number: 11470419Abstract: A method for auralizing a multi-microphone device. Path information for one or more sound paths using dimensions and room reflection coefficients of a simulated room for one of a plurality of microphones included in a multi-microphone device is determined. An array-related transfer functions (ARTFs) for the one of the plurality of microphones is retrieved. The auralized impulse response for the one of the plurality of microphones is generated based at least on the retrieved ARTFs and the determined path information.Type: GrantFiled: August 29, 2019Date of Patent: October 11, 2022Assignee: Google LLCInventors: Rajeev Conrad Nongpiur, Ananya Misra, Chanwoo Kim
-
Patent number: 11456005Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: obtaining, for each frame in a stream of frames from a video in which faces of one or more speakers have been detected, a respective per-frame face embedding of the face of each speaker; processing, for each speaker, the per-frame face embeddings of the face of the speaker to generate visual features for the face of the speaker; obtaining a spectrogram of an audio soundtrack for the video; processing the spectrogram to generate an audio embedding for the audio soundtrack; combining the visual features for the one or more speakers and the audio embedding for the audio soundtrack to generate an audio-visual embedding for the video; determining a respective spectrogram mask for each of the one or more speakers; and determining a respective isolated speech spectrogram for each speaker.Type: GrantFiled: November 21, 2018Date of Patent: September 27, 2022Assignee: Google LLCInventors: Inbar Mosseri, Michael Rubinstein, Ariel Ephrat, William Freeman, Oran Lang, Kevin William Wilson, Tali Dekel, Avinatan Hassidim
-
Patent number: 11455907Abstract: A computer-implemented method includes recognizing, by a computer device, a word as a new learned word for a user; registering, by the computer device, the new leaned word in a user's new learned word list as a registered new learned word; associating, by the computer device, the registered new learned word with related known words in a user's known word library, the known word library including words known to the user; tracking, by the computer device, uses of the related known words by the user; identifying, by the computer device, a used sentence used by the user that contains one of the related known words; and suggesting, by the computer device, to the user a new sentence that replaces the one of the related known words in the used sentence with the new learned word.Type: GrantFiled: November 27, 2018Date of Patent: September 27, 2022Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Su Liu, Manjunath Ravi, Zhichao Li, Kai Liu
-
Patent number: 11443759Abstract: An information processing apparatus includes a memory storing instructions. The instructions cause the apparatus to extract a plurality of local features from data indicating a speech, the characteristics of feature extraction being formed through learning; and to encode a series of chronological features of the data based on the plurality of local features, characteristics of encoding the series of chronological features being formed through learning. The instructions also cause the apparatus to generate information obtained by weighting features at a specific point in time associated with emotion classification, of the series of chronological features encoded, characteristics of weighting the features at the specific point in time being formed through learning; and to classify emotion corresponding to the data using the information obtained by weighting the features at the specific point in time, characteristics of classification being formed through learning.Type: GrantFiled: July 27, 2020Date of Patent: September 13, 2022Assignee: HONDA MOTOR CO., LTD.Inventor: Yuanchao Li
-
Patent number: 11443730Abstract: A system, a computer program product, and method for controlling synthesized speech output on a voice-controlled device. A sensor is used to capture an image of a face of a person. A database of previously stored images of facial features is accessed. In response to i) not recognizing the at least one person the voice-controlled device selects a first set of conversational starters; ii) recognizing the person and recognizing previous communications with the person, the voice-controlled device selects a second set of conversational starters; iii) recognizing the person and not recognizing previous communications with the person, the voice-controlled device selects a third set of conversational starters; or iv) recognizing the at least one person and recognizing previous communications with the person selecting but do not know the person's name selecting a fourth set of conversational starters. The voice controlled device outputs the selected set of conversational starters.Type: GrantFiled: January 28, 2020Date of Patent: September 13, 2022Assignee: International Business Machines CorporationInventors: Shang Qing Guo, Jonathan Lenchner
-
Patent number: 11443745Abstract: Included are: an apparatus function information acquiring unit for acquiring apparatus function information in which a target apparatus and one or more target functions to be executed by the target apparatus, which are determined on the basis of uttered speech, are associated with each other; a procedure determining unit for determining one or more manual operations for executing the one or more target functions and an order of the one or more manual operations on the basis of the apparatus function information acquired by the apparatus function information acquiring unit; and an operation command transmission controlling unit for sequentially transmitting, to the target apparatus, operation commands for outputting operation response output control information corresponding to each of the one or more manual operations in accordance with the order of the one or more manual operations determined by the procedure determining unit.Type: GrantFiled: October 21, 2020Date of Patent: September 13, 2022Assignee: MITSUBISHI ELECTRIC CORPORATIONInventors: Masato Hirai, Kenshiro Kitamura, Miho Ishikawa, Daisuke Iizawa
-
Patent number: 11430014Abstract: Systems and methods of facilitating transactions related to targeted or customized commercial offerings based on derived sentiment states are provided. The sentiment states are derived from digital representations such as images, videos and sound recordings.Type: GrantFiled: October 1, 2020Date of Patent: August 30, 2022Assignee: Nant Holdings IP, LLCInventor: Patrick Soon-Shiong
-
Patent number: 11430207Abstract: Provided are a reception apparatus, a reception system, a reception method, and a storage medium that can naturally provide a personal conversation in accordance with a user without requiring the user to register the personal information thereof in advance. A disclosure includes a face information acquisition unit that acquires face information of a user; a conversation processing unit that acquires reception information including a content of conversation with the user; a face matching unit that matches, against the face information of one user, the face information registered in a user information database in which user information including the face information of the user and the reception information is registered; and a user information management unit that, when a result of matching of the face information performed by the face matching unit is unmatched, registers the user information of the one user to the user information database.Type: GrantFiled: June 8, 2017Date of Patent: August 30, 2022Assignee: NEC CORPORATIONInventors: Nobuaki Kawase, Makoto Igarashi
-
Patent number: 11422764Abstract: Augmented reality display systems, apparatuses, and methods are disclosed for enabling a wearer of an augmented reality optical display to continue wearing the same optical display while moving between different platforms or vehicles. Example embodiments include optical displays that use a wired connection to connect with each platform to minimize the electromagnetic signature of the system. Embodiments include changing the information displayed to the user depending on the type of vehicle to which the optical display is connected. Additional embodiment display information about weapon systems associated with the platform to which the optical display is connected.Type: GrantFiled: May 31, 2019Date of Patent: August 23, 2022Assignee: EPIC OPTIX, INC.Inventor: Ray Kwong
-
Patent number: 11423879Abstract: A technique for controlling a voice-enabled device using voice commands includes receiving an audio signal that is generated in response to a verbal utterance, generating a verbal utterance indicator for the verbal utterance based on the audio signal, selecting a first command for a voice-controlled application residing within the voice-enabled device based on the verbal utterance indicator, and transmitting the first command to the voice-controlled application as an input.Type: GrantFiled: July 18, 2017Date of Patent: August 23, 2022Assignee: Disney Enterprises, Inc.Inventor: William Valentine Zajac, III
-
Patent number: 11423881Abstract: According to an embodiment of the present disclosure, a method of updating a speech recognition model using a mobile agent in real-time comprises obtaining, in real-time, space type information for a particular space where the mobile agent is located, varying, in real-time, parameters of a speech recognition model used in the particular space based on the space type information, and performing a speech recognition service based on the speech recognition model including the varied parameters. Embodiments of the present disclosure may be related to artificial intelligence (AI) devices, unmanned aerial vehicles (UAVs), robots, augmented reality (AR) devices, virtual reality (VR) devices, and 5G service-related devices.Type: GrantFiled: September 17, 2019Date of Patent: August 23, 2022Assignee: LG ELECTRONICS INC.Inventor: Jonghoon Chae
-
Patent number: 11416741Abstract: A technique for constructing a model supporting a plurality of domains is disclosed. In the technique, a plurality of teacher models, each of which is specialized for different one of the plurality of the domains, is prepared. A plurality of training data collections, each of which is collected for different one of the plurality of the domains, is obtained. A plurality of soft label sets is generated by inputting each training data in the plurality of the training data collections into corresponding one of the plurality of the teacher models. A student model is trained using the plurality of the soft label sets.Type: GrantFiled: June 8, 2018Date of Patent: August 16, 2022Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Takashi Fukuda, Osamu Ichikawa, Samuel Thomas, Bhuvana Ramabhadran
-
Patent number: 11417302Abstract: Apparatus, methods, and systems that operate to provide interactive streaming content identification and processing are disclosed. An example apparatus includes a classifier to determine an audio characteristic value representative of an audio characteristic in audio; a transition detector to detect a transition between a first category and a second category by comparing the audio characteristic value to a threshold value among a set of threshold values, the set of threshold values corresponding to the first category and the second category; and a context manager to control a device to switch from a first fingerprinting algorithm to a second fingerprinting algorithm different than the first fingerprinting algorithm, responsive to the detected transition between the first category and the second category.Type: GrantFiled: September 10, 2020Date of Patent: August 16, 2022Assignee: Gracenote, Inc.Inventors: Michael Jeffrey, Markus K. Cremer, Dong-In Lee
-
Patent number: 11417329Abstract: A system for performing magnetic resonance tomography is disclosed. A control system creates a speech data stream from an acquired linguistic expression and generates a command library, which contains a selection of speech commands, to each of which one or more linguistic expressions are assigned. The selection of speech commands is loaded from a command database depending on a current system status of a magnetic resonance (MR) scanner. The control system applies a speech recognition algorithm to the speech data stream to determine whether a linguistic expression contained in the command library can be assigned to the speech data stream. If so, the acquired linguistic expression is recognized, a speech command from the command library assigned to the recognized linguistic expression is established, and a control command for controlling the MR scanner in accordance with the speech command is created.Type: GrantFiled: January 30, 2020Date of Patent: August 16, 2022Assignee: Siemens Healthcare GmbHInventors: Rainer Schneider, Dirk Franger
-
Patent number: 11410401Abstract: The subject technology receives a selection of a selectable graphical item from a plurality of selectable graphical items, the selectable graphical item comprising an augmented reality content generator for applying a 3D effect, the 3D effect including at least one beautification operation. The subject technology captures image data and depth data using a camera. The subject technology applies, to the image data and the depth data, the 3D effect including the at least one beautification operation based at least in part on the augmented reality content generator, the beautification operation being performed as part of applying the 3D effect. The subject technology generates a 3D message based at least in part on the applied 3D effect including the at least one beautification operation. The subject technology renders a view of the 3D message based at least in part on the applied 3D effect including the at least one beautification operation.Type: GrantFiled: August 28, 2020Date of Patent: August 9, 2022Assignee: Snap Inc.Inventors: Kyle Goodrich, Samuel Edward Hare, Maxim Maximov Lazarov, Tony Mathew, Andrew James McPhee, Daniel Moreno, Dhritiman Sagar, Wentao Shang
-
Patent number: 11410144Abstract: Embodiments disclosed herein describe intelligent e-book readers which provide a significant improvement over the conventional e-books that simply render static content. The intelligent e-book readers may customize a rendered e-book based on, for example, the reading level and preferences of the user, the user's social media profile and activity, and current events. Furthermore, the intelligent e-book reader may provide additional augmented reality (AR)/virtual reality (VR) content associated with one or more portions of the rendered e-book. The intelligent e-book reader may also facilitate virtual, real time communication between multiple users and experts. The intelligent e-book reader may also facilitate one or more users to provide feedback and suggestions to authors and future movie-makers. The intelligent e-book reader may automatically determine difficult portions of an e-book based on the virtual communications and/or real time eye-tracking of a user.Type: GrantFiled: January 25, 2021Date of Patent: August 9, 2022Assignee: MASSACHUSETTS MUTUAL LIFE INSURANCE COMPANYInventors: Michal Knas, Payton A. Shubrick, Damon Ryan Depaolo, Jiby John