Speech To Image Patents (Class 704/235)
  • Patent number: 11973807
    Abstract: A connection procedure for data communications devices is implemented in a variety of embodiments. In one such embodiment, the procedure uses a first set of connection data for attempting to connect and upon failure to connect uses a second set of connection information in addition to the first set of connection information to attempt a connection. In another embodiment, a delay is implemented before transmitting the connection information and a subsequent delay is implemented to allow for additional connection information to be input and transmitted.
    Type: Grant
    Filed: June 23, 2022
    Date of Patent: April 30, 2024
    Assignee: 8x8, Inc.
    Inventor: Marc Petit-Huguenin
  • Patent number: 11971920
    Abstract: Disclosed is a method for determining a content associated with a voice signal, which is performed by a computing device. The method may include converting a voice signal and generating text information. The method may include determining a plurality of target word candidates. The method may include determining a target word among the plurality of target word candidates based on a comparison between the plurality of target word candidates and the generated text information. The method may also include determining a content associated with the target word.
    Type: Grant
    Filed: July 26, 2023
    Date of Patent: April 30, 2024
    Assignee: ActionPower Corp.
    Inventors: Hyungwoo Kim, Seungho Kwak
  • Patent number: 11971911
    Abstract: Systems and methods for generating customized annotations of a medical record are provided. The system receives a medical record and processes it using a predictive model to identify evidence of a finding. The system then determines whether to have a recall enhancement or validation of a specific finding. Recall enhancement is used to tune or develop the predictive model, while validation is used to rapidly validate the evidence. The source document is provided to the user and feedback is requested. When asking for validation, the system also highlights the evidence already identified and requests the user to indicate if the evidence is valid for a particular finding. If recall enhancement is utilized, the source document is provided and the user is asked to find evidence in the document for a particular finding. The user may then highlight the evidence that supports the finding. The user may also annotate the evidence using free form text.
    Type: Grant
    Filed: August 2, 2022
    Date of Patent: April 30, 2024
    Assignee: Apixio, LLC
    Inventors: Darren Matthew Schulte, John O. Schneider, Robert Derward Rogers, Vishnuvyas Sethumadhavan
  • Patent number: 11962482
    Abstract: At least one high-quality image of a speaker is captured. A low network quality condition may be detected between a client device and a video service node. In response to detecting the low network quality condition, a data stream comprising changes to the high-quality image of the speaker needed to recreate a representation of the speaker is generated. Transmission of the video stream of the speaker between the client device of the speaker and the video service node is stopped and, simultaneously, transmission of the data stream is begun. A digital twin of the speaker is then generated for display at the client device based on the data stream and the high-quality image of the speaker.
    Type: Grant
    Filed: July 14, 2022
    Date of Patent: April 16, 2024
    Assignee: Rovi Guides, Inc.
    Inventors: Johan Kölhi, Anthony Friede
  • Patent number: 11960534
    Abstract: Coordinating processing of audio queries is provided. A system receives a query. The system provides the query to a first digital assistant component and a second digital assistant component for processing. The system receives a first response to the query from the first digital assistant component, and a second response to the query from the second digital assistant component. The first digital assistant component can be authorized to access a database the second digital assistant component is prohibited from accessing. The system determines, based on a ranking decision function, to select the second response to the query from the second digital assistant component. The system provides, responsive to the selection, the second response from the second digital assistant to a computing device.
    Type: Grant
    Filed: April 8, 2019
    Date of Patent: April 16, 2024
    Assignee: GOOGLE LLC
    Inventors: Bo Wang, Smita Rai, Max Ohlendorf, Venkat Kotla, Chad Yoshikawa, Abhinav Taneja, Amit Agarwal, Chris Ramsdale, Chris Turkstra
  • Patent number: 11960694
    Abstract: Virtual assistants intelligently emulate a representative of a service provider by providing variable responses to user queries received via the virtual assistants. These variable responses may take the context of a user's query into account both when identifying an intent of a user's query and when identifying an appropriate response to the user's query.
    Type: Grant
    Filed: April 16, 2021
    Date of Patent: April 16, 2024
    Assignee: Verint Americas Inc.
    Inventors: Fred A. Brown, Tanya M. Miller, Mark Zartler
  • Patent number: 11961507
    Abstract: A transcription of a query for content discovery is generated, and a context of the query is identified, as well as a first plurality of candidate entities to which the query refers. A search is performed based on the context of the query and the first plurality of candidate entities, and results are generated for output. A transcription of a second voice query is then generated, and it is determined whether the second transcription includes a trigger term indicating a corrective query. If so, the context of the first query is retrieved. A second term of the second query similar to a term of the first query is identified, and a second plurality of candidate entities to which the second term refers is determined. A second search is performed based on the second plurality of candidates and the context, and new search results are generated for output.
    Type: Grant
    Filed: March 2, 2023
    Date of Patent: April 16, 2024
    Assignee: Rovi Guides, Inc.
    Inventors: Jeffry Copps Robert Jose, Sindhuja Chonat Sri
  • Patent number: 11960636
    Abstract: Examples of wearable systems and methods can use multiple inputs (e.g., gesture, head pose, eye gaze, voice, and/or environmental factors (e.g., location)) to determine a command that should be executed and objects in the three-dimensional (3D) environment that should be operated on. The multiple inputs can also be used by the wearable system to permit a user to interact with text, such as, e.g., composing, selecting, or editing text.
    Type: Grant
    Filed: December 22, 2021
    Date of Patent: April 16, 2024
    Assignee: MAGIC LEAP, INC.
    Inventors: James M. Powderly, Savannah Niles, Jennifer M. R. Devine, Adam C. Carlson, Jeffrey Scott Sommers, Praveen Babu J D, Ajoy Savio Fernandes, Anthony Robert Sheeder
  • Patent number: 11954223
    Abstract: A search index is generated from one or more data records, wherein the one or more data records have contents in a plurality of different fields. Field information of the one or more data records is stored in the search index as specialized indexed elements, wherein the specialized indexed elements overlap with other indexed elements of the one or more data records. A search query is received from a user allowed to access only a portion of the plurality of different fields. The search query is processed within the portion of the plurality of different fields using the search index including the specialized indexed elements.
    Type: Grant
    Filed: October 12, 2020
    Date of Patent: April 9, 2024
    Assignee: ServiceNow, Inc.
    Inventors: William Kimble Johnson, III, Raymond Lau, Benjamin Talcott Borchard
  • Patent number: 11955026
    Abstract: A method, computer program product, and computer system for public speaking guidance is provided. A processor retrieves speaker data regarding a speech made by a user. A processor separates the speaker data into one or more speaker modalities. A processor extracts one or more speaker features from the speaker data for the one or more speaker modalities. A processor generates a performance classification based on the one or more speaker features. A processor sends to the user guidance regarding the speech based on the performance classification.
    Type: Grant
    Filed: September 26, 2019
    Date of Patent: April 9, 2024
    Assignee: International Business Machines Corporation
    Inventors: Cheng-Fang Lin, Ching-Chun Liu, Ting-Chieh Yu, Yu-Siang Chen, Ryan Young
  • Patent number: 11955127
    Abstract: An embodiment extracts a set of designated entities and a set of relationships between designated entities from speech content of an audio feed of a plurality of participants of a current web conference using a machine learning model trained to classify parts of speech content. The embodiment generates a list of current action items based on the extracted set of designated entities and relationships between designated entities. The embodiment identifies a first current action item that is an updated version of an ongoing action item on a progress list of ongoing action items from past web conferences. The embodiment also identifies a second current action item that is unrelated to any of the ongoing action items on the progress list. The embodiment updates the progress list to include updates for the first current action item and by adding the second current action item.
    Type: Grant
    Filed: April 8, 2021
    Date of Patent: April 9, 2024
    Assignee: KYNDRYL, INC.
    Inventors: Muhammad Ammar Ahmed, Madiha Ijaz, Sreekrishnan Venkateswaran
  • Patent number: 11955117
    Abstract: A system and method are provided for analyzing and reacting to interactions between entities using electronic communication channels. The method includes receiving, via the communications module, data captured from a conversational exchange between a first entity communicating with a second entity using an electronic communication channel. The method also includes analyzing the captured data to detect an indication that the first entity is or was distracted during the conversational exchange, is or was disinterested in a portion of the conversational exchange or missed the portion of the conversational exchange. The method also includes determining based on the indication an action to address the distraction during, disinterest in, or missing of, the portion of the conversational exchange; and providing, via the communications module, an automated message to at least one of the first entity and the second entity for executing the action.
    Type: Grant
    Filed: May 27, 2021
    Date of Patent: April 9, 2024
    Assignee: The Toronto-Dominion Bank
    Inventors: Bridget McDermid, Brian Bellwood, Natalie Thien Huong Cornwall, Jeffery David True, Ryan Wall, Stella Pui Kwan Chan, Venetia D'Souza, Christopher Michael Arthur Caravan, Pranavan Premathas, Sahifa Habib Qazi, Mah Noor Siddiqui, Joe Moghaizel, Jonathan K. Barnett
  • Patent number: 11948578
    Abstract: Systems, methods, devices and non-transitory, computer-readable storage mediums are disclosed for a wearable multimedia device and cloud computing platform with an application ecosystem for processing multimedia data captured by the wearable multimedia device. In an embodiment, a wearable multimedia device receives a first speech input from a user, including a first command to generate a message, and first content for inclusion in the message. The device determines second content for inclusion in the message based on the first content, and generates the message such that the messages includes the first and second content. The device receives a second speech input from the user, including a second command to modify the message. In response, the device determines third content for inclusion in the message based on the first content and/or the second content, and modifies the message using the third content. The device transmits the modified message to a recipient.
    Type: Grant
    Filed: March 4, 2022
    Date of Patent: April 2, 2024
    Assignee: Humane, Inc.
    Inventors: Kenneth Luke Kocienda, Imran A. Chaudhri
  • Patent number: 11947924
    Abstract: The present disclosure relates to systems and methods for providing subtitle for a video. The video's audio is transcribed to obtain caption text for the video. A first machine-trained model identifies sentences in the caption text. A second model identifies intra-sentence breaks with in the sentences identified using the first machine-trained model. Based on the identified sentences and intra-sentence breaks, one or more words in the caption text are grouped into a clip caption to be displayed for a corresponding clip of the video.
    Type: Grant
    Filed: September 18, 2023
    Date of Patent: April 2, 2024
    Assignee: VoyagerX, Inc.
    Inventors: Hyeonsoo Oh, Sedong Nam
  • Patent number: 11942093
    Abstract: A system and method to perform dubbing automatically for multiple languages at the same time using speech-to-text transcriptions, language translation, and artificial intelligence engines to perform the actual dubbing in the voice likeness of the original speaker.
    Type: Grant
    Filed: March 5, 2020
    Date of Patent: March 26, 2024
    Assignee: SYNCWORDS LLC
    Inventors: Aleksandr Dubinsky, Taras Sereda
  • Patent number: 11941345
    Abstract: A computer-implemented process is programmed to process a source input, determine text enhancements, and present the text enhancements to apply to the sentences dictated from the source input. A text processor may use machine-learning models to process an audio input to generate sentences in a presentable format. An audio input can be processed by an automatic speech recognition model to generate electronic text. The electronic text may be used to generate sentence structures using a normalization model. A comprehension model may be used to identify instructions associated with the sentence structures and generate sentences based on the instructions and the sentence structures. An enhancement model may be used to identify enhancements to apply to the sentences. The enhancements may be presented alongside sentences generated by the comprehension model to provide the user an option to select either the enhancements or the sentences.
    Type: Grant
    Filed: October 26, 2021
    Date of Patent: March 26, 2024
    Assignee: Grammarly, Inc.
    Inventors: Timo Mertens, Vipul Raheja, Chad Mills, Ihor Skliarevskyi, Ignat Blazhko, Robyn Perry, Nicholas Bern, Dhruv Kumar, Melissa Lopez
  • Patent number: 11943492
    Abstract: A method includes that a media asset server receives an identifier and a new-language file of a target video and converts the new-language file into a new-language medium file. The media asset server finds a first index file based on the identifier of the target video, and obtains a second index file based on a storage address of the new-language medium file on the media asset server. The media asset server sends the new-language medium file and the second index file to a content delivery server. The content delivery server replaces the storage address of the new-language medium file on the media asset server in the second index file with a storage address of the new-language medium file on the content delivery server to obtain a third index file. The content delivery server generates a first URL of the target video.
    Type: Grant
    Filed: July 13, 2021
    Date of Patent: March 26, 2024
    Assignee: PETAL CLOUD TECHNOLOGY CO., LTD.
    Inventor: Wei Yan
  • Patent number: 11935540
    Abstract: A method may include obtaining first audio data originating at a first device during a communication session between the first device and a second device. The method may also include obtaining an availability of revoiced transcription units in a transcription system and in response to establishment of the communication session, selecting, based on the availability of revoiced transcription units, a revoiced transcription unit instead of a non-revoiced transcription unit to generate a transcript of the first audio data. The method may also include obtaining revoiced audio generated by a revoicing of the first audio data by a captioning assistant and generating a transcription of the revoiced audio using an automatic speech recognition system. The method may further include in response to selecting the revoiced transcription unit, directing the transcription of the revoiced audio to the second device as the transcript of the first audio data.
    Type: Grant
    Filed: October 5, 2021
    Date of Patent: March 19, 2024
    Assignee: Sorenson IP Holdings, LLC
    Inventors: David Thomson, David Black, Jonathan Skaggs, Kenneth Boehme, Shane Roylance
  • Patent number: 11936487
    Abstract: Systems and methods are provided herein for providing context to users who access video conferences late. This may be accomplished by a system receiving an audio segment of a video conference and generating a subtitle corresponding to the audio segment. The system may determine a summary relating to the audio segment and then display the subtitle, summary, and video conference on a device. The system allows a user, who accesses a video conference late, to quickly and accurately understand the current video conference discussion, improving the user's experience and increasing the productivity of the video conference.
    Type: Grant
    Filed: August 17, 2021
    Date of Patent: March 19, 2024
    Assignee: Rovi Guides, Inc.
    Inventors: Padmassri Chandrashekar, Daina Emmanuel
  • Patent number: 11929067
    Abstract: A security panel for controlling home automation devices via a voice assistant device is provided, in which the security panel includes a processor, a memory, a microphone, and a speaker. In one example implementation, the security panel is configured to receive a text input from a user, convert the text input into an audio format via a text-to-speech engine to generate a first voice command for controlling one or more home automation devices via a voice assistant device, and to output the first voice command via the speaker of the security panel, in which the first voice command is received by the voice assistant device via a microphone of the voice assistant device, in which the voice assistant device is configured to control the one or more home automation devices based on the first voice command.
    Type: Grant
    Filed: May 7, 2019
    Date of Patent: March 12, 2024
    Assignee: CARRIER CORPORATION
    Inventors: Pirammanayagam Nallaperumal, Vijayakumar Ummadisinghu, Srikanth Govindavaram
  • Patent number: 11922373
    Abstract: An automated system updates electronic medical records (EMRs) based on dictated reports, without requiring manual data entry into on-screen forms. A dictated report is transcribed by an automatic speech recognizer, and facts are extracted from the report and stored in encoded form. Information from a patient's report is also stored in encoded form. The resulting encoded information from the report and EMR are reconciled with each other, and changes to be made to the EMR are identified based on the reconciliation. The identified changes are made to the EMR automatically, without requiring manual data entry into the EMR.
    Type: Grant
    Filed: January 20, 2022
    Date of Patent: March 5, 2024
    Inventors: Detlef Koll, Juergen Fritsch
  • Patent number: 11922372
    Abstract: The following relates generally to voice assisted delivery of goods. In some embodiments, a digital assistant receives audio data, and determines an intent from the audio data. The digital assistant may then match the determined intent to a flow of a set of flows, where the set of flows may include at least one of: (i) requesting curbside pickup of an item, (ii) requesting locker storage of the item, (iii) requesting an indication of locations that the item is available, (iv) requesting an indication of an inventory of a retail store, (v) requesting ads for an additional item that is related to the item, (vi) requesting a status of an order for the item, (vii) requesting drone delivery of the item from a retail store to a residence, or (viii) requesting drone delivery of the item from a warehouse to the residence. The matched flow of the set of flows may then be executed.
    Type: Grant
    Filed: August 31, 2020
    Date of Patent: March 5, 2024
    Assignee: WALGREEN CO.
    Inventors: Julija Alegra Petkus, Andrew David Schweinfurth, Stephen Elijah Zambo
  • Patent number: 11922141
    Abstract: Systems and methods are disclosed for a voice/chatbot building system. The voice/chatbot builder may involve receiving an identified intent, receiving a task related to the identified intent, and receiving a response related to both the identified intent and the task. The identified intent, task, and response may form a first conversation. The first conversation may be linked to other conversations to establish contextual relationships among conversations and determine conversation priority. Voice/chatbot building may also train natural language processing machine learning algorithms.
    Type: Grant
    Filed: January 29, 2021
    Date of Patent: March 5, 2024
    Assignee: Walmart Apollo, LLC
    Inventors: John Brian Moss, Don Bambico, Jason Charles Benesch, Snehasish Mukherjee
  • Patent number: 11922974
    Abstract: A multimedia dashboard application runs on a computing device that is in networked communication with a seller's inventory database and is also in operative communication with the seller's distribution server. The multimedia dashboard application includes an item selector, recording modules, multimedia editors, and a distribution controller. Without adding or opening another application, the multimedia dashboard application records multimedia segments, selects segments to be uploaded to and downloaded from the inventory database, and edits the segments to produce multimedia promotions. The multimedia dashboard application also controls distribution of the promotions. The multimedia dashboard application can add closed-captioning, voiceover tracks, and background effects to the promotions. The multimedia dashboard application can use a video around a product to produce 360° views of the product and can combine a group of photos into a stitched video.
    Type: Grant
    Filed: February 1, 2022
    Date of Patent: March 5, 2024
    Inventors: James E. Plankey, Thomas G. Gallaher
  • Patent number: 11914643
    Abstract: Coordinating processing of audio queries is provided. A system receives a query. The system provides the query to a first digital assistant component and a second digital assistant component for processing. The system receives a first response to the query from the first digital assistant component, and a second response to the query from the second digital assistant component. The first digital assistant component can be authorized to access a database the second digital assistant component is prohibited from accessing. The system determines, based on a ranking decision function, to select the second response to the query from the second digital assistant component. The system provides, responsive to the selection, the second response from the second digital assistant to a computing device.
    Type: Grant
    Filed: April 8, 2019
    Date of Patent: February 27, 2024
    Assignee: GOOGLE LLC
    Inventors: Bo Wang, Smita Rai, Max Ohlendorf, Venkat Kotla, Chad Yoshikawa, Abhinav Taneja, Amit Agarwal, Chris Ramsdale, Chris Turkstra
  • Patent number: 11914840
    Abstract: Systems and methods are provided for presenting information related to electric charging stations for charging an electric vehicle. A first plurality of icons corresponding to an identified plurality of electric charging stations may be generated for presentation on a display. A zoom command to modify a zoom level of the map interface may be received, and in response to receiving the zoom command, a subset of the plurality of electric charging stations having a charging speed above a threshold charging speed may be identified, and a second plurality of icons, corresponding to the identified subset of the plurality of electric charging stations, may be generated for presentation at the display on a zoomed-out view of the map interface. Selectable options to display a detailed view of information related to an electric charger associated with an electric charger category from among multiple electric charger categories may be provided.
    Type: Grant
    Filed: July 20, 2021
    Date of Patent: February 27, 2024
    Assignee: Rivian IP Holdings, LLC
    Inventors: Jason Meyer Quint, Kok Wei Koh, Brennan Matthew Boblett
  • Patent number: 11910852
    Abstract: A facemask system with automated voice display is disclosed. The facemask includes a covering configured to be positioned about the face of a user. At least one microphone is positioned adjacent the covering and the facemask includes a display for displaying language spoken captured by at least one microphone. A processor is in operable communication with the microphone and the display. The facemask includes a memory that contains instructions that can be executed by the processor to perform operations. The operations include receiving signals from at least one microphone, identifying language contained within the signals, and displaying the language on the display.
    Type: Grant
    Filed: March 3, 2021
    Date of Patent: February 27, 2024
    Assignee: Sorenson IP Holdings, LLC
    Inventors: Doug Bergman, Glenn Andrew Mohan, David Thomson, Kerry Brown
  • Patent number: 11911200
    Abstract: Systems and techniques for producing image-based radiology reports including contextual cropping of image data and radiologist supplied notes and annotations are provided herein. Computer vision and natural language processing algorithms may enable processing of image data and language inputs to identify objects associated with annotations, aid in cropping the image data according to the annotations and object identification and in producing a final text and image laden report.
    Type: Grant
    Filed: August 25, 2020
    Date of Patent: February 27, 2024
    Assignee: Amazon Technologies, Inc.
    Inventors: Sharon Alpert, Antonio Criminisi
  • Patent number: 11907665
    Abstract: The disclosure relates to system and method for processing user input using Natural Language Processing (NLP). The method includes generating, by an NLP model, a set of input intent maps associated with a user input. The method includes matching each of the set of input intent maps with each of a plurality of pre-stored sets of intent maps. Each of the plurality of pre-stored sets of intent maps is generated from a single predefined training input and is mapped to a predefined intent and a predetermined response. The method includes determining a distance of each of the set of input intent maps relative to each of the plurality of pre-stored sets of intent maps. Further, the method includes identifying a pre-stored intent map closest to the set of input intent maps and rendering the predetermined response mapped to the pre-stored sets of intent maps to the user.
    Type: Grant
    Filed: May 11, 2021
    Date of Patent: February 20, 2024
    Assignee: RAJIV TREHAN
    Inventor: Rajiv Trehan
  • Patent number: 11900928
    Abstract: Natural language grammars interpret expressions at the conversational human-machine interfaces of devices. Under conditions favoring engagement, as specified in a unit of conversational code, the device initiates a discussion using one or more of TTS, images, video, audio, and animation depending on the device capabilities of screen and audio output. Conversational code units specify conditions based on conversation state, mood, and privacy. Grammars provide intents that cause calls to system functions. Units can provide scripts for guiding the conversation. The device, or supporting server system, can provide feedback to creators of the conversational code units for analysis and machine learning.
    Type: Grant
    Filed: December 23, 2017
    Date of Patent: February 13, 2024
    Assignee: SoundHound AI IP, LLC
    Inventors: Joel McKenzie, Qindi Zhang
  • Patent number: 11893997
    Abstract: A system and method of automatic transcription using a visual display device and an ear-wearable device. The system is configured to process an input audio signal at the display device to identify a first voice signal and a second voice signal from the input audio signal. A representation of the first voice signal and the second voice signal can be displayed on the display device and input can be received comprising the user selecting one of the first voice signal and the second voice signal as a selected voice signal. The system is configured to convert the selected voice signal to text data and display a transcript on the display device. The system can further generate an output signal sound at the first transducer of the ear-wearable device based on the input audio signal.
    Type: Grant
    Filed: January 26, 2022
    Date of Patent: February 6, 2024
    Assignee: Starkey Laboratories, Inc.
    Inventors: Achintya Kumar Bhowmik, David Alan Fabry, Amit Shahar, Clifford Anthony Tallman
  • Patent number: 11891678
    Abstract: The present disclosure provides a method for optimizing a liquid injection process of ionic rare earth ore, including the following steps of: 1) testing the hydraulic properties of an ore body; 2) determining the diffusion degree of the ore body; 3) determining the spatial distribution of the rare earth grade and the impurity grade of the ore body prior to leaching; 4) determining model parameters of competitive exchange of rare earth ions and impurity ions with ammonium ions; 5) obtaining distribution of rare earth ion concentration within the ore body after completion of leaching; 6) obtaining a profile plot of a rare earth leaching rate as a function of the concentration and dosage of an injected leaching agent; and 7) determining a minimum leaching agent dosage to achieve a target leaching rate according to the profile plot, and then determining the ammonium sulfate concentration according to the minimum leaching agent dosage.
    Type: Grant
    Filed: March 11, 2021
    Date of Patent: February 6, 2024
    Assignees: JIANGXI UNIVERSITY OF SCIENCE AND TECHNOLOGY, LONGYAN RARE-EARTH DEVELOPMENT CO., LTD.
    Inventors: Guanshi Wang, Ping Long, Wenli Liu, Ying Huang, Dingshun He, Lei Qin, Shili Hu, Chenliang Peng, Sihai Luo, Guoqiang Deng
  • Patent number: 11893992
    Abstract: Systems and processes for operating an intelligent automated assistant are provided. In one example process, a first input including activation of an affordance is received. A domain associated with the affordance is determined. A second input including user speech is received, where a user intent is determined based on the domain and the user speech. A determination is made whether the user intent includes a command associated with the affordance. In accordance with a determination that the user intent includes a command associated with the affordance, a task in furtherance of the command is performed.
    Type: Grant
    Filed: August 25, 2022
    Date of Patent: February 6, 2024
    Assignee: Apple Inc.
    Inventors: Philippe P. Piernot, Garrett L. Weinberg
  • Patent number: 11894008
    Abstract: Provided is a signal processing apparatus that includes a voice quality conversion unit that converts acoustic data of any sound of an input sound source to acoustic data of voice quality of a target sound source different from the input sound source on the basis of a voice quality converter parameter obtained by training using acoustic data for each of one or more sound sources as training data, the acoustic data being different from parallel data or clean data.
    Type: Grant
    Filed: November 28, 2018
    Date of Patent: February 6, 2024
    Assignee: SONY CORPORATION
    Inventor: Naoya Takahashi
  • Patent number: 11893993
    Abstract: Dynamic interfacing with applications is provided. For example, a system receives a first input audio signal. The system processes, via a natural language processing technique, the first input audio signal to identify an application. The system activates the application for execution on the client computing device. The application declares a function the application is configured to perform. The system modifies the natural language processing technique responsive to the function declared by the application. The system receives a second input audio signal. The system processes, via the modified natural language processing technique, the second input audio signal to detect one or more parameters. The system determines that the one or more parameters are compatible for input into an input field of the application. The system generates an action data structure for the application. The system inputs the action data structure into the application, which executes the action data structure.
    Type: Grant
    Filed: November 28, 2022
    Date of Patent: February 6, 2024
    Assignee: GOOGLE LLC
    Inventors: Quazi Hussain, Adam Coimbra, Ilya Firman
  • Patent number: 11893988
    Abstract: The disclosure provides a speech control method, a speech control apparatus, an electronic device, and a storage medium. The method includes: acquiring target audio data sent by a client, the target audio data including audio data collected by the client within a target duration before wake-up and audio data collected by the client after wake-up; performing speech recognition on the target audio data; and controlling the client based on an instruction recognized from a second audio segment of the target audio data in response to recognizing a wake-up word from a first audio segment at beginning of the target audio data; in which, the second audio segment is later than the first audio segment or has an overlapping portion with the first audio segment.
    Type: Grant
    Filed: June 24, 2021
    Date of Patent: February 6, 2024
    Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.
    Inventors: Song Yang, Saisai Zou, Jieyi Cao, Junyao Shao
  • Patent number: 11887589
    Abstract: Techniques for voice-based interactions are described. In an example, a device presents a user interface on a display. The device starts an operational mode of the device. The operational mode restricts voice-based interactions with the user interface to a set of commands. The set of commands is defined in a language model that is stored on the device. Further, the device receives, at a microphone of the device, audio data corresponding to a natural language utterance and generates, from the audio data, text data that corresponds to the natural language utterance. The device determines, based at least in part on the language model, that semantics of the text data correspond to a command from the set of commands and presents, on the display, an outcome of performing the command.
    Type: Grant
    Filed: June 17, 2020
    Date of Patent: January 30, 2024
    Assignee: Amazon Technologies, Inc.
    Inventors: Senthil Kumar Dayalan, Manikandan Thangarathnam, Sai Vinayak, Suraj Gopalakrishnan
  • Patent number: 11887585
    Abstract: Systems and processes for operating an intelligent automated assistant are provided. An example method includes, at an electronic device having one or more processors and memory: receiving a natural language speech input; determining, based on the natural language speech input, a plurality of candidate intents; obtaining contextual data associated with the user device; ranking, based on the contextual data, the plurality of candidate intents using a machine learning model, wherein the machine learning model is pre-trained at least partially on the user device; determining a user intent based on the ranked candidate intents; and performing a task corresponding to the determined user intent.
    Type: Grant
    Filed: May 5, 2020
    Date of Patent: January 30, 2024
    Assignee: Apple Inc.
    Inventors: Srinivas Chappidi, Arash Dawoodi
  • Patent number: 11880421
    Abstract: Disclosed herein are an apparatus and method for providing a search service based on important sentences. The apparatus for providing a search service based on important sentences includes memory in which at least one program and a previously trained word importance measurement model are recorded and a processor for executing the program. The program may include a word importance measurement unit for measuring the importance of each of multiple words included in input text in the corresponding input text based on the word importance measurement model and a sentence importance measurement unit for measuring the importance of each of at least one sentence included in the text based on the measured importance of each of the multiple words.
    Type: Grant
    Filed: November 24, 2021
    Date of Patent: January 23, 2024
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Yong-Jin Bae, Joon-Ho Lim, Min-Ho Kim, Hyun Kim, Hyun-Ki Kim, Ji-Hee Ryu, Kyung-Man Bae, Hyung-Jik Lee, Soo-Jong Lim, Myung-Gil Jang, Mi-Ran Choi, Jeong Heo
  • Patent number: 11880545
    Abstract: Systems and methods disclosed herein relate to assigning dynamic eye-gaze dwell-times. Dynamic dwell-times may be tailored to the individual user. For example, a dynamic dwell-time system may be configured to receive data from the user, such as the duration of time the user takes to execute certain actions within applications (e.g., read a word suggestion before actually selecting it). The dynamic dwell-time system may also prevent users from making unintended selections by providing different dwell times for different buttons. Specifically, on a user interface, longer dwell times may be established for the critical keys (e.g., “close” program key, “send” key, word suggestions, and the like) and shorter dwell times may be established for the less critical keys (e.g., individual character keys on a virtual keyboard, spacebar, backspace, and the like).
    Type: Grant
    Filed: June 24, 2021
    Date of Patent: January 23, 2024
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Dmytro Rudchenko, Eric N. Badger, Akhilesh Kaza, Jacob Daniel Cohen, Peter John Ansell, Jonathan T. Campbell, Harish S. Kulkarni
  • Patent number: 11880645
    Abstract: Systems and methods for generating encoded text representations of spoken utterances are disclosed. Audio data is received for a spoken utterance and analyzed to identify a nonverbal characteristic, such as a sentiment, a speaking rate, or a volume. An encoded text representation of the spoken utterance is generated, comprising a text transcription and a visual representation of the nonverbal characteristic. The visual representation comprises a geometric element, such as a graph or shape, or a variation in a text attribute, such as font, font size, or color. Analysis of the audio data and/or generation of the encoded text representation can be performed using machine learning.
    Type: Grant
    Filed: June 15, 2022
    Date of Patent: January 23, 2024
    Assignee: T-Mobile USA, Inc.
    Inventors: Peter P. Myron, Michael Mitchell
  • Patent number: 11880806
    Abstract: In an illustrative embodiment, systems and methods for automating recorded candidate assessments include receiving a submission for an available position including a question response recording for each of one or more interview questions. For each question response recording, a transcript can be generated by applying a speech-to-text algorithm to an audio portion of the recording. The systems and methods can detect, within the transcript, identifiers each associated with the personality aspects by applying a natural language classifier trained to detect words and phrases associated with the personality aspects of the personality model. Scores may be calculated for each of the personality aspects based on a relevance of the respective personality aspect to the respective interview question and detected identifiers. The scores can be presented within a user interface screen responsive to receiving a request to view interview results.
    Type: Grant
    Filed: July 7, 2021
    Date of Patent: January 23, 2024
    Assignee: Cut-E Assessment Global Holdings Limited
    Inventors: Achim Preuss, Richard Justenhoven, Niels Kruse, Nicholas Martin
  • Patent number: 11881302
    Abstract: In some aspects, a method of using a virtual medical assistant to assist a medical professional, the virtual medical assistant implemented, at least in part, by at least one processor of a host device capable of connecting to at least one network is provided. The method comprises receiving free-form instruction from the medical professional, providing the free-form instruction for processing to assist in identifying from the free-form instruction at least one medical task to be performed, obtaining identification of at least one impediment to performing the at least one medical task, and inferring at least some information needed to overcome the at least one impediment.
    Type: Grant
    Filed: November 11, 2019
    Date of Patent: January 23, 2024
    Assignee: Microsoft Technology Licensing, LLC.
    Inventors: Guido Gallopyn, Reid W. Coleman
  • Patent number: 11881209
    Abstract: Disclosed are an artificial intelligence (AI) system using a machine learning algorithm such as deep learning, and an application thereof. The present disclosure provides an electronic device comprising: an input unit for receiving content data; a memory for storing information on the content data; an audio output unit for outputting the content data; and a processor, which acquires a plurality of data keywords by analyzing the inputted content data, matches and stores time stamps, of the content data, respectively corresponding to the plurality of acquired keywords, based on a user command being inputted, searches for a data keyword corresponding to the inputted user command among the stored data keywords, and plays the content data based on the time stamp corresponding to the searched data keyword.
    Type: Grant
    Filed: January 21, 2022
    Date of Patent: January 23, 2024
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Chan-jong Park, Ji-man Kim, Do-jun Yang, Hyun-woo Lee
  • Patent number: 11881010
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for machine learning for video analysis and feedback. In some implementations, a machine learning model is trained to classify videos into performance level classifications based on characteristics of image data and audio data in the videos. Video data captured by a device of a user following a prompt that the device provides to the user is received. A set of feature values that describe audio and video characteristics of the video data are determined. The set of feature values are provided as input to the trained machine learning model to generate output that classifies the video data with respect to the performance level classifications. A user interface of the device is updated based on the performance level classification for the video data.
    Type: Grant
    Filed: September 1, 2022
    Date of Patent: January 23, 2024
    Assignee: Voomer, Inc.
    Inventor: David Wesley Anderton-Yang
  • Patent number: 11875883
    Abstract: Methods and systems for natural language processing/understanding of voice conversations are provided. Using natural language processing, a clinical condition is extracted from a voice conversation. A clinical ontology identifies clinical concepts associated with the clinical conditions. The clinical concepts are classified for documentation. The clinical concepts are searched and validated from within an individual's longitudinal record.
    Type: Grant
    Filed: December 23, 2020
    Date of Patent: January 16, 2024
    Assignee: Cerner Innovation, Inc.
    Inventors: Leo V. Perez, Justin Morrison, Tanuj Gupta, Joe Geris, Rachel Gegen, Jacob Geers, Gyandeep Singh, Emin Agassi
  • Patent number: 11875794
    Abstract: Methods and systems for processing of voice input to identify intents and mapped standard terminologies are provided. Using natural language processing, an intent of a voice input is identified. The intent is utilized to identify a standard terminology that maps to the intent. The standard terminology is utilized to identify information relevant to the standard terminology in a patient's electronic health record.
    Type: Grant
    Filed: July 5, 2022
    Date of Patent: January 16, 2024
    Assignee: Cerner Innovation, Inc.
    Inventors: Emin Agassi, Jodi Kodish-Wachs
  • Patent number: 11875796
    Abstract: A computer implemented method includes receiving information streams on a meeting server from a set of multiple distributed devices included in a meeting, receiving audio signals representative of speech by at least two users in at least two of the information streams, receiving at least one video signal of at least one user in the information streams, associating a specific user with speech in the received audio signals as a function of the received audio and video signals, and generating a transcript of the meeting with an indication of the specific user associated with the speech.
    Type: Grant
    Filed: April 30, 2019
    Date of Patent: January 16, 2024
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Lijuan Qin, Nanshan Zeng, Dimitrios Basile Dimitriadis, Zhuo Chen, Andreas Stolcke, Takuya Yoshioka, William Isaac Hinthorn, Xuedong Huang
  • Patent number: 11869017
    Abstract: Described herein are systems and methods of facilitating remote notarization of electronic instruments and storing records memorializing these notarization events. A notarization system provider may capture video feeds from different angles, and combine the feeds into a single archive multimedia file, which may be stored as a secured record of a notarization. The notarization system may be utilized as an alternative to a conventional notarization settings involving a notary, in-person. The notarization system provider may supply a signing party with an electronic instrument, which could be any instrument requiring notarization to be effective. Parties signing the document may go to a system provider's location or representatives could bring the requisite devices to the signing parties. Cameras generate video feeds of the notary, the notary, and any witnesses, and then forward the feeds to other devices in the system, allowing each party to observe the signing party sign the instrument.
    Type: Grant
    Filed: June 11, 2014
    Date of Patent: January 9, 2024
    Assignee: United Services Automobile Association (USAA)
    Inventor: Michael Joseph Gaeta
  • Patent number: 11868319
    Abstract: Providing an Artificial Intelligence (AI) and Internet of Things (IoT) based system and method that predicts the chronological requirements for various components of the file-being-stored, and then takes an appropriate storage action on each component based on the predicted chronological requirements.
    Type: Grant
    Filed: December 8, 2021
    Date of Patent: January 9, 2024
    Assignee: International Business Machines Corporation
    Inventors: Raghuveer Prasad Nagar, Harshit Sharma, Satisha C Honnavalli, Parvathy Rajeev