Patents by Inventor Alexander Waibel

Alexander Waibel has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Providing off-the-record functionality during virtual meetings

Patent number: 11974074

Abstract: A system for providing off-the-record functionality is provided herein. The system may include a processor configured to execute processor-executable instructions stored in non-transitory computer-readable medium to establish a video conference having a plurality of participants, each participant of the plurality of participants exchanging a plurality of audio or video streams via the video conference. The processor may also be configured to receive, from a first client device associated with one of the plurality of participants, a first audio stream or a first video stream of the plurality of audio or video streams, and record the plurality of audio or video streams within a recording. The processor may also be configured to receive an off-the-record request to begin an off-the-record time period, and in response to the off-the-record request, prevent at least one of the first audio stream or the first video stream from being included in the recording.

Type: Grant

Filed: April 29, 2022

Date of Patent: April 30, 2024

Assignee: Zoom Video Communications, Inc.

Inventors: Shane P. Springer, Alexander Waibel
Lexicon development via shared translation database

Patent number: 11972227

Abstract: A speech translation system and methods for cross-lingual communication that enable users to improve and customize content and usage of the system and easily. The methods include, in response to receiving an utterance including a first term associated with a field, translating the utterance into a second language. In response to receiving an indication to add the first term associated with the field to a first recognition lexicon, adding the first term associated with the field and the determined translation to a first machine translation module and to a shared database for a community associated with the field of the first term associated with the field, wherein the first term associated with the field added to the shared database is accessible by the community.

Type: Grant

Filed: December 7, 2021

Date of Patent: April 30, 2024

Assignee: Meta Platforms, Inc.

Inventors: Alexander Waibel, Ian R. Lane
ACCENT CONVERSION FOR VIRTUAL CONFERENCES

Publication number: 20240098218

Abstract: One example method includes receiving, during a virtual conference hosted by a virtual conference provider, a first audio stream comprising speech having first speech patterns according to a first accent, the first audio stream received from a first client device associated with a first participant in the virtual conference; generating, by a first trained machine learning (“ML”) model, a second audio stream comprising the speech having second speech patterns according to a second accent; and outputting the second audio stream.

Type: Application

Filed: January 30, 2023

Publication date: March 21, 2024

Inventors: Tuan Nam Nguyen, Alexander Waibel
ENFORCING CONSENT REQUIREMENTS FOR SHARING VIRTUAL MEETING RECORDINGS

Publication number: 20230351060

Abstract: Systems and methods for enforcing consent requirements for sharing virtual meeting recordings are provided herein. In an example, a method may include receiving, from a first client device, a recording privacy request associated with a virtual meeting, and receiving, from a second client device, a request to share a recording of the virtual meeting with one or more recipients. The method may also include modifying, by a video conference provider, at least one of a first audio stream or a first video stream associated with the first client device in the recording based on the recording privacy request, and generating, by the video conference provider, a privatized recording based on the modification of at least one of the first audio stream or the first video stream. The method may also include transmitting, by the video conference provider, the privatized recording to the one or more recipients.

Type: Application

Filed: April 29, 2022

Publication date: November 2, 2023

Inventors: Shane Paul Springer, Alexander Waibel
PROVIDING AUTOMATED PERSONAL PRIVACY DURING VIRTUAL MEETINGS

Publication number: 20230351059

Abstract: Systems and methods for providing automated personal privacy during virtual meetings are provided herein. The method may include establishing, by a video conference provider, a video conference having a plurality of participants. The method may also include receiving, from a first client device associated with one of the plurality of participants, a first audio stream and a first video stream, and recording responsive to an indication from one of the plurality of participants, one or more audio or video streams within a recording. The method may include receiving, from the first client device, a personal privacy request. In response to the personal privacy request, the method may include modifying, by the video conference provider, at least one of the first audio stream or the first video stream in the recording and storing the least one of the first audio stream or the first video stream as modified to the recording.

Type: Application

Filed: April 29, 2022

Publication date: November 2, 2023

Applicant: Zoom Video Communications, Inc.

Inventors: Shane P. SPRINGER, Alexander Waibel
PROVIDING INSTANT PROCESSING OF VIRTUAL MEETING RECORDINGS

Publication number: 20230353704

Abstract: Systems and methods for providing instant processing of virtual meeting recordings are provided.

Type: Application

Filed: April 29, 2022

Publication date: November 2, 2023

Inventors: Shane Paul Springer, Alexander Waibel
DELTA MODELS FOR PROVIDING PRIVATIZED SPEECH-TO-TEXT DURING VIRTUAL MEETINGS

Publication number: 20230352026

Abstract: Provided herein are systems and methods for delta models for providing privatized speech-to-text during virtual meetings. In one embodiment, a system may include a non-transitory computer-readable medium; a communications interface; and a processor. The processor may be configured to execute processor-executable instructions to: join a virtual meeting. Each participant in the virtual meeting may exchange audio streams with other participants in the virtual meeting. The instructions may include receiving, from a video conference provider, a local model for speech recognition. The local model may be a copy of a centralized model. The instructions may include performing speech recognition using the local model on the audio streams.

Type: Application

Filed: April 29, 2022

Publication date: November 2, 2023

Inventors: Shane Paul Springer, Alexander Waibel
PROVIDING OFF-THE-RECORD FUNCTIONALITY DURING VIRTUAL MEETINGS

Publication number: 20230353708

Abstract: A system for providing off-the-record functionality is provided herein. The system may include a processor configured to execute processor-executable instructions stored in non-transitory computer-readable medium to establish a video conference having a plurality of participants, each participant of the plurality of participants exchanging a plurality of audio or video streams via the video conference. The processor may also be configured to receive, from a first client device associated with one of the plurality of participants, a first audio stream or a first video stream of the plurality of audio or video streams, and record the plurality of audio or video streams within a recording. The processor may also be configured to receive an off-the-record request to begin an off-the-record time period, and in response to the off-the-record request, prevent at least one of the first audio stream or the first video stream from being included in the recording.

Type: Application

Filed: April 29, 2022

Publication date: November 2, 2023

Applicant: Zoom Video Communications, Inc.

Inventors: Shane P. SPRINGER, Alexander Waibel
INCREMENTAL POST-EDITING AND LEARNING IN SPEECH TRANSCRIPTION AND TRANSLATION SERVICES

Publication number: 20230186899

Abstract: Computer systems and computer-implemented methods provide for interactive and incremental post-editing of real-time speech transcription and translation. A first component is automatic identification of potentially problematic regions in the output (e.g., transcription or translation) that are either likely to be technically processed badly or risky in terms of their content or expression. A second component is intelligent, efficient interfaces that permit multiple editors to correct system output concurrently, collaboratively, efficiently, and simultaneously, so that corrections can be seamlessly inserted and become part of a running presentation. A third component is incremental learning and adaptation that allows the system to use the human corrective feedback to deliver instantaneous improvement of system behavior down-stream. A fourth component is transfer learning to transfer short-term learning into long term learning if the modifications warrant long-term retention.

Type: Application

Filed: April 2, 2021

Publication date: June 15, 2023

Inventors: Alexander Waibel, Sebastian Stuker
Voice agent for sidebars during virtual meetings

Patent number: 11671472

Abstract: Systems and methods for providing a voice agent for sidebars during virtual meetings are provided. In an Example, a system including a non-transitory computer-readable medium, a communications interface, and a processor is provided. The processor configured to execute processor-executable instructions stored in the non-transitory computer-readable medium to: establish a video conference, receive, from a first client device, a request for a sidebar meeting, and transmit to the first client device: a first set of audio and video streams corresponding to a main meeting, and a second set of audio and video streams corresponding to the sidebar meeting. The processor may be configured to identify, by a voice agent, an attention cue in an audio stream from the first set of audio and video streams, and generate, by the voice agent, an alert based on the attention cue identified in the audio stream.

Type: Grant

Filed: January 31, 2022

Date of Patent: June 6, 2023

Assignee: Zoom Video Communications, Inc.

Inventor: Alexander Waibel
ERROR-CORRECTION AND EXTRACTION IN REQUEST DIALOGS

Publication number: 20230013768

Abstract: A system comprises a machine that is configured to act upon requests from a user and sensing means for sensing an operational-mode dialog stream from the user for the machine. The system also comprises a computing system that is configured to train a neural network through machine learning to output, for each training example in a training dialog stream dataset, a corrected request for the machine. The computing system is also configure to, in an operational mode, using the trained neural network, generate a corrected, operational-mode request for the machine based on the operational-mode dialog stream from the user for the machine, wherein the operational-mode dialog stream is sensed by the sensing means.

Type: Application

Filed: December 14, 2020

Publication date: January 19, 2023

Inventors: Stefan Constantin, Alexander Waibel
LEXICON DEVELOPMENT VIA SHARED TRANSLATION DATABASE

Publication number: 20220092278

Abstract: A speech translation system and methods for cross-lingual communication that enable users to improve and customize content and usage of the system and easily. The methods include, in response to receiving an utterance including a first term associated with a field, translating the utterance into a second language. In response to receiving an indication to add the first term associated with the field to a first recognition lexicon, adding the first term associated with the field and the determined translation to a first machine translation module and to a shared database for a community associated with the field of the first term associated with the field, wherein the first term associated with the field added to the shared database is accessible by the community.

Type: Application

Filed: December 7, 2021

Publication date: March 24, 2022

Inventors: Alexander Waibel, Ian R. Lane
NEURAL MODULATION CODES FOR MULTILINGUAL AND STYLE DEPENDENT SPEECH AND LANGUAGE PROCESSING

Publication number: 20220059083

Abstract: Computer-implemented methods and apparatus that use neural modulation codes as an alternative to training many individual recognition models or to loosing performance by training mixed models. Large neural models are modulated by codes that model the different conditions. The codes directly alter (modulate) the behavior of connections in a multiconditional perceptual classifier, so as to permit the most appropriate neuronal units and their features to be applied to each condition. The approach may be applied to multilingual ASR, where the resulting multilingual network arrangement is able to achieve performance that is competitive or better than individually trained mono-lingual network. Moreover, the approach requires no adaptation data or extensive adaptation/training time to operate in a manner tuned to each condition. Beyond multilingual speech processing systems the approach can be applied to many other perceptual processing problems (e.g.

Type: Application

Filed: June 7, 2019

Publication date: February 24, 2022

Applicant: Interactive-AI, LLC

Inventors: Markus MULLER, Alexander WAIBEL
Translation training with cross-lingual multi-media support

Patent number: 11256882

Abstract: An improved lecture support system integrates multi-media presentation materials with spoken content so that the listener can follow with both the speech and the supporting materials that accompany the presentation to provide additional understanding. Computer-based systems and methods are disclosed for translation of a spoken presentation (e.g., a lecture, a video) along with the accompanying presentation materials. The content of the presentation materials can be used to improve presentation translation, as it extracts supportive material from the presentation materials as they relate to the speech.

Type: Grant

Filed: November 5, 2020

Date of Patent: February 22, 2022

Assignee: Meta Platforms, Inc.

Inventor: Alexander Waibel
Lexicon development via shared translation database

Patent number: 11222185

Abstract: A speech translation system and methods for cross-lingual communication that enable users to improve and customize content and usage of the system and easily. The methods include, in response to receiving an utterance including a first term associated with a field, translating the utterance into a second language. In response to receiving an indication to add the first term associated with the field to a first recognition lexicon, adding the first term associated with the field and the determined translation to a first machine translation module and to a shared database for a community associated with the field of the first term associated with the field, wherein the first term associated with the field added to the shared database is accessible by the community.

Type: Grant

Filed: September 1, 2017

Date of Patent: January 11, 2022

Assignee: Meta Platforms, Inc.

Inventors: Alexander Waibel, Ian R. Lane
Translation training with cross-lingual multi-media support

Patent number: 10839169

Abstract: An improved lecture support system integrates multi-media presentation materials with spoken content so that the listener can follow with both the speech and the supporting materials that accompany the presentation to provide additional understanding. Computer-based systems and methods are disclosed for translation of a spoken presentation (e.g., a lecture, a video) along with the accompanying presentation materials. The content of the presentation materials can be used to improve presentation translation, as it extracts supportive material from the presentation materials as they relate to the speech.

Type: Grant

Filed: May 8, 2019

Date of Patent: November 17, 2020

Assignee: Facebook, Inc.

Inventor: Alexander Waibel
Training statistical speech translation systems from speech

Patent number: 10755054

Abstract: An iterative language translation system includes multiple communicatively connected statistical speech translation systems. The system includes an automatic speech recognition component adapted to recognize spoken language in a source language and to create a source language hypothesis. A machine translation component is adapted to translate the source language hypothesis into a target language. The system also includes a second automatic speech recognition component and second machine translation component. The translation results are used to adapt the automatic speech recognition components and the language hypotheses are used to adapt the machine translation components.

Type: Grant

Filed: June 15, 2018

Date of Patent: August 25, 2020

Assignee: Facebook, Inc.

Inventors: Alexander Waibel, Matthias Paulik
Device for extracting information from a dialog

Patent number: 10606942

Abstract: Computer-implemented systems and methods for extracting information during a human-to-human mono-lingual or multi-lingual dialog between two speakers are disclosed. Information from either the recognized speech (or the translation thereof) by the second speaker and/or the recognized speech by the first speaker (or the translation thereof) is extracted. The extracted information is then entered into an electronic form stored in a data store.

Type: Grant

Filed: April 25, 2019

Date of Patent: March 31, 2020

Assignee: Facebook, Inc.

Inventor: Alexander Waibel
Incorporation of user-provided natural language translations in a social networking system

Patent number: 10528677

Abstract: A social networking system determines whether a particular user is qualified to provide translations of text from a first language to a second language. The determination may include evaluation of the language competencies of the user, and also of the trustworthiness of the user as a translator, as determined based on prior translations submitted by the user. The social networking system also selects translations of a text item for a user to whom that text is to be shown. When evaluating a candidate translation for presentation to the user, the evaluation may assess factors such as the determined qualification as a translator of the user who provided the candidate translation; a quality score of the candidate translation itself; and/or the similarity of the user viewing the content and the user providing the candidate translation.

Type: Grant

Filed: February 21, 2019

Date of Patent: January 7, 2020

Assignee: Facebook, Inc.

Inventors: Ying Zhang, Alexander Waibel
Device for Extracting Information from a Dialog

Publication number: 20190251156

Abstract: Computer-implemented systems and methods for extracting information during a human-to-human mono-lingual or multi-lingual dialog between two speakers are disclosed. Information from either the recognized speech (or the translation thereof) by the second speaker and/or the recognized speech by the first speaker (or the translation thereof) is extracted. The extracted information is then entered into an electronic form stored in a data store.

Type: Application

Filed: April 25, 2019

Publication date: August 15, 2019

Inventor: Alexander Waibel

1 2 3 4 next