Patents by Inventor Alexander Waibel

Alexander Waibel has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11974074
    Abstract: A system for providing off-the-record functionality is provided herein. The system may include a processor configured to execute processor-executable instructions stored in non-transitory computer-readable medium to establish a video conference having a plurality of participants, each participant of the plurality of participants exchanging a plurality of audio or video streams via the video conference. The processor may also be configured to receive, from a first client device associated with one of the plurality of participants, a first audio stream or a first video stream of the plurality of audio or video streams, and record the plurality of audio or video streams within a recording. The processor may also be configured to receive an off-the-record request to begin an off-the-record time period, and in response to the off-the-record request, prevent at least one of the first audio stream or the first video stream from being included in the recording.
    Type: Grant
    Filed: April 29, 2022
    Date of Patent: April 30, 2024
    Assignee: Zoom Video Communications, Inc.
    Inventors: Shane P. Springer, Alexander Waibel
  • Patent number: 11972227
    Abstract: A speech translation system and methods for cross-lingual communication that enable users to improve and customize content and usage of the system and easily. The methods include, in response to receiving an utterance including a first term associated with a field, translating the utterance into a second language. In response to receiving an indication to add the first term associated with the field to a first recognition lexicon, adding the first term associated with the field and the determined translation to a first machine translation module and to a shared database for a community associated with the field of the first term associated with the field, wherein the first term associated with the field added to the shared database is accessible by the community.
    Type: Grant
    Filed: December 7, 2021
    Date of Patent: April 30, 2024
    Assignee: Meta Platforms, Inc.
    Inventors: Alexander Waibel, Ian R. Lane
  • Publication number: 20240098218
    Abstract: One example method includes receiving, during a virtual conference hosted by a virtual conference provider, a first audio stream comprising speech having first speech patterns according to a first accent, the first audio stream received from a first client device associated with a first participant in the virtual conference; generating, by a first trained machine learning (“ML”) model, a second audio stream comprising the speech having second speech patterns according to a second accent; and outputting the second audio stream.
    Type: Application
    Filed: January 30, 2023
    Publication date: March 21, 2024
    Inventors: Tuan Nam Nguyen, Alexander Waibel
  • Publication number: 20230351060
    Abstract: Systems and methods for enforcing consent requirements for sharing virtual meeting recordings are provided herein. In an example, a method may include receiving, from a first client device, a recording privacy request associated with a virtual meeting, and receiving, from a second client device, a request to share a recording of the virtual meeting with one or more recipients. The method may also include modifying, by a video conference provider, at least one of a first audio stream or a first video stream associated with the first client device in the recording based on the recording privacy request, and generating, by the video conference provider, a privatized recording based on the modification of at least one of the first audio stream or the first video stream. The method may also include transmitting, by the video conference provider, the privatized recording to the one or more recipients.
    Type: Application
    Filed: April 29, 2022
    Publication date: November 2, 2023
    Inventors: Shane Paul Springer, Alexander Waibel
  • Publication number: 20230351059
    Abstract: Systems and methods for providing automated personal privacy during virtual meetings are provided herein. The method may include establishing, by a video conference provider, a video conference having a plurality of participants. The method may also include receiving, from a first client device associated with one of the plurality of participants, a first audio stream and a first video stream, and recording responsive to an indication from one of the plurality of participants, one or more audio or video streams within a recording. The method may include receiving, from the first client device, a personal privacy request. In response to the personal privacy request, the method may include modifying, by the video conference provider, at least one of the first audio stream or the first video stream in the recording and storing the least one of the first audio stream or the first video stream as modified to the recording.
    Type: Application
    Filed: April 29, 2022
    Publication date: November 2, 2023
    Applicant: Zoom Video Communications, Inc.
    Inventors: Shane P. SPRINGER, Alexander Waibel
  • Publication number: 20230353704
    Abstract: Systems and methods for providing instant processing of virtual meeting recordings are provided.
    Type: Application
    Filed: April 29, 2022
    Publication date: November 2, 2023
    Inventors: Shane Paul Springer, Alexander Waibel
  • Publication number: 20230352026
    Abstract: Provided herein are systems and methods for delta models for providing privatized speech-to-text during virtual meetings. In one embodiment, a system may include a non-transitory computer-readable medium; a communications interface; and a processor. The processor may be configured to execute processor-executable instructions to: join a virtual meeting. Each participant in the virtual meeting may exchange audio streams with other participants in the virtual meeting. The instructions may include receiving, from a video conference provider, a local model for speech recognition. The local model may be a copy of a centralized model. The instructions may include performing speech recognition using the local model on the audio streams.
    Type: Application
    Filed: April 29, 2022
    Publication date: November 2, 2023
    Inventors: Shane Paul Springer, Alexander Waibel
  • Publication number: 20230353708
    Abstract: A system for providing off-the-record functionality is provided herein. The system may include a processor configured to execute processor-executable instructions stored in non-transitory computer-readable medium to establish a video conference having a plurality of participants, each participant of the plurality of participants exchanging a plurality of audio or video streams via the video conference. The processor may also be configured to receive, from a first client device associated with one of the plurality of participants, a first audio stream or a first video stream of the plurality of audio or video streams, and record the plurality of audio or video streams within a recording. The processor may also be configured to receive an off-the-record request to begin an off-the-record time period, and in response to the off-the-record request, prevent at least one of the first audio stream or the first video stream from being included in the recording.
    Type: Application
    Filed: April 29, 2022
    Publication date: November 2, 2023
    Applicant: Zoom Video Communications, Inc.
    Inventors: Shane P. SPRINGER, Alexander Waibel
  • Publication number: 20230186899
    Abstract: Computer systems and computer-implemented methods provide for interactive and incremental post-editing of real-time speech transcription and translation. A first component is automatic identification of potentially problematic regions in the output (e.g., transcription or translation) that are either likely to be technically processed badly or risky in terms of their content or expression. A second component is intelligent, efficient interfaces that permit multiple editors to correct system output concurrently, collaboratively, efficiently, and simultaneously, so that corrections can be seamlessly inserted and become part of a running presentation. A third component is incremental learning and adaptation that allows the system to use the human corrective feedback to deliver instantaneous improvement of system behavior down-stream. A fourth component is transfer learning to transfer short-term learning into long term learning if the modifications warrant long-term retention.
    Type: Application
    Filed: April 2, 2021
    Publication date: June 15, 2023
    Inventors: Alexander Waibel, Sebastian Stuker
  • Patent number: 11671472
    Abstract: Systems and methods for providing a voice agent for sidebars during virtual meetings are provided. In an Example, a system including a non-transitory computer-readable medium, a communications interface, and a processor is provided. The processor configured to execute processor-executable instructions stored in the non-transitory computer-readable medium to: establish a video conference, receive, from a first client device, a request for a sidebar meeting, and transmit to the first client device: a first set of audio and video streams corresponding to a main meeting, and a second set of audio and video streams corresponding to the sidebar meeting. The processor may be configured to identify, by a voice agent, an attention cue in an audio stream from the first set of audio and video streams, and generate, by the voice agent, an alert based on the attention cue identified in the audio stream.
    Type: Grant
    Filed: January 31, 2022
    Date of Patent: June 6, 2023
    Assignee: Zoom Video Communications, Inc.
    Inventor: Alexander Waibel
  • Publication number: 20230013768
    Abstract: A system comprises a machine that is configured to act upon requests from a user and sensing means for sensing an operational-mode dialog stream from the user for the machine. The system also comprises a computing system that is configured to train a neural network through machine learning to output, for each training example in a training dialog stream dataset, a corrected request for the machine. The computing system is also configure to, in an operational mode, using the trained neural network, generate a corrected, operational-mode request for the machine based on the operational-mode dialog stream from the user for the machine, wherein the operational-mode dialog stream is sensed by the sensing means.
    Type: Application
    Filed: December 14, 2020
    Publication date: January 19, 2023
    Inventors: Stefan Constantin, Alexander Waibel
  • Publication number: 20220092278
    Abstract: A speech translation system and methods for cross-lingual communication that enable users to improve and customize content and usage of the system and easily. The methods include, in response to receiving an utterance including a first term associated with a field, translating the utterance into a second language. In response to receiving an indication to add the first term associated with the field to a first recognition lexicon, adding the first term associated with the field and the determined translation to a first machine translation module and to a shared database for a community associated with the field of the first term associated with the field, wherein the first term associated with the field added to the shared database is accessible by the community.
    Type: Application
    Filed: December 7, 2021
    Publication date: March 24, 2022
    Inventors: Alexander Waibel, Ian R. Lane
  • Publication number: 20220059083
    Abstract: Computer-implemented methods and apparatus that use neural modulation codes as an alternative to training many individual recognition models or to loosing performance by training mixed models. Large neural models are modulated by codes that model the different conditions. The codes directly alter (modulate) the behavior of connections in a multiconditional perceptual classifier, so as to permit the most appropriate neuronal units and their features to be applied to each condition. The approach may be applied to multilingual ASR, where the resulting multilingual network arrangement is able to achieve performance that is competitive or better than individually trained mono-lingual network. Moreover, the approach requires no adaptation data or extensive adaptation/training time to operate in a manner tuned to each condition. Beyond multilingual speech processing systems the approach can be applied to many other perceptual processing problems (e.g.
    Type: Application
    Filed: June 7, 2019
    Publication date: February 24, 2022
    Applicant: Interactive-AI, LLC
    Inventors: Markus MULLER, Alexander WAIBEL
  • Patent number: 11256882
    Abstract: An improved lecture support system integrates multi-media presentation materials with spoken content so that the listener can follow with both the speech and the supporting materials that accompany the presentation to provide additional understanding. Computer-based systems and methods are disclosed for translation of a spoken presentation (e.g., a lecture, a video) along with the accompanying presentation materials. The content of the presentation materials can be used to improve presentation translation, as it extracts supportive material from the presentation materials as they relate to the speech.
    Type: Grant
    Filed: November 5, 2020
    Date of Patent: February 22, 2022
    Assignee: Meta Platforms, Inc.
    Inventor: Alexander Waibel
  • Patent number: 11222185
    Abstract: A speech translation system and methods for cross-lingual communication that enable users to improve and customize content and usage of the system and easily. The methods include, in response to receiving an utterance including a first term associated with a field, translating the utterance into a second language. In response to receiving an indication to add the first term associated with the field to a first recognition lexicon, adding the first term associated with the field and the determined translation to a first machine translation module and to a shared database for a community associated with the field of the first term associated with the field, wherein the first term associated with the field added to the shared database is accessible by the community.
    Type: Grant
    Filed: September 1, 2017
    Date of Patent: January 11, 2022
    Assignee: Meta Platforms, Inc.
    Inventors: Alexander Waibel, Ian R. Lane
  • Patent number: 10839169
    Abstract: An improved lecture support system integrates multi-media presentation materials with spoken content so that the listener can follow with both the speech and the supporting materials that accompany the presentation to provide additional understanding. Computer-based systems and methods are disclosed for translation of a spoken presentation (e.g., a lecture, a video) along with the accompanying presentation materials. The content of the presentation materials can be used to improve presentation translation, as it extracts supportive material from the presentation materials as they relate to the speech.
    Type: Grant
    Filed: May 8, 2019
    Date of Patent: November 17, 2020
    Assignee: Facebook, Inc.
    Inventor: Alexander Waibel
  • Patent number: 10755054
    Abstract: An iterative language translation system includes multiple communicatively connected statistical speech translation systems. The system includes an automatic speech recognition component adapted to recognize spoken language in a source language and to create a source language hypothesis. A machine translation component is adapted to translate the source language hypothesis into a target language. The system also includes a second automatic speech recognition component and second machine translation component. The translation results are used to adapt the automatic speech recognition components and the language hypotheses are used to adapt the machine translation components.
    Type: Grant
    Filed: June 15, 2018
    Date of Patent: August 25, 2020
    Assignee: Facebook, Inc.
    Inventors: Alexander Waibel, Matthias Paulik
  • Patent number: 10606942
    Abstract: Computer-implemented systems and methods for extracting information during a human-to-human mono-lingual or multi-lingual dialog between two speakers are disclosed. Information from either the recognized speech (or the translation thereof) by the second speaker and/or the recognized speech by the first speaker (or the translation thereof) is extracted. The extracted information is then entered into an electronic form stored in a data store.
    Type: Grant
    Filed: April 25, 2019
    Date of Patent: March 31, 2020
    Assignee: Facebook, Inc.
    Inventor: Alexander Waibel
  • Patent number: 10528677
    Abstract: A social networking system determines whether a particular user is qualified to provide translations of text from a first language to a second language. The determination may include evaluation of the language competencies of the user, and also of the trustworthiness of the user as a translator, as determined based on prior translations submitted by the user. The social networking system also selects translations of a text item for a user to whom that text is to be shown. When evaluating a candidate translation for presentation to the user, the evaluation may assess factors such as the determined qualification as a translator of the user who provided the candidate translation; a quality score of the candidate translation itself; and/or the similarity of the user viewing the content and the user providing the candidate translation.
    Type: Grant
    Filed: February 21, 2019
    Date of Patent: January 7, 2020
    Assignee: Facebook, Inc.
    Inventors: Ying Zhang, Alexander Waibel
  • Publication number: 20190251156
    Abstract: Computer-implemented systems and methods for extracting information during a human-to-human mono-lingual or multi-lingual dialog between two speakers are disclosed. Information from either the recognized speech (or the translation thereof) by the second speaker and/or the recognized speech by the first speaker (or the translation thereof) is extracted. The extracted information is then entered into an electronic form stored in a data store.
    Type: Application
    Filed: April 25, 2019
    Publication date: August 15, 2019
    Inventor: Alexander Waibel