Patents by Inventor David Yuheng Zhao

David Yuheng Zhao has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20230334322
    Abstract: Apparatus and methods are disclosed for using machine learning models with private and public domains. Operations can be applied to transform input to a machine learning model in a private domain that is kept secret or otherwise made unavailable to third parties. In one example of the disclosed technology, a method includes applying a private transform to produce transformed input, providing the transformed input to a machine learning model that was trained using a training set modified by the private transform, and generating inferences with the machine learning model using the transformed input. Examples of suitable transforms that can be employed include matrix multiplication, time or spatial domain to frequency domains, and partitioning a neural network model such that an input and at least one hidden layer form part of the private domain, while the remaining layers form part of the public domain.
    Type: Application
    Filed: June 23, 2023
    Publication date: October 19, 2023
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Sriram Srinivasan, David Yuheng Zhao, Ming-Chieh Lee, Mu Han
  • Publication number: 20230308699
    Abstract: Systems and methods may be used for access control. These systems and methods may include using a data processing system to access a video stream, the video stream including an image including a virtual background, segmenting the image into a foreground portion and a background portion to determine whether the foreground portion or the background portion of the image meets a threshold requirement, and outputting an alert in response to determining that the foreground portion or the background portion of the image fails to meet the threshold requirement.
    Type: Application
    Filed: August 19, 2021
    Publication date: September 28, 2023
    Inventors: Henrik Valdemar TURBELL, David Yuheng ZHAO, Qingfen LIN
  • Publication number: 20230308492
    Abstract: Systems and methods may be used for access control. These systems and methods may include using a data processing system to identify a video stream, determine an identification of a source type of the video stream, determine whether the video stream is live or pre-recorded based on a determined type of the source type, and output an alert to a device in response to determining that the source type indicates the video stream is pre-recorded.
    Type: Application
    Filed: August 19, 2021
    Publication date: September 28, 2023
    Inventors: Henrik Valdemar TURBELL, David Yuheng ZHAO, Qingfen LIN
  • Patent number: 11763157
    Abstract: Apparatus and methods are disclosed for using machine learning models with private and public domains. Operations can be applied to transform input to a machine learning model in a private domain that is kept secret or otherwise made unavailable to third parties. In one example of the disclosed technology, a method includes applying a private transform to produce transformed input, providing the transformed input to a machine learning model that was trained using a training set modified by the private transform, and generating inferences with the machine learning model using the transformed input. Examples of suitable transforms that can be employed include matrix multiplication, time or spatial domain to frequency domains, and partitioning a neural network model such that an input and at least one hidden layer form part of the private domain, while the remaining layers form part of the public domain.
    Type: Grant
    Filed: March 24, 2020
    Date of Patent: September 19, 2023
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Sriram Srinivasan, David Yuheng Zhao, Ming-Chieh Lee, Mu Han
  • Patent number: 11699084
    Abstract: An agent interfaces with a sending computing device and a receiving computing device to automatically adjust one-way or two-way real-time audio and real-time video transmission parameters responsive to changing network conditions and/or application requirements. The agent incorporates a reinforcement learning model that adjusts transmission parameters to maximize an expected value of a sum of future rewards; the expected value of the sum of future rewards is based on a current state of the sending computing, a current action (e.g. a current set of transmission parameters) at the sending computing device and a reward provided by the receiving computing device. The reward is representative of a user-perceived quality of experience at the receiving computing device.
    Type: Grant
    Filed: May 24, 2022
    Date of Patent: July 11, 2023
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Joyce Fang, Martin Ellis, Ross Garrett Cutler, David Yuheng Zhao, Thang Duong
  • Publication number: 20230188743
    Abstract: Techniques are described for efficiently embedding frame masks in a video stream. In some solutions, a computer implemented method includes operations for encoding a frame of video data comprising an array of pixels to generate an encoded video frame, and transmitting the encoded video frame to a video decoder. The array of pixels include foreground pixels and background pixels. The foreground pixels have respective first chroma component values which are bounded within a first chroma component range. Encoding the frame of video data can include converting the first chroma component values of the foreground pixels to second chroma component values by applying a mapping function. The second chroma component values are bounded within a second chroma component range. The second chroma component range is shifted or compressed from the first chroma component range.
    Type: Application
    Filed: February 9, 2023
    Publication date: June 15, 2023
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Henrik Valdemar TURBELL, David Yuheng ZHAO, Zhuangfei WU
  • Patent number: 11601665
    Abstract: Techniques are described for efficiently embedding frame masks in a video stream. In some solutions, a computer implemented method includes operations for encoding a frame of video data comprising an array of pixels to generate an encoded video frame and transmitting the encoded video frame. The array of pixels can include foreground pixels and background pixels. The foreground pixels can have respective original luma values which are bounded within a first luma range. In certain examples, encoding the frame of video data can include converting the original luma values of the foreground pixels to updated luma values which are bounded within a second luma range. The second luma range can be shifted and/or compressed from the first luma range.
    Type: Grant
    Filed: June 23, 2021
    Date of Patent: March 7, 2023
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Henrik Valdemar Turbell, David Yuheng Zhao, Zhuangfei Wu
  • Patent number: 11582464
    Abstract: A computer implemented method can decode a frame of video data comprising an array of pixels to obtain decoded luma values and decoded chroma values corresponding to the array of pixels, and extract a frame mask based on the decoded luma values. The frame mask can include an array of mask values respectively corresponding to the array of pixels. A mask value indicates whether a corresponding pixel is in foreground or background of the frame. The method can perform a morphological operation to the frame mask to change one or more mask values to indicate their corresponding pixels are removed from the foreground and added to the background of the frame. The method can also identify foreground pixels after performing the morphological operation to the frame mask, and render a foreground image for display based on the decoded luma values and decoded chroma values of the foreground pixels.
    Type: Grant
    Filed: June 23, 2021
    Date of Patent: February 14, 2023
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Henrik Valdemar Turbell, David Yuheng Zhao, Zhuangfei Wu
  • Publication number: 20220417532
    Abstract: A computer implemented method can decode a frame of video data comprising an array of pixels to obtain decoded luma values and decoded chroma values corresponding to the array of pixels, and extract a frame mask based on the decoded luma values. The frame mask can include an array of mask values respectively corresponding to the array of pixels. A mask value indicates whether a corresponding pixel is in foreground or background of the frame. The method can perform a morphological operation to the frame mask to change one or more mask values to indicate their corresponding pixels are removed from the foreground and added to the background of the frame. The method can also identify foreground pixels after performing the morphological operation to the frame mask, and render a foreground image for display based on the decoded luma values and decoded chroma values of the foreground pixels.
    Type: Application
    Filed: June 23, 2021
    Publication date: December 29, 2022
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Henrik Valdemar TURBELL, David Yuheng ZHAO, Zhuangfei WU
  • Publication number: 20220417546
    Abstract: Techniques are described for efficiently embedding frame masks in a video stream. In some solutions, a computer implemented method includes operations for encoding a frame of video data comprising an array of pixels to generate an encoded video frame and transmitting the encoded video frame. The array of pixels can include foreground pixels and background pixels. The foreground pixels can have respective original luma values which are bounded within a first luma range. In certain examples, encoding the frame of video data can include converting the original luma values of the foreground pixels to updated luma values which are bounded within a second luma range. The second luma range can be shifted and/or compressed from the first luma range.
    Type: Application
    Filed: June 23, 2021
    Publication date: December 29, 2022
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Henrik Valdemar TURBELL, David Yuheng ZHAO, Zhuangfei WU
  • Publication number: 20220300841
    Abstract: An agent interfaces with a sending computing device and a receiving computing device to automatically adjust one-way or two-way real-time audio and real-time video transmission parameters responsive to changing network conditions and/or application requirements. The agent incorporates a reinforcement learning model that adjusts transmission parameters to maximize an expected value of a sum of future rewards; the expected value of the sum of future rewards is based on a current state of the sending computing, a current action (e.g. a current set of transmission parameters) at the sending computing device and a reward provided by the receiving computing device. The reward is representative of a user-perceived quality of experience at the receiving computing device.
    Type: Application
    Filed: May 24, 2022
    Publication date: September 22, 2022
    Inventors: Joyce FANG, Martin Ellis, Ross Garrett CUTLER, David Yuheng ZHAO, THANG DUONG
  • Patent number: 11373108
    Abstract: An agent interfaces with a sending computing device and a receiving computing device to automatically adjust one-way or two-way real-time audio and real-time video transmission parameters responsive to changing network conditions and/or application requirements. The agent incorporates a reinforcement learning model that adjusts transmission parameters to maximize an expected value of a sum of future rewards; the expected value of the sum of future rewards is based on a current state of the sending computing, a current action (e.g. a current set of transmission parameters) at the sending computing device and a reward provided by the receiving computing device. The reward is representative of a user-perceived quality of experience at the receiving computing device.
    Type: Grant
    Filed: July 10, 2019
    Date of Patent: June 28, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Joyce S Fang, Martin Ellis, Ross Garrett Cutler, David Yuheng Zhao, Thang Duong
  • Publication number: 20210133577
    Abstract: Apparatus and methods are disclosed for using machine learning models with private and public domains. Operations can be applied to transform input to a machine learning model in a private domain that is kept secret or otherwise made unavailable to third parties. In one example of the disclosed technology, a method includes applying a private transform to produce transformed input, providing the transformed input to a machine learning model that was trained using a training set modified by the private transform, and generating inferences with the machine learning model using the transformed input. Examples of suitable transforms that can be employed include matrix multiplication, time or spatial domain to frequency domains, and partitioning a neural network model such that an input and at least one hidden layer form part of the private domain, while the remaining layers form part of the public domain.
    Type: Application
    Filed: March 24, 2020
    Publication date: May 6, 2021
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Sriram Srinivasan, David Yuheng Zhao, Ming-Chieh Lee, Mu Han
  • Publication number: 20210012227
    Abstract: An agent interfaces with a sending computing device and a receiving computing device to automatically adjust one-way or two-way real-time audio and real-time video transmission parameters responsive to changing network conditions and/or application requirements. The agent incorporates a reinforcement learning model that adjusts transmission parameters to maximize an expected value of a sum of future rewards; the expected value of the sum of future rewards is based on a current state of the sending computing, a current action (e.g. a current set of transmission parameters) at the sending computing device and a reward provided by the receiving computing device. The reward is representative of a user-perceived quality of experience at the receiving computing device.
    Type: Application
    Filed: July 10, 2019
    Publication date: January 14, 2021
    Inventors: Joyce S FANG, Martin ELLIS, Ross Garrett CUTLER, David Yuheng ZHAO, Thang DUONG
  • Patent number: 10798141
    Abstract: Multiple data streams are transmitted from a transmitting device via an end-to-end or process-to-process channel to a receiving device. At the transmitting device a media data stream is generated. A maximum packet size for another data stream is determined based on a determined end-to-end or process-to-process bandwidth and using playout information pertaining to the media data, which conveys a desired playout rate of the media data. At the transmitting device, based on the determined maximum packet size, a plurality of data packets of the other data stream is generated, each having a packet size no more than the determined maximum. The media and other data streams are multiplexed onto the channel at the transmitting device, thereby transmitting both of the data streams from the transmitting device to the receiving device via the same channel.
    Type: Grant
    Filed: November 16, 2018
    Date of Patent: October 6, 2020
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Sergey Sablin, Sergei Nikiforov, Matthieu Michel Jean-Alexis Moinard, Victor Miguel Barbero, David Yuheng Zhao, Mattias D. Nilsson
  • Publication number: 20200304713
    Abstract: A method and system for providing an enhanced video stream includes receiving a video stream captured by a camera positioned to include a target in a camera field of view and entering a detection mode to detect the target in the received video stream, before detecting the target in the video stream. Upon detecting the target in the video stream, the method automatically switches from the detection mode to an enhancement mode configured to process the video stream to obtain an enhanced video stream of the detected target. The enhanced video stream is then presented for display on a display device, while a position of at least one of the camera or the target are monitored while presenting the enhanced video stream for display. The method and system then detects a change in the position of at least one of the camera or the target, and upon detecting the change in the position, automatically generates a signal to switch back to the detection mode and detect a position of the target in the video stream.
    Type: Application
    Filed: July 15, 2019
    Publication date: September 24, 2020
    Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: David Yuheng ZHAO, Henrik TURBELL
  • Patent number: 10609332
    Abstract: According to a disclosed example, a first video stream is captured via a first camera associated with a first communication device engaged in a multi-party video conference. The first video stream includes a plurality of two-dimensional image frames. A subset of pixels corresponding to a first human subject is identified within each image frame of the first video stream. A second video stream is captured via a second camera associated with a second communication device engaged in the multi-party video conference. A composite video stream formed by at least a portion of the second video stream and the subset of pixels of the first video stream is rendered, and the composite video stream is output for display at one or more of the first and/or second communication devices. The composite video stream may provide the appearance of remotely located participants being physically present within the same visual scene.
    Type: Grant
    Filed: December 21, 2018
    Date of Patent: March 31, 2020
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Henrik Valdemar Turbell, Matthew Jason Pope, John C. Tang, Gina D. Venolia, David Yuheng Zhao, Kori Marie Inkpen
  • Patent number: 10595025
    Abstract: A transmitting device for generating a plurality of encoded portions of a video to be transmitted to a receiving device over a network configured to: receive an error message over a feedback channel from the receiving device indicating at least one of said plurality of encoded portions that has been lost at the receiving device; encode a recovery portion responsive to said receiving said error message; and transmit said recovery portion to the receiving device over said network; wherein said error message includes information pertaining to a decoded portion successfully decoded at the receiving device and said recovery portion is encoded relative to said decoded portion.
    Type: Grant
    Filed: September 8, 2015
    Date of Patent: March 17, 2020
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Ming-Chieh Lee, Amy Lu, Pontus Carlsson, Mattias Dan Nilsson, Sergey Sablin, Sergey Silkin, David Yuheng Zhao, Magnus Hemmendorff, Sergei Nikiforov
  • Patent number: 10412305
    Abstract: Method, device and computer program product for stabilizing a video signal. A plurality of frames of the video signal are captured using a camera. A motion sensor associated with the camera is used to generate a plurality of samples representing motion of the camera. The samples are used to determine a displacement of the camera between a first time and a second time, wherein the first time corresponds to an exposure time midpoint of a first frame of the video signal and the second time corresponds to an exposure time midpoint of a second frame of the video signal. The determined displacement is used to compensate for motion in the video signal between the first and second frames caused by the motion of the camera, to thereby stabilize the video signal.
    Type: Grant
    Filed: April 1, 2014
    Date of Patent: September 10, 2019
    Assignee: SKYPE
    Inventors: Andrei Jefremov, David Yuheng Zhao, Pontus Carlsson
  • Patent number: 10313685
    Abstract: A transmitting device for generating a plurality of encoded portions of a video to be transmitted to a receiving device over a network configured to: receive an error message over a feedback channel from the receiving device indicating at least one of said plurality of encoded portions that has been lost at the receiving device; encode a recovery portion responsive to said receiving said error message; and transmit said recovery portion to the receiving device over said network; wherein said error message includes information pertaining to a decoded portion successfully decoded at the receiving device and said recovery portion is encoded relative to said decoded portion.
    Type: Grant
    Filed: September 8, 2015
    Date of Patent: June 4, 2019
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Ming-Chieh Lee, Amy Lu, Pontus Carlsson, Mattias Dan Nilsson, Sergey Sablin, Sergey Silkin, David Yuheng Zhao, Magnus Hemmendorff, Sergei Nikiforov