Patents by Inventor Yongjun Wu

Yongjun Wu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11418809
    Abstract: Innovations for signaling state of a decoded picture buffer (“DPB”) and reference picture lists (“RPLs”). In example implementations, rather than rely on internal state of a decoder to manage and update DPB and RPLs, state information about the DPB and RPLs is explicitly signaled. This permits a decoder to determine which pictures are expected to be available for reference from the signaled state information. For example, an encoder determines state information that identifies which pictures are available for use as reference pictures (optionally considering feedback information from a decoder about which pictures are available). The encoder sets syntax elements that represent the state information. In doing so, the encoder sets identifying information for a long-term reference picture (“LTRP”), where the identifying information is a value of picture order count least significant bits for the LTRB. The encoder then outputs the syntax elements as part of a bitstream.
    Type: Grant
    Filed: December 4, 2020
    Date of Patent: August 16, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Gary J. Sullivan, Yongjun Wu
  • Patent number: 11386644
    Abstract: An example preprocessor circuit includes: a first buffer configured to store rows of image data and output a row thereof; a second buffer, coupled to the first buffer, including storage locations to store respective image samples of the row output by the first buffer; shift registers; an interconnect network including connections, each connection coupling a respective one of the shift registers to more than one of the storage locations, one or more of the storage locations being coupled to more than one of the connections; and a control circuit configured to load the shift registers with the image samples based on the connections and shift the shift registers to output streams of image samples.
    Type: Grant
    Filed: October 17, 2017
    Date of Patent: July 12, 2022
    Assignee: XILINX, INC.
    Inventors: Elliott Delaye, Ashish Sirasao, Aaron Ng, Yongjun Wu, Jindrich Zejda
  • Publication number: 20220180898
    Abstract: Techniques are described for automating virtual placements in video content.
    Type: Application
    Filed: December 9, 2020
    Publication date: June 9, 2022
    Inventors: Ahmed Aly Saad Ahmed, Muhammad Raffay Hamid, Yongjun Wu, Yash Chaturvedi, Steven James Cox, Travis Silvers, Amit S. Jain, Amjad Y. A. Abu Jbara, Prasanth Saraswatula
  • Patent number: 11356516
    Abstract: Methods and apparatus are described for facilitating the client-side caching of media content based on one or more properties of the media content. Information relating to the cacheability of different types of content is communicated to the media player on a client device in the manifest or playlist employed by the media player to request fragments of the media content. The media player uses this information to make decisions about how to cache the corresponding content.
    Type: Grant
    Filed: July 23, 2021
    Date of Patent: June 7, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Yongjun Wu, Kyle Koceski, Kyle Alexander Woo, Vasanthakumar Soundararajan, Shiranchai Taneja, Shyam Krishnan Krishnan Moni, Jake O'Halloran
  • Patent number: 11336935
    Abstract: Techniques are described for detecting desynchronization between an audio stream and a video stream.
    Type: Grant
    Filed: November 25, 2020
    Date of Patent: May 17, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Yongjun Wu, Joshua Peter Ebenezer, Sriram Sethuraman, Hai Wei, Zongyi Liu
  • Patent number: 11317172
    Abstract: Techniques for video fragment aware audio packaging that ensure a periodic and best alignment of audio and video fragments at any corresponding audio and video fragments are described. As one example, a video fragment aware audio packaging service determines a number of audio frames for a corresponding video fragment of video frames and generates an audio fragment that includes those audio frames, with flexible choices of video fragment duration, which may be considered and decided for device compatibility or content encoding optimization purposes.
    Type: Grant
    Filed: June 29, 2020
    Date of Patent: April 26, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Yongjun Wu, Vasanthakumar Soundararajan, Yang Yang, Sitaraman Ganapathy, Mukund Manikarnike
  • Patent number: 11297355
    Abstract: Techniques are described for creating and using playback-conditions-adaptive live video encoding ladders.
    Type: Grant
    Filed: February 18, 2021
    Date of Patent: April 5, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Hai Wei, Brian Manuel Alvarez, Yongjun Wu, Abhishek Kumar, Lei Li
  • Patent number: 11290735
    Abstract: Techniques are described for adaptive encoding of different visual elements in a video frame. Characteristics of visual elements can be determined and used to set encoding parameters for the visual elements. The visual elements can be encoded such that one visual element is encoded differently than another visual element if they have different characteristics.
    Type: Grant
    Filed: May 20, 2020
    Date of Patent: March 29, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Charles Benjamin Franklin Waggoner, Yongjun Wu, Hai Wei
  • Patent number: 11284103
    Abstract: Innovations in intra block copy (“BC”) prediction as well as innovations in encoder-side search patterns and approaches to partitioning. For example, some of the innovations relate to use of asymmetric partitions for intra BC prediction. Other innovations relate to search patterns or approaches that an encoder uses during block vector estimation (for intra BC prediction) or motion estimation. Still other innovations relate to uses of BV search ranges that have a horizontal or vertical bias during BV estimation.
    Type: Grant
    Filed: August 8, 2014
    Date of Patent: March 22, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Lihua Zhu, Yongjun Wu, Gary J. Sullivan
  • Patent number: 11228774
    Abstract: Techniques to enable virtual reality content to be delivered using a video codec that operates according to a scalable video encoding standard. Base layer frames for the different views of the virtual reality content are downloaded by a client device. The views are prioritized using a prediction model that assigns priorities based on the likelihood that a corresponding view will be selected within a particular period of time. Enhancement layer frames are then selected and downloaded based on the priorities.
    Type: Grant
    Filed: June 29, 2020
    Date of Patent: January 18, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Charles Benjamin Franklin Waggoner, Yongjun Wu
  • Patent number: 11228773
    Abstract: Techniques to enable virtual reality content to be delivered using a video codec that operates according to a scalable video encoding standard. Base layer frames for the different views of the virtual reality content are downloaded by a client device. The views are prioritized using a prediction model that assigns priorities based on the likelihood that a corresponding view will be selected within a particular period of time. Enhancement layer frames are then selected and downloaded based on the priorities.
    Type: Grant
    Filed: June 29, 2020
    Date of Patent: January 18, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Charles Benjamin Franklin Waggoner, Yongjun Wu
  • Patent number: 11204747
    Abstract: Embodiments herein describe techniques for interfacing a neural network application with a neural network accelerator that operate on two heterogeneous computing systems. For example, the neural network application may execute on a central processing unit (CPU) in a computing system while the neural network accelerator executes on a FPGA. As a result, when moving a software-hardware boundary between the two heterogeneous systems, changes may be made to both the neural network application (using software code) and to the accelerator (using RTL). The embodiments herein describe a software defined approach where shared interface code is used to express both sides of the interface between the two heterogeneous systems in a single abstraction (e.g., a software class).
    Type: Grant
    Filed: October 17, 2017
    Date of Patent: December 21, 2021
    Assignee: XILINX, INC.
    Inventors: Jindrich Zejda, Elliott Delaye, Yongjun Wu, Aaron Ng, Ashish Sirasao, Khang K. Dao, Christopher J. Case
  • Patent number: 11201903
    Abstract: Techniques are described for synchronizing video of a live event with separately generated metadata relating to the live event. Timing information representing a delay associated with the encoding and packaging pipeline is determined for each origin that produces a set of content streams, and then provided in the form of a synchronization offset in the client manifest for various streaming technologies. The client device requests the metadata, and synchronizes the metadata with content segments using timing information associated with each and the synchronization offset communicated in the manifest.
    Type: Grant
    Filed: March 10, 2021
    Date of Patent: December 14, 2021
    Assignee: Amazon Technologies, Inc.
    Inventors: Jeremy Matthew Cabrido, Vinay Raj, Yongjun Wu
  • Publication number: 20210377578
    Abstract: A media processing tool adds custom data to an elementary media bitstream or media container. The custom data indicates nominal range of samples of media content, but the meaning of the custom data is not defined in the codec format or media container format. For example, the custom data indicates the nominal range is full range or limited range. For playback, a media processing tool parses the custom data and determines an indication of media content type. A rendering engine performs color conversion operations whose logic changes based at least in part on the media content type. In this way, a codec format or media container format can in effect be extended to support full nominal range media content as well as limited nominal range media content, and hence preserve full or correct color fidelity, while maintaining backward compatibility and conformance with the codec format or media container format.
    Type: Application
    Filed: August 5, 2021
    Publication date: December 2, 2021
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Yongjun Wu, Naveen Thumpudi, Shyam Sadhwani
  • Publication number: 20210360264
    Abstract: This application relates to video encoding and decoding, and specifically to tools and techniques for using and providing supplemental enhancement information in bitstreams. Among other things, the detailed description presents innovations for bitstreams having supplemental enhancement information (SEI). In particular embodiments, the SEI message includes picture source data (e.g., data indicating whether the associated picture is a progressive scan picture or an interlaced scan picture and/or data indicating whether the associated picture is a duplicate picture). The SEI message can also express a confidence level of the encoder's relative confidence in the accuracy of this picture source data. A decoder can use the confidence level indication to determine whether the decoder should separately identify the picture as progressive or interlaced and/or a duplicate picture or honor the picture source scanning information in the SEI as it is.
    Type: Application
    Filed: July 22, 2021
    Publication date: November 18, 2021
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Gary J. Sullivan, Yongjun Wu
  • Patent number: 11150946
    Abstract: There is provided a method and a system for processing a communication channel. Including a heartbeat channel and a data channel between a master process and a worker process. The method includes determining at least one data channel associated with a heartbeat channel, detecting the determined at least one data channel, disconnecting the heartbeat channel when it is detected that any data channel is in a disconnected state to cause a heartbeat to time out, and ending a current task after it is determined that the heartbeat times out.
    Type: Grant
    Filed: September 10, 2018
    Date of Patent: October 19, 2021
    Assignee: ALIBABA GROUP HOLDING LIMITED
    Inventors: Yongjun Wu, Zhipeng Tang, Yangyu Tao, Chao Li, Yihui Feng
  • Patent number: 11140442
    Abstract: Methods and apparatus are described for providing content playback options to content playback systems based on the capabilities of the systems. These techniques determine whether the content options should be provided based on the capabilities of a source device in a combined playback system. The version of the High-bandwidth Digital Content Protection (HDCP) protocol used to initialize the connection between the source and sink devices is determined. Based on the version of the HDCP protocol used, an override message may be sent to the content service, in response to which the media player on the source device is directed to an content service endpoint corresponding to particular set of playback options.
    Type: Grant
    Filed: June 26, 2019
    Date of Patent: October 5, 2021
    Assignee: Amazon Technologies, Inc.
    Inventors: Yongjun Wu, Abhishek Kumar, Apoorv Bansal
  • Patent number: 11134287
    Abstract: Techniques are described for synchronizing audio content and video content when server-side fragment insertion techniques are used.
    Type: Grant
    Filed: September 17, 2020
    Date of Patent: September 28, 2021
    Assignee: Amazon Technologies, Inc.
    Inventors: Yongjun Wu, Abhishek Kumar, Kyle Alexander Woo, Mukund Manikarnike
  • Patent number: 11115668
    Abstract: This application relates to video encoding and decoding, and specifically to tools and techniques for using and providing supplemental enhancement information in bitstreams. Among other things, the detailed description presents innovations for bitstreams having supplemental enhancement information (SEI). In particular embodiments, the SEI message includes picture source data (e.g., data indicating whether the associated picture is a progressive scan picture or an interlaced scan picture and/or data indicating whether the associated picture is a duplicate picture). The SEI message can also express a confidence level of the encoder's relative confidence in the accuracy of this picture source data. A decoder can use the confidence level indication to determine whether the decoder should separately identify the picture as progressive or interlaced and/or a duplicate picture or honor the picture source scanning information in the SEI as it is.
    Type: Grant
    Filed: May 17, 2019
    Date of Patent: September 7, 2021
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Gary J. Sullivan, Yongjun Wu
  • Patent number: 11115691
    Abstract: A media processing tool adds custom data to an elementary media bitstream or media container. The custom data indicates nominal range of samples of media content, but the meaning of the custom data is not defined in the codec format or media container format. For example, the custom data indicates the nominal range is full range or limited range. For playback, a media processing tool parses the custom data and determines an indication of media content type. A rendering engine performs color conversion operations whose logic changes based at least in part on the media content type. In this way, a codec format or media container format can in effect be extended to support full nominal range media content as well as limited nominal range media content, and hence preserve full or correct color fidelity, while maintaining backward compatibility and conformance with the codec format or media container format.
    Type: Grant
    Filed: June 11, 2020
    Date of Patent: September 7, 2021
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Yongjun Wu, Naveen Thumpudi, Shyam Sadhwani