Patents by Inventor Dihong Tian

Dihong Tian has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Generating and Rendering Synthesized Views with Multiple Video Streams in Telepresence Video Conference Sessions

Publication number: 20150042748

Abstract: Techniques are provided for establishing a videoconference session between participants at different endpoints, where each endpoint includes at least one computing device and one or more displays. A plurality of video streams is received at an endpoint, and each video stream is classified as at least one of a people view and a data view. The classified views are analyzed to determine one or more regions of interest for each of the classified views, where at least one region of interest has a size smaller than a size of the classified view. Synthesized views of at least some of the video streams are generated, wherein the synthesized views include at least one view including a region of interest, and views including the synthesized views are rendered at one or more displays of an endpoint device.

Type: Application

Filed: October 27, 2014

Publication date: February 12, 2015

Inventors: Dihong Tian, J. William Mauchly, Joseph T. Friel
System and method for video coding in a dynamic environment

Patent number: 8934026

Abstract: A method is provided in one example embodiment and includes receiving a camera dynamic parameter; determining a reference transform parameter based on the camera dynamic parameter; applying the reference transform parameter to generate a video image; and encoding the reference transform parameter in a bitstream for transmission with the video image. In other more specific instances, the method may include decoding a particular video image; decoding a particular reference transform parameter; and applying a particular reference transform parameter to the particular video image. The entropy-decoded data can undergo inverse quantization and transformation such that reference transformed data is combined with the entropy-decoded data. Additionally, the entropy-decoded data can be subjected to filtering before decoded video images are rendered on a display.

Type: Grant

Filed: May 12, 2011

Date of Patent: January 13, 2015

Assignee: Cisco Technology, Inc.

Inventors: Dihong Tian, Wen-Hsiung Chen
Generating and rendering synthesized views with multiple video streams in telepresence video conference sessions

Patent number: 8890923

Abstract: Techniques are provided for establishing a videoconference session between participants at different endpoints, where each endpoint includes at least one computing device and one or more displays. A plurality of video streams is received at an endpoint, and each video stream is classified as at least one of a people view and a data view. The classified views are analyzed to determine one or more regions of interest for each of the classified views, where at least one region of interest has a size smaller than a size of the classified view. Synthesized views of at least some of the video streams are generated, wherein the synthesized views include at least one view including a region of interest, and views including the synthesized views are rendered at one or more displays of an endpoint device.

Type: Grant

Filed: September 4, 2012

Date of Patent: November 18, 2014

Assignee: Cisco Technology, Inc.

Inventors: Dihong Tian, J. William Mauchly, Joseph T. Friel
Video compression using search techniques of long-term reference memory

Patent number: 8861598

Abstract: Particular embodiments generally relate to video compression. In one embodiment, a store of reference frames is provided in memory. The reference frames may be classified based on a plurality of classifiers. The classifiers may correspond to features that are found in the reference frame. A frame to encode is then received. The frame is analyzed to determine features found in the frame. As macroblocks in the frame are encoded, a macroblock is analyzed to determine which feature may be included in the macroblock. The feature is used to determine a classifier, which is used to determine a subset of the reference frames. The subset is then searched to determine a reference frame for the macroblock.

Type: Grant

Filed: March 19, 2008

Date of Patent: October 14, 2014

Assignee: Cisco Technology, Inc.

Inventors: William Mauchly, Joseph T. Friel, Alan MacDonald, Dihong Tian, Wen-Hsiung Chen, Michael A. Arnao
Real-time automatic scene relighting in video conference sessions

Patent number: 8854412

Abstract: Video frames are captured at one or more cameras during a video conference session, where each video frame includes a digital image with a plurality of pixels. Depth values associated with each pixel are determined in at least one video frame, where each depth value represents a distance of a portion of the digital image represented by at least one corresponding pixel from the one or more cameras that capture the at least one video frame. Luminance values of pixels are adjusted within captured video frames based upon the depth values determined for the pixels so as to achieve relighting of the video frames as the video frames are displayed during the video conference session.

Type: Grant

Filed: September 21, 2012

Date of Patent: October 7, 2014

Assignee: Cisco Technology, Inc.

Inventors: Dihong Tian, J. William Mauchly, Joseph T. Friel
ANNOTATING A PRESENTATION IN A TELEPRESENCE MEETING

Publication number: 20140292999

Abstract: A processing system can include an encoder to encode a real-time transmission of a presentation. A memory buffer can copy and store images of the presentation and convert the images into snapshot images. A transmitter can transmit the snapshot images to an external annotation device, and a receiver can receive annotation data of an annotation performed on the snapshot images at the external annotation device. The annotation can be encoded, in accordance with the annotation data, into the real-time transmission of the presentation to display the real-time transmission with the annotation.

Type: Application

Filed: April 22, 2013

Publication date: October 2, 2014

Applicant: Cisco Technology, Inc.

Inventors: Dau Do, Nermin Ismail, Brandon Gulrich, Dihong Tian
UTILIZING A SMART CAMERA SYSTEM FOR IMMERSIVE TELEPRESENCE

Publication number: 20140253667

Abstract: Video content is received at a computing device that camera views provided by video cameras that are aligned to capture images of participants within a defined space. Each camera view is at a first resolution and the video cameras are aligned such that a field of view (FOV) for each camera overlaps a portion of the FOV of at least one other adjacent camera. Positions of participants depicted within the video content are detected, where at least one participant is captured by overlapping FOVs of two adjacent camera views. A target view is generated from the first number of camera views, the target view having a second resolution that is lower than the first resolution, and the target view includes a view of the at least one participant captured within the overlapping FOVs of two adjacent camera views. The target view is displayed at a display device.

Type: Application

Filed: March 11, 2013

Publication date: September 11, 2014

Applicant: CISCO TECHNOLOGY, INC.

Inventor: Dihong Tian
Perceptual Quality Of Content In Video Collaboration

Publication number: 20140254688

Abstract: Techniques are provided for receiving and decoding a sequence of video frames at a computing device, and analyzing a current video frame N to determine whether to skip or render the current video frame N for display by the computing device. The analyzing includes generating color histograms of the current video frame N and one or more previous video frames, determining a difference value representing a difference between the current video frame N and a previous video frame N?K, where K>0, the difference value being based upon the generated color histograms, in response to the difference value not exceeding a threshold value, rendering the current video frame N or a recently rendered video frame N?K using the current video frame, and in response to the difference value exceeding the threshold value, skipping the current video frame N from being rendered.

Type: Application

Filed: March 8, 2013

Publication date: September 11, 2014

Applicant: CISCO TECHNOLOGY, INC.

Inventors: Dihong Tian, Jennifer Sha
Implementing selective image enhancement

Patent number: 8823776

Abstract: A method that includes capturing depth information associated with a first field of view of a depth camera. The depth information is represented by a first plurality of depth pixels. The method also includes capturing color information associated with a second field of view of a video camera that substantially overlaps with the first field of view of the depth camera. The color information is represented by a second plurality of color pixels. The method further includes enhancing color information represented by at least one color pixel of the second plurality of color pixels to generate an enhanced image. The enhanced image adjusts an exposure characteristic of the color information captured by the video camera. The at least one color pixel is enhanced based on depth information represented by at least one corresponding depth pixel of the first plurality of depth pixels.

Type: Grant

Filed: May 20, 2010

Date of Patent: September 2, 2014

Assignee: Cisco Technology, Inc.

Inventors: Dihong Tian, J. William Mauchly, Joseph T. Friel
Context Adaptive Position and Amplitude Coding of Coefficients for Video

Publication number: 20140169453

Abstract: A coding method, apparatus, and medium with software encoded thereon to implement a coding method. The coding method includes encoding the position of non-zero-valued coefficients in an ordered series of quantized transform coefficients of a block of image data, including encoding events using variable length coding using a plurality of variable length code mappings that each maps events to codewords, the position encoding including switching between the code mappings based on the context. The coding method further includes encoding amplitudes of the non-zero-valued coefficients using variable dimensional amplitude coding in the reverse order of the original ordering of the series.

Type: Application

Filed: February 20, 2014

Publication date: June 19, 2014

Applicant: Cisco Technology Inc.

Inventors: Dihong Tian, Wen-hsiung Chen, Pi Sheng Chang
SYSTEM AND METHOD FOR DEPTH-GUIDED FILTERING IN A VIDEO CONFERENCE ENVIRONMENT

Publication number: 20140160239

Abstract: A method is provided in one example embodiment that includes generating a depth map that corresponds to a video image and filtering the depth map with the video image to create a filtered depth map. The video image can be filtered with the filtered depth map to create an image. In one example implementation, the video image is filtered using extended depth-guided filtering that is incorporated into a video encoding-decoding loop.

Type: Application

Filed: December 6, 2012

Publication date: June 12, 2014

Inventor: Dihong Tian
System and method for providing enhanced video processing in a network environment

Patent number: 8723914

Abstract: A method is provided in one example and includes receiving a video input from a camera element; using change detection statistics to identify background image data; using the background image data as a temporal reference to determine foreground image data of a particular video frame within the video input; using a selected foreground image for a background registration of a subsequent video frame; and providing at least a portion of the subsequent video frame to a next destination.

Type: Grant

Filed: November 19, 2010

Date of Patent: May 13, 2014

Assignee: Cisco Technology, Inc.

Inventors: David J. Mackie, Dihong Tian, Andrew P. Weir, Maurice Buttimer, Joseph T. Friel, J. William Mauchly, Wen-Hsiung Chen
REAL-TIME AUTOMATIC SCENE RELIGHTING IN VIDEO CONFERENCE SESSIONS

Publication number: 20140085398

Abstract: Video frames are captured at one or more cameras during a video conference session, where each video frame includes a digital image with a plurality of pixels. Depth values associated with each pixel are determined in at least one video frame, where each depth value represents a distance of a portion of the digital image represented by at least one corresponding pixel from the one or more cameras that capture the at least one video frame. Luminance values of pixels are adjusted within captured video frames based upon the depth values determined for the pixels so as to achieve relighting of the video frames as the video frames are displayed during the video conference session.

Type: Application

Filed: September 21, 2012

Publication date: March 27, 2014

Applicant: CISCO TECHNOLOGY, INC.

Inventors: Dihong Tian, J. William Mauchly, Joseph T. Friel
System and method for depth-guided image filtering in a video conference environment

Patent number: 8682087

Abstract: A method is provided in one example embodiment that includes receiving a plurality of depth values corresponding to pixels of an image; and filtering the image as a function of a plurality of variations in the depth values between adjacent pixels of a window associated with the image. In more detailed embodiments, the method may include encoding the image into a bit stream for transmission over a network. The filtering can account for a bit rate associated with the encoding of the image.

Type: Grant

Filed: December 19, 2011

Date of Patent: March 25, 2014

Assignee: Cisco Technology, Inc.

Inventor: Dihong Tian
Context adaptive position and amplitude coding of coefficients for video compression

Patent number: 8670485

Abstract: A coding method, apparatus, and medium with software encoded thereon to implement a coding method. The coding method includes encoding the position of non-zero-valued coefficients in an ordered series of quantized transform coefficients of a block of image data, including encoding events using variable length coding using a plurality of variable length code mappings that each maps events to codewords, the position encoding including switching between the code mappings based on the context. The coding method further includes encoding amplitudes of the non-zero-valued coefficients using variable dimensional amplitude coding in the reverse order of the original ordering of the series.

Type: Grant

Filed: September 4, 2012

Date of Patent: March 11, 2014

Assignee: Cisco Technology, Inc.

Inventors: Dihong Tian, Wen-hsiung Chen, Pi Sheng Chang
Generating and Rendering Synthesized Views with Multiple Video Streams in Telepresence Video Conference Sessions

Publication number: 20140063177

Abstract: Techniques are provided for establishing a videoconference session between participants at different endpoints, where each endpoint includes at least one computing device and one or more displays. A plurality of video streams is received at an endpoint, and each video stream is classified as at least one of a people view and a data view. The classified views are analyzed to determine one or more regions of interest for each of the classified views, where at least one region of interest has a size smaller than a size of the classified view. Synthesized views of at least some of the video streams are generated, wherein the synthesized views include at least one view including a region of interest, and views including the synthesized views are rendered at one or more displays of an endpoint device.

Type: Application

Filed: September 4, 2012

Publication date: March 6, 2014

Applicant: CISCO TECHNOLOGY, INC.

Inventors: Dihong Tian, J. William Mauchly, Joseph T. Friel
Video coding reference picture prediction using information available at a decoder

Patent number: 8644384

Abstract: A video coder includes a forward coder and a reconstruction module determining a motion compensated predicted picture from one or more previously decoded pictures in a multi-picture store. The reconstruction module includes a reference picture predictor that uses only previously decoded pictures to determine one or more predicted reference pictures. The predicted reference picture(s) are used for motion compensated prediction. The reference picture predictor may include optical flow analysis that uses a current decoded picture and that may use one or more previously decoded pictures together with affine motion analysis and image warping to determine at least a portion of at least one of the reference pictures.

Type: Grant

Filed: January 24, 2013

Date of Patent: February 4, 2014

Assignee: Cisco Technology, Inc.

Inventors: Dihong Tian, J. William Mauchly, Joseph T. Friel, Wen-hsiung Chen
Image-Based Real-Time Gesture Recognition

Publication number: 20130342636

Abstract: Techniques are provided for image-based real-time gesture recognition. Video data of a person is obtained. Pixels are classified in the video stream at a given time instance during a time period as a foreground or a background pixel. A data entry is generated comprising data indicating foreground history values for each of a plurality of time instances of the video stream and data indicating a time period value. When the classifying indicates that a first pixel is a foreground pixel, the data structure associated with the first pixel is evaluated to determine whether or not to update a foreground history value associated with the first pixel at the given time instance. A motion gradient vector is generated for the video stream based on the foreground history value associated with the first pixel and foreground history values associated with other pixels.

Type: Application

Filed: June 22, 2012

Publication date: December 26, 2013

Applicant: CISCO TECHNOLOGY, INC.

Inventor: Dihong Tian
System and method for skip coding during video conferencing in a network environment

Patent number: 8599934

Abstract: A method is provided in one example and includes receiving an input video, and identifying values of pixels from noise associated with a current video image within the video input. The method also includes creating a skip-reference video image associated with the identified pixel values, and comparing a portion of the current video image to the skip-reference video image. The method also includes determining a macroblock associated with the current video image to be skipped before an encoding operation occurs.

Type: Grant

Filed: September 8, 2010

Date of Patent: December 3, 2013

Assignee: Cisco Technology, Inc.

Inventors: Dihong Tian, Joseph T. Friel, Andrew P. Weir
Position coding for context-based adaptive variable length coding

Patent number: 8576915

Abstract: Particular embodiments include a method, an apparatus, and logic embodied in tangible computer-readable medium that when executed carries out a method of encoding an ordered sequence of quantized transform coefficients of a block of image data. One embodiment is a context adaptive variable length coding method that includes position coding the positions of zero-valued and non-zero valued coefficients by either a mixed method that encodes either the run length of zeroes preceding a non-zero coefficient or the run length of nonzero-valued coefficients preceding a zero-valued coefficients. Another includes position coding that uses a variable length code for two parameters respectively indicating the number of zero-valued coefficient positions and nonzero-valued coefficient positions still to be coded.

Type: Grant

Filed: March 21, 2012

Date of Patent: November 5, 2013

Assignee: Cisco Technology, Inc.

Inventors: Dihong Tian, Wen-hsiung Chen, Pi Sheng Chang

prev 1 2 3 4 5 next