Patents by Inventor Tsuhan Chen

Tsuhan Chen has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 6459732
    Abstract: A technique for implicitly encoding shape information by using a chroma-key color. A bounding box is created enclosing the video object. The bounding box is extended to be of size of next integer multiple of macroblock size and divided into a plurality of macroblocks. For each boundary macroblock, each pixel outside the object is replaced with the chroma-key color to implicitly encode shape information. Pixel data for boundary macroblocks and macroblocks inside the object are DCT transformed, scaled and motion compensated. A finer quantizer (smaller quantizer) is used for boundary macroblocks to improve image quality. A first_shape_code can be used to identify each macroblock as either 1) inside the object; 2) outside the object; or 3) on the object boundary. To improve data compression and achieve low complexity shape extraction with DCT and motion compensation, a first_shape_code is sent for all macroblocks, and only macroblocks that are inside the object or on the object boundary are coded.
    Type: Grant
    Filed: January 9, 2001
    Date of Patent: October 1, 2002
    Assignee: AT&T Corp.
    Inventors: Tsuhan Chen, Atul Puri, Robert Lewis Schmidt
  • Patent number: 6340991
    Abstract: A technique is provided for calculating the time offsets between different video cameras and re-synchronizing the captured frames in a post-processing manner, thus eliminating the necessity of an explicit common clock for synchronization. This approach allows effective synchronization of frames from different cameras so that a multi-camera system can be used to more accurately analyze a subject under observation.
    Type: Grant
    Filed: December 31, 1998
    Date of Patent: January 22, 2002
    Assignee: AT&T Corporation
    Inventors: Tsuhan Chen, Sun-Yuan Kung, Yun-Ting Lin
  • Patent number: 6330023
    Abstract: A method of increasing the frame rate of an image of a speaking person comprises monitoring an audio signal indicative of utterances by the speaking person and the associated video signal. The audio signal corresponds to one or more fields or frames to be reconstructed, and individual portions of the audio signal are associated with facial feature information. The facial information includes mouth formation and position information derived from phonemes or other speech-based criteria from which the position of a speaker's mouth may be reliably predicted. A field or frame of the image is reconstructed using image features extracted from the existing frame and by utilizing the facial feature information associated with a detected phoneme.
    Type: Grant
    Filed: March 18, 1994
    Date of Patent: December 11, 2001
    Assignee: American Telephone and Telegraph Corporation
    Inventor: Tsuhan Chen
  • Publication number: 20010036229
    Abstract: A technique for implicitly encoding shape information by using a chroma-key color. A bounding box is created enclosing the video object. The bounding box is extended to be of size of next integer multiple of macroblock size and divided into a plurality of macroblocks. For each boundary macroblock, each pixel outside the object is replaced with the chroma-key color to implicitly encode shape information. Pixel data for boundary macroblocks and macroblocks inside the object are DCT transformed, scaled and motion compensated. A finer quantizer (smaller quantizer) is used for boundary macroblocks to improve image quality. A first_shape_code can be used to identify each macroblock as either 1) inside the object; 2) outside the object; or 3) on the object boundary. To improve data compression and achieve low complexity shape extraction with DCT and motion compensation, a first_shape_code is sent for all macroblocks, and only macroblocks that are inside the object or on the object boundary are coded.
    Type: Application
    Filed: January 9, 2001
    Publication date: November 1, 2001
    Inventors: Tsuhan Chen, Atul Puri, Robert Lewis Schmidt
  • Patent number: 6301385
    Abstract: To segment moving foreground from background, where the moving foreground is of most interest to the viewer, this method uses three detection algorithms as the input to a neural network. The multiple cues used are focus, intensity, and motion. The neural network consists of a two-layered neural network. Focus and motion measurements are taken from high frequency data, edges; whereas, intensity measurements are taken from low frequency data, object interiors. Combined, these measurements are used to segment a complete object. Results indicate that moving foreground can be segmented from stationary foreground and moving or stationary background. The neural network segments the entire object, both interior and exterior, in this integrated approach. Results also demonstrate that combining cues allows flexibility in both type and complexity of scenes. Integration of cues improves accuracy in segmenting complex scenes containing both moving foreground and background.
    Type: Grant
    Filed: December 17, 1998
    Date of Patent: October 9, 2001
    Assignee: AT&T Corp.
    Inventors: Tsuhan Chen, Cassandra Turner Swain
  • Patent number: 6208693
    Abstract: A technique for implicitly encoding shape information by using a chroma-key color. A bounding box is created enclosing the video object. The bounding box is extended to be of size of next integer multiple of macroblock size and divided into a plurality of macroblocks. For each boundary macroblock, each pixel outside the object is replaced with the chroma-key color to implicitly encode shape information. Pixel data for boundary macroblocks and macroblocks inside the object are DCT transformed, scaled and motion compensated. A finer quantizer (smaller quantizer) is used for boundary macroblocks to improve image quality. A first_shape_code can be used to identify each macroblock as either 1) inside the object; 2) outside the object; or 3) on the object boundary. To improve data compression and achieve low complexity shape extraction with DCT and motion compensation, a first_shape_code is sent for all macroblocks, and only macroblocks that are inside the object or on the object boundary are coded.
    Type: Grant
    Filed: July 9, 1998
    Date of Patent: March 27, 2001
    Assignee: AT&T Corp
    Inventors: Tsuhan Chen, Atul Puri, Robert Lewis Schmidt
  • Patent number: 6141442
    Abstract: A method and apparatus for generating region frames from video frames are disclosed which employs an industry standard encoder to lessen the negative impact on the quality of the transmitted video sequence while consuming fewer bits. The invention utilizes image segmentation and color replacement techniques to create the region frames. Each region frame includes a subject region, zero or more previously segmented regions and zero or more non-subject regions. The subject region is defined by the pixels of the original video frame. The previously segmented regions and non-subject regions are assigned replacement pixels P.sub.n,y and C.sub.n, respectively. The replacement pixel C.sub.n is chosen to indicate a color that is not likely to be confused with any color in the subject region R.sub.n. The replacement pixels P.sub.n,y are chosen such that the compression ratio of the region frame data is maximized.
    Type: Grant
    Filed: July 21, 1999
    Date of Patent: October 31, 2000
    Assignee: AT&T Corp
    Inventor: Tsuhan Chen
  • Patent number: 6058187
    Abstract: Secure data transmission apparatus comprises a data translator for translating an input string of signals, each signal having incomplete information for identifying an alphanumeric character, into a first encryption key. A data encrypter receives a first encryption key, a choice of encryption algorithm and a message and outputs an encrypted message according to the selected algorithm. The apparatus may be applied whenever the user is confronted with telecommunications apparatus that provides a limited input capability and no means for encrypting a message for transmission to an end user. In this manner, for example, a user may authenticate their name for display on caller identification plus name apparatus and the called party can be assured, before answering their line, that the call is from the party having the displayed name.
    Type: Grant
    Filed: April 17, 1997
    Date of Patent: May 2, 2000
    Assignee: AT&T Corp.
    Inventor: Tsuhan Chen
  • Patent number: 6035060
    Abstract: A method and apparatus for generating region frames from video frames are disclosed which employs an industry standard encoder to lessen the negative impact on the quality of the transmitted video sequence while consuming fewer bits. The invention utilizes image segmentation and color replacement techniques to create the region frames. Each region frame includes a subject region, zero or more previously segmented regions and zero or more non-subject regions. The subject region is defined by the pixels of the original video frame. The previously segmented regions and non-subject regions are assigned replacement pixels P.sub.n,y and C.sub.n, respectively. The replacement pixel C.sub.n is chosen to indicate a color that is not likely to be confused with any color in the subject region R.sub.n. The replacement pixels P.sub.n,y are chosen such that the compression ratio of the region frame data is maximized.
    Type: Grant
    Filed: February 14, 1997
    Date of Patent: March 7, 2000
    Assignee: AT&T Corp
    Inventors: Tsuhan Chen, Barin Geoffry Haskell, Cassandra Turner Swain
  • Patent number: 5974172
    Abstract: A method and apparatus for generating region frames from video frames are disclosed which employs an industry standard encoder to lessen the negative impact on the quality of the transmitted video sequence while consuming fewer bits. The invention utilizes image segmentation and color replacement techniques to create the region frames. Each region frame includes a subject region, zero or more previously segmented regions and zero or more non-subject regions. The subject region is defined by the pixels of the original video frame. The previously segmented regions and non-subject regions are assigned replacement pixels P.sub.n,y and C.sub.n, respectively. The replacement pixel C.sub.n is chosen to indicate a color that is not likely to be confused with any color in the subject region R.sub.n. The replacement pixels P.sub.n,y are chosen such that the compression ratio of the region frame data is maximized.
    Type: Grant
    Filed: February 14, 1997
    Date of Patent: October 26, 1999
    Assignee: AT&T Corp
    Inventor: Tsuhan Chen
  • Patent number: 5960111
    Abstract: To segment moving foreground from background, where the moving foreground is of most interest to the viewer, this method uses three detection algorithms as the input to a neural network. The multiple cues used are focus, intensity, and motion. The neural network consists of a two-layered neural network. Focus and motion measurements are taken from high frequency data, edges; whereas, intensity measurements are taken from low frequency data, object interiors. Combined, these measurements are used to segment a complete object. Results indicate that moving foreground can be segmented from stationary foreground and moving or stationary background. The neural network segments the entire object, both interior and exterior, in this integrated approach. Results also demonstrate that combining cues allows flexibility in both type and complexity of scenes. Integration of cues improves accuracy in segmenting complex scenes containing both moving foreground and background.
    Type: Grant
    Filed: February 10, 1997
    Date of Patent: September 28, 1999
    Assignee: AT&T Corp
    Inventors: Tsuhan Chen, Cassandra Turner Swain
  • Patent number: 5907351
    Abstract: A method and apparatus for transmitting and remotely displaying the audio and visual portion of a person speaking so that the audio and visual signals are synchronized. The audio signal is constantly transmitted to the receiver and is also used to create a predicted image of the lips of the talking head. The actual lip image is compared to the predicted lip image. Based upon this comparison, it is determined which of three signals is to be transmitted to the receiver: no signal corresponding to the video signal, a signal corresponding only to the differences between the actual lip image and a predicted lip image, or the actual lip image. The receiver reconstructs a lip image based upon the audio signal received and the signal received, if any, corresponding to the video image and inserts it into the previously received video frame or modifies the previous frame accordingly.
    Type: Grant
    Filed: October 24, 1995
    Date of Patent: May 25, 1999
    Assignee: Lucent Technologies Inc.
    Inventors: Tsuhan Chen, Ram R. Rao
  • Patent number: 5786855
    Abstract: A method and apparatus for generating region frames from video frames are disclosed which employs an industry standard encoder to lessen the negative impact on the quality of the transmitted video sequence while consuming fewer bits. The method and apparatus utilizes image segmentation and color replacement techniques to create the region frames. Each region frame includes a subject region, zero or more previously segmented regions and zero or more non-subject regions. The subject region is defined by the pixels of the original video frame. The previously segmented regions and non-subject regions are assigned replacement pixels P.sub.n,y and C.sub.n, respectively. The replacement pixel C.sub.n is chosen to indicate a color that is not likely to be confused with any color in the subject region R.sub.n. The replacement pixels P.sub.n,y are chosen such that the compression ratio of the region frame data is maximized.
    Type: Grant
    Filed: October 26, 1995
    Date of Patent: July 28, 1998
    Assignee: Lucent Technologies Inc.
    Inventors: Tsuhan Chen, Barin Geoffry Haskell
  • Patent number: 5761329
    Abstract: A method and apparatus is provided for determining the authenticity of an individual. In accordance with the method, audio and video data of the individual speaking at least one selected phrase is obtained. Identifying audio features and video features are then extracted from the audio data and the video data, respectively. A feature vector is formed which incorporates both the audio features and the video features. The feature vector is compared to a stored feature vector of a validated user speaking the same selected phrase. The individual is authenticated if the feature vector and the stored feature vector form a match within a prescribed threshold.
    Type: Grant
    Filed: December 15, 1995
    Date of Patent: June 2, 1998
    Inventors: Tsuhan Chen, Mehmet Reha Civanlar
  • Patent number: 5710829
    Abstract: Motion video is represented by digital signals. The digital signals can be compressed by coding to reduce bitrate and thus save time and expense in transmitting and reproducing the video. The present invention is a system and method for reducing the amount of information required to satisfactorily reproduce video signals by distinguishing more pertinent portions of the video from less pertinent portions. In particular, edges are detected based on amplitude variations in the luminance characteristic of the picture elements. Sharp edges are deemed to be "focused". The portion of the video frame between the focused edges is deemed to be focused, as well. A template is created corresponding to the focused portion. A signal corresponding to the outline of the template is combined with the original frame signal to create a segmented frame signal. When motion detection information is available, a motion-based template may be created and intersected with the focus template.
    Type: Grant
    Filed: April 27, 1995
    Date of Patent: January 20, 1998
    Assignee: Lucent Technologies Inc.
    Inventors: Tsuhan Chen, Cassandra Turner Swain
  • Patent number: 5500671
    Abstract: A video conference system provides eye contact and a sense of presence to a plurality of conference participants located in respective remotely-sited conference rooms. Each conference room contains at least one video telephone or communications device that includes a video camera for generating video signals indicative of a sequence of local conferee image frames, and an image receiver for displaying image frames of at least one remote conferee. The image receiver, the video camera, and the eyes of the local conferee define a parallax angle. The video conference system further includes a frame generating system, responsive to the video signals, for analyzing local conferee image frames and generating a corresponding sequence of parallax-compensated frames. A signal indicative of each respective sequence of parallax-compensated frames is transmitted to a corresponding image receiver, whereby apparent eye contact is provided between each local conferee and a displayed image of a corresponding remote conferee.
    Type: Grant
    Filed: October 25, 1994
    Date of Patent: March 19, 1996
    Assignee: AT&T Corp.
    Inventors: Russell L. Andersson, Tsuhan Chen, Barin G. Haskell