Patents by Inventor Tsuhan Chen

Tsuhan Chen has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Chroma-key for efficient and low complexity shape representation of coded arbitrary video objects

Patent number: 6459732

Abstract: A technique for implicitly encoding shape information by using a chroma-key color. A bounding box is created enclosing the video object. The bounding box is extended to be of size of next integer multiple of macroblock size and divided into a plurality of macroblocks. For each boundary macroblock, each pixel outside the object is replaced with the chroma-key color to implicitly encode shape information. Pixel data for boundary macroblocks and macroblocks inside the object are DCT transformed, scaled and motion compensated. A finer quantizer (smaller quantizer) is used for boundary macroblocks to improve image quality. A first_shape_code can be used to identify each macroblock as either 1) inside the object; 2) outside the object; or 3) on the object boundary. To improve data compression and achieve low complexity shape extraction with DCT and motion compensation, a first_shape_code is sent for all macroblocks, and only macroblocks that are inside the object or on the object boundary are coded.

Type: Grant

Filed: January 9, 2001

Date of Patent: October 1, 2002

Assignee: AT&T Corp.

Inventors: Tsuhan Chen, Atul Puri, Robert Lewis Schmidt
Frame synchronization in a multi-camera system

Patent number: 6340991

Abstract: A technique is provided for calculating the time offsets between different video cameras and re-synchronizing the captured frames in a post-processing manner, thus eliminating the necessity of an explicit common clock for synchronization. This approach allows effective synchronization of frames from different cameras so that a multi-camera system can be used to more accurately analyze a subject under observation.

Type: Grant

Filed: December 31, 1998

Date of Patent: January 22, 2002

Assignee: AT&T Corporation

Inventors: Tsuhan Chen, Sun-Yuan Kung, Yun-Ting Lin
Video signal processing systems and methods utilizing automated speech analysis

Patent number: 6330023

Abstract: A method of increasing the frame rate of an image of a speaking person comprises monitoring an audio signal indicative of utterances by the speaking person and the associated video signal. The audio signal corresponds to one or more fields or frames to be reconstructed, and individual portions of the audio signal are associated with facial feature information. The facial information includes mouth formation and position information derived from phonemes or other speech-based criteria from which the position of a speaker's mouth may be reliably predicted. A field or frame of the image is reconstructed using image features extracted from the existing frame and by utilizing the facial feature information associated with a detected phoneme.

Type: Grant

Filed: March 18, 1994

Date of Patent: December 11, 2001

Assignee: American Telephone and Telegraph Corporation

Inventor: Tsuhan Chen
Chroma-key for efficient and low complexity shape representation of coded arbitrary video objects

Publication number: 20010036229

Abstract: A technique for implicitly encoding shape information by using a chroma-key color. A bounding box is created enclosing the video object. The bounding box is extended to be of size of next integer multiple of macroblock size and divided into a plurality of macroblocks. For each boundary macroblock, each pixel outside the object is replaced with the chroma-key color to implicitly encode shape information. Pixel data for boundary macroblocks and macroblocks inside the object are DCT transformed, scaled and motion compensated. A finer quantizer (smaller quantizer) is used for boundary macroblocks to improve image quality. A first_shape_code can be used to identify each macroblock as either 1) inside the object; 2) outside the object; or 3) on the object boundary. To improve data compression and achieve low complexity shape extraction with DCT and motion compensation, a first_shape_code is sent for all macroblocks, and only macroblocks that are inside the object or on the object boundary are coded.

Type: Application

Filed: January 9, 2001

Publication date: November 1, 2001

Inventors: Tsuhan Chen, Atul Puri, Robert Lewis Schmidt
Method and apparatus for segmenting images prior to coding

Patent number: 6301385

Abstract: To segment moving foreground from background, where the moving foreground is of most interest to the viewer, this method uses three detection algorithms as the input to a neural network. The multiple cues used are focus, intensity, and motion. The neural network consists of a two-layered neural network. Focus and motion measurements are taken from high frequency data, edges; whereas, intensity measurements are taken from low frequency data, object interiors. Combined, these measurements are used to segment a complete object. Results indicate that moving foreground can be segmented from stationary foreground and moving or stationary background. The neural network segments the entire object, both interior and exterior, in this integrated approach. Results also demonstrate that combining cues allows flexibility in both type and complexity of scenes. Integration of cues improves accuracy in segmenting complex scenes containing both moving foreground and background.

Type: Grant

Filed: December 17, 1998

Date of Patent: October 9, 2001

Assignee: AT&T Corp.

Inventors: Tsuhan Chen, Cassandra Turner Swain
Chroma-key for efficient and low complexity shape representation of coded arbitrary video objects

Patent number: 6208693

Abstract: A technique for implicitly encoding shape information by using a chroma-key color. A bounding box is created enclosing the video object. The bounding box is extended to be of size of next integer multiple of macroblock size and divided into a plurality of macroblocks. For each boundary macroblock, each pixel outside the object is replaced with the chroma-key color to implicitly encode shape information. Pixel data for boundary macroblocks and macroblocks inside the object are DCT transformed, scaled and motion compensated. A finer quantizer (smaller quantizer) is used for boundary macroblocks to improve image quality. A first_shape_code can be used to identify each macroblock as either 1) inside the object; 2) outside the object; or 3) on the object boundary. To improve data compression and achieve low complexity shape extraction with DCT and motion compensation, a first_shape_code is sent for all macroblocks, and only macroblocks that are inside the object or on the object boundary are coded.

Type: Grant

Filed: July 9, 1998

Date of Patent: March 27, 2001

Assignee: AT&T Corp

Inventors: Tsuhan Chen, Atul Puri, Robert Lewis Schmidt
Method and apparatus for coding segmented regions which may be transparent in video sequences for content-based scalability

Patent number: 6141442

Abstract: A method and apparatus for generating region frames from video frames are disclosed which employs an industry standard encoder to lessen the negative impact on the quality of the transmitted video sequence while consuming fewer bits. The invention utilizes image segmentation and color replacement techniques to create the region frames. Each region frame includes a subject region, zero or more previously segmented regions and zero or more non-subject regions. The subject region is defined by the pixels of the original video frame. The previously segmented regions and non-subject regions are assigned replacement pixels P.sub.n,y and C.sub.n, respectively. The replacement pixel C.sub.n is chosen to indicate a color that is not likely to be confused with any color in the subject region R.sub.n. The replacement pixels P.sub.n,y are chosen such that the compression ratio of the region frame data is maximized.

Type: Grant

Filed: July 21, 1999

Date of Patent: October 31, 2000

Assignee: AT&T Corp

Inventor: Tsuhan Chen
Secure telecommunications data transmission

Patent number: 6058187

Abstract: Secure data transmission apparatus comprises a data translator for translating an input string of signals, each signal having incomplete information for identifying an alphanumeric character, into a first encryption key. A data encrypter receives a first encryption key, a choice of encryption algorithm and a message and outputs an encrypted message according to the selected algorithm. The apparatus may be applied whenever the user is confronted with telecommunications apparatus that provides a limited input capability and no means for encrypting a message for transmission to an end user. In this manner, for example, a user may authenticate their name for display on caller identification plus name apparatus and the called party can be assured, before answering their line, that the call is from the party having the displayed name.

Type: Grant

Filed: April 17, 1997

Date of Patent: May 2, 2000

Assignee: AT&T Corp.

Inventor: Tsuhan Chen
Method and apparatus for removing color artifacts in region-based coding

Patent number: 6035060

Abstract: A method and apparatus for generating region frames from video frames are disclosed which employs an industry standard encoder to lessen the negative impact on the quality of the transmitted video sequence while consuming fewer bits. The invention utilizes image segmentation and color replacement techniques to create the region frames. Each region frame includes a subject region, zero or more previously segmented regions and zero or more non-subject regions. The subject region is defined by the pixels of the original video frame. The previously segmented regions and non-subject regions are assigned replacement pixels P.sub.n,y and C.sub.n, respectively. The replacement pixel C.sub.n is chosen to indicate a color that is not likely to be confused with any color in the subject region R.sub.n. The replacement pixels P.sub.n,y are chosen such that the compression ratio of the region frame data is maximized.

Type: Grant

Filed: February 14, 1997

Date of Patent: March 7, 2000

Assignee: AT&T Corp

Inventors: Tsuhan Chen, Barin Geoffry Haskell, Cassandra Turner Swain
Method and apparatus for coding segmented regions which may be transparent in video sequences for content-based scalability

Patent number: 5974172

Abstract: A method and apparatus for generating region frames from video frames are disclosed which employs an industry standard encoder to lessen the negative impact on the quality of the transmitted video sequence while consuming fewer bits. The invention utilizes image segmentation and color replacement techniques to create the region frames. Each region frame includes a subject region, zero or more previously segmented regions and zero or more non-subject regions. The subject region is defined by the pixels of the original video frame. The previously segmented regions and non-subject regions are assigned replacement pixels P.sub.n,y and C.sub.n, respectively. The replacement pixel C.sub.n is chosen to indicate a color that is not likely to be confused with any color in the subject region R.sub.n. The replacement pixels P.sub.n,y are chosen such that the compression ratio of the region frame data is maximized.

Type: Grant

Filed: February 14, 1997

Date of Patent: October 26, 1999

Assignee: AT&T Corp

Inventor: Tsuhan Chen
Method and apparatus for segmenting images prior to coding

Patent number: 5960111

Abstract: To segment moving foreground from background, where the moving foreground is of most interest to the viewer, this method uses three detection algorithms as the input to a neural network. The multiple cues used are focus, intensity, and motion. The neural network consists of a two-layered neural network. Focus and motion measurements are taken from high frequency data, edges; whereas, intensity measurements are taken from low frequency data, object interiors. Combined, these measurements are used to segment a complete object. Results indicate that moving foreground can be segmented from stationary foreground and moving or stationary background. The neural network segments the entire object, both interior and exterior, in this integrated approach. Results also demonstrate that combining cues allows flexibility in both type and complexity of scenes. Integration of cues improves accuracy in segmenting complex scenes containing both moving foreground and background.

Type: Grant

Filed: February 10, 1997

Date of Patent: September 28, 1999

Assignee: AT&T Corp

Inventors: Tsuhan Chen, Cassandra Turner Swain
Method and apparatus for cross-modal predictive coding for talking head sequences

Patent number: 5907351

Abstract: A method and apparatus for transmitting and remotely displaying the audio and visual portion of a person speaking so that the audio and visual signals are synchronized. The audio signal is constantly transmitted to the receiver and is also used to create a predicted image of the lips of the talking head. The actual lip image is compared to the predicted lip image. Based upon this comparison, it is determined which of three signals is to be transmitted to the receiver: no signal corresponding to the video signal, a signal corresponding only to the differences between the actual lip image and a predicted lip image, or the actual lip image. The receiver reconstructs a lip image based upon the audio signal received and the signal received, if any, corresponding to the video image and inserts it into the previously received video frame or modifies the previous frame accordingly.

Type: Grant

Filed: October 24, 1995

Date of Patent: May 25, 1999

Assignee: Lucent Technologies Inc.

Inventors: Tsuhan Chen, Ram R. Rao
Method and apparatus for coding segmented regions in video sequences for content-based scalability

Patent number: 5786855

Abstract: A method and apparatus for generating region frames from video frames are disclosed which employs an industry standard encoder to lessen the negative impact on the quality of the transmitted video sequence while consuming fewer bits. The method and apparatus utilizes image segmentation and color replacement techniques to create the region frames. Each region frame includes a subject region, zero or more previously segmented regions and zero or more non-subject regions. The subject region is defined by the pixels of the original video frame. The previously segmented regions and non-subject regions are assigned replacement pixels P.sub.n,y and C.sub.n, respectively. The replacement pixel C.sub.n is chosen to indicate a color that is not likely to be confused with any color in the subject region R.sub.n. The replacement pixels P.sub.n,y are chosen such that the compression ratio of the region frame data is maximized.

Type: Grant

Filed: October 26, 1995

Date of Patent: July 28, 1998

Assignee: Lucent Technologies Inc.

Inventors: Tsuhan Chen, Barin Geoffry Haskell
Method and apparatus employing audio and video data from an individual for authentication purposes

Patent number: 5761329

Abstract: A method and apparatus is provided for determining the authenticity of an individual. In accordance with the method, audio and video data of the individual speaking at least one selected phrase is obtained. Identifying audio features and video features are then extracted from the audio data and the video data, respectively. A feature vector is formed which incorporates both the audio features and the video features. The feature vector is compared to a stored feature vector of a validated user speaking the same selected phrase. The individual is authenticated if the feature vector and the stored feature vector form a match within a prescribed threshold.

Type: Grant

Filed: December 15, 1995

Date of Patent: June 2, 1998

Inventors: Tsuhan Chen, Mehmet Reha Civanlar
System and method for focused-based image segmentation for video signals

Patent number: 5710829

Abstract: Motion video is represented by digital signals. The digital signals can be compressed by coding to reduce bitrate and thus save time and expense in transmitting and reproducing the video. The present invention is a system and method for reducing the amount of information required to satisfactorily reproduce video signals by distinguishing more pertinent portions of the video from less pertinent portions. In particular, edges are detected based on amplitude variations in the luminance characteristic of the picture elements. Sharp edges are deemed to be "focused". The portion of the video frame between the focused edges is deemed to be focused, as well. A template is created corresponding to the focused portion. A signal corresponding to the outline of the template is combined with the original frame signal to create a segmented frame signal. When motion detection information is available, a motion-based template may be created and intersected with the focus template.

Type: Grant

Filed: April 27, 1995

Date of Patent: January 20, 1998

Assignee: Lucent Technologies Inc.

Inventors: Tsuhan Chen, Cassandra Turner Swain
Video conference system and method of providing parallax correction and a sense of presence

Patent number: 5500671

Abstract: A video conference system provides eye contact and a sense of presence to a plurality of conference participants located in respective remotely-sited conference rooms. Each conference room contains at least one video telephone or communications device that includes a video camera for generating video signals indicative of a sequence of local conferee image frames, and an image receiver for displaying image frames of at least one remote conferee. The image receiver, the video camera, and the eyes of the local conferee define a parallax angle. The video conference system further includes a frame generating system, responsive to the video signals, for analyzing local conferee image frames and generating a corresponding sequence of parallax-compensated frames. A signal indicative of each respective sequence of parallax-compensated frames is transmitted to a corresponding image receiver, whereby apparent eye contact is provided between each local conferee and a displayed image of a corresponding remote conferee.

Type: Grant

Filed: October 25, 1994

Date of Patent: March 19, 1996

Assignee: AT&T Corp.

Inventors: Russell L. Andersson, Tsuhan Chen, Barin G. Haskell

prev 1 2