Patents by Inventor Tsuhan Chen
Tsuhan Chen has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 6459732Abstract: A technique for implicitly encoding shape information by using a chroma-key color. A bounding box is created enclosing the video object. The bounding box is extended to be of size of next integer multiple of macroblock size and divided into a plurality of macroblocks. For each boundary macroblock, each pixel outside the object is replaced with the chroma-key color to implicitly encode shape information. Pixel data for boundary macroblocks and macroblocks inside the object are DCT transformed, scaled and motion compensated. A finer quantizer (smaller quantizer) is used for boundary macroblocks to improve image quality. A first_shape_code can be used to identify each macroblock as either 1) inside the object; 2) outside the object; or 3) on the object boundary. To improve data compression and achieve low complexity shape extraction with DCT and motion compensation, a first_shape_code is sent for all macroblocks, and only macroblocks that are inside the object or on the object boundary are coded.Type: GrantFiled: January 9, 2001Date of Patent: October 1, 2002Assignee: AT&T Corp.Inventors: Tsuhan Chen, Atul Puri, Robert Lewis Schmidt
-
Patent number: 6340991Abstract: A technique is provided for calculating the time offsets between different video cameras and re-synchronizing the captured frames in a post-processing manner, thus eliminating the necessity of an explicit common clock for synchronization. This approach allows effective synchronization of frames from different cameras so that a multi-camera system can be used to more accurately analyze a subject under observation.Type: GrantFiled: December 31, 1998Date of Patent: January 22, 2002Assignee: AT&T CorporationInventors: Tsuhan Chen, Sun-Yuan Kung, Yun-Ting Lin
-
Patent number: 6330023Abstract: A method of increasing the frame rate of an image of a speaking person comprises monitoring an audio signal indicative of utterances by the speaking person and the associated video signal. The audio signal corresponds to one or more fields or frames to be reconstructed, and individual portions of the audio signal are associated with facial feature information. The facial information includes mouth formation and position information derived from phonemes or other speech-based criteria from which the position of a speaker's mouth may be reliably predicted. A field or frame of the image is reconstructed using image features extracted from the existing frame and by utilizing the facial feature information associated with a detected phoneme.Type: GrantFiled: March 18, 1994Date of Patent: December 11, 2001Assignee: American Telephone and Telegraph CorporationInventor: Tsuhan Chen
-
Publication number: 20010036229Abstract: A technique for implicitly encoding shape information by using a chroma-key color. A bounding box is created enclosing the video object. The bounding box is extended to be of size of next integer multiple of macroblock size and divided into a plurality of macroblocks. For each boundary macroblock, each pixel outside the object is replaced with the chroma-key color to implicitly encode shape information. Pixel data for boundary macroblocks and macroblocks inside the object are DCT transformed, scaled and motion compensated. A finer quantizer (smaller quantizer) is used for boundary macroblocks to improve image quality. A first_shape_code can be used to identify each macroblock as either 1) inside the object; 2) outside the object; or 3) on the object boundary. To improve data compression and achieve low complexity shape extraction with DCT and motion compensation, a first_shape_code is sent for all macroblocks, and only macroblocks that are inside the object or on the object boundary are coded.Type: ApplicationFiled: January 9, 2001Publication date: November 1, 2001Inventors: Tsuhan Chen, Atul Puri, Robert Lewis Schmidt
-
Patent number: 6301385Abstract: To segment moving foreground from background, where the moving foreground is of most interest to the viewer, this method uses three detection algorithms as the input to a neural network. The multiple cues used are focus, intensity, and motion. The neural network consists of a two-layered neural network. Focus and motion measurements are taken from high frequency data, edges; whereas, intensity measurements are taken from low frequency data, object interiors. Combined, these measurements are used to segment a complete object. Results indicate that moving foreground can be segmented from stationary foreground and moving or stationary background. The neural network segments the entire object, both interior and exterior, in this integrated approach. Results also demonstrate that combining cues allows flexibility in both type and complexity of scenes. Integration of cues improves accuracy in segmenting complex scenes containing both moving foreground and background.Type: GrantFiled: December 17, 1998Date of Patent: October 9, 2001Assignee: AT&T Corp.Inventors: Tsuhan Chen, Cassandra Turner Swain
-
Patent number: 6208693Abstract: A technique for implicitly encoding shape information by using a chroma-key color. A bounding box is created enclosing the video object. The bounding box is extended to be of size of next integer multiple of macroblock size and divided into a plurality of macroblocks. For each boundary macroblock, each pixel outside the object is replaced with the chroma-key color to implicitly encode shape information. Pixel data for boundary macroblocks and macroblocks inside the object are DCT transformed, scaled and motion compensated. A finer quantizer (smaller quantizer) is used for boundary macroblocks to improve image quality. A first_shape_code can be used to identify each macroblock as either 1) inside the object; 2) outside the object; or 3) on the object boundary. To improve data compression and achieve low complexity shape extraction with DCT and motion compensation, a first_shape_code is sent for all macroblocks, and only macroblocks that are inside the object or on the object boundary are coded.Type: GrantFiled: July 9, 1998Date of Patent: March 27, 2001Assignee: AT&T CorpInventors: Tsuhan Chen, Atul Puri, Robert Lewis Schmidt
-
Patent number: 6141442Abstract: A method and apparatus for generating region frames from video frames are disclosed which employs an industry standard encoder to lessen the negative impact on the quality of the transmitted video sequence while consuming fewer bits. The invention utilizes image segmentation and color replacement techniques to create the region frames. Each region frame includes a subject region, zero or more previously segmented regions and zero or more non-subject regions. The subject region is defined by the pixels of the original video frame. The previously segmented regions and non-subject regions are assigned replacement pixels P.sub.n,y and C.sub.n, respectively. The replacement pixel C.sub.n is chosen to indicate a color that is not likely to be confused with any color in the subject region R.sub.n. The replacement pixels P.sub.n,y are chosen such that the compression ratio of the region frame data is maximized.Type: GrantFiled: July 21, 1999Date of Patent: October 31, 2000Assignee: AT&T CorpInventor: Tsuhan Chen
-
Patent number: 6058187Abstract: Secure data transmission apparatus comprises a data translator for translating an input string of signals, each signal having incomplete information for identifying an alphanumeric character, into a first encryption key. A data encrypter receives a first encryption key, a choice of encryption algorithm and a message and outputs an encrypted message according to the selected algorithm. The apparatus may be applied whenever the user is confronted with telecommunications apparatus that provides a limited input capability and no means for encrypting a message for transmission to an end user. In this manner, for example, a user may authenticate their name for display on caller identification plus name apparatus and the called party can be assured, before answering their line, that the call is from the party having the displayed name.Type: GrantFiled: April 17, 1997Date of Patent: May 2, 2000Assignee: AT&T Corp.Inventor: Tsuhan Chen
-
Patent number: 6035060Abstract: A method and apparatus for generating region frames from video frames are disclosed which employs an industry standard encoder to lessen the negative impact on the quality of the transmitted video sequence while consuming fewer bits. The invention utilizes image segmentation and color replacement techniques to create the region frames. Each region frame includes a subject region, zero or more previously segmented regions and zero or more non-subject regions. The subject region is defined by the pixels of the original video frame. The previously segmented regions and non-subject regions are assigned replacement pixels P.sub.n,y and C.sub.n, respectively. The replacement pixel C.sub.n is chosen to indicate a color that is not likely to be confused with any color in the subject region R.sub.n. The replacement pixels P.sub.n,y are chosen such that the compression ratio of the region frame data is maximized.Type: GrantFiled: February 14, 1997Date of Patent: March 7, 2000Assignee: AT&T CorpInventors: Tsuhan Chen, Barin Geoffry Haskell, Cassandra Turner Swain
-
Patent number: 5974172Abstract: A method and apparatus for generating region frames from video frames are disclosed which employs an industry standard encoder to lessen the negative impact on the quality of the transmitted video sequence while consuming fewer bits. The invention utilizes image segmentation and color replacement techniques to create the region frames. Each region frame includes a subject region, zero or more previously segmented regions and zero or more non-subject regions. The subject region is defined by the pixels of the original video frame. The previously segmented regions and non-subject regions are assigned replacement pixels P.sub.n,y and C.sub.n, respectively. The replacement pixel C.sub.n is chosen to indicate a color that is not likely to be confused with any color in the subject region R.sub.n. The replacement pixels P.sub.n,y are chosen such that the compression ratio of the region frame data is maximized.Type: GrantFiled: February 14, 1997Date of Patent: October 26, 1999Assignee: AT&T CorpInventor: Tsuhan Chen
-
Patent number: 5960111Abstract: To segment moving foreground from background, where the moving foreground is of most interest to the viewer, this method uses three detection algorithms as the input to a neural network. The multiple cues used are focus, intensity, and motion. The neural network consists of a two-layered neural network. Focus and motion measurements are taken from high frequency data, edges; whereas, intensity measurements are taken from low frequency data, object interiors. Combined, these measurements are used to segment a complete object. Results indicate that moving foreground can be segmented from stationary foreground and moving or stationary background. The neural network segments the entire object, both interior and exterior, in this integrated approach. Results also demonstrate that combining cues allows flexibility in both type and complexity of scenes. Integration of cues improves accuracy in segmenting complex scenes containing both moving foreground and background.Type: GrantFiled: February 10, 1997Date of Patent: September 28, 1999Assignee: AT&T CorpInventors: Tsuhan Chen, Cassandra Turner Swain
-
Patent number: 5907351Abstract: A method and apparatus for transmitting and remotely displaying the audio and visual portion of a person speaking so that the audio and visual signals are synchronized. The audio signal is constantly transmitted to the receiver and is also used to create a predicted image of the lips of the talking head. The actual lip image is compared to the predicted lip image. Based upon this comparison, it is determined which of three signals is to be transmitted to the receiver: no signal corresponding to the video signal, a signal corresponding only to the differences between the actual lip image and a predicted lip image, or the actual lip image. The receiver reconstructs a lip image based upon the audio signal received and the signal received, if any, corresponding to the video image and inserts it into the previously received video frame or modifies the previous frame accordingly.Type: GrantFiled: October 24, 1995Date of Patent: May 25, 1999Assignee: Lucent Technologies Inc.Inventors: Tsuhan Chen, Ram R. Rao
-
Patent number: 5786855Abstract: A method and apparatus for generating region frames from video frames are disclosed which employs an industry standard encoder to lessen the negative impact on the quality of the transmitted video sequence while consuming fewer bits. The method and apparatus utilizes image segmentation and color replacement techniques to create the region frames. Each region frame includes a subject region, zero or more previously segmented regions and zero or more non-subject regions. The subject region is defined by the pixels of the original video frame. The previously segmented regions and non-subject regions are assigned replacement pixels P.sub.n,y and C.sub.n, respectively. The replacement pixel C.sub.n is chosen to indicate a color that is not likely to be confused with any color in the subject region R.sub.n. The replacement pixels P.sub.n,y are chosen such that the compression ratio of the region frame data is maximized.Type: GrantFiled: October 26, 1995Date of Patent: July 28, 1998Assignee: Lucent Technologies Inc.Inventors: Tsuhan Chen, Barin Geoffry Haskell
-
Patent number: 5761329Abstract: A method and apparatus is provided for determining the authenticity of an individual. In accordance with the method, audio and video data of the individual speaking at least one selected phrase is obtained. Identifying audio features and video features are then extracted from the audio data and the video data, respectively. A feature vector is formed which incorporates both the audio features and the video features. The feature vector is compared to a stored feature vector of a validated user speaking the same selected phrase. The individual is authenticated if the feature vector and the stored feature vector form a match within a prescribed threshold.Type: GrantFiled: December 15, 1995Date of Patent: June 2, 1998Inventors: Tsuhan Chen, Mehmet Reha Civanlar
-
Patent number: 5710829Abstract: Motion video is represented by digital signals. The digital signals can be compressed by coding to reduce bitrate and thus save time and expense in transmitting and reproducing the video. The present invention is a system and method for reducing the amount of information required to satisfactorily reproduce video signals by distinguishing more pertinent portions of the video from less pertinent portions. In particular, edges are detected based on amplitude variations in the luminance characteristic of the picture elements. Sharp edges are deemed to be "focused". The portion of the video frame between the focused edges is deemed to be focused, as well. A template is created corresponding to the focused portion. A signal corresponding to the outline of the template is combined with the original frame signal to create a segmented frame signal. When motion detection information is available, a motion-based template may be created and intersected with the focus template.Type: GrantFiled: April 27, 1995Date of Patent: January 20, 1998Assignee: Lucent Technologies Inc.Inventors: Tsuhan Chen, Cassandra Turner Swain
-
Patent number: 5500671Abstract: A video conference system provides eye contact and a sense of presence to a plurality of conference participants located in respective remotely-sited conference rooms. Each conference room contains at least one video telephone or communications device that includes a video camera for generating video signals indicative of a sequence of local conferee image frames, and an image receiver for displaying image frames of at least one remote conferee. The image receiver, the video camera, and the eyes of the local conferee define a parallax angle. The video conference system further includes a frame generating system, responsive to the video signals, for analyzing local conferee image frames and generating a corresponding sequence of parallax-compensated frames. A signal indicative of each respective sequence of parallax-compensated frames is transmitted to a corresponding image receiver, whereby apparent eye contact is provided between each local conferee and a displayed image of a corresponding remote conferee.Type: GrantFiled: October 25, 1994Date of Patent: March 19, 1996Assignee: AT&T Corp.Inventors: Russell L. Andersson, Tsuhan Chen, Barin G. Haskell