Patents by Inventor Dongge Li
Dongge Li has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20070156667Abstract: A network element of choice receives (101) information and uses (102) that information to develop a content search query. That network element then instigates (103) a content search using the content search query and receives (104), in turn, content search results comprising a plurality of content items. Profile information for a plurality of playback platforms is then accessed (105) and used (106) to identify which of the content items are best played back on particular ones of the playback platforms.Type: ApplicationFiled: January 4, 2006Publication date: July 5, 2007Inventors: Dongge Li, Bhavan Gandhi, Cuneyt Taskiran, Wei Wang
-
Publication number: 20070140550Abstract: An object detection algorithm that generates a two-layer Gaussian Mixture Model (GMM) during a training session, and subsequent to the training session, uses the two-layer GMM to perform face detection. No labeling of local features is needed. The only input that is provided by a user is the setting of a few global parameters for the image being captured during the training session, such as, for example, the person's facial pose.Type: ApplicationFiled: December 20, 2005Publication date: June 21, 2007Inventors: Dongge Li, Bhavan Gandhi, Zhu Li
-
Publication number: 20060290699Abstract: A system and method is provided for synthesizing audio-visual content in a video image processor. A content synthesis application processor extracts audio features and video features from audio-visual input signals that represent a speaker who is speaking. The processor uses the extracted visual features to create a computer generated animated version of the face of the speaker. The processor synchronizes facial movements of the animated version of the face of the speaker with a plurality of audio logical units such as phonemes that represent the speaker's speech. In this manner the processor synthesizes an audio-visual representation of the speaker's face that is properly synchronized with the speaker's speech.Type: ApplicationFiled: September 28, 2004Publication date: December 28, 2006Inventors: Nevenka Dimtrva, Andrew Miller, Dongge Li
-
Patent number: 7120626Abstract: A method and system which enable a user to query a multimedia archive in one media modality and automatically retrieve correlating data in another media modality without the need for manually associating the data items through a data structure. The correlation method finds the maximum correlation between the data items without being affected by the distribution of the data in the respective subspace of each modality. Once the direction of correlation is disclosed, extracted features can be transferred from one subspace to another.Type: GrantFiled: November 15, 2002Date of Patent: October 10, 2006Assignee: Koninklijke Philips Electronics N.V.Inventors: Dongge Li, Nevenka Dimitrova
-
Patent number: 7058889Abstract: A method of synchronizing visual information with audio playback includes the steps of selecting a desired audio file from a list stored in memory associated with a display device, sending a signal from the display device to a separate playback device to cause the separate playback device to start playing the desired audio file; and displaying visual information associated with the desired audio file on the display device in accordance with timestamp data such that the visual information is displayed synchronously with the playing of the desired audio file, wherein the commencement of playing the desired audio file and the commencement of the displaying step are a function of the signal from the display device.Type: GrantFiled: November 29, 2001Date of Patent: June 6, 2006Assignee: Koninklijke Philips Electronics N.V.Inventors: Karen I. Trovato, Dongge Li, Muralidharan Ramaswamy
-
Publication number: 20060062059Abstract: Meta-data retrieved externally is determined to be relevant to the creation of an electronic programming guide (EPG) or relevant to a user's preferred programs. The meta-data is stored locally if it is determined that it is relevant to the creation of the EPG or if it is relevant to the user's preferred programs. All other meta-data is discarded. When a user requests meta-data, an attempt is made to retrieve the data from a local database, and if the attempt fails, then the meta-data is obtained from an external source.Type: ApplicationFiled: September 20, 2004Publication date: March 23, 2006Inventors: Alfonso Smith, Jianjun Fang, Bhavan Gandhi, Dongge Li
-
Publication number: 20050229233Abstract: A method for providing complementary information 226 for a video program is provided that includes receiving complementary information 226 for a video program. A query is received from a consumer. The query is related to a specified portion of the complementary information 226. A query response is provided to the consumer based on the specified portion of the complementary information 226.Type: ApplicationFiled: April 1, 2003Publication date: October 13, 2005Inventors: John Zimmerman, Nevenka Dimitrova, Dongge Li, Johanna Bont, Andreas Henricus Lamers, Angel Janevski, Lira Nikolovska
-
Publication number: 20040234108Abstract: A processor (10) utilizes information regarding one or more physical dimensions of an individual (14) to better inform a personal identification process. In one embodiment, the measured physical dimensions are utilized to influence the conduct of a face recognition process. In one embodiment, a Bayesian Belief Network can be utilized to facilitate such processes.Type: ApplicationFiled: May 22, 2003Publication date: November 25, 2004Applicant: Motorola, Inc.Inventors: Dongge Li, Bhavan Gandhi
-
Publication number: 20040098376Abstract: A method and system which enable a user to query a multimedia archive in one media modality and automatically retrieve correlating data in another media modality without the need for manually associating the data items through a data structure. The correlation method finds the maximum correlation between the data items without being affected by the distribution of the data in the respective subspace of each modality. Once the direction of correlation is disclosed, extracted features can be transferred from one subspace to another.Type: ApplicationFiled: November 15, 2002Publication date: May 20, 2004Applicant: KONINKLIJKE PHILIPS ELECTRONICS N.V.Inventors: Dongge Li, Nevenka Dimitrova
-
Publication number: 20040024780Abstract: The present invention provides a method, system and program product for generating a content-based table of contents for a program. Specifically, under the present invention the genre of a program having sequences is determined. Once the genre has been determined, each sequence is assigned a classification. The classifications are assigned based on video content, audio content and textual content within the sequences. Based on the genre and the classifications, keyframe(s) are selected from the sequences for use in a content-based table of contents.Type: ApplicationFiled: August 1, 2002Publication date: February 5, 2004Applicant: Koninklijke Philips Electronics N.V.Inventors: Lalitha Agnihotri, Nevenka Dimitrova, Srinivas Gutta, Dongge Li
-
Publication number: 20030236663Abstract: A memory storing computer readable instructions for causing a processor associated with a mega speaker identification (ID) system to instantiate functions including an audio segmentation and classification function receiving general audio data (GAD) and generating segments, a feature extraction function receiving the segments and extracting features based on mel-frequency cepstral coefficients (MFCC) therefrom, a learning and clustering function receiving the extracted features and reclassifying segments, when required, based on the extracted features, a matching and labeling function assigning a speaker ID to speech signals within the GAD, and a database function for correlating the assigned speaker ID to the respective speech signals within the GAD. The audio segmentation and classification function can assign each segment to one of N audio signal classes including silence, single speaker speech, music, environmental noise, multiple speaker's speech, simultaneous speech and music, and speech and noise.Type: ApplicationFiled: June 19, 2002Publication date: December 25, 2003Applicant: KONINKLIJKE PHILIPS ELECTRONICS N.V.Inventors: Nevenka Dimitrova, Dongge Li
-
Publication number: 20030154084Abstract: A method and system are disclosed for determining who is the speaking person in video data. This may be used to add in person identification in video content analysis and retrieval applications. A correlation is used to improve the person recognition rate relying on both face recognition and speaker identification. Latent Semantic Association (LSA) process may also be used to improve the association of a speaker's face with his voice. Other sources of data (e.g., text) may be integrated for a broader domain of video content understanding applications.Type: ApplicationFiled: February 14, 2002Publication date: August 14, 2003Applicant: Koninklijke Philips Electronics N.V.Inventors: Mingkun Li, Dongge Li, Nevenka Dimitrova
-
Publication number: 20030123734Abstract: A processing system is provided that detects an object of interest. The system receives an input image of the object. At least one feature is extracted from the input image. The extracted feature is then used to determine a set of candidate models by filtering out image models that do not contain the extracted feature. A sample image template is then formed based on a candidate sample image. The object of interest is then detected by comparing the input image to the sample image template. In a preferred embodiment, the formation of the template further includes calculating a parameter, such as direction, expression, articulation or lighting, of the object.Type: ApplicationFiled: December 28, 2001Publication date: July 3, 2003Applicant: Koninklijke Philips Electronics N.V.Inventors: Dongge Li, Nevenka Dimitrova
-
Publication number: 20030117428Abstract: A visualization system captures and analyzes a video signal to extract features in the video signal to render a graphical multi-dimensional visual representation of the program. The visualization system includes a memory and a processor and is programmed to extract features, augment the feature extraction with supplemental information, and render a visual summary to be displayed on a display device. Using the visual summary, a user can more easily determine the nature of a particular video program.Type: ApplicationFiled: December 20, 2001Publication date: June 26, 2003Applicant: KONINKLIJKE PHILIPS ELECTRONICS N.V.Inventors: Dongge Li, John D. Zimmerman, Nevenka Dimitrova
-
Publication number: 20030107592Abstract: An information tracking device receives content data, such as a video or television signal from one or more information sources and analyzes the content data according to a query criteria to extract relevant stories. The query criteria utilizes a variety of information, such as but not limited to a user request, a user profile, and a knowledge base of known relationships. Using the query criteria, the information tracking device calculates a probability of a person or event occurring in the content data and spots and extracts stories accordingly. The results are index, ordered, and then displayed on a display device.Type: ApplicationFiled: December 11, 2001Publication date: June 12, 2003Applicant: KONINKLIJKE PHILIPS ELECTRONICS N.V.Inventors: Dongge Li, Nevenka Dimitrova, Lalitha Agnihotri
-
Publication number: 20030101104Abstract: An information tracking device receives content data, such as a video or television signal from one or more information sources and analyzes the content data according to a query criteria to extract relevant stories. The query criteria utilizes a variety of information, such as but not limited to a user request, a user profile, and a knowledge base of known relationships. Using the query criteria, the information tracking device calculates a probability of a person or event occurring in the content data and spots and extracts stories accordingly. The results are index, ordered, and then displayed on a display device.Type: ApplicationFiled: November 28, 2001Publication date: May 29, 2003Applicant: KONINKLIJKE PHILIPS ELECTRONICS N.V.Inventors: Nevenka Dimitrova, Dongge Li, Lalitha Agnihotri
-
Publication number: 20020163533Abstract: A method of synchronizing visual information with audio playback includes the steps of selecting a desired audio file from a list stored in memory associated with a display device, sending a signal from the display device to a remote device to cause the remote device to start playing the desired audio file; and displaying visual information associated with the desired audio file on the display device in accordance with timestamp data such that the visual information is displayed synchronously with the playing of the desired audio file, wherein the commencement of playing the desired audio file and the commencement of the displaying step are a function of the signal from the display device.Type: ApplicationFiled: November 29, 2001Publication date: November 7, 2002Applicant: Koninklijke Philips Electronics N.V.Inventors: Karen I. Trovato, Dongge Li, Muralidharan Ramaswamy