Patents by Inventor Ajay Divakaran

Ajay Divakaran has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20050154973
    Abstract: A system and method summarizes multimedia stored in a compressed multimedia file partitioned into a sequence of segments, where the content of the multimedia is, for example, video signals, audio signals, text, and binary data. An associated metadata file includes index information and an importance level for each segment. The importance information is continuous over as closed interval. An importance level threshold is selected in the closed interval, and only segments of the multimedia having a particular importance level greater than the importance level threshold are reproduced. The importance level can also be determined for fixed-length windows of multiple segments, or a sliding window. Furthermore, the importance level can be weighted by a factor, such as the audio volume.
    Type: Application
    Filed: February 13, 2004
    Publication date: July 14, 2005
    Inventors: Isao Otsuka, Ajay Divakaran, Masaharu Ogawa, Kazuhiko Nakane
  • Publication number: 20050154987
    Abstract: A system and method summarizes multimedia stored in a compressed multimedia file partitioned into a sequence of segments, where the content of the multimedia is, for example, video signals, audio signals, text, and binary data. An associated metadata file includes index information and an importance level for each segment. The importance information is continuous over as closed interval. An importance level threshold is selected in the closed interval, and only segments of the multimedia having a particular importance level greater than the importance level threshold are reproduced.
    Type: Application
    Filed: January 14, 2004
    Publication date: July 14, 2005
    Inventors: Isao Otsuka, Ajay Divakaran, Masaharu Ogawa, Kazuhiko Nakane
  • Publication number: 20050131869
    Abstract: A method learns a structure of a video, in an unsupervised setting, to detect events in the video consistent with the structure. Sets of features are selected from the video. Based on the selected features, a hierarchical statistical model is updated, and an information gain of the hierarchical statistical model is evaluated. Redundant features are then filtered, and the hierarchical statistical model is updated, based on the filtered features. A Bayesian information criteria is applied to each model and feature set pair, which can then be rank ordered according to the criteria to detect the events in the video.
    Type: Application
    Filed: December 12, 2003
    Publication date: June 16, 2005
    Inventors: Lexing Xie, Ajay Divakaran, Shih-Fu Chang
  • Publication number: 20050125223
    Abstract: A method uses probabilistic fusion to detect highlights in videos using both audio and visual information. Specifically, the method uses coupled hidden Markov models (CHMMs). Audio labels are generated using audio classification via Gaussian mixture models (GMMs), and visual labels are generated by quantizing average motion vector magnitudes. Highlights are modeled using discrete-observation CHMMs trained with labeled videos. The CHMMs have better performance than conventional hidden Markov models (HMMs) trained only on audio signals, or only on video frames.
    Type: Application
    Filed: December 5, 2003
    Publication date: June 9, 2005
    Inventors: Ajay Divakaran, Ziyou Xiong, Regunathan Radhakrishnan
  • Patent number: 6865226
    Abstract: A method analyzes a high-level syntax and structure of a continuous compressed video according to a plurality of states. First, a set of hidden Markov models for each of the states is trained with a training video segmented into known states. Then, a set of domain specific features are extracted from a fixed-length sliding window of the continuous compressed video, and a set of maximum likelihoods is determined for each set of domain specific features using the sets of trained hidden Markov models. Finally, dynamic programming is applied to each set of maximum likelihoods to determine a specific state for each fixed-length sliding window of frames of the compressed video.
    Type: Grant
    Filed: December 5, 2001
    Date of Patent: March 8, 2005
    Assignee: Mitsubishi Electric Research Laboratories, Inc.
    Inventors: Lexing Xie, Shih-Fu Chang, Ajay Divakaran, Huifang Sun
  • Publication number: 20050018881
    Abstract: A method plays frames of a video adaptively according to a visual complexity of the video. First a spatial frequency of pixel within frames of the video is measured, as well as a temporal velocity of corresponding pixels between frames of the video. The spatial frequency is multiplied by the temporal velocity to obtain a measure of the visual complexity of the frames of the video. The frames of the video are then played at a speed that corresponds to the visual complexity.
    Type: Application
    Filed: July 10, 2003
    Publication date: January 27, 2005
    Inventors: Kadir Peker, Ajay Divakaran
  • Patent number: 6847680
    Abstract: A method identifies a talking head or principal cast in a compressed video by first segmenting the video into shots. Motion activity descriptors are extracted from each of the shots, and combined into a shot motion activity descriptor. A distance between the shot motion activity descriptor and a template motion activity descriptor is measured. The template motion activity descriptor is obtained from a training video. If the measured distance is less than a predetermined threshold, then the shot is identified as including a talking head.
    Type: Grant
    Filed: December 17, 2001
    Date of Patent: January 25, 2005
    Assignee: Mitsubishi Electric Research Laboratories, Inc.
    Inventors: Ajay Divakaran, Regunathan Radhakrishnan
  • Publication number: 20040268380
    Abstract: A method detects short term, unusual events in a video. First, features are extracted features from the audio and the video portions of the video. Segments of the video are labeled according to the features. A global sliding window is applied to the labeled segments to determine global characteristics over time, while a local sliding window is applied only to the labeled segments of the global sliding window to determine local characteristic over time. The local window is substantially shorter in time than the global window. A distance between the global and local characteristic is measured to determine occurrences of the unusual short time events.
    Type: Application
    Filed: June 30, 2003
    Publication date: December 30, 2004
    Inventors: Ajay Divakaran, Ziyou Xiong, Regunathan Radhakrishnan, Kadir A. Peker, Koji Miyahara
  • Publication number: 20040233987
    Abstract: A method segments a video into objects, without user assistance. An MPEG compressed video is converted to a structure called a pseudo spatial/temporal data using DCT coefficients and motion vectors. The compressed video is first parsed and the pseudo spatial/temporal data are formed. Seeds macro-blocks are identified using, e.g., the DCT coefficients and changes in the motion vector of macro-blocks.
    Type: Application
    Filed: May 21, 2003
    Publication date: November 25, 2004
    Inventors: Fatih M. Porikli, Huifang Sun, Ajay Divakaran
  • Patent number: 6823011
    Abstract: A method detects an unusual event in a video. Motion vectors are extracted from each frame in a video acquired by a camera of a scene. Zero run-length parameters are determined for each frame from the motion vectors. The zero run-length parameters are summed over predetermined time intervals of the video, and a distance is determined between the sum of the zero run-lengths of a current time interval and the sum of the zero run-lengths of a previous time interval. Then, the unusual event is detected if the distance is greater than a predetermined threshold.
    Type: Grant
    Filed: November 19, 2001
    Date of Patent: November 23, 2004
    Assignee: Mitsubishi Electric Research Laboratories, Inc.
    Inventors: Ajay Divakaran, Regunathan Radhakrishnan
  • Patent number: 6813313
    Abstract: A system and method analyzes a compressed video including a sequence of frames. The amount of a dominant feature in each frame of the compressed video is measured. A label is associated with each frame according the measured amount of the dominant feature. Views in the video are identified according to the labels, and the video is segmented into actions according to the views. The video can then be analyzed according to the action to determine significant events in the video.
    Type: Grant
    Filed: April 20, 2001
    Date of Patent: November 2, 2004
    Assignee: Mitsubishi Electric Research Laboratories, Inc.
    Inventors: Peng Xu, Shih-Fu Chang, Ajay Divakaran
  • Publication number: 20040167767
    Abstract: A method extracts highlights from an audio signal of a sporting event. The audio signal can be part of a sports videos. First, sets of features are extracted from the audio signal. The sets of features are classified according to the following classes: applause, cheering, ball hit, music, speech and speech with music. Adjacent sets of identically classified features are grouped. Portions of the audio signal corresponding to groups of features classified as applause or cheering and with a duration greater than a predetermined threshold are selected as highlights.
    Type: Application
    Filed: February 25, 2003
    Publication date: August 26, 2004
    Inventors: Ziyou Xiong, Regunathan Radhakrishnan, Ajay Divakaran
  • Patent number: 6778708
    Abstract: A compressed bit-stream represents a corresponding sequence having intra-coded frames and inter-coded frames. The compressed bit-stream includes bits associated with each of the inter-coded frames representing a displacement from the associated inter-coded frame to a closest matching of the intra-coded frames. A magnitude of the displacement of a first of the inter-coded frames is determined based on the bits in the compressed bit-stream associated with that inter-coded frame. The inter-coded frame is then identified based on the determined displacement magnitude. The inter-coded frame includes macro-blocks. Each macro-block is associated with a respective portion of the inter-coded frame bits which represent the displacement from that macro-block to the closest matching intra-coded frame. The displacement magnitude is an average of the displacement magnitudes of all the macro-blocks associated with the inter-coded frame.
    Type: Grant
    Filed: July 1, 1999
    Date of Patent: August 17, 2004
    Assignee: Mitsubishi Electric Research Laboratories, Inc.
    Inventors: Ajay Divakaran, Huifang Sun
  • Publication number: 20040143434
    Abstract: A method segments and summarizes a news video using both audio and visual features extracted from the video. The summaries can be used to quickly browse the video to locate topics of interest. A generalized sound recognition hidden Markov model (HMM) framework for joint segmentation and classification of the audio signal of the news video is used. The HMM not only provides a classification label for audio segment, but also compact state duration histogram descriptors.
    Type: Application
    Filed: January 17, 2003
    Publication date: July 22, 2004
    Inventors: Ajay Divakaran, Regunathan Radhakrishnan
  • Patent number: 6763069
    Abstract: A method extracts high-level features from a video including a sequence of frames. Low-level features are extracted from each frame of the video. Each frame of the video is labeled according to the extracted low-level features to generate sequences of labels. Each sequence of labels is associated with one of the extracted low-level feature. The sequences of labels are analyzed using learning machine learning techniques to extract high-level features of the video.
    Type: Grant
    Filed: July 6, 2000
    Date of Patent: July 13, 2004
    Assignee: Mitsubishi Electric Research Laboratories, Inc
    Inventors: Ajay Divakaran, Anthony Vetro, Huifang Sun, Peng Xu, Shih-Fu Chang
  • Publication number: 20040086180
    Abstract: A method discovers patterns in unknown content of a video. The video is partitioned into sets of disjoint segments. Each set includes all frames of the video, and each set is partitioned according to a selected low-level feature of the video. The disjoint segments are grouped into corresponding sets of clusters, each cluster including similar segments. The clusters are then labeled, and association rules are identified among the labels to discover high-level patterns in the unknown content of the video.
    Type: Application
    Filed: November 1, 2002
    Publication date: May 6, 2004
    Inventor: Ajay Divakaran
  • Publication number: 20040085339
    Abstract: A method summarizes unknown content of a video. First, low-level features of the video are selected. The video is then partitioned into segments according to the low-level features. The segments are grouped into disjoint clusters where each cluster contains similar segments. The clusters are labeled according to the low-level features, and parameters characterizing the clusters are assigned. High-level patterns among the labels are found, and the these patterns are used to extract frames from the video according to form a content-adaptive summary of the unknown content of the video.
    Type: Application
    Filed: November 1, 2002
    Publication date: May 6, 2004
    Inventors: Ajay Divakaran, Kadir A. Peker
  • Publication number: 20040085323
    Abstract: A method mines unknown content of a video by first selecting one or more low-level features of the video. For each selected feature, or combination of features, time series data is generated. The time series data is then self-correlated to identify similar segments of the video according to the low-level features. The similar segments are grouped into clusters to discover high-level patterns in the unknown content of video.
    Type: Application
    Filed: November 1, 2002
    Publication date: May 6, 2004
    Inventors: Ajay Divakaran, Kadir A. Peker
  • Patent number: 6697523
    Abstract: A method extracts an intensity of motion activity from shots in a compressed video. The method then uses the intensity of motion activity to segment the video into easy and difficult segments to summarize. Easy to summarize segments are represented by any frames selected from the easy to summarize segments, while a color based summarization process extracts generates sequences of frames from each difficult to summarize segment. The selected and generated frames of each segment in each shot are combined to form the summary of the compressed video.
    Type: Grant
    Filed: August 9, 2000
    Date of Patent: February 24, 2004
    Assignee: Mitsubishi Electric Research Laboratories, Inc.
    Inventors: Ajay Divakaran, Kadir A. Peker, Huifang Sun
  • Publication number: 20040008789
    Abstract: A method segments a compressed video by extracting audio and visual features from the compressed video. The audio features are clustered according to K-means clustering in a set of classes, and the compressed video is then partitioned into first segments according to the set of classes. The visual features are then used to partitioning each first segment into second segments using motion analysis. Summaries of the second segments can be provided to assist in the browsing of the compressed video.
    Type: Application
    Filed: July 10, 2002
    Publication date: January 15, 2004
    Inventors: Ajay Divakaran, Regunathan Radhakrishnan, Michael A. Casey