Patents by Inventor Gautham J. Mysore

Gautham J. Mysore has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 9514722
    Abstract: Techniques are disclosed for automatic detection of dense ornamentation in music. Input data representing a piece of digitally encoded music in a time domain is converted into a spectrogram representing time-frequency coefficients in a frequency domain. The spectrogram includes column vectors of the time-frequency coefficients that correspond to time periods spanning different portions of the piece of music. A one-dimensional onset detection array is calculated based on a subset of the column vectors. Using the spectrogram and the onset detection array, a two-dimensional self-similarity matrix (SSM) is calculated based on pair-wise comparisons of elements in the onset detection array. As a result, an irregular pattern score representing the presence of dense ornamentation in the piece of music can be calculated based on a magnitude difference between a beat pattern in the music and each column of the slim SSM.
    Type: Grant
    Filed: November 10, 2015
    Date of Patent: December 6, 2016
    Assignee: Adobe Systems Incorporated
    Inventors: Minje Kim, Gautham J. Mysore, Paris Smaragdis, Peter Merrill
  • Patent number: 9451304
    Abstract: Sound feature priority alignment techniques are described. In one or more implementations, features of sound data are identified from a plurality of recordings. Values are calculated for frames of the sound data from the plurality of recordings. The values are based on similarity of the frames of the sound data from the plurality of recordings to each other, the similarity based on the identified features and a priority that is assigned based on the identified features of respective frames. The sound data from the plurality of recordings is then aligned based at least in part on the calculated values.
    Type: Grant
    Filed: November 29, 2012
    Date of Patent: September 20, 2016
    Assignee: Adobe Systems Incorporated
    Inventors: Brian John King, Gautham J. Mysore, Paris Smaragdis
  • Patent number: 9449085
    Abstract: Pattern matching of sound data using hashing is described. In one or more implementations, a query formed from one or more spectrograms of sound data is hashed and used to locate one or more labels in a database of sound signals. Each of the labels is located using a hash of an entry in the database. At least one of the located one or more labels is chosen as corresponding to the query.
    Type: Grant
    Filed: November 14, 2013
    Date of Patent: September 20, 2016
    Assignee: Adobe Systems Incorporated
    Inventors: Minje Kim, Paris Smaragdis, Gautham J. Mysore
  • Patent number: 9437208
    Abstract: Sound decomposition models are described. In one or more implementations, a plurality of individual models is generated for respective ones of a plurality of sound sources. The plurality of models is collected to form a universal audio model that is configured to support sound decomposition of sound data through use of one or more of the models. The plurality of models is not generated using a sound source that originated at least a portion of the sound data.
    Type: Grant
    Filed: June 3, 2013
    Date of Patent: September 6, 2016
    Assignee: Adobe Systems Incorporated
    Inventors: Dennis L. Sun, Gautham J. Mysore
  • Publication number: 20160232914
    Abstract: Sound enhancement techniques through dereverberation are described. In one or more implementations, a method is described of enhancing sound data through removal of reverberation from the sound data by one or more computing devices. The method includes obtaining a model that describes primary sound data that is to be utilized as a prior that assumes no prior knowledge about specifics of the sound data from which the reverberation is to be removed. A reverberation kernel is computed having parameters that, when applied to the model that describes the primary sound data, corresponds to the sound data from which the reverberation is to be removed. The reverberation is removed from the sound data using the reverberation kernel.
    Type: Application
    Filed: February 5, 2015
    Publication date: August 11, 2016
    Inventors: Dawen Liang, Matthew Douglas Hoffman, Gautham J. Mysore
  • Publication number: 20160196828
    Abstract: Acoustic matching and splicing of sound tracks is described. In one or more implementations, a method to acoustically match and splice first and second sound tracks by one or more computing devices is described. The method includes source separating the first and second sound tracks into first track primary and background sound data and second track primary and background sound data. Features extracted from the first and second primary sound data are matched, one to another, to generate first and second primary matching masks. Features extracted from the first and second background sound data are matched, one to another, to generate first and second background matching masks, which are applied to respective separated sound data. The applied first track primary and background sound data and the applied second track primary and background sound data are spliced to generate a spliced sound track.
    Type: Application
    Filed: January 7, 2015
    Publication date: July 7, 2016
    Inventors: François G. Germain, Gautham J. Mysore
  • Patent number: 9355649
    Abstract: Sound alignment techniques that employ timing information are described. In one or more implementations, features and timing information of sound data generated from a first sound signal are identified and used to identify features of sound data generated from a second sound signal. The identified features may then be utilized to align portions of the sound data from the first and second sound signals to each other.
    Type: Grant
    Filed: November 13, 2012
    Date of Patent: May 31, 2016
    Assignee: Adobe Systems Incorporated
    Inventors: Brian John King, Gautham J. Mysore, Paris Smaragdis
  • Patent number: 9351093
    Abstract: Multichannel sound source identification and location techniques are described. In one or more implementations, source separation is performed using a collaborative technique for a plurality of sound data that was captured by respective ones of a plurality of sound capture devices of an audio scene. The source separation is performed by recognizing spectral and temporal aspects from the plurality of sound data and sharing the recognized spectral and temporal aspects, one with another, to identify one or more sound sources in the audio scene. A relative position of the identified one or more sounds sources to the plurality of sound capture devices is determined based on the source separation.
    Type: Grant
    Filed: December 24, 2013
    Date of Patent: May 24, 2016
    Assignee: Adobe Systems Incorporated
    Inventors: Minje Kim, Gautham J. Mysore, Paris Smaragdis
  • Patent number: 9318106
    Abstract: Joint sound model generation techniques are described. In one or more implementations, a plurality of models of sound data received from a plurality of different sound scenes are jointly generated. The joint generating includes learning information as part of generating a first said model of sound data from a first one of the sound scenes and sharing the learned information for use in generating a second one of the models of sound data from a second one of the sound scenes.
    Type: Grant
    Filed: August 8, 2013
    Date of Patent: April 19, 2016
    Assignee: Adobe Systems Incorporated
    Inventors: Dennis L. Sun, Gautham J. Mysore
  • Publication number: 20160007130
    Abstract: Performance metric based stopping criteria for iterative algorithm techniques are described. In one or more implementations, a training dataset is processed by one or more computing devices using an iterative algorithm having a cost function. The processing includes, for a plurality of iterations of the iterative algorithm, computing a cost for the iterative algorithm using the cost function and a value for each of a plurality of performance metrics that are usable to infer accuracy of the iterative algorithm for a respective one of the iterations. Responsive to the processing, a particular one of the plurality of iterations is identified as a stopping criterion based at least in part on the computed values for the plurality of performance metrics and the stopping criterion is output to configure the iterative algorithm to use the stopping criterion for subsequent processing of data by the iterative algorithm.
    Type: Application
    Filed: July 7, 2014
    Publication date: January 7, 2016
    Inventors: Francois G. Germain, Gautham J. Mysore
  • Patent number: 9201580
    Abstract: Sound alignment user interface techniques are described. In one or more implementations, a user interface is output having a first representation of sound data generated from a first sound signal and a second representation of sound data generated from a second sound signal. One or more inputs are received, via interaction with the user interface, that indicate that a first point in time in the first representation corresponds to a second point in time in the second representation. Aligned sound data is generated from the sound data from the first and second sound signals based at least in part on correspondence of the first point in time in the sound data generated from the first sound signal to the second point in time in the sound data generated from the second sound signal.
    Type: Grant
    Filed: November 13, 2012
    Date of Patent: December 1, 2015
    Assignee: Adobe Systems Incorporated
    Inventors: Brian John King, Gautham J. Mysore, Paris Smaragdis
  • Patent number: 9165565
    Abstract: A sound mixture may be received that includes a plurality of sources. A model may be received that includes a dictionary of spectral basis vectors for the plurality of sources. A weight may be estimated for each of the plurality of sources in the sound mixture based on the model. In some examples, such weight estimation may be performed using a source separation technique without actually separating the sources.
    Type: Grant
    Filed: February 29, 2012
    Date of Patent: October 20, 2015
    Assignee: Adobe Systems Incorporated
    Inventors: Gautham J. Mysore, Paris Smaragdis, Juhan Nam
  • Publication number: 20150242180
    Abstract: Sound processing techniques using recurrent neural networks are described. In one or more implementations, temporal dependencies are captured in sound data that are modeled through use of a recurrent neural network (RNN). The captured temporal dependencies are employed as part of feature extraction performed using nonnegative matrix factorization (NMF). One or more sound processing techniques are performed on the sound data based at least in part on the feature extraction.
    Type: Application
    Filed: February 21, 2014
    Publication date: August 27, 2015
    Inventors: Nicolas Maurice Boulanger-Lewandowski, Gautham J. Mysore, Matthew Douglas Hoffman
  • Publication number: 20150181359
    Abstract: Multichannel sound source identification and location techniques are described. In one or more implementations, source separation is performed using a collaborative technique for a plurality of sound data that was captured by respective ones of a plurality of sound capture devices of an audio scene. The source separation is performed by recognizing spectral and temporal aspects from the plurality of sound data and sharing the recognized spectral and temporal aspects, one with another, to identify one or more sound sources in the audio scene. A relative position of the identified one or more sounds sources to the plurality of sound capture devices is determined based on the source separation.
    Type: Application
    Filed: December 24, 2013
    Publication date: June 25, 2015
    Inventors: Minje Kim, Gautham J. Mysore, Paris Smaragdis
  • Publication number: 20150142450
    Abstract: Sound processing using a product-of-filters model is described. In one or more implementations, a model is formed by one or more computing devices for a time frame of sound data as a product of filters. The model is utilized by the one or more computing devices to perform one or more sound processing techniques on the time frame of the sound data.
    Type: Application
    Filed: November 15, 2013
    Publication date: May 21, 2015
    Applicant: Adobe Systems Incorporated
    Inventors: Dawen Liang, Matthew Douglas Hoffman, Gautham J. Mysore
  • Publication number: 20150142433
    Abstract: Pattern identification using convolution is described. In one or more implementations, a representation of a pattern is obtained that is described using data points that include frequency coordinates, time coordinates, and energy values. An identification is made as to whether sound data described using irregularly positioned data points includes the pattern, the identifying including use of a convolution of the frequency or time coordinates to determine correspondence with the representation of the pattern.
    Type: Application
    Filed: November 20, 2013
    Publication date: May 21, 2015
    Applicant: ADOBE SYSTEMS INCORPORATED
    Inventors: Minje Kim, Paris Smaragdis, Gautham J. Mysore
  • Publication number: 20150134691
    Abstract: Pattern matching of sound data using hashing is described. In one or more implementations, a query formed from one or more spectrograms of sound data is hashed and used to locate one or more labels in a database of sound signals. Each of the labels is located using a hash of an entry in the database. At least one of the located one or more labels is chosen as corresponding to the query.
    Type: Application
    Filed: November 14, 2013
    Publication date: May 14, 2015
    Applicant: ADOBE SYSTEMS INCORPORATED
    Inventors: Minje Kim, Paris Smaragdis, Gautham J. Mysore
  • Patent number: 8965832
    Abstract: A sound mixture may be received that includes a plurality of sources. A model may be received for one of the source that includes a dictionary of spectral basis vectors corresponding to that one source. At least one feature of the one source in the sound mixture may be estimated based on the model. In some examples, the estimation may be constrained according to temporal data.
    Type: Grant
    Filed: February 29, 2012
    Date of Patent: February 24, 2015
    Assignee: Adobe Systems Incorporated
    Inventors: Paris Smaragdis, Gautham J. Mysore
  • Publication number: 20150046377
    Abstract: Joint sound model generation techniques are described. In one or more implementations, a plurality of models of sound data received from a plurality of different sound scenes are jointly generated. The joint generating includes learning information as part of generating a first said model of sound data from a first one of the sound scenes and sharing the learned information for use in generating a second one of the models of sound data from a second one of the sound scenes.
    Type: Application
    Filed: August 8, 2013
    Publication date: February 12, 2015
    Applicant: Adobe Systems Incorporated
    Inventors: Dennis L. Sun, Gautham J. Mysore
  • Patent number: 8954175
    Abstract: A system and method are described for selecting a target sound object from a sound mixture. In embodiments, a sound mixture comprises a plurality of sound objects superimposed in time. A user can select one of these sound objects by providing reference audio data corresponding to a reference sound object. The system analyzes the audio data and the reference audio data to identify a portion of the audio data corresponding to a target sound object in the mixture that is most similar to the reference sound object. The analysis may include decomposing the reference audio data into a plurality of reference components and the sound mixture into a plurality of components guided by the reference components. The target sound object can be re-synthesized from the target components.
    Type: Grant
    Filed: August 26, 2009
    Date of Patent: February 10, 2015
    Assignee: Adobe Systems Incorporated
    Inventors: Paris Smaragdis, Gautham J. Mysore