Patents by Inventor Paris Smaragdis

Paris Smaragdis has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20150006168
    Abstract: Variable sound decomposition masking techniques are described. In one or more implementations, a mask is generated that incorporates a user input as part of the mask, the user input is usable at least in part to define a threshold that is variable based on the user input and configured for use in performing a sound decomposition process. The sound decomposition process is performed using the mask to assign portions of sound data to respective ones of a plurality of sources of the sound data.
    Type: Application
    Filed: June 28, 2013
    Publication date: January 1, 2015
    Inventors: Gautham J. Mysore, Paris Smaragdis
  • Patent number: 8924345
    Abstract: Clustering and synchronizing content may include extracting audio features for each of a plurality of files that include audio content. The plurality of files may be clustered into one or more clusters. Clustering may include clustering based on a histogram that may be generated for each file pair of the plurality of files. Within each of the clusters, the files of the cluster may be time aligned.
    Type: Grant
    Filed: December 22, 2011
    Date of Patent: December 30, 2014
    Assignee: Adobe Systems Incorporated
    Inventors: Nicholas James Bryan, Paris Smaragdis, Gautham J. Mysore
  • Patent number: 8843364
    Abstract: Methods and systems for non-negative hidden Markov modeling of signals are described. For example, techniques disclosed herein may be applied to signals emitted by one or more sources. The modeling may be constrained according to high level information. In some embodiments, methods and systems may enable the separation of a signal's various components. As such, the systems and methods disclosed herein may find a wide variety of applications. In audio-related fields, for example, these techniques may be useful in music recording and processing, source separation/extraction, noise reduction, teaching, automatic transcription, electronic games, audio search and retrieval, and many other applications.
    Type: Grant
    Filed: February 29, 2012
    Date of Patent: September 23, 2014
    Assignee: Adobe Systems Incorporated
    Inventors: Gautham J. Mysore, Paris Smaragdis
  • Patent number: 8812322
    Abstract: Systems and methods for semi-supervised source separation using non-negative techniques are described. In some embodiments, various techniques disclosed herein may enable the separation of signals present within a mixture, where one or more of the signals may be emitted by one or more different sources. In audio-related applications, for instance, a signal mixture may include speech (e.g., from a human speaker) and noise (e.g., background noise). In some cases, speech may be separated from noise using a speech model developed from training data. A noise model may be created, for example, during the separation process (e.g., “on-the-fly”) and in the absence of corresponding training data.
    Type: Grant
    Filed: May 27, 2011
    Date of Patent: August 19, 2014
    Assignee: Adobe Systems Incorporated
    Inventors: Gautham J. Mysore, Paris Smaragdis
  • Patent number: 8775167
    Abstract: Noise robust template matching may be performed. First features of a first signal may be computed. Based at least on a portion of the first features, second features of a second signal may be computed. A new signal may be generated based on at least another portion of the first features and on at least a portion of the second features.
    Type: Grant
    Filed: December 22, 2011
    Date of Patent: July 8, 2014
    Assignee: Adobe Systems Incorporated
    Inventors: Gautham J. Mysore, Paris Smaragdis, Brian John King
  • Publication number: 20140148933
    Abstract: Sound feature priority alignment techniques are described. In one or more implementations, features of sound data are identified from a plurality of recordings. Values are calculated for frames of the sound data from the plurality of recordings. The values are based on similarity of the frames of the sound data from the plurality of recordings to each other, the similarity based on the identified features and a priority that is assigned based on the identified features of respective frames. The sound data from the plurality of recordings is then aligned based at least in part on the calculated values.
    Type: Application
    Filed: November 29, 2012
    Publication date: May 29, 2014
    Applicant: ADOBE SYSTEMS INCORPORATED
    Inventors: Brian John King, Gautham J. Mysore, Paris Smaragdis
  • Publication number: 20140142947
    Abstract: Sound rate modification techniques are described. In one or more implementations, an indication is received of an amount that a rate of output of sound data is to be modified. One or more sound rate rules are applied to the sound data that, along with the received indication, are usable to calculate different rates at which different portions of the sound data are to be modified, respectively. The sound data is then output such that the calculated rates are applied.
    Type: Application
    Filed: November 20, 2012
    Publication date: May 22, 2014
    Applicant: ADOBE SYSTEMS INCORPORATED
    Inventors: Brian John King, Gautham J. Mysore, Paris Smaragdis
  • Publication number: 20140140517
    Abstract: Sound data identification techniques are described. In one or more implementations, common sound data and uncommon sound data are identified from a plurality of sound data from a plurality of recordings of an audio source using a collaborative technique. The identification may include recognition of spectral and temporal aspects of the plurality of the sound data from the plurality of the recordings and sharing of the recognized spectral and temporal aspects to identify the common sound data as common to the plurality of recordings and the uncommon sound data as not common to the plurality of recordings.
    Type: Application
    Filed: November 19, 2012
    Publication date: May 22, 2014
    Applicant: ADOBE SYSTEMS INCORPORATED
    Inventors: Minje Kim, Paris Smaragdis
  • Publication number: 20140135962
    Abstract: Sound alignment techniques that employ timing information are described. In one or more implementations, features and timing information of sound data generated from a first sound signal are identified and used to identify features of sound data generated from a second sound signal. The identified features may then be utilized to align portions of the sound data from the first and second sound signals to each other.
    Type: Application
    Filed: November 13, 2012
    Publication date: May 15, 2014
    Applicant: ADOBE SYSTEMS INCORPORATED
    Inventors: Brian John King, Gautham J. Mysore, Paris Smaragdis
  • Publication number: 20140136976
    Abstract: Sound alignment user interface techniques are described. In one or more implementations, a user interface is output having a first representation of sound data generated from a first sound signal and a second representation of sound data generated from a second sound signal. One or more inputs are received, via interaction with the user interface, that indicate that a first point in time in the first representation corresponds to a second point in time in the second representation. Aligned sound data is generated from the sound data from the first and second sound signals based at least in part on correspondence of the first point in time in the sound data generated from the first sound signal to the second point in time in the sound data generated from the second sound signal.
    Type: Application
    Filed: November 13, 2012
    Publication date: May 15, 2014
    Applicant: ADOBE SYSTEMS INCORPORATED
    Inventors: Brian John King, Gautham J. Mysore, Paris Smaragdis
  • Publication number: 20140133675
    Abstract: Time interval sound alignment techniques are described. In one or more implementations, one or more inputs are received via interaction with a user interface that indicate that a first time interval in a first representation of sound data generated from a first sound signal corresponds to a second time interval in a second representation of sound data generated from a second sound signal. A stretch value is calculated based on an amount of time represented in the first and second time intervals, respectively. Aligned sound data is generated from the sound data for the first and second time intervals based on the calculated stretch value.
    Type: Application
    Filed: November 13, 2012
    Publication date: May 15, 2014
    Applicant: Adobe Systems Incorporated
    Inventors: Brian John King, Gautham J. Mysore, Paris Smaragdis
  • Patent number: 8724798
    Abstract: A method and apparatus for canceling an echo in audio communication is disclosed. The method comprises receiving an audio signal from a network and subsequently detecting a mixture audio signal comprising a target audio signal and an echo audio signal, the echo signal corresponding to the received audio signal. The method then comprises estimating the target audio signal by determining magnitude spectrograms for the mixture and received audio signals respectively, estimating a magnitude spectrogram of the target audio signal dependent on those of the mixture and received audio signal, and generating an output audio signal that estimates the target audio signal, the output audio signal being dependent on the estimated magnitude spectrogram.
    Type: Grant
    Filed: November 20, 2009
    Date of Patent: May 13, 2014
    Assignee: Adobe Systems Incorporated
    Inventors: Paris Smaragdis, Gautham J. Mysore
  • Patent number: 8699858
    Abstract: A computer-implemented method includes segmenting a plurality of video frames of a sequence of video frames into a first portion that includes a selected visual object represented in the video frame and a second portion that includes a background represented in the video frame. The selected visual object is selected by using a selection envelope.
    Type: Grant
    Filed: November 26, 2008
    Date of Patent: April 15, 2014
    Assignee: Adobe Systems Incorporated
    Inventors: Ce Liu, Sylvain Paris, Paris Smaragdis, Wojciech Matusik
  • Patent number: 8611558
    Abstract: A method and system is presented for sampling analog signals in a manner that avoids the effects of signal clipping due to a limited dynamic range. A method and device for sampling an analog input using multiple gains, or gain mask, is described. By using different gains during different time quanta, a subset of the sampled points may effectively be attenuated before being sampled and converted to a digital representation. If clipping occurs during the sampling process, the true values of the clipped samples may be interpolated using the amplitudes of the non-clipped samples, which may not have been attenuated. Such interpolation may include constructing and/or solving a constraint optimization problem using linear programming. In one embodiment, such a problem may be constructed and/or solved by using sign information from the clipped samples and/or by imposing a sparsity assumption on the signals during the reconstruction process.
    Type: Grant
    Filed: February 26, 2009
    Date of Patent: December 17, 2013
    Assignee: Adobe Systems Incorporated
    Inventor: Paris Smaragdis
  • Patent number: 8554553
    Abstract: Methods and systems for non-negative hidden Markov modeling of signals are described. For example, techniques disclosed herein may be applied to signals emitted by one or more sources. In some embodiments, methods and systems may enable the separation of a signal's various components. As such, the systems and methods disclosed herein may find a wide variety of applications. In audio-related fields, for example, these techniques may be useful in music recording and processing, source extraction, noise reduction, teaching, automatic transcription, electronic games, audio search and retrieval, and many other applications.
    Type: Grant
    Filed: February 21, 2011
    Date of Patent: October 8, 2013
    Assignee: Adobe Systems Incorporated
    Inventors: Gautham J. Mysore, Paris Smaragdis
  • Publication number: 20130226858
    Abstract: A sound mixture may be received that includes a plurality of sources. A model may be received for one of the source that includes a dictionary of spectral basis vectors corresponding to that one source. At least one feature of the one source in the sound mixture may be estimated based on the model. In some examples, the estimation may be constrained according to temporal data.
    Type: Application
    Filed: February 29, 2012
    Publication date: August 29, 2013
    Inventors: Paris Smaragdis, Gautham J. Mysore
  • Publication number: 20130226558
    Abstract: Methods and systems for non-negative hidden Markov modeling of signals are described. For example, techniques disclosed herein may be applied to signals emitted by one or more sources. The modeling may be constrained according to high level information. In some embodiments, methods and systems may enable the separation of a signal's various components. As such, the systems and methods disclosed herein may find a wide variety of applications. In audio-related fields, for example, these techniques may be useful in music recording and processing, source separation/extraction, noise reduction, teaching, automatic transcription, electronic games, audio search and retrieval, and many other applications.
    Type: Application
    Filed: February 29, 2012
    Publication date: August 29, 2013
    Inventors: Gautham J. Mysore, Paris Smaragdis
  • Publication number: 20130129115
    Abstract: A method and system is presented for sampling analog signals in a manner that avoids the effects of signal clipping due to a limited dynamic range. A method and device for sampling an analog input using multiple gains, or gain mask, is described. By using different gains during different time quanta, a subset of the sampled points may effectively be attenuated before being sampled and converted to a digital representation. If clipping occurs during the sampling process, the true values of the clipped samples may be interpolated using the amplitudes of the non-clipped samples, which may not have been attenuated. Such interpolation may include constructing and/or solving a constraint optimization problem using linear programming. In one embodiment, such a problem may be constructed and/or solved by using sign information from the clipped samples and/or by imposing a sparsity assumption on the signals during the reconstruction process.
    Type: Application
    Filed: February 26, 2009
    Publication date: May 23, 2013
    Inventor: Paris Smaragdis
  • Publication number: 20130132077
    Abstract: Systems and methods for semi-supervised source separation using non-negative techniques are described. In some embodiments, various techniques disclosed herein may enable the separation of signals present within a mixture, where one or more of the signals may be emitted by one or more different sources. In audio-related applications, for instance, a signal mixture may include speech (e.g., from a human speaker) and noise (e.g., background noise). In some cases, speech may be separated from noise using a speech model developed from training data. A noise model may be created, for example, during the separation process (e.g., “on-the-fly”) and in the absence of corresponding training data.
    Type: Application
    Filed: May 27, 2011
    Publication date: May 23, 2013
    Inventors: Gautham J. Mysore, Paris Smaragdis
  • Publication number: 20130132082
    Abstract: Methods and systems for recognition of concurrent, superimposed, or otherwise overlapping signals are described. A Markov Selection Model is introduced that, together with probabilistic decomposition methods, enable recognition of simultaneously emitted signals from various sources. For example, a signal mixture may include overlapping speech from different persons. In some instances, recognition may be performed without the need to separate signals or sources. As such, some of the techniques described herein may be useful in automatic transcription, noise reduction, teaching, electronic games, audio search and retrieval, medical and scientific applications, etc.
    Type: Application
    Filed: February 21, 2011
    Publication date: May 23, 2013
    Inventor: Paris Smaragdis