Patents by Inventor Paris Smaragdis
Paris Smaragdis has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20150006168Abstract: Variable sound decomposition masking techniques are described. In one or more implementations, a mask is generated that incorporates a user input as part of the mask, the user input is usable at least in part to define a threshold that is variable based on the user input and configured for use in performing a sound decomposition process. The sound decomposition process is performed using the mask to assign portions of sound data to respective ones of a plurality of sources of the sound data.Type: ApplicationFiled: June 28, 2013Publication date: January 1, 2015Inventors: Gautham J. Mysore, Paris Smaragdis
-
Patent number: 8924345Abstract: Clustering and synchronizing content may include extracting audio features for each of a plurality of files that include audio content. The plurality of files may be clustered into one or more clusters. Clustering may include clustering based on a histogram that may be generated for each file pair of the plurality of files. Within each of the clusters, the files of the cluster may be time aligned.Type: GrantFiled: December 22, 2011Date of Patent: December 30, 2014Assignee: Adobe Systems IncorporatedInventors: Nicholas James Bryan, Paris Smaragdis, Gautham J. Mysore
-
Patent number: 8843364Abstract: Methods and systems for non-negative hidden Markov modeling of signals are described. For example, techniques disclosed herein may be applied to signals emitted by one or more sources. The modeling may be constrained according to high level information. In some embodiments, methods and systems may enable the separation of a signal's various components. As such, the systems and methods disclosed herein may find a wide variety of applications. In audio-related fields, for example, these techniques may be useful in music recording and processing, source separation/extraction, noise reduction, teaching, automatic transcription, electronic games, audio search and retrieval, and many other applications.Type: GrantFiled: February 29, 2012Date of Patent: September 23, 2014Assignee: Adobe Systems IncorporatedInventors: Gautham J. Mysore, Paris Smaragdis
-
Patent number: 8812322Abstract: Systems and methods for semi-supervised source separation using non-negative techniques are described. In some embodiments, various techniques disclosed herein may enable the separation of signals present within a mixture, where one or more of the signals may be emitted by one or more different sources. In audio-related applications, for instance, a signal mixture may include speech (e.g., from a human speaker) and noise (e.g., background noise). In some cases, speech may be separated from noise using a speech model developed from training data. A noise model may be created, for example, during the separation process (e.g., “on-the-fly”) and in the absence of corresponding training data.Type: GrantFiled: May 27, 2011Date of Patent: August 19, 2014Assignee: Adobe Systems IncorporatedInventors: Gautham J. Mysore, Paris Smaragdis
-
Patent number: 8775167Abstract: Noise robust template matching may be performed. First features of a first signal may be computed. Based at least on a portion of the first features, second features of a second signal may be computed. A new signal may be generated based on at least another portion of the first features and on at least a portion of the second features.Type: GrantFiled: December 22, 2011Date of Patent: July 8, 2014Assignee: Adobe Systems IncorporatedInventors: Gautham J. Mysore, Paris Smaragdis, Brian John King
-
Publication number: 20140148933Abstract: Sound feature priority alignment techniques are described. In one or more implementations, features of sound data are identified from a plurality of recordings. Values are calculated for frames of the sound data from the plurality of recordings. The values are based on similarity of the frames of the sound data from the plurality of recordings to each other, the similarity based on the identified features and a priority that is assigned based on the identified features of respective frames. The sound data from the plurality of recordings is then aligned based at least in part on the calculated values.Type: ApplicationFiled: November 29, 2012Publication date: May 29, 2014Applicant: ADOBE SYSTEMS INCORPORATEDInventors: Brian John King, Gautham J. Mysore, Paris Smaragdis
-
Publication number: 20140142947Abstract: Sound rate modification techniques are described. In one or more implementations, an indication is received of an amount that a rate of output of sound data is to be modified. One or more sound rate rules are applied to the sound data that, along with the received indication, are usable to calculate different rates at which different portions of the sound data are to be modified, respectively. The sound data is then output such that the calculated rates are applied.Type: ApplicationFiled: November 20, 2012Publication date: May 22, 2014Applicant: ADOBE SYSTEMS INCORPORATEDInventors: Brian John King, Gautham J. Mysore, Paris Smaragdis
-
Publication number: 20140140517Abstract: Sound data identification techniques are described. In one or more implementations, common sound data and uncommon sound data are identified from a plurality of sound data from a plurality of recordings of an audio source using a collaborative technique. The identification may include recognition of spectral and temporal aspects of the plurality of the sound data from the plurality of the recordings and sharing of the recognized spectral and temporal aspects to identify the common sound data as common to the plurality of recordings and the uncommon sound data as not common to the plurality of recordings.Type: ApplicationFiled: November 19, 2012Publication date: May 22, 2014Applicant: ADOBE SYSTEMS INCORPORATEDInventors: Minje Kim, Paris Smaragdis
-
Publication number: 20140135962Abstract: Sound alignment techniques that employ timing information are described. In one or more implementations, features and timing information of sound data generated from a first sound signal are identified and used to identify features of sound data generated from a second sound signal. The identified features may then be utilized to align portions of the sound data from the first and second sound signals to each other.Type: ApplicationFiled: November 13, 2012Publication date: May 15, 2014Applicant: ADOBE SYSTEMS INCORPORATEDInventors: Brian John King, Gautham J. Mysore, Paris Smaragdis
-
Publication number: 20140136976Abstract: Sound alignment user interface techniques are described. In one or more implementations, a user interface is output having a first representation of sound data generated from a first sound signal and a second representation of sound data generated from a second sound signal. One or more inputs are received, via interaction with the user interface, that indicate that a first point in time in the first representation corresponds to a second point in time in the second representation. Aligned sound data is generated from the sound data from the first and second sound signals based at least in part on correspondence of the first point in time in the sound data generated from the first sound signal to the second point in time in the sound data generated from the second sound signal.Type: ApplicationFiled: November 13, 2012Publication date: May 15, 2014Applicant: ADOBE SYSTEMS INCORPORATEDInventors: Brian John King, Gautham J. Mysore, Paris Smaragdis
-
Publication number: 20140133675Abstract: Time interval sound alignment techniques are described. In one or more implementations, one or more inputs are received via interaction with a user interface that indicate that a first time interval in a first representation of sound data generated from a first sound signal corresponds to a second time interval in a second representation of sound data generated from a second sound signal. A stretch value is calculated based on an amount of time represented in the first and second time intervals, respectively. Aligned sound data is generated from the sound data for the first and second time intervals based on the calculated stretch value.Type: ApplicationFiled: November 13, 2012Publication date: May 15, 2014Applicant: Adobe Systems IncorporatedInventors: Brian John King, Gautham J. Mysore, Paris Smaragdis
-
Patent number: 8724798Abstract: A method and apparatus for canceling an echo in audio communication is disclosed. The method comprises receiving an audio signal from a network and subsequently detecting a mixture audio signal comprising a target audio signal and an echo audio signal, the echo signal corresponding to the received audio signal. The method then comprises estimating the target audio signal by determining magnitude spectrograms for the mixture and received audio signals respectively, estimating a magnitude spectrogram of the target audio signal dependent on those of the mixture and received audio signal, and generating an output audio signal that estimates the target audio signal, the output audio signal being dependent on the estimated magnitude spectrogram.Type: GrantFiled: November 20, 2009Date of Patent: May 13, 2014Assignee: Adobe Systems IncorporatedInventors: Paris Smaragdis, Gautham J. Mysore
-
Patent number: 8699858Abstract: A computer-implemented method includes segmenting a plurality of video frames of a sequence of video frames into a first portion that includes a selected visual object represented in the video frame and a second portion that includes a background represented in the video frame. The selected visual object is selected by using a selection envelope.Type: GrantFiled: November 26, 2008Date of Patent: April 15, 2014Assignee: Adobe Systems IncorporatedInventors: Ce Liu, Sylvain Paris, Paris Smaragdis, Wojciech Matusik
-
Patent number: 8611558Abstract: A method and system is presented for sampling analog signals in a manner that avoids the effects of signal clipping due to a limited dynamic range. A method and device for sampling an analog input using multiple gains, or gain mask, is described. By using different gains during different time quanta, a subset of the sampled points may effectively be attenuated before being sampled and converted to a digital representation. If clipping occurs during the sampling process, the true values of the clipped samples may be interpolated using the amplitudes of the non-clipped samples, which may not have been attenuated. Such interpolation may include constructing and/or solving a constraint optimization problem using linear programming. In one embodiment, such a problem may be constructed and/or solved by using sign information from the clipped samples and/or by imposing a sparsity assumption on the signals during the reconstruction process.Type: GrantFiled: February 26, 2009Date of Patent: December 17, 2013Assignee: Adobe Systems IncorporatedInventor: Paris Smaragdis
-
Patent number: 8554553Abstract: Methods and systems for non-negative hidden Markov modeling of signals are described. For example, techniques disclosed herein may be applied to signals emitted by one or more sources. In some embodiments, methods and systems may enable the separation of a signal's various components. As such, the systems and methods disclosed herein may find a wide variety of applications. In audio-related fields, for example, these techniques may be useful in music recording and processing, source extraction, noise reduction, teaching, automatic transcription, electronic games, audio search and retrieval, and many other applications.Type: GrantFiled: February 21, 2011Date of Patent: October 8, 2013Assignee: Adobe Systems IncorporatedInventors: Gautham J. Mysore, Paris Smaragdis
-
Publication number: 20130226858Abstract: A sound mixture may be received that includes a plurality of sources. A model may be received for one of the source that includes a dictionary of spectral basis vectors corresponding to that one source. At least one feature of the one source in the sound mixture may be estimated based on the model. In some examples, the estimation may be constrained according to temporal data.Type: ApplicationFiled: February 29, 2012Publication date: August 29, 2013Inventors: Paris Smaragdis, Gautham J. Mysore
-
Publication number: 20130226558Abstract: Methods and systems for non-negative hidden Markov modeling of signals are described. For example, techniques disclosed herein may be applied to signals emitted by one or more sources. The modeling may be constrained according to high level information. In some embodiments, methods and systems may enable the separation of a signal's various components. As such, the systems and methods disclosed herein may find a wide variety of applications. In audio-related fields, for example, these techniques may be useful in music recording and processing, source separation/extraction, noise reduction, teaching, automatic transcription, electronic games, audio search and retrieval, and many other applications.Type: ApplicationFiled: February 29, 2012Publication date: August 29, 2013Inventors: Gautham J. Mysore, Paris Smaragdis
-
Publication number: 20130129115Abstract: A method and system is presented for sampling analog signals in a manner that avoids the effects of signal clipping due to a limited dynamic range. A method and device for sampling an analog input using multiple gains, or gain mask, is described. By using different gains during different time quanta, a subset of the sampled points may effectively be attenuated before being sampled and converted to a digital representation. If clipping occurs during the sampling process, the true values of the clipped samples may be interpolated using the amplitudes of the non-clipped samples, which may not have been attenuated. Such interpolation may include constructing and/or solving a constraint optimization problem using linear programming. In one embodiment, such a problem may be constructed and/or solved by using sign information from the clipped samples and/or by imposing a sparsity assumption on the signals during the reconstruction process.Type: ApplicationFiled: February 26, 2009Publication date: May 23, 2013Inventor: Paris Smaragdis
-
Publication number: 20130132077Abstract: Systems and methods for semi-supervised source separation using non-negative techniques are described. In some embodiments, various techniques disclosed herein may enable the separation of signals present within a mixture, where one or more of the signals may be emitted by one or more different sources. In audio-related applications, for instance, a signal mixture may include speech (e.g., from a human speaker) and noise (e.g., background noise). In some cases, speech may be separated from noise using a speech model developed from training data. A noise model may be created, for example, during the separation process (e.g., “on-the-fly”) and in the absence of corresponding training data.Type: ApplicationFiled: May 27, 2011Publication date: May 23, 2013Inventors: Gautham J. Mysore, Paris Smaragdis
-
Publication number: 20130132082Abstract: Methods and systems for recognition of concurrent, superimposed, or otherwise overlapping signals are described. A Markov Selection Model is introduced that, together with probabilistic decomposition methods, enable recognition of simultaneously emitted signals from various sources. For example, a signal mixture may include overlapping speech from different persons. In some instances, recognition may be performed without the need to separate signals or sources. As such, some of the techniques described herein may be useful in automatic transcription, noise reduction, teaching, electronic games, audio search and retrieval, medical and scientific applications, etc.Type: ApplicationFiled: February 21, 2011Publication date: May 23, 2013Inventor: Paris Smaragdis