Patents by Inventor Lie Lu
Lie Lu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20160078882Abstract: Embodiments for measuring content coherence and embodiments for measuring content similarity are described. Content coherence between a first audio section and a second audio section is measured. For each audio segment in the first audio section, a predetermined number of audio segments in the second audio section are determined. Content similarity between the audio segment in the first audio section and the determined audio segments is higher than that between the audio segment and all the other audio segments in the second audio section. An average of the content similarity between the audio segment in the first audio section and the determined audio segments is calculated. The content coherence is calculated as an average, the maximum or the minimum of the averages calculated for the audio segments in the first audio section. The content similarity may be calculated based on Dirichlet distribution.Type: ApplicationFiled: November 25, 2015Publication date: March 17, 2016Applicant: DOLBY LABORATORIES LICENSING CORPORATIONInventors: Lie LU, Mingqing HU
-
Publication number: 20160056787Abstract: Equalizer controller and controlling method are disclosed. In one embodiment, an equalizer controller includes an audio classifier for identifying the audio type of an audio signal in real time; and an adjusting unit for adjusting an equalizer in a continuous manner based on the confidence value of the audio type as identified.Type: ApplicationFiled: March 17, 2014Publication date: February 25, 2016Applicant: DOLBY LABORATORIES LICENSING CORPORATIONInventors: Lie LU, Jun WANG, Alan SEEFELDT, Mingqing HU
-
Publication number: 20160049915Abstract: Volume leveler controller and controlling method are disclosed. In one embodiment, A volume leveler controller includes an audio content classifier for identifying the content type of an audio signal in real time; and an adjusting unit for adjusting a volume leveler in a continuous manner based on the content type as identified. The adjusting unit may configured to positively correlate the dynamic gain of the volume leveler with informative content types of the audio signal, and negatively correlate the dynamic gain of the volume leveler with interfering content types of the audio signal.Type: ApplicationFiled: March 17, 2014Publication date: February 18, 2016Applicant: DOLBY LABORATORIES LICENSING CORPORATIONInventors: Jun WANG, Lie LU, Alan SEEFELDT
-
Publication number: 20160019671Abstract: Embodiments of identifying multimedia objects based on multimedia fingerprints are provided. Query fingerprints are derived from a multimedia object according to differing fingerprint algorithms. For each fingerprint algorithm, decisions are calculated through at least one classifier corresponding to the fingerprint algorithm based on the query fingerprint and reference fingerprints, the reference fingerprints being derived from reference multimedia objects according to the same fingerprint algorithm. Each of the decisions indicates a possibility that the query fingerprint and the reference fingerprint are not derived from the same multimedia content. For each of the reference multimedia objects, a distance is calculated as a weighted sum of the decisions relating to the reference fingerprints. The multimedia object is identified as matching the reference multimedia object with the smallest distance less than a threshold.Type: ApplicationFiled: September 29, 2015Publication date: January 21, 2016Inventors: Claus Bauer, Lie Lu, Mingqing Hu
-
Patent number: 9218821Abstract: Embodiments for measuring content coherence and embodiments for measuring content similarity are described. Content coherence between a first audio section and a second audio section is measured. For each audio segment in the first audio section, a predetermined number of audio segments in the second audio section are determined. Content similarity between the audio segment in the first audio section and the determined audio segments is higher than that between the audio segment and all the other audio segments in the second audio section. An average of the content similarity between the audio segment in the first audio section and the determined audio segments is calculated. The content coherence is calculated as an average, the maximum or the minimum of the averages calculated for the audio segments in the first audio section. The content similarity may be calculated based on Dirichlet distribution.Type: GrantFiled: August 7, 2012Date of Patent: December 22, 2015Assignee: Dolby Laboratories Licensing CorporationInventors: Lie Lu, Mingqing Hu
-
Patent number: 9202255Abstract: Embodiments of identifying multimedia objects based on multimedia fingerprints are provided. Query fingerprints are derived from a multimedia object according to differing fingerprint algorithms. For each fingerprint algorithm, decisions are calculated through at least one classifier corresponding to the fingerprint algorithm based on the query fingerprint and reference fingerprints, the reference fingerprints being derived from reference multimedia objects according to the same fingerprint algorithm. Each of the decisions indicates a possibility that the query fingerprint and the reference fingerprint are not derived from the same multimedia content. For each of the reference multimedia objects, a distance is calculated as a weighted sum of the decisions relating to the reference fingerprints. The multimedia object is identified as matching the reference multimedia object with the smallest distance less than a threshold.Type: GrantFiled: April 1, 2013Date of Patent: December 1, 2015Assignee: Dolby Laboratories Licensing CorporationInventors: Claus Bauer, Lie Lu, Mingqing Hu
-
Publication number: 20150332680Abstract: Embodiments are directed a method of rendering object-based audio comprising determining an initial spatial position of objects having object audio data and associated metadata, determining a perceptual importance of the objects, and grouping the audio objects into a number of clusters based on the determined perceptual importance of the objects, such that a spatial error caused by moving an object from an initial spatial position to a second spatial position in a cluster is minimized for objects with a relatively high perceptual importance. The perceptual importance is based at least in part by a partial loudness of an object and content semantics of the object.Type: ApplicationFiled: November 25, 2013Publication date: November 19, 2015Applicant: Dolby Laboratories Licensing CorporationInventors: Brett G. CROCKETT, Alan J. SEEFELDT, Nicolas R. TSINGOS, Rhonda WILSON, Dirk Jeroen BREEBAART, Lie LU, Lianwu CHEN
-
Publication number: 20150254054Abstract: A method for audio signal processing is provided. The method includes acquiring a first set of metadata associated with consumption of an audio signal by a target user, acquiring a second set of metadata associated with a set of reference users and generating, at least partially based on the first and second sets of metadata, a recommended configuration of at least one parameter for the target user, the at least one parameter being for use in the consumption of the audio signal. Corresponding apparatus and computer program product are also disclosed.Type: ApplicationFiled: February 25, 2015Publication date: September 10, 2015Applicant: Dolby Laboratories Licensing CorporationInventors: Xuejing Sun, Bin Cheng, Claus Bauer, Lie Lu, Guilin Ma
-
Publication number: 20140358265Abstract: Audio processing method and audio processing apparatus, and training method are described. According to embodiments of the application, an accent identifier is used to identify accent frames from a plurality of audio frames, resulting in an accent sequence comprised of probability scores of accent and/or non-accent decisions with respect to the plurality of audio frames. Then a tempo estimator is used to estimate a tempo sequence of the plurality of audio frames based on the accent sequence. The embodiments can be well adaptive to the change of tempo, and can be further used to tracking beats properly.Type: ApplicationFiled: May 20, 2014Publication date: December 4, 2014Applicant: DOLBY LABORATORIES LICENSING CORPORATIONInventors: Jun Wang, Lie Lu
-
Patent number: 8892231Abstract: Embodiments for audio classification are described. An audio classification system includes at least one device which executes a process of audio classification on an audio signal. The at least one device can operate in at least two modes requiring different resources. The audio classification system also includes a complexity controller which determines a combination and instructs the at least one device to operate according to the combination. For each of the at least one device, the combination specifies one of the modes of the device, and the resources requirement of the combination does not exceed maximum available resources. By controlling the modes, the audio classification system has improved scalability to an execution environment.Type: GrantFiled: August 22, 2012Date of Patent: November 18, 2014Assignee: Dolby Laboratories Licensing CorporationInventors: Bin Cheng, Lie Lu
-
Publication number: 20140205103Abstract: Embodiments for measuring content coherence and embodiments for measuring content similarity are described. Content coherence between a first audio section and a second audio section is measured. For each audio segment in the first audio section, a predetermined number of audio segments in the second audio section are determined. Content similarity between the audio segment in the first audio section and the determined audio segments is higher than that between the audio segment and all the other audio segments in the second audio section. An average of the content similarity between the audio segment in the first audio section and the determined audio segments is calculated. The content coherence is calculated as an average, the maximum or the minimum of the averages calculated for the audio segments in the first audio section. The content similarity may be calculated based on Dirichlet distribution.Type: ApplicationFiled: August 7, 2012Publication date: July 24, 2014Applicant: DOLBY LABORATORIES LICENSING CORPORATIONInventors: Lie Lu, Mingqing Hu
-
Publication number: 20140149468Abstract: Described is a technology by which a playback list comprising similar songs is automatically built based on automatically detected/generated song attributes, such as by extracting numeric features of each song. The attributes may be downloaded from a remote connection, and/or may be locally generated on the playback device. To build a playlist, a seed song's attributes may be compared against attributes of other songs to determine which other songs are similar to the seed song and thus included in the playlist. Another way to build a playlist is based on similarity of songs to a set of user provided-attributes, such as corresponding to moods or usage modes such as “resting” “reading” “jogging” or “driving” moods/modes. The playlist may be dynamically adjusted based on user interaction with the device, such as when a user skips a song, queues a song, or dequeues a song.Type: ApplicationFiled: February 3, 2014Publication date: May 29, 2014Applicant: MICROSOFT CORPORATIONInventors: Lie Lu, Frank Torsten Bernd Seide, Gabriel White
-
Patent number: 8642872Abstract: Described is a technology by which a playback list comprising similar songs is automatically built based on automatically detected/generated song attributes, such as by extracting numeric features of each song. The attributes may be downloaded from a remote connection, and/or may be locally generated on the playback device. To build a playlist, a seed song's attributes may be compared against attributes of other songs to determine which other songs are similar to the seed song and thus included in the playlist. Another way to build a playlist is based on similarity of songs to a set of user provided-attributes, such as corresponding to moods or usage modes such as “resting” “reading” “jogging” or “driving” moods/modes. The playlist may be dynamically adjusted based on user interaction with the device, such as when a user skips a song, queues a song, or dequeues a song.Type: GrantFiled: June 4, 2008Date of Patent: February 4, 2014Assignee: Microsoft CorporationInventors: Lie Lu, Frank Torsten Bernd Seide, Gabriel White
-
Patent number: 8595009Abstract: Methods and apparatuses for performing song detection on an audio signal are described. Clips of the audio signal are classified into classes comprising music. Class boundaries of music clips are detected as candidate boundaries of a first type. Combinations including non-overlapped sections are derived. Each section meets the following conditions: 1) including at least one music segment longer than a predetermined minimum song duration, 2) shorter than a predetermined maximum song duration, 3) both starting and ending with a music clip, and 4) a proportion of the music clips in each of the sections is greater than a predetermined minimum proportion. In this way, various possible song partitions in the audio signal can be obtained for investigation.Type: GrantFiled: July 26, 2012Date of Patent: November 26, 2013Assignee: Dolby Laboratories Licensing CorporationInventors: Lie Lu, Claus Bauer
-
Patent number: 8572088Abstract: Automated rich presentation of a semantic topic is described. In one aspect, respective portions of multimodal information corresponding to a semantic topic are evaluated to locate events associated with the semantic topic. The probability that a document belongs to an event is determined based on document inclusion of one or more of persons, times, locations, and keywords, and document distribution along a timeline associated with the event. For each event, one or more documents objectively determined to be substantially representative of the event are identified. One or more other types of media (e.g., video, images, etc.) related to the event are then extracted from the multimodal information. The representative documents and the other media are for presentation to a user in a storyboard.Type: GrantFiled: October 21, 2005Date of Patent: October 29, 2013Assignee: Microsoft CorporationInventors: Lie Lu, Wei-Ying Ma, Zhiwei Li
-
Publication number: 20130279740Abstract: Embodiments of identifying multimedia objects based on multimedia fingerprints are provided. Query fingerprints are derived from a multimedia object according to differing fingerprint algorithms. For each fingerprint algorithm, decisions are calculated through at least one classifier corresponding to the fingerprint algorithm based on the query fingerprint and reference fingerprints, the reference fingerprints being derived from reference multimedia objects according to the same fingerprint algorithm. Each of the decisions indicates a possibility that the query fingerprint and the reference fingerprint are not derived from the same multimedia content. For each of the reference multimedia objects, a distance is calculated as a weighted sum of the decisions relating to the reference fingerprints. The multimedia object is identified as matching the reference multimedia object with the smallest distance less than a threshold.Type: ApplicationFiled: April 1, 2013Publication date: October 24, 2013Applicant: Dolby Laboratories Licensing CorporationInventors: Claus Bauer, Lie Lu, Mingqing Hu
-
Publication number: 20130058488Abstract: Embodiments for audio classification are described. An audio classification system includes at least one device which executes a process of audio classification on an audio signal. The at least one device can operate in at least two modes requiring different resources. The audio classification system also includes a complexity controller which determines a combination and instructs the at least one device to operate according to the combination. For each of the at least one device, the combination specifies one of the modes of the device, and the resources requirement of the combination does not exceed maximum available resources. By controlling the modes, the audio classification system has improved scalability to an execution environment.Type: ApplicationFiled: August 22, 2012Publication date: March 7, 2013Applicant: DOLBY LABORATORIES LICENSING CORPORATIONInventors: Bin Cheng, Lie Lu
-
Publication number: 20130046536Abstract: Methods and apparatuses for performing song detection on an audio signal are described. Clips of the audio signal are classified into classes comprising music. Class boundaries of music clips are detected as candidate boundaries of a first type. Combinations including non-overlapped sections are derived. Each section meets the following conditions: 1) including at least one music segment longer than a predetermined minimum song duration, 2) shorter than a predetermined maximum song duration, 3) both starting and ending with a music clip, and 4) a proportion of the music clips in each of the sections is greater than a predetermined minimum proportion. In this way, various possible song partitions in the audio signal can be obtained for investigation.Type: ApplicationFiled: July 26, 2012Publication date: February 21, 2013Applicant: DOLBY LABORATORIES LICENSING CORPORATIONInventors: Lie Lu, Claus Bauer
-
Publication number: 20130046399Abstract: Methods and apparatus for detecting a repetitive pattern in a sequence of audio frames are described. Similarity values of a first similarity matrix with first resolution for the sequence are calculated. An adaptive threshold is estimated from the similarity values for classifying the similarity values into repetition or non-repetition. For each of one or more offsets of a second similarity matrix with second resolution higher that the first resolution, similarity values of the second similarity matrix corresponding to the offset are calculated. Then the calculated similarity values are binarized with the adaptive threshold to obtain binarized data. Finally, the repetitive pattern is detected from the binarized data. The requirement on memory may be reduced because less data are stored in detecting the repetitive pattern.Type: ApplicationFiled: August 1, 2012Publication date: February 21, 2013Applicant: DOLBY LABORATORIES LICENSING CORPORATIONInventors: Lie Lu, Bin cheng
-
Publication number: 20120101815Abstract: Described is a technology by which a user hums, sings or otherwise plays a user-provided rendition of a ringtone (or ringback tone) through a mobile telephone to a ringtone search service (e.g., a WAP, interactive voice response or SMS-based search platform). The service matches features of the user's rendition against features of actual ringtones to determine one or more matching candidate ringtones for downloading. Features may include pitch contours (up or down), pitch intervals and durations of notes. Matching candidates may be ranked based on the determined similarity, possibly in conjunction with weighting criterion such as the popularity of the ringtone and/or the importance of the matched part. The candidate set may be augmented with other ringtones independent of the matching, such as the most popular ones downloaded by other users, ringtones from similar artists, and so forth.Type: ApplicationFiled: December 29, 2011Publication date: April 26, 2012Applicant: Microsoft CorporationInventors: Lie LU, Yutao XIE, Sing XIE, Jiafan OU, Ruihao WENG