Patents by Inventor Changxue Ma

Changxue Ma has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20120095764
    Abstract: A method of performing a search of a database of speakers, includes: receiving a query speech sample spoken by a query speaker; deriving a query utterance from the query speech sample; extracting query utterance statistics from the query utterance; performing Kernelized Locality-Sensitive Hashing (KLSH) using a kernel function, the KLSH using as input the query utterance statistics and utterance statistics extracted from a plurality of utterances included in a database of speakers in order to select a subset of the plurality of utterances; and comparing, using an utterance comparison equation, the query utterance statistics to the utterance statistics for each utterance in the subset to generate a list of speakers from the database of utterances having a highest similarity to the query speaker.
    Type: Application
    Filed: October 19, 2010
    Publication date: April 19, 2012
    Applicant: MOTOROLA, INC.
    Inventors: WOOJAY JEON, YAN-MING CHEN, CHANGXUE MA, DUSAN MACHO
  • Patent number: 8049093
    Abstract: During operation, a “coarse search” stage applies variable-scale windowing on the query pitch contours to compare them with fixed-length segments of target pitch contours to find matching candidates while efficiently scanning over variable tempo differences and target locations. Because the target segments are of fixed-length, this has the effect of drastically reducing the storage space required in a prior-art method. Furthermore, by breaking the query contours into parts, rhythmic inconsistencies can be more flexibly handled. Normalization is also applied to the contours to allow comparisons independent of differences in musical key. In a “fine search” stage, a “segmental” dynamic time warping (DTW) method is applied that calculates a more accurate similarity score between the query and each candidate target with more explicit consideration toward rhythmic inconsistencies.
    Type: Grant
    Filed: December 30, 2009
    Date of Patent: November 1, 2011
    Assignee: Motorola Solutions, Inc.
    Inventors: Woojay Jeon, Changxue Ma
  • Patent number: 8041700
    Abstract: A method and apparatus for textual searching of a database is provided herein. During operation a user will input a letter into a search engine. The search engine will score words based on the letter and display results of the highest-scored words. Another letter will again be received and the process repeated. In situations where titles are returned to the user, additional steps of associating the words with a title and scoring the title take place. The highest-scored titles are provided to the user as the displayed results.
    Type: Grant
    Filed: April 7, 2009
    Date of Patent: October 18, 2011
    Assignee: Motorola Mobility, Inc.
    Inventor: Changxue Ma
  • Patent number: 8019604
    Abstract: A method, system and communication device for enabling uniterm discovery from audio content and voice-to-voice searching of audio content stored on a device using discovered uniterms. Received audio/voice input signal is sent to a uniterm discovery and search (UDS) engine within the device. The audio data may be associated with other content that is also stored within the device. The UDS engine retrieves a number of uniterms from the audio data and associates the uniterms with the stored content. When a voice search is initiated at the device, the UDS engine generates a statistical latent lattice model from the voice query and scores the uniterms from the audio database against the latent lattice model. Following a further refinement, the best group of uniterms is then determined and segments of the stored audio data and/or other content corresponding to the best group of uniterms are outputted.
    Type: Grant
    Filed: December 21, 2007
    Date of Patent: September 13, 2011
    Assignee: Motorola Mobility, Inc.
    Inventor: Changxue Ma
  • Patent number: 8015005
    Abstract: A method, system and communication device for enabling voice-to-voice searching and ordered content retrieval via audio tags assigned to individual content, which tags generate uniterms that are matched against components of a voice query. The method includes storing content and tagging at least one of the content with an audio tag. The method further includes receiving a voice query to retrieve content stored on the device. When the voice query is received, the method completes a voice-to-voice search utilizing uniterms of the audio tag, scored against the phoneme latent lattice model generated by the voice query to identify matching terms within the audio tags and corresponding stored content. The retrieved content(s) associated with the identified audio tags having uniterms that score within the phoneme lattice model are outputted in an order corresponding to an order in which the uniterms are structured within the voice query.
    Type: Grant
    Filed: February 15, 2008
    Date of Patent: September 6, 2011
    Assignee: Motorola Mobility, Inc.
    Inventor: Changxue Ma
  • Patent number: 7983428
    Abstract: A communication device includes: (1) a wireless adapter at which a wireless headset is communicatively connected to the communication device and at which is received a first acoustic input that includes a speech input and a first ambient noise input; (2) a microphone that receives a second acoustic input, which includes a second ambient noise input; and (3) a dual-channel adaptive noise canceller that utilizes the second ambient noise input to filter the first ambient noise input out of the first acoustic input to generate an acoustic output that primarily comprises the speech input.
    Type: Grant
    Filed: May 9, 2007
    Date of Patent: July 19, 2011
    Assignee: Motorola Mobility, Inc.
    Inventors: Changxue Ma, Chen Liu
  • Publication number: 20110154977
    Abstract: During operation, a “coarse search” stage applies variable-scale windowing on the query pitch contours to compare them with fixed-length segments of target pitch contours to find matching candidates while efficiently scanning over variable tempo differences and target locations. Because the target segments are of fixed-length, this has the effect of drastically reducing the storage space required in a prior-art method. Furthermore, by breaking the query contours into parts, rhythmic inconsistencies can be more flexibly handled. Normalization is also applied to the contours to allow comparisons independent of differences in musical key. In a “fine search” stage, a “segmental” dynamic time warping (DTW) method is applied that calculates a more accurate similarity score between the query and each candidate target with more explicit consideration toward rhythmic inconsistencies.
    Type: Application
    Filed: December 30, 2009
    Publication date: June 30, 2011
    Applicant: MOTOROLA, INC.
    Inventors: Woojay Jeon, Changxue Ma
  • Publication number: 20110145214
    Abstract: A search system will receive a voice query and use speech recognition with a predefined vocabulary to generate a textual transcription of the voice query. Queries are sent to a text search engine, retrieving multiple web page results for each of these initial text queries. The collection of the keywords is extracted from the resulting web pages and is phonetically indexed to form a voice query dependent and phonetically searchable index database. Finally, a phonetically-based voice search engine is used to search the original voice query against the voice query dependent and phonetically searchable index database to find the keywords and/or key phrases that best match what was originally spoken. The keywords and/or key phrases that best match what was originally spoken are then used as a final text query for a search engine. Search results from the final text query are then presented to the user.
    Type: Application
    Filed: December 16, 2009
    Publication date: June 16, 2011
    Applicant: MOTOROLA, INC.
    Inventors: Fan Zhang, Yan-Ming Cheng, Changxue Ma, James R. Talley
  • Publication number: 20110144996
    Abstract: Disclosed is a method for parsing a verbal expression received from a user to determine whether or not the expression contains a multiple-goal command. Specifically, known techniques are applied to extract terms from the verbal expression. The extracted terms are assigned to categories. If two or more terms are found in the parsed verbal expression that are in associated categories and that do not overlap one another temporally, then the confidence levels of these terms are compared. If the confidence levels are similar, then the terms may be parallel entries in the verbal expression and may represent multiple goals. If a multiple-goal command is found, then the command is either presented to the user for review and possible editing or is executed. If the parsed multiple-goal command is presented to the user for review, then the presentation can be made via any appropriate interface including voice and text interfaces.
    Type: Application
    Filed: December 16, 2009
    Publication date: June 16, 2011
    Applicant: MOTOROLA, INC.
    Inventors: Changxue Ma, Yan-Ming Cheng
  • Publication number: 20110071826
    Abstract: A method and apparatus for ordering results from a query is provided herein. During operation, a spoken query is received and converted to a textual representation, such as a word lattice. Search strings are then created from the word lattice. For example a set search strings may be created from the N-grams, such as unigrams and bigrams, of the word lattice. The search strings may be ordered and truncated based on confidence values assigned to the n-grams by the speech recognition system. The set of search strings are sent to at least one search engine, and search results are obtained. The search results are then re-arranged or reordered based on a semantic similarity between the search results and the word lattice.
    Type: Application
    Filed: September 23, 2009
    Publication date: March 24, 2011
    Applicant: MOTOROLA, INC.
    Inventors: Changxue Ma, Harry M. Bliss
  • Publication number: 20100257166
    Abstract: A method and apparatus for textual searching of a database is provided herein. During operation a user will input a letter into a search engine. The search engine will score words based on the letter and display results of the highest-scored words. Another letter will again be received and the process repeated. In situations where titles are returned to the user, additional steps of associating the words with a title and scoring the title take place. The highest-scored titles are provided to the user as the displayed results.
    Type: Application
    Filed: April 7, 2009
    Publication date: October 7, 2010
    Applicant: MOTOROLA, INC.
    Inventor: Changxue MA
  • Publication number: 20100218141
    Abstract: An electronic device including a processor communicably coupled to a display component wherein the processor is configured to generate and display an interactive icon on the display component. The interactive icon includes a primary item and at least one alternative item, and the processor is configured to visually prioritize the presentation of the primary item on the display component relative to the presentation of the alternative item.
    Type: Application
    Filed: February 23, 2009
    Publication date: August 26, 2010
    Applicant: MOTOROLA, INC.
    Inventors: SHUANG XU, Changxue Ma
  • Publication number: 20100153112
    Abstract: Disclosed are editing methods that are added to speech-based searching to allow users to better understand textual queries submitted to a search engine and to easily edit their speech queries. According to some embodiments, the user begins to speak. The user's speech is translated into a textual query and submitted to a search engine. The results of the search are presented to the user. As the user continues to speak, the user's speech query is refined based on the user's further speech. The refined speech query is converted to a textual query which is again submitted to the search engine. The refined results are presented to the user. This process continues as long as the user continues to refine the query. Some embodiments present the textual query to the user and allow the user to use both speech-based and non-speech-based tools to edit the textual query.
    Type: Application
    Filed: December 16, 2008
    Publication date: June 17, 2010
    Applicant: MOTOROLA, INC.
    Inventors: W. Garland Phillips, Harry M. Bliss, Bashar Jano, Changxue Ma
  • Publication number: 20100137030
    Abstract: Disclosed is a technique for presenting audible items to a user in a manner that allows the user to easily distinguish them and to select from among them. A number of audible items are rendered simultaneously to the user. To prevent the sounds from blending together into a sonic mishmash, some of the items are “conditioned” while they are being rendered. For example, one audible item might be rendered more quietly than another, or one item can be moved up in register compared with another. Some embodiments combine audible conditioning with visual avatars portrayed on, for example, a display screen of a user device. During the rendering, each audible item is paired with an avatar, the pairing based on some suitable criterion, such as a type of conditioning applied to the audible item. Audible spatial placement is mimicked by visual placement of the avatars on the user's display screen.
    Type: Application
    Filed: December 2, 2008
    Publication date: June 3, 2010
    Applicant: MOTOROLA, INC.
    Inventor: Changxue Ma
  • Patent number: 7650445
    Abstract: A portable electronic communication device, designed for voice and data communication is utilized as a peripheral input device for transmitting/providing character inputs, entered in the first device's touch input mechanism, to a second electronic device. The first device has a mode switching utility that switches the first device between a first standard communication mode and a second peripheral input device mode. When the first device is in the second peripheral input device mode, the first device operates as a peripheral input device for the second device.
    Type: Grant
    Filed: September 12, 2007
    Date of Patent: January 19, 2010
    Assignee: Motorola, Inc.
    Inventors: Changxue Ma, Wei Lin, Li-Xin Zhen
  • Publication number: 20090259469
    Abstract: A method and apparatus for performing speech recognition receives an audio signal, generates a sequence of frames of the audio signal, transforms each frame of the audio signal into a set of narrow band feature vectors using a narrow passband, couples the narrow band feature vectors to a speech model, and determines whether the audio signal is a wide band signal. When the audio signal is determined to be a wide band signal, a pass band parameter of each of one or more passbands that are outside the narrow passband is generated for each frame and the one or more band energy parameters are coupled to the speech model.
    Type: Application
    Filed: April 14, 2008
    Publication date: October 15, 2009
    Applicant: MOTOROLA, INC.
    Inventors: Changxue Ma, Yuan-Jun Wei
  • Publication number: 20090210226
    Abstract: A method, system and communication device for enabling voice-to-voice searching and ordered content retrieval via audio tags assigned to individual content, which tags generate uniterms that are matched against components of a voice query. The method includes storing content and tagging at least one of the content with an audio tag. The method further includes receiving a voice query to retrieve content stored on the device. When the voice query is received, the method completes a voice-to-voice search utilizing uniterms of the audio tag, scored against the phoneme latent lattice model generated by the voice query to identify matching terms within the audio tags and corresponding stored content. The retrieved content(s) associated with the identified audio tags having uniterms that score within the phoneme lattice model are outputted in an order corresponding to an order in which the uniterms are structured within the voice query.
    Type: Application
    Filed: February 15, 2008
    Publication date: August 20, 2009
    Inventor: Changxue Ma
  • Publication number: 20090172546
    Abstract: A method, apparatus, and electronic device for voice navigation are disclosed. A voice input mechanism 310 may receive a verbal input from a user to a voice user interface program invisible to the user. A processor 104 may identify in a graphical user interface (GUI) a set of GUI items. The processor 104 may convert the set of GUI items to a set of voice searchable indices 400. The processor 104 may correlate a matching GUI item of the set of GUI items to a phonemic representation of the verbal input.
    Type: Application
    Filed: May 23, 2008
    Publication date: July 2, 2009
    Applicant: Motorola, Inc.
    Inventors: Yan Ming CHANG, Changxue Ma, Ted Mazurkiewicz
  • Publication number: 20090164218
    Abstract: A method, system and communication device for enabling uniterm discovery from audio content and voice-to-voice searching of audio content stored on a device using discovered uniterms. Received audio/voice input signal is sent to a uniterm discovery and search (UDS) engine within the device. The audio data may be associated with other content that is also stored within the device. The UDS engine retrieves a number of uniterms from the audio data and associates the uniterms with the stored content. When a voice search is initiated at the device, the UDS engine generates a statistical latent lattice model from the voice query and scores the uniterms from the audio database against the latent lattice model. Following a further refinement, the best group of uniterms is then determined and segments of the stored audio data and/or other content corresponding to the best group of uniterms are outputted.
    Type: Application
    Filed: December 21, 2007
    Publication date: June 25, 2009
    Applicant: Motorola,Inc.
    Inventor: CHANGXUE MA
  • Publication number: 20090089059
    Abstract: A method and apparatus for enabling multimodal tags in a communication device is disclosed. The method comprises receiving a first training signal and receiving a second training signal in conjunction with the first training signal. A multimodal tag is created to represent a combination of the first training signal and the second training signal and a function is associated with the created multimodal tag.
    Type: Application
    Filed: September 28, 2007
    Publication date: April 2, 2009
    Applicant: MOTOROLA, INC.
    Inventors: Changxue Ma, Harry M. Bliss