Patents by Inventor Changxue Ma

Changxue Ma has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

METHODS FOR CREATING AND SEARCHING A DATABASE OF SPEAKERS

Publication number: 20120095764

Abstract: A method of performing a search of a database of speakers, includes: receiving a query speech sample spoken by a query speaker; deriving a query utterance from the query speech sample; extracting query utterance statistics from the query utterance; performing Kernelized Locality-Sensitive Hashing (KLSH) using a kernel function, the KLSH using as input the query utterance statistics and utterance statistics extracted from a plurality of utterances included in a database of speakers in order to select a subset of the plurality of utterances; and comparing, using an utterance comparison equation, the query utterance statistics to the utterance statistics for each utterance in the subset to generate a list of speakers from the database of utterances having a highest similarity to the query speaker.

Type: Application

Filed: October 19, 2010

Publication date: April 19, 2012

Applicant: MOTOROLA, INC.

Inventors: WOOJAY JEON, YAN-MING CHEN, CHANGXUE MA, DUSAN MACHO
Method and apparatus for best matching an audible query to a set of audible targets

Patent number: 8049093

Abstract: During operation, a “coarse search” stage applies variable-scale windowing on the query pitch contours to compare them with fixed-length segments of target pitch contours to find matching candidates while efficiently scanning over variable tempo differences and target locations. Because the target segments are of fixed-length, this has the effect of drastically reducing the storage space required in a prior-art method. Furthermore, by breaking the query contours into parts, rhythmic inconsistencies can be more flexibly handled. Normalization is also applied to the contours to allow comparisons independent of differences in musical key. In a “fine search” stage, a “segmental” dynamic time warping (DTW) method is applied that calculates a more accurate similarity score between the query and each candidate target with more explicit consideration toward rhythmic inconsistencies.

Type: Grant

Filed: December 30, 2009

Date of Patent: November 1, 2011

Assignee: Motorola Solutions, Inc.

Inventors: Woojay Jeon, Changxue Ma
Content item retrieval based on a free text entry

Patent number: 8041700

Abstract: A method and apparatus for textual searching of a database is provided herein. During operation a user will input a letter into a search engine. The search engine will score words based on the letter and display results of the highest-scored words. Another letter will again be received and the process repeated. In situations where titles are returned to the user, additional steps of associating the words with a title and scoring the title take place. The highest-scored titles are provided to the user as the displayed results.

Type: Grant

Filed: April 7, 2009

Date of Patent: October 18, 2011

Assignee: Motorola Mobility, Inc.

Inventor: Changxue Ma
Method and apparatus for uniterm discovery and voice-to-voice search on mobile device

Patent number: 8019604

Abstract: A method, system and communication device for enabling uniterm discovery from audio content and voice-to-voice searching of audio content stored on a device using discovered uniterms. Received audio/voice input signal is sent to a uniterm discovery and search (UDS) engine within the device. The audio data may be associated with other content that is also stored within the device. The UDS engine retrieves a number of uniterms from the audio data and associates the uniterms with the stored content. When a voice search is initiated at the device, the UDS engine generates a statistical latent lattice model from the voice query and scores the uniterms from the audio database against the latent lattice model. Following a further refinement, the best group of uniterms is then determined and segments of the stored audio data and/or other content corresponding to the best group of uniterms are outputted.

Type: Grant

Filed: December 21, 2007

Date of Patent: September 13, 2011

Assignee: Motorola Mobility, Inc.

Inventor: Changxue Ma
Method and apparatus for voice searching for stored content using uniterm discovery

Patent number: 8015005

Abstract: A method, system and communication device for enabling voice-to-voice searching and ordered content retrieval via audio tags assigned to individual content, which tags generate uniterms that are matched against components of a voice query. The method includes storing content and tagging at least one of the content with an audio tag. The method further includes receiving a voice query to retrieve content stored on the device. When the voice query is received, the method completes a voice-to-voice search utilizing uniterms of the audio tag, scored against the phoneme latent lattice model generated by the voice query to identify matching terms within the audio tags and corresponding stored content. The retrieved content(s) associated with the identified audio tags having uniterms that score within the phoneme lattice model are outputted in an order corresponding to an order in which the uniterms are structured within the voice query.

Type: Grant

Filed: February 15, 2008

Date of Patent: September 6, 2011

Assignee: Motorola Mobility, Inc.

Inventor: Changxue Ma
Noise reduction on wireless headset input via dual channel calibration within mobile phone

Patent number: 7983428

Abstract: A communication device includes: (1) a wireless adapter at which a wireless headset is communicatively connected to the communication device and at which is received a first acoustic input that includes a speech input and a first ambient noise input; (2) a microphone that receives a second acoustic input, which includes a second ambient noise input; and (3) a dual-channel adaptive noise canceller that utilizes the second ambient noise input to filter the first ambient noise input out of the first acoustic input to generate an acoustic output that primarily comprises the speech input.

Type: Grant

Filed: May 9, 2007

Date of Patent: July 19, 2011

Assignee: Motorola Mobility, Inc.

Inventors: Changxue Ma, Chen Liu
METHOD AND APPARATUS FOR BEST MATCHING AN AUDIBLE QUERY TO A SET OF AUDIBLE TARGETS

Publication number: 20110154977

Abstract: During operation, a “coarse search” stage applies variable-scale windowing on the query pitch contours to compare them with fixed-length segments of target pitch contours to find matching candidates while efficiently scanning over variable tempo differences and target locations. Because the target segments are of fixed-length, this has the effect of drastically reducing the storage space required in a prior-art method. Furthermore, by breaking the query contours into parts, rhythmic inconsistencies can be more flexibly handled. Normalization is also applied to the contours to allow comparisons independent of differences in musical key. In a “fine search” stage, a “segmental” dynamic time warping (DTW) method is applied that calculates a more accurate similarity score between the query and each candidate target with more explicit consideration toward rhythmic inconsistencies.

Type: Application

Filed: December 30, 2009

Publication date: June 30, 2011

Applicant: MOTOROLA, INC.

Inventors: Woojay Jeon, Changxue Ma
VOICE WEB SEARCH

Publication number: 20110145214

Abstract: A search system will receive a voice query and use speech recognition with a predefined vocabulary to generate a textual transcription of the voice query. Queries are sent to a text search engine, retrieving multiple web page results for each of these initial text queries. The collection of the keywords is extracted from the resulting web pages and is phonetically indexed to form a voice query dependent and phonetically searchable index database. Finally, a phonetically-based voice search engine is used to search the original voice query against the voice query dependent and phonetically searchable index database to find the keywords and/or key phrases that best match what was originally spoken. The keywords and/or key phrases that best match what was originally spoken are then used as a final text query for a search engine. Search results from the final text query are then presented to the user.

Type: Application

Filed: December 16, 2009

Publication date: June 16, 2011

Applicant: MOTOROLA, INC.

Inventors: Fan Zhang, Yan-Ming Cheng, Changxue Ma, James R. Talley
ANALYZING AND PROCESSING A VERBAL EXPRESSION CONTAINING MULTIPLE GOALS

Publication number: 20110144996

Abstract: Disclosed is a method for parsing a verbal expression received from a user to determine whether or not the expression contains a multiple-goal command. Specifically, known techniques are applied to extract terms from the verbal expression. The extracted terms are assigned to categories. If two or more terms are found in the parsed verbal expression that are in associated categories and that do not overlap one another temporally, then the confidence levels of these terms are compared. If the confidence levels are similar, then the terms may be parallel entries in the verbal expression and may represent multiple goals. If a multiple-goal command is found, then the command is either presented to the user for review and possible editing or is executed. If the parsed multiple-goal command is presented to the user for review, then the presentation can be made via any appropriate interface including voice and text interfaces.

Type: Application

Filed: December 16, 2009

Publication date: June 16, 2011

Applicant: MOTOROLA, INC.

Inventors: Changxue Ma, Yan-Ming Cheng
METHOD AND APPARATUS FOR ORDERING RESULTS OF A QUERY

Publication number: 20110071826

Abstract: A method and apparatus for ordering results from a query is provided herein. During operation, a spoken query is received and converted to a textual representation, such as a word lattice. Search strings are then created from the word lattice. For example a set search strings may be created from the N-grams, such as unigrams and bigrams, of the word lattice. The search strings may be ordered and truncated based on confidence values assigned to the n-grams by the speech recognition system. The set of search strings are sent to at least one search engine, and search results are obtained. The search results are then re-arranged or reordered based on a semantic similarity between the search results and the word lattice.

Type: Application

Filed: September 23, 2009

Publication date: March 24, 2011

Applicant: MOTOROLA, INC.

Inventors: Changxue Ma, Harry M. Bliss
CONTENT ITEM RETRIEVAL BASED ON A FREE TEXT ENTRY

Publication number: 20100257166

Abstract: A method and apparatus for textual searching of a database is provided herein. During operation a user will input a letter into a search engine. The search engine will score words based on the letter and display results of the highest-scored words. Another letter will again be received and the process repeated. In situations where titles are returned to the user, additional steps of associating the words with a title and scoring the title take place. The highest-scored titles are provided to the user as the displayed results.

Type: Application

Filed: April 7, 2009

Publication date: October 7, 2010

Applicant: MOTOROLA, INC.

Inventor: Changxue MA
VIRTUAL SPHERE INPUT CONTROLLER FOR ELECTRONICS DEVICE

Publication number: 20100218141

Abstract: An electronic device including a processor communicably coupled to a display component wherein the processor is configured to generate and display an interactive icon on the display component. The interactive icon includes a primary item and at least one alternative item, and the processor is configured to visually prioritize the presentation of the primary item on the display component relative to the presentation of the alternative item.

Type: Application

Filed: February 23, 2009

Publication date: August 26, 2010

Applicant: MOTOROLA, INC.

Inventors: SHUANG XU, Changxue Ma
PROGRESSIVELY REFINING A SPEECH-BASED SEARCH

Publication number: 20100153112

Abstract: Disclosed are editing methods that are added to speech-based searching to allow users to better understand textual queries submitted to a search engine and to easily edit their speech queries. According to some embodiments, the user begins to speak. The user's speech is translated into a textual query and submitted to a search engine. The results of the search are presented to the user. As the user continues to speak, the user's speech query is refined based on the user's further speech. The refined speech query is converted to a textual query which is again submitted to the search engine. The refined results are presented to the user. This process continues as long as the user continues to refine the query. Some embodiments present the textual query to the user and allow the user to use both speech-based and non-speech-based tools to edit the textual query.

Type: Application

Filed: December 16, 2008

Publication date: June 17, 2010

Applicant: MOTOROLA, INC.

Inventors: W. Garland Phillips, Harry M. Bliss, Bashar Jano, Changxue Ma
FILTERING A LIST OF AUDIBLE ITEMS

Publication number: 20100137030

Abstract: Disclosed is a technique for presenting audible items to a user in a manner that allows the user to easily distinguish them and to select from among them. A number of audible items are rendered simultaneously to the user. To prevent the sounds from blending together into a sonic mishmash, some of the items are “conditioned” while they are being rendered. For example, one audible item might be rendered more quietly than another, or one item can be moved up in register compared with another. Some embodiments combine audible conditioning with visual avatars portrayed on, for example, a display screen of a user device. During the rendering, each audible item is paired with an avatar, the pairing based on some suitable criterion, such as a type of conditioning applied to the audible item. Audible spatial placement is mimicked by visual placement of the avatars on the user's display screen.

Type: Application

Filed: December 2, 2008

Publication date: June 3, 2010

Applicant: MOTOROLA, INC.

Inventor: Changxue Ma
System and method for enabling a mobile device as a portable character input peripheral device

Patent number: 7650445

Abstract: A portable electronic communication device, designed for voice and data communication is utilized as a peripheral input device for transmitting/providing character inputs, entered in the first device's touch input mechanism, to a second electronic device. The first device has a mode switching utility that switches the first device between a first standard communication mode and a second peripheral input device mode. When the first device is in the second peripheral input device mode, the first device operates as a peripheral input device for the second device.

Type: Grant

Filed: September 12, 2007

Date of Patent: January 19, 2010

Assignee: Motorola, Inc.

Inventors: Changxue Ma, Wei Lin, Li-Xin Zhen
METHOD AND APPARATUS FOR SPEECH RECOGNITION

Publication number: 20090259469

Abstract: A method and apparatus for performing speech recognition receives an audio signal, generates a sequence of frames of the audio signal, transforms each frame of the audio signal into a set of narrow band feature vectors using a narrow passband, couples the narrow band feature vectors to a speech model, and determines whether the audio signal is a wide band signal. When the audio signal is determined to be a wide band signal, a pass band parameter of each of one or more passbands that are outside the narrow passband is generated for each frame and the one or more band energy parameters are coupled to the speech model.

Type: Application

Filed: April 14, 2008

Publication date: October 15, 2009

Applicant: MOTOROLA, INC.

Inventors: Changxue Ma, Yuan-Jun Wei
Method and Apparatus for Voice Searching for Stored Content Using Uniterm Discovery

Publication number: 20090210226

Abstract: A method, system and communication device for enabling voice-to-voice searching and ordered content retrieval via audio tags assigned to individual content, which tags generate uniterms that are matched against components of a voice query. The method includes storing content and tagging at least one of the content with an audio tag. The method further includes receiving a voice query to retrieve content stored on the device. When the voice query is received, the method completes a voice-to-voice search utilizing uniterms of the audio tag, scored against the phoneme latent lattice model generated by the voice query to identify matching terms within the audio tags and corresponding stored content. The retrieved content(s) associated with the identified audio tags having uniterms that score within the phoneme lattice model are outputted in an order corresponding to an order in which the uniterms are structured within the voice query.

Type: Application

Filed: February 15, 2008

Publication date: August 20, 2009

Inventor: Changxue Ma
SEARCH-BASED DYNAMIC VOICE ACTIVATION

Publication number: 20090172546

Abstract: A method, apparatus, and electronic device for voice navigation are disclosed. A voice input mechanism 310 may receive a verbal input from a user to a voice user interface program invisible to the user. A processor 104 may identify in a graphical user interface (GUI) a set of GUI items. The processor 104 may convert the set of GUI items to a set of voice searchable indices 400. The processor 104 may correlate a matching GUI item of the set of GUI items to a phonemic representation of the verbal input.

Type: Application

Filed: May 23, 2008

Publication date: July 2, 2009

Applicant: Motorola, Inc.

Inventors: Yan Ming CHANG, Changxue Ma, Ted Mazurkiewicz
METHOD AND APPARATUS FOR UNITERM DISCOVERY AND VOICE-TO-VOICE SEARCH ON MOBILE DEVICE

Publication number: 20090164218

Abstract: A method, system and communication device for enabling uniterm discovery from audio content and voice-to-voice searching of audio content stored on a device using discovered uniterms. Received audio/voice input signal is sent to a uniterm discovery and search (UDS) engine within the device. The audio data may be associated with other content that is also stored within the device. The UDS engine retrieves a number of uniterms from the audio data and associates the uniterms with the stored content. When a voice search is initiated at the device, the UDS engine generates a statistical latent lattice model from the voice query and scores the uniterms from the audio database against the latent lattice model. Following a further refinement, the best group of uniterms is then determined and segments of the stored audio data and/or other content corresponding to the best group of uniterms are outputted.

Type: Application

Filed: December 21, 2007

Publication date: June 25, 2009

Applicant: Motorola,Inc.

Inventor: CHANGXUE MA
METHOD AND APPARATUS FOR ENABLING MULTIMODAL TAGS IN A COMMUNICATION DEVICE

Publication number: 20090089059

Abstract: A method and apparatus for enabling multimodal tags in a communication device is disclosed. The method comprises receiving a first training signal and receiving a second training signal in conjunction with the first training signal. A multimodal tag is created to represent a combination of the first training signal and the second training signal and a function is associated with the created multimodal tag.

Type: Application

Filed: September 28, 2007

Publication date: April 2, 2009

Applicant: MOTOROLA, INC.

Inventors: Changxue Ma, Harry M. Bliss

prev 1 2 3 4 5 next