Patents by Inventor Chienchung Chang
Chienchung Chang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20060140493Abstract: This disclosure is directed to encoding techniques that can be used to improve encoding of digital video data. The techniques can be implemented by an encoder of a digital video device in order to reduce the number of computations and possibly reduce power consumption during video encoding. More specifically, video encoding techniques are described which utilize one or more programmable thresholds in order to terminate the execution of various computations when the computations would be unlikely to improve the encoding. By terminating computations prematurely, the amount of processing required for video encoding can be reduced, and power can be conserved.Type: ApplicationFiled: February 28, 2006Publication date: June 29, 2006Inventors: Anthony Mauro, Chienchung Chang, King-Chung Lai
-
Patent number: 7039246Abstract: This disclosure is directed to encoding techniques that can be used to improve encoding of digital video data. The techniques can be implemented by an encoder of a digital video device in order to reduce the number of computations and possibly reduce power consumption during video encoding. More specifically, video encoding techniques are describe which utilize one or more programmable thresholds in order to terminate the execution of various computations when the computations would be unlikely to improve the encoding. By terminating computations prematurely, the amount of processing required for video encoding can be reduced, and power can be conserved.Type: GrantFiled: May 3, 2002Date of Patent: May 2, 2006Assignee: Qualcomm IncorporatedInventors: Anthony Patrick Mauro, II, Chienchung Chang, King-Chung Lai
-
Patent number: 7024359Abstract: A voice recognition system applies speaker-dependent modification functions to acoustic feature vectors prior to voice recognition pattern matching against a speaker-independent acoustic model. An adaptation engine matches a set of acoustic feature vectors X with an adaptation model to select a speaker-dependent feature vector modification function f( ), which is then applied to X to form a modified set of acoustic feature vectors f(X). Voice recognition is then performed by correlating the modified acoustic feature vectors f(X) with a speaker-independent acoustic model.Type: GrantFiled: January 31, 2001Date of Patent: April 4, 2006Assignee: Qualcomm IncorporatedInventors: Chienchung Chang, Naren Malayath, Byron Yoshio Yafuso
-
Patent number: 6941265Abstract: Generally stated a method and an accompanying apparatus provides for a voice recognition system (300) with programmable front end processing unit (400). The front end processing unit (400) requests and receives different configuration files at different times for processing voice data in the voice recognition system (300). The configuration files are communicated to the front end unit via a communication link (310) for configuring the front end processing unit (400). A microprocessor may provide the front end configuration files on the communication link at different times.Type: GrantFiled: December 14, 2001Date of Patent: September 6, 2005Assignee: Qualcomm IncInventors: Ning Bi, Andrew DeJaco, Xin Zhong, Chienchung Chang, Chuck Han, Hari Garudadri, Naren Malayath, Suhail Jalil
-
Patent number: 6836758Abstract: A method and system for speech recognition combines different types of engines in order to recognize user-defined digits and control words, predefined digits and control words, and nametags. Speaker-independent engines are combined with speaker-dependent engines. A Hidden Markov Model (HMM) engine is combined with Dynamic Time Warping (DTW) engines.Type: GrantFiled: January 9, 2001Date of Patent: December 28, 2004Assignee: Qualcomm IncorporatedInventors: Ning Bi, Andrew P. DeJaco, Harinath Garudadri, Chienchung Chang, William Yee-Ming Huang, Narendranath Malayath, Suhail Jalil, David Puig Oses, Yingyong Qi
-
Publication number: 20040151242Abstract: A device is described that includes an encoder/decoder (CODEC) in which functionality is partitioned between a video front end (VFE) and a video back end (VBE). The VFE encapsulates functionality and image processing operations to support a variety of applications, and presents a flexible inter-processor by which an external master device can easily control these operations. The video back end (VBE) operates as an encoder and/or a decoder to generate encoded and/or decoded video sequences. The VFE and VBE may execute within an operating environment provided by a slave device.Type: ApplicationFiled: January 30, 2003Publication date: August 5, 2004Inventor: Chienchung Chang
-
Publication number: 20040008780Abstract: This disclosure describes video encoding techniques capable of reducing the number of processing cycles and memory transfers necessary to encode a video sequence. In this manner, the disclosed video encoding techniques may increase video encoding speed and reduce power consumption. In general, the video encoding techniques make use of a candidate memory that stores video blocks in columns corresponding to a search space for a motion estimation routine. A memory control unit addresses the candidate memory to retrieve multiple pixels in parallel for simultaneous comparison to pixels in a video block to be encoded, e.g., using Sum of Absolute Difference (SAD) or Sum of Squared Difference (SSD) techniques. A difference processor performs the parallel calculations. In addition, for subsequent video blocks to be encoded, the candidate memory can be incrementally updated by loading a new column of video blocks, rather than reloading the entire search space.Type: ApplicationFiled: February 21, 2003Publication date: January 15, 2004Inventors: King-Chung Lai, Gilbert Christopher Sih, Chienchung Chang, Anthony Patrick Mauro
-
Publication number: 20040008779Abstract: This disclosure describes video encoding techniques capable of reducing the number of processing cycles and memory transfers necessary to encode a video sequence. In this manner, the disclosed video encoding techniques may increase video encoding speed and reduce power consumption. In general, the video encoding techniques make use of a candidate memory that stores video blocks in columns corresponding to a search space for a motion estimation routine. A memory control unit addresses the candidate memory to retrieve multiple pixels in parallel for simultaneous comparison to pixels in a video block to be encoded, e.g., using Sum of Absolute Difference (SAD) or Sum of Squared Difference (SSD) techniques. A difference processor performs the parallel calculations. In addition, for subsequent video blocks to be encoded, the candidate memory can be incrementally updated by loading a new column of video blocks, rather than reloading the entire search space.Type: ApplicationFiled: February 21, 2003Publication date: January 15, 2004Inventors: King Chung Lai, Gilbert Christopher Sih, Chienchung Chang, Anthony Patrick Mauro
-
Publication number: 20040001545Abstract: Video encoding techniques are described that involve determining a number of processing cycles used during encoding of a video frame, and encoding the video frame without using motion estimation techniques when a number of processing cycles used during encoding exceeds a threshold. For example, the threshold may define a number of processing cycles available for encoding of the given video frame, and the determined number of processing cycles may be a counted number of cycles used during the encoding process of the given video frame. If the number of processing cycles used during encoding exceeds the threshold, motion estimation techniques can be discontinued in favor of less computationally intensive encoding techniques such as texture encoding.Type: ApplicationFiled: June 28, 2002Publication date: January 1, 2004Inventor: Chienchung Chang
-
Publication number: 20030231712Abstract: Video encoding techniques are described. In one example, a video encoding technique includes identifying a pixel location associated with a video block in a search space based on motion vectors associated with a set of video blocks within a video frame to be encoded, wherein the video blocks in the set are spatially located at defined locations relative to a current video block of the video frame to be encoded. A motion estimation routine can then be initialized for the current video block at the identified pixel location. By identifying a pixel location associated with a video block in a search space based on motion vectors associated with a set of video blocks within a video frame, the phenomenon of spatial redundancy can be more readily exploited to accelerate and improve the encoding process.Type: ApplicationFiled: June 18, 2002Publication date: December 18, 2003Inventors: Naren Malayath, Chienchung Chang, Suhail Jalil
-
Publication number: 20030206658Abstract: This disclosure is directed to encoding techniques that can be used to improve encoding of digital video data. The techniques can be implemented by an encoder of a digital video device in order to reduce the number of computations and possibly reduce power comsumption during video encoding. More specifically, video encoding techniques are describe which utilize one or more programmable thresholds in order to terminate the execution of various computations when the computations would be unlikely to improve the encoding. By terminating computations prematurely, the amount of processing required for video encoding can be reduced, and power can be conserved.Type: ApplicationFiled: May 3, 2002Publication date: November 6, 2003Inventors: Anthony Patrick Mauro, Chienchung Chang, King-Chung Lai
-
Publication number: 20030171924Abstract: Generally stated a method and an accompanying apparatus provides for a voice recognition system (300) with programmable front end processing unit (400). The front end processing unit (400) requests and receives different configuration files at different times for processing voice data in the voice recognition system (300). The configuration files are communicated to the front end unit via a communication link (310) for configuring the front end processing unit (400). A microprocessor may provide the front end configuration files on the communication link at different times.Type: ApplicationFiled: December 14, 2001Publication date: September 11, 2003Applicant: Qualcomm, Incorporated.Inventors: Ning Bi, Andrew P. DeJaco, Xin Zhong, Chienchung Chang, Chuck Han, Hari Garudadri, Naren Malayath, Suhail Jalil
-
Publication number: 20030154080Abstract: Method and apparatus for modifying a speech input to generate a desired speech output. A voice recognition unit analyzes the speech input and generates a textual output. The output is used by a speech synthesis unit to generate an output speech signal using speech characteristics and parameters stored in a database. The database may be preconfigured to store speech characteristics of a variety of types. A training unit allows the user to enter speech samples which are used to generate inputs to the database. The training unit acquires the speech samples and generates speech units, such as diphones, which may be modified to achieve a desired result. In one embodiment, the modified speech output is applied to an STU program, and feedback is provided indicating the error rate.Type: ApplicationFiled: February 14, 2002Publication date: August 14, 2003Inventors: Sandra L. Godsey, Chienchung Chang, Carola R. Emelius-Swartz
-
Publication number: 20030133565Abstract: An improved echo cancellation system (400) includes a double talk detector (406) configured for detecting a double talk condition by monitoring voice energy in a first frequency band (503). An adaptive filter (420) is configured for producing an echo signal based on a set of coefficients, and holds the set of coefficients constant when the double talk detector (406) detects the double talk condition. A microphone system (402) inputs audible signals (404) in a second frequency band (520) that is wider and overlaps the first frequency band (503). The echo signal is used to cancel echo in the input signal. A loud speaker (401) is configured for playing voice data in a third frequency band (501) essentially equal to a difference of the first and second frequency bands (502 and 503). The first and third frequency bands (503 and 501) essentially makeup the second frequency band (502).Type: ApplicationFiled: January 15, 2002Publication date: July 17, 2003Inventors: Chienchung Chang, Narendranath Malayath
-
Publication number: 20030135371Abstract: An improved system for an interactive voice recognition system (400) includes a voice prompt generator (401) for generating voice prompt in a first frequency band (501). A speech detector (406) detects presence of speech energy in a second frequency band (502). The first and second frequency bands (501, 502) are essentially conjugate frequency bands. A voice data generator (412) generates voice data based on an output of the voice prompt generator (401) and audible speech of a voice response generator (402). A control signal (422) controls the voice prompt generator (401) based on whether the speech detector (406) detects presence of speech energy in the second frequency band (502). A back end (405) of the interactive voice recognition system (400) is configured to operate on an extracted front end voice feature based on whether the speech detector (406) detects presence of speech energy in the second frequency band (502).Type: ApplicationFiled: January 15, 2002Publication date: July 17, 2003Inventors: Chienchung Chang, Narendranath Malayath
-
Patent number: 6594628Abstract: A voice recognition system is disclosed wherein a feature extraction apparatus is located in a remote station. The feature extraction apparatus extracts features from an input speech frame and then provides the extracted features to a central processing station. In the central processing station, the features are provided to a word decoder which determines the syntax of the input speech frame.Type: GrantFiled: April 2, 1997Date of Patent: July 15, 2003Assignee: Qualcomm, IncorporatedInventors: Paul E. Jacobs, Chienchung Chang
-
Patent number: 6574596Abstract: A voice recognition rejection scheme for capturing an utterance includes the steps accepting the utterance, applying an N-best algorithm to the utterance, or rejecting the utterance. The utterance is accepted if a first predefined relationship exists between one or more closest comparison results for the utterance with respect to a stored word and one or more differences between the one or more closest comparison results and one or more other comparison results between the utterance and one or more other stored words. An N-best algorithm is applied to the utterance if a second predefined relationship exists between the one or more closest comparison results and the one or more differences between the one or more closest comparison results and the one or more other comparison results.Type: GrantFiled: February 8, 1999Date of Patent: June 3, 2003Assignee: Qualcomm IncorporatedInventors: Ning Bi, Chienchung Chang, Harinath Garudadri, Andrew P. Dejaco
-
Publication number: 20020178004Abstract: A voice recognition system applies user inputs to adapt speaker-dependent voice recognition templates using implicit user confirmation during a transaction. In one embodiment, the user confirms the vocabulary word to complete at transaction, such as entry of a password, and in response a template database is updated. User utterances are used to generate test templates that are compared to the template database. Scores are generated for each test template and a winner selected. The template database includes one set of speaker independent templates and two sets of speaker dependent templates.Type: ApplicationFiled: May 23, 2001Publication date: November 28, 2002Inventors: Chienchung Chang, Narendranath Malayath
-
Publication number: 20020143540Abstract: A voice recognition (VR) system is disclosed that utilizes a combination of speaker independent (SI) and speaker dependent (SD) acoustic models. At least one SI acoustic model is used in combination with at least one SD acoustic model to provide a level of speech recognition performance that at least equals that of a purely SI acoustic model. The disclosed hybrid SI/SD VR system continually uses unsupervised training to update the acoustic templates in the one or more SD acoustic models.Type: ApplicationFiled: March 28, 2001Publication date: October 3, 2002Inventors: Narendranath Malayath, Andrew P. DeJaco, Chienchung Chang, Suhail Jalil, Ning Bi, Harinath Garudadri
-
Patent number: 6449496Abstract: A method and apparatus providing a user interface within a phone that responds to a limited vocabulary of user trained voice commands. The interface allows users to perform all phone handset dialing functions using voice commands. Additionally, users will be able to create and modify entries within a voice recognition phonebook, whereby a number within the voice recognition phonebook can be called by saying the name associated with the number. The user interface provides a combination of voice and LCD displayed user prompts and responses to voice input. The interface responds to user voice commands and performs the command functions based upon matches to previously user trained voice command vocabulary words stored in memory.Type: GrantFiled: February 8, 1999Date of Patent: September 10, 2002Assignee: Qualcomm IncorporatedInventors: Scott D. Beith, Ning Bi, Chienchung Chang, Karthick Chinnaswami, Andrew P. DeJaco, Jason B. Kenagy, Robert Opalsky, George Pan