Viterbi Trellis Patents (Class 704/242)

Soft decision enhancement

Patent number: 8340204

Abstract: A Viterbi trellis processing technique in which soft decisions and hard decisions are derived from a received signal and the soft decisions are enhanced by being modified using the hard decisions. A log likelihood ratio for a bit of the received signal can be derived by grouping candidate metrics associated with the decision that the bit has a first state, grouping candidate metrics associated with the decision that the bit has a second state, applying respective functions to the groups and calculating the difference of the function values.

Type: Grant

Filed: August 5, 2005

Date of Patent: December 25, 2012

Assignees: MStar Semiconductor, Inc., MStar Software R&D (Shenzhen) Ltd., MStar France SAS, MStar Semiconductor, Inc.

Inventors: Navid Fatemi-Ghomi, Cyril Valadon
Viterbi decoder and speech recognition method using same using non-linear filter for observation probabilities

Patent number: 8332222

Abstract: A Viterbi decoder includes: an observation vector sequence generator for generating an observation vector sequence by converting an input speech to a sequence of observation vectors; a local optimal state calculator for obtaining a partial state sequence having a maximum similarity up to a current observation vector as an optimal state; an observation probability calculator for obtaining, as a current observation probability, a probability for observing the current observation vector in the optimal state; a buffer for storing therein a specific number of previous observation probabilities; a non-linear filter for calculating a filtered probability by using the previous observation probabilities stored in the buffer and the current observation probability; and a maximum likelihood calculator for calculating a partial maximum likelihood by using the filtered probability.

Type: Grant

Filed: July 21, 2009

Date of Patent: December 11, 2012

Assignee: Electronics and Telecommunications Research Institute

Inventors: Hoon Chung, Jeon Gue Park, Yunkeun Lee, Ho-Young Jung, Hyung-Bae Jeon, Jeom Ja Kang, Sung Joo Lee, Euisok Chung, Ji Hyun Wang, Byung Ok Kang, Ki-young Park, Jong Jin Kim
Modified trace-back using soft output viterbi algorithm (SOVA)

Patent number: 8321771

Abstract: Systems and methods are provided for generating error events for decoded bits using a Soft output Viterbi algorithm (SOYA). A winning path through a trellis can be determined and decoded information can be generated. Path metric differences can be computed within the trellis based on the winning path. A plurality of error event masks and error event metrics can be generated based on the decoded information and the path metric differences.

Type: Grant

Filed: October 2, 2009

Date of Patent: November 27, 2012

Assignee: Marvell International Ltd.

Inventor: Manoj Kumar Yadav
Multi-channel sequential Viterbi decoder

Patent number: 8290097

Abstract: A multi-channel sequential Viterbi decoder includes: an input data buffer, a “Read Single Data Word from Input Data Buffer” signal driver, a processing unit selector, a decoder channel parameters registers unit, a processing unit for the “Reset Path Metrics” command, a processing unit for the “Set Path Metric Value for the Given Path Number” command, a processing unit for the “Get Single Bit from the Path with Given Number” command, a processing unit for the “Process Input Samples” command, a decoding paths and path metrics RAM, a unit for generating current decoder channel base address for the decoding paths and path metrics RAM, a unit for generating cell address for the decoding path and path metric RAM, and a data buffers unit for decoder channels output.

Type: Grant

Filed: April 19, 2010

Date of Patent: October 16, 2012

Assignee: Topcon Positioning Systems, Inc.

Inventors: Timur G. Kelin, Dmitry D. Murzinov, Dmitry A. Pyatkov
Viterbi pack instruction

Patent number: 8290095

Abstract: A Viterbi pack instruction is disclosed that masks the contents of a first predicate register with a first masking value and masks the contents of a second predicate register with a second masking value. The resulting masked data is written to a destination register. The Viterbi pack instruction may be implemented in hardware, firmware, software, or any combination thereof.

Type: Grant

Filed: March 23, 2006

Date of Patent: October 16, 2012

Assignee: QUALCOMM Incorporated

Inventors: Mao Zeng, Lucian Codrescu
Uncertainty interval content sensing within communications

Patent number: 8209175

Abstract: Repetition of content words in a communication is used to increase the certainty, or, alternatively, reduce the uncertainty, that the content words were actual words from the communication. Reducing the uncertainty of a particular content word of a communication in turn increases the likelihood that the content word is relevant to the communication. Reliable, relevant content words mined from a communication can be used for, e.g., automatic internet searches for documents and/or web sites pertinent to the communication. Reliable, relevant content words mined from a communication can also, or alternatively, be used to automatically generate one or more documents from the communication, e.g., communication summaries, communication outlines, etc.

Type: Grant

Filed: June 8, 2006

Date of Patent: June 26, 2012

Assignee: Microsoft Corporation

Inventors: Kunal Mukerjee, Rafael Ballesteros
Multi-resolution hidden markov model using class specific features

Patent number: 8200489

Abstract: A method for classifying data includes selecting an elemental size and features for the data that are representative of possible subclasses. Resolution widths are selected in conjunction with these features. Models associated with symbols are developed from these resolution widths and features. Data is compared with these models to give a likelihood that the model applies. The best model is determined and a signal is provided related to the symbol associated with the best model.

Type: Grant

Filed: January 29, 2009

Date of Patent: June 12, 2012

Assignee: The United States of America as represented by the Secretary of the Navy

Inventor: Paul M. Baggenstoss
Efficient decoding of spatially-multiplexed signals

Patent number: 8194801

Abstract: A method for communication includes receiving a spatially-multiplexed signal using multiple receivers to produce multiple respective received signals. The spatially-multiplexed signal includes multiple simultaneously-transmitted symbols, which are selected from respective sets of constellation symbols, each constellation symbol representing a respective set of values of a group of data bits. Combinations of the constellation symbols are traversed iteratively. Each combination includes one constellation symbol from each of the sets of the constellation symbols and represents N data bits. The traversed combinations are searched for a combination that matches the received signals. During traversal of the combinations, at least 2N measures of likelihood regarding the values of the data bits represented by each traversed combination are accumulated. The accumulated measures of likelihood are processed to produce soft bit metrics.

Type: Grant

Filed: September 21, 2011

Date of Patent: June 5, 2012

Assignee: Altair Semiconductor Ltd.

Inventors: Yigal Bitran, Itay Lusky, Ariel Yagil
System and method for providing large vocabulary speech processing based on fixed-point arithmetic

Patent number: 8195462

Abstract: Disclosed herein is a system, method and computer-readable medium storing instructions for controlling a computing device according to the method. The invention relates to a system, method and computer-readable medium storing instructions for controlling a computing device according to the method. As an example embodiment, the method uses a speech recognition decoder that operates or uses fixed point arithmetic. The exemplary method comprises representing arc costs associated with at least one finite state transducer (FST) in fixed point, representing parameters associated with a hidden Markov model (HMM) in fixed point and processing speech data in the speech recognition decoder using fixed point arithmetic for the fixed point FST arc costs and the fixed point HMM parameters. The method may also include computing at the decoder sentence hypothesis probabilities with fixed point arithmetic as type Q-2e numbers.

Type: Grant

Filed: February 16, 2006

Date of Patent: June 5, 2012

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Charles Douglas Blewett, Enrico Luigi Bocchieri
Information communication terminal, information communication system, information communication method, and storage medium for storing an information communication program thereof for recognizing speech information

Patent number: 8126712

Abstract: An information communication terminal (100) that includes: a speech recognition module (6) for recognizing speech information to identify a plurality of words in the recognized speech information; a storage medium (20) for storing keyword extraction condition setting data (24) in which a condition for extracting a keyword is set; a keyword extraction module (8) for reading the keyword extraction condition setting data (24) to extract a plurality of keywords from the plurality of words; a related information acquisition module (11) for acquiring related information related to a plurality of keywords; and a related information output module (14) for providing related information to a monitor (2).

Type: Grant

Filed: February 8, 2006

Date of Patent: February 28, 2012

Assignee: Nippon Telegraph and Telephone Corporation

Inventors: Takeya Mukaigaito, Shinya Takada, Daigoro Yokozeki, Miki Sakai, Rie Sakai, Katsuya Arai, Takuo Nishihara, Takahiko Murayama
Adjusting a speech engine for a mobile computing device based on background noise

Patent number: 8121837

Abstract: Methods, apparatus, and products are disclosed for adjusting a speech engine for a mobile computing device based on background noise, the mobile computing device operatively coupled to a microphone, that include: sampling, through the microphone, background noise for a plurality of operating environments in which the mobile computing device operates; generating, for each operating environment, a noise model in dependence upon the sampled background noise for that operating environment; and configuring the speech engine for the mobile computing device with the noise model for the operating environment in which the mobile computing device currently operates.

Type: Grant

Filed: April 24, 2008

Date of Patent: February 21, 2012

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, Jr., Paritosh D. Patel
Information Processing Apparatus, Information Processing Method, and Program

Publication number: 20120035927

Abstract: An information processing apparatus includes a plurality of information input units that inputs observation information of a real space, an event detection unit that generates event information including estimated position information and estimated identification (ID) information of a user present in the real space based on analysis of the information input from the information input unit, and an information integration processing unit that inputs the event information, and generates target information including a position and user ID information of each user based on the input event information and signal information representing a probability value for an event generating source. Here, the information integration processing unit includes an utterance source probability calculation unit having an identifier, and calculates an utterance source probability based on input information using the identifier in the utterance source probability calculation unit.

Type: Application

Filed: July 1, 2011

Publication date: February 9, 2012

Inventors: Keiichi Yamada, Tsutomu Sawada
Model development authoring, generation and execution based on data and processor dependencies

Patent number: 8086455

Abstract: A recognition (e.g., speech, handwriting, etc.) model build process that is declarative and data-dependence-based. Process steps are defined in a declarative language as individual processors having input/output data relationships and data dependencies of predecessors and subsequent process steps. A compiler is utilized to generate the model building sequence. The compiler uses the input data and output data files of each model build processor to determine the sequence of model building and automatically orders the processing steps based on the declared input/output relationship (the user does not need to determine the order of execution). The compiler also automatically detects ill-defined processes, including cyclic definition and data being produced by more than one action. The user can add, change and/or modify a process by editing a declaration file, and rerunning the compiler, thereby a new process is automatically generated.

Type: Grant

Filed: January 9, 2008

Date of Patent: December 27, 2011

Assignee: Microsoft Corporation

Inventors: Yifan Gong, Ye Tian
Speech recognition circuit using parallel processors

Patent number: 8036890

Abstract: A speech recognition circuit comprises an input buffer for receiving processed speech parameters. A lexical memory contains lexical data for word recognition. The lexical data comprises a plurality of lexical tree data structures. Each lexical tree data structure comprises a model of words having common prefix components. An initial component of each lexical tree structure is unique. A plurality of lexical tree processors are connected in parallel to the input buffer for processing the speech parameters in parallel to perform parallel lexical tree processing for word recognition by accessing the lexical data in the lexical memory. A results memory is connected to the lexical tree processors for storing processing results from the lexical tree processors and lexical tree identifiers to identify lexical trees to be processed by the lexical tree processors.

Type: Grant

Filed: September 4, 2009

Date of Patent: October 11, 2011

Assignee: Zentian Limited

Inventor: Mark Catchpole
Technique for training a phonetic decision tree with limited phonetic exceptional terms

Patent number: 8027834

Abstract: The present invention discloses a method for training an exception-limited phonetic decision tree. An initial subset of data can be selected and used for creating an initial phonetic decision tree. Additional terms can then be incorporated into the subset. The enlarged subset can be used to evaluate the phonetic decision tree with the results being categorized as either correctly or incorrectly phonetized. An exception-limited phonetic tree can be generated from the set of correctly phonetized terms. If the termination conditions for the method have been determined to be unsatisfactorily met, then steps of the method can be repeated.

Type: Grant

Filed: June 25, 2007

Date of Patent: September 27, 2011

Assignee: Nuance Communications, Inc.

Inventor: Steven M. Hancock
Method and system of optimal selection strategy for statistical classifications

Patent number: 8024188

Abstract: An optimal selection or decision strategy is described through an example that includes use in dialog systems. The selection strategy or method includes receiving multiple predictions and multiple probabilities. The received predictions predict the content of a received input and each of the probabilities corresponds to one of the predictions. In an example dialog system, the received input includes an utterance. The selection method includes dynamically selecting a set of predictions from the received predictions by generating ranked predictions. The ranked predictions are generated by ordering the plurality of predictions according to descending probability.

Type: Grant

Filed: August 24, 2007

Date of Patent: September 20, 2011

Assignee: Robert Bosch GmbH

Inventors: Junling Hu, Fabrizio Morbini, Fuliang Weng, Xue Liu
Method and apparatus for uniterm discovery and voice-to-voice search on mobile device

Patent number: 8019604

Abstract: A method, system and communication device for enabling uniterm discovery from audio content and voice-to-voice searching of audio content stored on a device using discovered uniterms. Received audio/voice input signal is sent to a uniterm discovery and search (UDS) engine within the device. The audio data may be associated with other content that is also stored within the device. The UDS engine retrieves a number of uniterms from the audio data and associates the uniterms with the stored content. When a voice search is initiated at the device, the UDS engine generates a statistical latent lattice model from the voice query and scores the uniterms from the audio database against the latent lattice model. Following a further refinement, the best group of uniterms is then determined and segments of the stored audio data and/or other content corresponding to the best group of uniterms are outputted.

Type: Grant

Filed: December 21, 2007

Date of Patent: September 13, 2011

Assignee: Motorola Mobility, Inc.

Inventor: Changxue Ma
Audio source separation based on flexible pre-trained probabilistic source models

Patent number: 8014536

Abstract: Improved audio source separation is provided by providing an audio dictionary for each source to be separated. Thus the invention can be regarded as providing “partially blind” source separation as opposed to the more commonly considered “blind” source separation problem, where no prior information about the sources is given. The audio dictionaries are probabilistic source models, and can be derived from training data from the sources to be separated, or from similar sources. Thus a library of audio dictionaries can be developed to aid in source separation. An unmixing and deconvolutive transformation can be inferred by maximum likelihood (ML) given the received signals and the selected audio dictionaries as input to the ML calculation. Optionally, frequency-domain filtering of the separated signal estimates can be performed prior to reconstructing the time-domain separated signal estimates. Such filtering can be regarded as providing an “audio skin” for a recovered signal.

Type: Grant

Filed: December 1, 2006

Date of Patent: September 6, 2011

Assignee: Golden Metallic, Inc.

Inventor: Hagai Thomas Attias
Speech analysis using statistical learning

Patent number: 8005676

Abstract: Included are embodiments for providing speech analysis. At least one embodiment of a method includes receiving audio data associated with a communication and providing the at least one phoneme in a phonetic transcript, the phonetic transcript including at least one character from a phonetic alphabet.

Type: Grant

Filed: September 29, 2006

Date of Patent: August 23, 2011

Assignee: Verint Americas, Inc.

Inventors: Gary Duke, Joseph Watson
Device and method of modeling acoustic characteristics with HMM and collating the same with a voice characteristic vector sequence

Patent number: 7895040

Abstract: According to an embodiment, voice recognition apparatus includes units of: acoustic processing, voice interval detecting, dictionary, collating, search target selecting, storing and determining, and voice recognition method includes processes of: selecting a search range on basis of a beam search, setting and storing a standard frame, storing an output probability of a certain transition path, determining whether or not the output probability of a certain path is stored. Number of times of calculation of the output probability is reduced by selecting the search range on basis of the beam search, calculating the output probability of the certain transition path only once in an interval from when the standard frame is set to when the standard frame is renewed, and storing and using thus calculated value as an approximate value of the output probability in subsequent frames.

Type: Grant

Filed: March 30, 2007

Date of Patent: February 22, 2011

Assignee: Kabushiki Kaisha Toshiba

Inventors: Masaru Sakai, Shinichi Tanaka
Time synchronous decoding for long-span hidden trajectory model

Patent number: 7877256

Abstract: A time-synchronous lattice-constrained search algorithm is developed and used to process a linguistic model of speech that has a long-contextual-span capability. In the algorithm, hypotheses are represented as traces that include an indication of a current frame, previous frames and future frames. Each frame can include an associated linguistic unit such as a phone or units that are derived from a phone. Additionally, pruning strategies can be applied to speed up the search. Further, word-ending recombination methods are developed to speed up the computation. These methods can effectively deal with an exponentially increased search space.

Type: Grant

Filed: February 17, 2006

Date of Patent: January 25, 2011

Assignee: Microsoft Corporation

Inventors: Xiaolong Li, Li Deng, Dong Yu, Alejandro Acero
METHOD AND APPARATUS FOR IMPROVING MEMORY LOCALITY FOR REAL-TIME SPEECH RECOGNITION

Publication number: 20100332228

Abstract: According to some embodiments, a method and apparatus are provided to buffer N audio frames of a plurality of audio frames associated with an audio signal, pre-compute scores for a subset of context dependent models (CDMs), and perform a graphical model search associated with the N audio frames where a score of a context independent model (CIM) associated with a CDM is used in lieu of a score for the CDM when a score for the CDM is needed and has not been pre-computed.

Type: Application

Filed: June 25, 2009

Publication date: December 30, 2010

Inventors: Michael Eugene Deisher, Tao Ma
Enhancement to Viterbi speech processing algorithm for hybrid speech models that conserves memory

Patent number: 7805305

Abstract: The present invention discloses a method for semantically processing speech for speech recognition purposes. The method can reduce an amount of memory required for a Viterbi search of an N-gram language model having a value of N greater than two and also having at least one embedded grammar that appears in a multiple contexts to a memory size of approximately a bigram model search space with respect to the embedded grammar. The method also reduces needed CPU requirements. Achieved reductions can be accomplished by representing the embedded grammar as a recursive transition network (RTN), where only one instance of the recursive transition network is used for the contexts. Other than the embedded grammars, a Hidden Markov Model (HMM) strategy can be used for the search space.

Type: Grant

Filed: October 12, 2006

Date of Patent: September 28, 2010

Assignee: Nuance Communications, Inc.

Inventors: Daniel E. Badt, Tomas Beran, Radek Hampl, Pavel Krbec, Jan Sedivy
Systems and methods for fast and memory efficient machine translation using statistical integrated phase lattice

Patent number: 7797148

Abstract: A phrase-based translation system and method includes a statistically integrated phrase lattice (SIPL) (H) which represents an entire translational model. An input (I) is translated by determining a best path through an entire lattice (S) by performing an efficient composition operation between the input and the SIPL. The efficient composition operation is performed by a multiple level search where each operand in the efficient composition operation represents a different search level.

Type: Grant

Filed: June 4, 2008

Date of Patent: September 14, 2010

Assignee: International Business Machines Corporation

Inventors: Stanley Chen, Yuqing Gao, Bowen Zhou
Adaptive deinterleaver memory allocation

Patent number: 7782982

Abstract: A receiver for use in a wireless network comprising a communications channel and a method of allocating deinterleaver memory usage in the receiver, wherein the receiver comprises a processor adapted to organize subchannels of the communications channel and set a number (N) of data bits per soft decision, wherein the soft decision is represented by N data bits; an address decoder adapted to decode the subchannels; a demapper adapted to receive QAM symbols and demap the QAM symbols to soft decisions; a deinterleaver adapted to perform deinterleaving on the soft decisions, wherein the deinterleaver comprises a memory component having a storage size that is a function of the number (N) of bits per soft decision; and a Viterbi decoder adapted to decode the deinterleaved soft decisions.

Type: Grant

Filed: October 25, 2007

Date of Patent: August 24, 2010

Assignee: Newport Media, Inc.

Inventor: Nabil Yousef
Speaker Recognition in a Speech Recognition System

Publication number: 20100198598

Abstract: A method for recognizing a speaker of an utterance in a speech recognition system is disclosed. A likelihood score for each of a plurality of speaker models for different speakers is determined. The likelihood score indicating how well the speaker model corresponds to the utterance. For each of the plurality of speaker models, a probability that the utterance originates from that speaker is determined. The probability is determined based on the likelihood score for the speaker model and requires the estimation of a distribution of likelihood scores expected based at least in part on the training state of the speaker.

Type: Application

Filed: February 4, 2010

Publication date: August 5, 2010

Applicant: NUANCE COMMUNICATIONS, INC.

Inventors: Tobias Herbig, Franz Gerl
Methods of channel coding for communication systems

Patent number: 7764743

Abstract: A method of encoding data for transmission to one or more users selects a given number of bits of data from a transport block to be subject to hybrid ARQ functionality for channel coding. Only the selected bits are channel coded in a HARQ block for subsequent transmission using a given set of channelization codes to one or more users.

Type: Grant

Filed: August 5, 2005

Date of Patent: July 27, 2010

Assignee: Alcatel-Lucent USA Inc.

Inventor: Emad N. Farag
Method for the soft bit metric calculation with linear MIMO detection for LDPC codes

Patent number: 7751506

Abstract: A MIMO receiver implements a method for the soft bit metric calculation with linear MIMO detection for LDPC codes, after linear matrix inversion MIMO detection. In the receiver, a detector detects the estimated symbol and the noise variance. Further, a soft metric calculation unit computes the distance between the estimated symbol and the constellation point, and then divides the distance by the noise variance to determine the soft bit metrics.

Type: Grant

Filed: December 1, 2005

Date of Patent: July 6, 2010

Assignee: Samsung Electronics Co., Ltd.

Inventors: Huaning Niu, Chiu Ngo
VITERBI DECODER AND SPEECH RECOGNITION METHOD USING SAME

Publication number: 20100161329

Abstract: A Viterbi decoder includes: an observation vector sequence generator for generating an observation vector sequence by converting an input speech to a sequence of observation vectors; a local optimal state calculator for obtaining a partial state sequence having a maximum similarity up to a current observation vector as an optimal state; an observation probability calculator for obtaining, as a current observation probability, a probability for observing the current observation vector in the optimal state; a buffer for storing therein a specific number of previous observation probabilities; a non-linear filter for calculating a filtered probability by using the previous observation probabilities stored in the buffer and the current observation probability; and a maximum likelihood calculator for calculating a partial maximum likelihood by using the filtered probability.

Type: Application

Filed: July 21, 2009

Publication date: June 24, 2010

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventors: Hoon CHUNG, Jeon Gue PARK, Yunkeun LEE, Ho-Young JUNG, Hyung-Bae JEON, Jeorn Ja KANG, Sung Joo LEE, Euisok CHUNG, Ji Hyun WANG, Byung Ok KANG, Ki-young PARK, Jong Jin KIM
Quadrature amplitude modulation trellis coded modulation decoding apparatus and method thereof

Patent number: 7738580

Abstract: A quadrature amplitude modulation trellis coded modulation (QAM-TCM) decoding apparatus and the related method that receives and decodes a QAM signal. The QAM-TCM decoding apparatus includes an in-phase least significant bit (LSB) decoding path, which includes a in-phase Viterbi decoder for executing a decoding procedure on at least one LSB corresponding to an in-phase component of the QAM signal, a quadrature-phase LSB decoding path, which includes a quadrature-phase Viterbi decoder for executing a decoding procedure on at least one LSB corresponding to a quadrature-phase component of the QAM signal, and a most significant bit (MSB) decoding path for executing a decoding procedure on MSB portions corresponding to the in-phase or the quadrature-phase of the QAM signal.

Type: Grant

Filed: June 6, 2006

Date of Patent: June 15, 2010

Assignee: Realtek Semiconductor Corp.

Inventors: Jung-Tang Chiang, Hou-Wei Lin
Pattern matching method and apparatus and speech information retrieval system

Patent number: 7739111

Abstract: A pattern matching method for matching between a first symbol sequence and a second symbol sequence which is shorter than the first symbol sequence is provided. The method includes the steps of performing DP matching between the first and second symbol sequences to create a matrix of the DP matching transition, detecting the maximum length of lengths of consecutive correct answers based on the matrix of the DP matching transition, and calculating similarity based on the maximum length.

Type: Grant

Filed: August 9, 2006

Date of Patent: June 15, 2010

Assignee: Canon Kabushiki Kaisha

Inventor: Kazue Kaneko
Path memory circuit

Patent number: 7734992

Abstract: A path memory circuit for use in a Viterbi decoding process performed based on state transitions through a number n (n is a positive integer) of states. The path memory circuit includes a memory area A formed by the storage circuits of the first to ith (i is an integer from 0 to M) stages; a memory area B formed by the selective storage circuits that select and hold a decoding result for any state k (k is integer from 1 to n) of the storage circuits from the i+1th stage to the Mth stage; and a memory area C formed by the selective storage circuits other than the memory area A and the memory area B.

Type: Grant

Filed: December 7, 2004

Date of Patent: June 8, 2010

Assignee: Panasonic Corporation

Inventor: Yukio Arima
ONLINE ARABIC HANDWRITING RECOGNITION

Publication number: 20100128985

Abstract: Method for online character recognition of Arabic text, the method including receiving handwritten Arabic text from a user in the form of handwriting strokes, sampling the handwriting strokes to acquire a sequence of two dimensional point representations thereof, with associated temporal data, geometrically pre processing and extracting features on the point representations, detecting delayed strokes and word parts in the pre processed point representations, projecting the delayed strokes onto the body of the word parts, constructing feature vector representations for each word part, thereby generating an observation sequence, and determining the word with maximum probability given the observation sequence, resulting in a list of word probabilities.

Type: Application

Filed: July 26, 2007

Publication date: May 27, 2010

Applicant: BGN TECHNOLOGIES LTD.

Inventors: Jihad El-Sana, Fadi Biadsy
Speech recognition system and technique

Patent number: 7711561

Abstract: The present invention relates to speech recognition systems, particularly speech-to-text systems and software and decoders for the same.

Type: Grant

Filed: April 15, 2004

Date of Patent: May 4, 2010

Assignee: Kabushiki Kaisha Toshiba

Inventors: Wide Hogenhout, Kean Kheong Chin
Method of multilingual speech recognition by reduction to single-language recognizer engine components

Patent number: 7689404

Abstract: In some speech recognition applications, not only the language of the utterance is not known in advance, but also a single utterance may contain words in more than one language. At the same time, it is impractical to build speech recognizers for all expected combinations of languages. Moreover, business needs may require a new combination of languages to be supported in short period of time. The invention addresses this issue by a novel way of combining and controlling the components of the single-language speech recognizers to produce multilingual speech recognition functionality capable of recognizing multilingual utterances at a modest increase of computational complexity.

Type: Grant

Filed: February 16, 2005

Date of Patent: March 30, 2010

Inventor: Arkady Khasin
Adaptive modulation method and coding rate control method

Patent number: 7656960

Abstract: In a radio communication system, transmitter and receiver stations share information on a maximum number of bits communicated per symbol. The transmitter station encodes a signal with sufficient error correcting capabilities to create a codeword. The transmitter station allocates the bits from the codeword to each symbol, modulates the symbols using a modulation type which processes symbols each having a number of bits equal to or smaller than the maximum number of bits per symbol, and transmits the modulated symbols. The receiver station demodulates the symbols using a modulation type which processes a larger number of bits per symbol as the transmission path quality is higher from among modulation types which process symbols having a number of bits equal to or smaller than the maximum number of bits per symbol.

Type: Grant

Filed: February 25, 2005

Date of Patent: February 2, 2010

Assignee: Hitachi, Ltd.

Inventors: Satoshi Tamaki, Takashi Yano, Seishi Hanaoka, Toshiyuki Saito
Method and system for decoding WCDMA AMR speech data using redundancy

Patent number: 7643993

Abstract: A method and system for decoding WCDMA AMR speech data using redundancy may include generating at least one bit-sequence for at least one of a plurality of channels that comprises received WCDMA speech data. The bit-sequence may be generated by using a decoding algorithm and may be decrypted to recover the data that may have been encrypted before being transmitted. At least one bit-sequence may be selected for each of the channels by using redundancy, such as, for example, CRC, in the received WCDMA speech data. The redundancy in the received WCDMA speech data may be, for example, CRC. The bit-sequence for each of the channels may be combined to form at least one speech stream. A speech stream may be selected based on speech constraints, which may comprise gain continuity and/or pitch continuity. The selected speech stream may be communicated to a voice decoder.

Type: Grant

Filed: January 5, 2006

Date of Patent: January 5, 2010

Assignee: Broadcom Corporation

Inventor: Arie Heiman
SPEECH RECOGNITION CIRCUIT USING PARALLEL PROCESSORS

Publication number: 20090326941

Abstract: A speech recognition circuit comprises an input buffer for receiving processed speech parameters. A lexical memory contains lexical data for word recognition. The lexical data comprises a plurality of lexical tree data structures. Each lexical tree data structure comprises a model of words having common prefix components. An initial component of each lexical tree structure is unique. A plurality of lexical tree processors are connected in parallel to the input buffer for processing the speech parameters in parallel to perform parallel lexical tree processing for word recognition by accessing the lexical data in the lexical memory. A results memory is connected to the lexical tree processors for storing processing results from the lexical tree processors and lexical tree identifiers to identify lexical trees to be processed by the lexical tree processors.

Type: Application

Filed: September 4, 2009

Publication date: December 31, 2009

Inventor: Mark Catchpole
Large-vocabulary speech recognition method, apparatus, and medium based on multilayer central lexicons

Patent number: 7627474

Abstract: A speech recognition method including: layering a central lexicon in a tree structure with respect to recognition-subject vocabularies; performing multi-pass symbol matching between a recognized phoneme sequence and a phonetic sequence of the central lexicon layered in the tree structure; and selecting a final speech recognition result via a Viterbi search process using a detailed acoustic model with respect to candidate vocabularies selected by the multi-pass symbol matching.

Type: Grant

Filed: August 28, 2006

Date of Patent: December 1, 2009

Assignee: Samsung Electronics Co., Ltd.

Inventors: Nam Hoon Kim, In Jeong Choi, Ick Sang Han, Sang Bae Jeong
Systems and modules for use with trellis-based decoding

Patent number: 7623585

Abstract: Systems and modules for use in trellis-based decoding of convolutionally encoded sets of data bits. A first calculation module receives an encoded set of data bits and calculates a signal distance or a measure of the differences between the encoded set and each one of a group of predetermined states. The first calculation module consists of multiple parallel calculation submodules with each submodule being tasked to perform an XOR operation between the encoded set and one of the predetermined states. Multiple parallel second calculation modules each receiving the output of the first calculation module, calculate cumulative signal distances using the output of the first calculation module. Each second calculation module outputs its lowest valued cumulative signal distance and this may be used as input to a memory system for storing a database used in further decoding of the encoded data.

Type: Grant

Filed: February 28, 2003

Date of Patent: November 24, 2009

Inventor: Maher Amer
Method and apparatus for performing channel compensation and symbol demodulation for coherent demodulation in an OFDM system

Patent number: 7609615

Abstract: A method and apparatus for performing channel compensation and symbol demodulation using an estimated channel impulse response during coherent demodulation of a received Orthogonal Frequency Division Multiplexing (OFDM) signal are provided. In the apparatus, an FFT processor IFFT-processes a received signal. A channel compensator generates a channel-compensated signal by multiplying the FFT received signal by an estimated channel impulse response and calculates the power of the estimated channel impulse response. A symbol demodulator sets the power of the estimated channel impulse response as a reference point defining a minimum distance between signal points in a signal constellation, and decides soft metric values for channel decoding using the reference point and I-channel and Q-channel signal components of the channel-compensated signal. A decoder recovers information bits by decoding the soft metric values.

Type: Grant

Filed: June 20, 2006

Date of Patent: October 27, 2009

Assignee: Samsung Electronics Co., Ltd.

Inventors: Eun-Jeong Yim, Ji-Won Ha, Hee-Jin Roh, Sung-Jin Park
Front-End Noise Reduction for Speech Recognition Engine

Publication number: 20090248411

Abstract: VoIP phones according to the present invention include a microphone, which may be internal or external, and allow the user to communicate unobtrusively, check voice mail and conduct other activities in an environment which can be noisy in general and extremely noisy sometimes. Speech recognition functionally may also be used to generate and send touch tone or DTMF tones such as in response to call trees or voice recognition functionality used by airlines, credit card companies, voice mail systems, and other applications. A system and method of audio processing which provides enhanced speech recognition is provided. Audio input is received at the microphone which is processed by adaptive noise cancellation to generate an enhanced audio signal. The operation of the speech recognition engine and the adaptive noise canceller may be advantageously controlled based on Voice Activity Detection (VAD).

Type: Application

Filed: March 27, 2009

Publication date: October 1, 2009

Inventors: Alon Konchitsky, Alberto D. Berstein, Hariharan Ganapathy Kathirvelu, Sandeep Kulakcherla, William Martin Ribble
Viterbi pretraceback for partial cascade processing

Patent number: 7594162

Abstract: This invention modifies Viterbi decoding to improve BER. Within the state metric unit cascade block, this invention forces the unused ACS units decision bits to a 0 for the top rail and a 1 for the bottom rail. This invention modifies the final maximum state index with the selected decision bits from the unused ACS units. This invention uses the modified final maximum state index as the initial conditions for the k?1 traceback shift register. This invention also uses the final maximum state index to mask the generated pretraceback decision bits generated from the last block of ACS units.

Type: Grant

Filed: May 11, 2006

Date of Patent: September 22, 2009

Assignee: Texas Instruments Incorporated

Inventor: Tod D. Wolf
Speaker clustering and adaptation method based on the HMM model variation information and its apparatus for speech recognition

Patent number: 7590537

Abstract: A speech recognition method and apparatus perform speaker clustering and speaker adaptation using average model variation information over speakers while analyzing the quantity variation amount and the directional variation amount. In the speaker clustering method, a speaker group model variation is generated based on the model variation between a speaker-independent model and a training speaker ML model. In the speaker adaptation method, the model in which the model variation between a test speaker ML model and a speaker group ML model to which the test speaker belongs which is most similar to a training speaker group model variation is found, and speaker adaptation is performed on the found model. Herein, the model variation in the speaker clustering and the speaker adaptation are calculated while analyzing both the quantity variation amount and the directional variation amount. The present invention may be applied to any speaker adaptation algorithm of MLLR and MAP.

Type: Grant

Filed: December 27, 2004

Date of Patent: September 15, 2009

Assignee: Samsung Electronics Co., Ltd.

Inventors: Namhoon Kim, Injeong Choi, Yoonkyung Song
Method for detecting signal in multiple input multiple output system and receiving device of multiple input multiple output system

Patent number: 7590201

Abstract: The present invention relates to a method for detecting a signal in an MIMO system. In the method, a received signal is detected in a zero forcing (ZF) method, and a first detection interval is established from the signal detected in the ZF method. The received signal is detected within the first detection interval in a maximum likelihood (ML) method, a second detection interval is established from the signals respectively detected in the ZF method and the ML method. A final solution is determined by detecting the received signal within the second detection interval in the ML method.

Type: Grant

Filed: August 4, 2005

Date of Patent: September 15, 2009

Assignee: Electronics and Telecommunications Research Institute

Inventors: Heejung Yu, Taehyun Jeon, Myung-Soon Kim, Eun-Young Choi, Sok-Kyu Lee, Deuk-Su Lyu
Speech recognition circuit using parallel processors

Patent number: 7587319

Abstract: A speech recognition circuit comprises an input buffer for receiving processed speech parameters. A lexical memory contains lexical data for word recognition. The lexical data comprises a plurality of lexical tree data structures. Each lexical tree data structure comprises a model of words having common prefix components. An initial component of each lexical tree structure is unique. A plurality of lexical tree processors are connected in parallel to the input buffer for processing the speech parameters in parallel to perform parallel lexical tree processing for word recognition by accessing the lexical data in the lexical memory. A results memory is connected to the lexical tree processors for storing processing results from the lexical tree processors and lexical tree identifiers to identify lexical trees to be processed by the lexical tree processors.

Type: Grant

Filed: February 4, 2003

Date of Patent: September 8, 2009

Assignee: Zentian Limited

Inventor: Mark Catchpole
Language model for use in speech recognition

Patent number: 7584102

Abstract: Building a language model for use in speech recognition includes identifying without user interaction a source of text related to a user. Text is retrieved from the identified source of text and a language model related to the user is built from the retrieved text.

Type: Grant

Filed: November 15, 2002

Date of Patent: September 1, 2009

Assignee: Scansoft, Inc.

Inventors: Kwangil Hwang, Eric Fieleke
Viterbi path generation for a Dynamic Bayesian Network

Patent number: 7584408

Abstract: Methods, systems, and apparatus are provided to generate a Viterbi path for a DBN. The DBN is converted to a chain of junction trees, where each tree represents a decision-making process. The trees are forwardly iterated and the Viterbi path is generated during the forward iteration (forward pass). This is achieved by maintaining backpointers to previously processed junction trees during the forward pass and dynamically assembling the Viterbi with each pair of junction trees during the forward pass.

Type: Grant

Filed: February 13, 2006

Date of Patent: September 1, 2009

Assignee: Intel Corporation

Inventors: Wei Hu, Yimin Zhang
Vocabulary-independent search of spontaneous speech

Patent number: 7584098

Abstract: A method of identifying a location of a query string in an audio signal is provided. Under the method, a segment of the audio signal is selected. A score for a query string in the segment of the audio signal is determined by determining the product of probabilities of overlapping sequences of tokens. The score is then used to decide if the segment of the audio signal is likely to contain the query string.

Type: Grant

Filed: November 29, 2004

Date of Patent: September 1, 2009

Assignee: Microsoft Corporation

Inventors: Roger Peng Yu, Frank Torsten Seide
Base station transceiver in CDMA mobile communication system

Patent number: RE42557

Abstract: The base station transceiver in a CDMA mobile communication system is configured to have a separate hardware component for performing only a Viterbi decoding apart from a single hardware H/W that performs a composite function of modulation/demodulation and Viterbi encoding/decoding. The modulator and the Viterbi encoder are provided in one hardware by sectors; more than one demodulator being provided by sectors for demodulating signals from multiple users. And, more than one Viterbi decoder is separately provided for performing a Viterbi decoding of the signals demodulated at the plural demodulator constituted in each sector, thereby facilitating a decoding of demodulated signals received from multiple users and enhancing efficiency of the hardware.

Type: Grant

Filed: November 13, 2009

Date of Patent: July 19, 2011

Assignee: Transpacific Bluetooth, LLC

Inventors: Woon Hee Hwang, Jae Hong Park, Hyun Soo Paik

prev 1 2 3 4 next