Silence Decision Patents (Class 704/215)
  • Publication number: 20040088168
    Abstract: A system and method for voice transmission over high level network protocols. On the Internet and the World Wide Web, such high level protocols are HTTP/TCP. The restrictions imposed by firewalls and proxy servers are avoided by using HTTP level connections to transmit voice data. In addition, packet delivery guarantees are obtained by using TCP instead of UDP. Variable compression based on silence detection takes advantage of the natural silences and pauses in human speech, thus reducing the delays in transmission caused by using HTTP/TCP. The silence detection includes the ability to bookend the voice data sent with small portions of silence to insure that the voice sounds natural. Finally, the voice data is transmitted to each client computer, independently from a common circular list of voice data, thus insuring that all clients will stay current with the most recent voice data. The combination of these features enables simple, seamless, and interactive Internet conferencing.
    Type: Application
    Filed: July 21, 2003
    Publication date: May 6, 2004
    Applicant: Raindance Communications, Inc.
    Inventors: Andrew W. Scherpbier, Mark Randle Boyns
  • Patent number: 6725191
    Abstract: A method for transmitting speech of a first person communicating with a second person via a packet switched network comprising: generating a stream of samples of the first person's speech during the communication; parsing the sample stream into audio frames; determining which audio frames correspond to periods when the first person is speaking and which correspond to periods when the first person is silent; transmitting audio frames corresponding to silent periods and speaking periods of the first person's speech; and transmitting at least some of the audio frames corresponding to speaking periods, but none of the audio frames corresponding to silent periods, at least twice.
    Type: Grant
    Filed: July 19, 2001
    Date of Patent: April 20, 2004
    Assignee: VocalTec Communications Limited
    Inventor: Ofir Mecayten
  • Patent number: 6718298
    Abstract: Speech recording is effected in a GSM phone handset (100) by storing in a memory (116) speech frames during the presence of speech, one or more SID frames during the absence of speech, and data representative of the duration of the absence of speech. In this way memory (116) does not store silent speech frames, and utilisation of memory space is therefore particularly efficient. In addition, items such as a voice activity detector and a comfort noise estimator, which are already provided in the handset as part of the GSM system, are “re-used” by the invention, thereby making efficient use of already-provided hardware/software.
    Type: Grant
    Filed: October 17, 2000
    Date of Patent: April 6, 2004
    Assignee: Agere Systems Inc.
    Inventor: Rupinder Judge
  • Patent number: 6718303
    Abstract: An apparatus for automatically generating punctuation marks in a continuous speech recognition system, comprises means (1,2,3,5) for recognizing user speech and converting the user speech into words, characterized in that means (1,2,3,5) for recognizing user speech is further used to recognize pseudo noises in the user speech; and the apparatus characterized by further comprising: means (9) for marking pseudo noises in output results of means (1,2,3,5) for recognizing user speech; means (10,14,13) for generating punctuation marks by finding most likely pseudo punctuation marks at locations of pseudo noises marked by the means (9) for marking pseudo noises based on a language model containing pseudo punctuation marks.
    Type: Grant
    Filed: May 13, 1999
    Date of Patent: April 6, 2004
    Assignee: International Business Machines Corporation
    Inventors: Donald T. Tang, Xiao Jin Zhu, Li Oin Shen
  • Publication number: 20040024593
    Abstract: In an acoustic signal encoding apparatus (100), a tonal noise verification unit (110) verifies whether the input acoustic time-domain signals are tonal or noisy. If the input acoustic time-domain signals are tonal, tonal component signals are extracted by a tonal component extraction unit (121), and tonal component parameters are normalized and quantized in a normalization/quantization unit (122). The residual time-domain signals, obtained on extracting the tonal component signals from the acoustic time-domain signals, are transformed by an orthogonal transforming unit (131) into the spectral information, which spectral information is normalized and quantized by a normalization/quantization unit (132). A code string generating unit (140) generates a code string from the quantized tonal component parameters and the quantized residual component spectral information.
    Type: Application
    Filed: February 18, 2003
    Publication date: February 5, 2004
    Inventors: Minoru Tsuji, Shiro Suzuki, Keisuke Toyama
  • Patent number: 6658064
    Abstract: The invention relates to a method for transmitting background noise information including a silence descriptor identifier and background noise parameters in a communication system in which the information to be transmitted is formed into data frames. The data frames are subjected to channel coding to form channel-coded frames. The channel-coded frames are interleaved to be transmitted in two or more data transmission frames, and information of two channel-coded frames is transmitted in each data transmission frame. A first silence descriptor frame is formed provided with the silence descriptor identifier. The first silence descriptor frame is subjected to channel coding to form a channel-coded silence descriptor frame. The channel-coded silence descriptor frame is transmitted in two or more data transmission frames, and at least one data transmission frame transmitting part of the channel-coded silence descriptor frame is also used to transmit at least the background noise parameters.
    Type: Grant
    Filed: August 31, 1999
    Date of Patent: December 2, 2003
    Assignee: Nokia Mobile Phones Limited
    Inventors: Jani Rotola-Pukkila, Hannu Mikkola, Pekka Kapanen, Janne Vainio
  • Patent number: 6658380
    Abstract: A digital speech signal processed by successive frames is subjected to noise suppression taking account of estimates of the noise included in the signal, updated for each frame in a manner dependent on at least one degree of vocal activity. A priori noise suppression is applied to the speech signal of each frame on the basis of estimates of the noise obtained on processing at least one preceding frame, and the energy variations of the a priori noise-suppressed signal are analyzed to detect the degree of vocal activity of said frame.
    Type: Grant
    Filed: June 2, 2000
    Date of Patent: December 2, 2003
    Assignee: Matra Nortel Communications
    Inventors: Philip Lockwood, Stéphane Lubiarz
  • Publication number: 20030212550
    Abstract: According to one embodiment of the invention, an apparatus is provided which includes an encoder to encode input speech signals. The speech signals contain frames of talk spurts and silence gaps. The apparatus further includes a voice activity detector coupled to the encoder, the voice activity detector to detect whether a current frame of the input speech signals is the first active frame of a talk spurt. In response to the voice activity detector detecting that the current frame is the first active frame of a talk spurt, the encoder is reset and the encoder states are initialized.
    Type: Application
    Filed: May 10, 2002
    Publication date: November 13, 2003
    Inventor: Anil W. Ubale
  • Patent number: 6618701
    Abstract: A communications transmitter which operates as a mobile telephone incorporates a noise suppressor (100, FIG. 1) which reduces the background noise in the transmitted voice signal. An external voice activity detector (150), which operates in conjunction with a noise suppressor (100) estimates the signal power of the incoming voice signal and compares this to an estimated noise floor. As a result of this comparison, a voice activity factor is applied to an updated noise floor estimate to create a voice activity threshold estimate. The voice activity threshold estimate is then used to decide whether or not to the force noise suppressor (100) to perform an update of a noise content estimate of the incoming voice signal.
    Type: Grant
    Filed: April 19, 1999
    Date of Patent: September 9, 2003
    Assignee: Motorola, Inc.
    Inventors: James Brian Piket, Ernest Pei-Ching Chen, Christopher Wayne Springfield
  • Publication number: 20030135363
    Abstract: An overflow problem of LSF quantization in G.729 Annex B speech encoding. Preferred embodiments fix the problem with default or limited random variable assignments or flagging the overflow and adjusting the frame encoding such as by limiting spectral components or changing quantization targets.
    Type: Application
    Filed: November 4, 2002
    Publication date: July 17, 2003
    Inventors: Dunling Li, Gokhan Sisli, John T. Dowdal, Zoran Mladenovic
  • Patent number: 6587830
    Abstract: A medical record processor and method include a voice processor for receiving unprompted and unstructured free dictation, for processing the dictation, and for generating voice data comprising a plurality of terms therefrom. A key term comparator identifies a key medical term in the generated voice data by comparing each term in the generated voice data with a plurality of key medical terms. The identified key medical terms are separated from non-key medical terms in the generated voice data, and a non-key medical term is eliminated from the generated voice data. The identified key medical term is matched with a patient condition, and additional data are extracted from a knowledge database on the patient condition. A draft medical record is generated that includes the identified key medical term and the additional data.
    Type: Grant
    Filed: September 21, 2001
    Date of Patent: July 1, 2003
    Assignee: Medical Manager Health Systems, Inc.
    Inventor: Michael A. Singer
  • Publication number: 20030120484
    Abstract: A method and system for utilizing information content in speech and a transition hangover between speech and noise to generate comfort noise on the decoder side. This adaptation to noise may be accomplished using various algorithms of estimating the spectrum of color noise. According to an embodiment of the present invention, an adaptation algorithm may be implemented that adapts with time, rather than a block based algorithm to prevent the repeated generation of artifacts present in the block that are being adapting to. The method and system of the present invention generates comfort noise in the absence of silent insertion descriptions containing spectrum information thereby saving bandwidth and generating colored comfort noise that reflects the spectrum of the actual noise.
    Type: Application
    Filed: January 3, 2002
    Publication date: June 26, 2003
    Inventors: David Wong, Matthew Randmaa, Vasudev S. Nayak
  • Publication number: 20030093267
    Abstract: A buffering process for real-time digital audio is provided to effect of network “jitter” from inconsistent network packet delivery rates. The buffering algorithm is particularly useful for audio data including distinct bursts separated by silence, such as speech. The process holds incoming audio packets in a queue until either: (a) the buffer contents meet a predetermined threshold; or (b) the end packet of a burst is received. The result is that silent periods between bursts may expand or decrease relative to the original audio pattern, allowing cumulative jitter to be played out as silence. The threshold is sized such that the deviation in silence is unnoticeable by a listener. In an optional embodiment, the process periodically adjusts the threshold to adapt to network conditions.
    Type: Application
    Filed: November 15, 2001
    Publication date: May 15, 2003
    Applicant: Microsoft Corporation
    Inventors: Ivan J. Leichtling, Ido Ben-Shachar
  • Publication number: 20030088404
    Abstract: With respect to data having periodicity to be compressed, windows of the same size are set for every two sections according to an interval of peaks appearing substantially periodically and processing for sorting sample data alternately among the set windows of the same size is sequentially performed, whereby a frequency of data having periodicity is replaced with an approximately half frequency without damaging reproducibility to original data at all to make it possible to apply compression processing to data of the replaced low frequency. If this sorting processing is applied to compression processing having a characteristic that a compression ratio is not increased in a high-frequency region, it becomes possible to improve a compression ratio without damaging a quality of reproduced data by decompression at all.
    Type: Application
    Filed: December 16, 2002
    Publication date: May 8, 2003
    Inventor: Yukio Koyanagi
  • Publication number: 20030065508
    Abstract: A first CN code (silence code) obtained by encoding a silence signal, which is contained in an input signal, by a silence compression function of a first speech encoding scheme is transcoded to a second CN code of a second speech encoding scheme without decoding the first CN code to a CN signal. For example, the first CN code is demultiplexed into a plurality of first element codes by a code demultiplexer, the first element codes are each transcoded to a plurality of second element codes that constitute the second CN code, and the second element codes obtained by this transcoding are multiplexed to output the second CN code.
    Type: Application
    Filed: March 27, 2002
    Publication date: April 3, 2003
    Inventors: Yoshiteru Tsuchinaga, Yasuji Ota, Masanao Suzuki
  • Patent number: 6535844
    Abstract: A method and apparatus for detecting silence in voice packets. A packet energy calculator calculates a smoothed energy value for each packet of voice data to be transmitted. A noise level detector adaptively calculates noise values during periods of said silence. A silent packet detector compares the energy value to the noise value and if it is less than the noise value and less than a predetermined silence ceiling value then silence is indicated. Also, if the energy value is less than a predetermined silence noise value then silence is also indicated.
    Type: Grant
    Filed: May 30, 2000
    Date of Patent: March 18, 2003
    Assignee: Mitel Corporation
    Inventors: Robert Geoffrey Wood, Franck Beaucoup
  • Patent number: 6505153
    Abstract: Disclosed is a five-step process for producing closed captions for a television program, subtitles for a movie or other uses for time-aligned transcripts. An operator transcribes the audio track while listening to the recorded material. The system helps him/her to work efficiently and produce precisely aligned captions. The first step consists of identifying the portions of the input audio that contain spoken text. Only the spoken parts are further processed by the invention system. The other parts may be used to generate non-spoken captions. The second step controls the rate of speech depending on how fast the operator types. While the operator types, the third module records the time the words were typed in. This provides a rough time alignment for the transcribed text. Then the fourth module realigns precisely the transcribed text on the audio track. A final module segments the transcribed text into captions, based on acoustic clues and natural language constraints.
    Type: Grant
    Filed: May 22, 2000
    Date of Patent: January 7, 2003
    Assignee: Compaq Information Technologies Group, L.P.
    Inventors: Jean-Manuel Van Thong, Michael Swain, Beth Logan
  • Patent number: 6490554
    Abstract: The invention relates to a voice activity detecting device and a voice activity detecting method. An object of the invention is to adapt to various characteristics of noise which may possibly be superimposed on an aural signal to thereby reliably discriminate between an active voice segment and a non-active voice segment. For this purpose, the voice activity detecting device comprises: a speech-segment inferring section 11 for determining the probability that each of active voice frames given in order of time sequence belongs to the active voice segment, based on the statistical characteristic of the aural signal; a quality monitoring section 12 for monitoring the quality of the aural signal for each active voice frame, and a speech-segment determining section 13 for weighting the determined probability with the above quality to obtain for each active voice frame the accuracy that the active voice frame belongs to the active voice segment.
    Type: Grant
    Filed: March 28, 2002
    Date of Patent: December 3, 2002
    Assignee: Fujitsu Limited
    Inventors: Kaori Endo, Yasuji Ota
  • Patent number: 6490556
    Abstract: A half duplex switching device includes an input connection for receiving an input audio signal, and classification module coupled to the input connection. The classification module provides an output which indicates a classification of the input signal based upon a density of the input audio signal, an energy level of the input audio signal, and classification data provided with the input audio signal. A switching device is coupled to the classification module and determines if the received input audio signal contains speech signals based upon the output of the classification module. The communication receiving device can be used in both communication systems which provide continuous speech signals, and communication systems which remove silence and only provide speech signals.
    Type: Grant
    Filed: May 28, 1999
    Date of Patent: December 3, 2002
    Assignee: Intel Corporation
    Inventors: David L. Graumann, Claudia M. Henry
  • Publication number: 20020120439
    Abstract: In a voice communication system having a transmitter and receiver on opposing sides of an interface, the transmitter is switched on to transmit speech components and is switched off during speech pauses. To provide comfort noise at the receiver, and thus avoid annoying effects caused by continual switching of the transmitter, a comfort noise generator disposed to produce comfort noise of an adjustable amplitude is located on the receiver side of the interface. A first subsystem responsive to operation of the transmitter provides a flag to the receiver to commence operation of the comfort noise generator, when the transmitter discontinues transmission in response to a speech pause. A second subsystem transmits a succession of amplitude parameters through the interface to selectively adjust the amplitude of the generator in corresponding relationship with a noise level at the transmitter.
    Type: Application
    Filed: February 28, 2001
    Publication date: August 29, 2002
    Applicant: Telefonaktiebolaget LM Ericsson (publ)
    Inventors: Fisseha Mekuria, Joakim Persson
  • Publication number: 20020120440
    Abstract: A method and apparatus for detecting and transmitting voice signals in a packet voice network system. The method and apparatus make use of a voice activity detection (VAD) unit at a transmitter, for determining if an input signal contains active audio information or passive audio information, where the input signal includes a plurality of frames. For one or more frames of the input signal containing active audio information, the VAD computes a hangover time period. This computation includes determining whether the hangover time period has a fixed duration or a variable duration on the basis of characteristics of the active audio information contained in the one or more frames. When the VAD detects a frame containing passive audio information subsequent to the one or more frames containing active audio information, the input signal is suppressed after the expiry of the computed hangover time period from the detection of the passive audio information.
    Type: Application
    Filed: December 26, 2001
    Publication date: August 29, 2002
    Inventor: Shude Zhang
  • Patent number: 6385447
    Abstract: A system and method employing an access terminal for maintaining discontinuous communications including a gateway receiver for receiving the discontinuous information, a radio frequency (RF) communication link via geosynchronous earth orbit satellite for conveying multiple communication channels using time division multiple access (TDMA), the access terminal initiating information communication with the receiver via at least one of the multiple communication channels. The access terminal further includes a memory for storing protocol processing information and a transmitter for establishing the radio frequency communication link to the receiver of the terrestrial gateway system. The access terminal memory provides for destroying of a signal pattern or protocol assigned to the access terminal by the gateway receiver or transmission of keep-alive bursts by the transmitter during periods of inactivity to maintain information communication with the receiver.
    Type: Grant
    Filed: July 13, 1998
    Date of Patent: May 7, 2002
    Assignee: Hughes Electronics Corporation
    Inventors: Mohammad Soleimani, Moe Rahnema, Jean-Aicard Fabien, David Roos, Anthony Noerpel, Michael Parr
  • Patent number: 6374213
    Abstract: Frame power of an input signal is calculated to discriminate speech frame intervals from non-speech intervals, by thresholding current frame power using an adaptive speech-detection threshold based on the past maximum frame power value and the difference between past maximum and the minimum frame power values, adaptively updated using a predetermined number of frames prior to the current one.
    Type: Grant
    Filed: February 12, 2001
    Date of Patent: April 16, 2002
    Assignee: Nippon Hoso Kyokai
    Inventors: Atsushi Imai, Nobumasa Seiyama, Tohru Takagi
  • Patent number: 6370392
    Abstract: Various methods for effectively detecting the discontinuous transmission (DTX) mode are provided both from the physical layer perspective and from the medium access layer (MAC) perspective. The DTX mode transmission on reverse supplemental channels is detected by comparing a transmitted and a received power ratio of a pilot channel and a supplemental channel. For a reverse link communication, the DTX mode transmission is detected on dedicated control channels. The DTX mode transmission on a forward link supplemental channel can also be detected by monitoring both the transmitted and received power ratios of a primary power control channel and the supplemental channel. As an alternative to or in combination with the above mentioned methods, in order to monitor the DTX mode, a new network parameter is introduced which requires a change in the IS-2000-3 standard. This parameter acts as an indicator signaling to a receiver, from the MAC layer, that the transmitter has entered into the DTX mode.
    Type: Grant
    Filed: June 14, 2000
    Date of Patent: April 9, 2002
    Assignee: Nortel Networks Limited
    Inventors: Jun Li, Chung-Ching Wang
  • Patent number: 6370500
    Abstract: A technique is used in a speech encoder (107) that reduces non-speech activity of a low bit rate digital voice message. Speech model parameters that include quantized speech spectral parameter vectors are generated in a sequence of frames. A determination is made as to which frames of the sequence of frames are voiced frames and which frames are unvoiced frames. A consecutive sequence of frames of unvoiced frames is identified (2330) as an unvoiced burst when a length, NUV, of the consecutive sequence of frames exceeds a predetermined length, Ns. A non-speech activity portion of the unvoiced burst is identified (2335-2365) and removed.
    Type: Grant
    Filed: September 30, 1999
    Date of Patent: April 9, 2002
    Assignee: Motorola, Inc.
    Inventors: Jian-Cheng Huang, Sunil Satyamurti, Floyd Simpson, Kenneth Finlon
  • Patent number: 6347081
    Abstract: A method is disclosed for reducing the power during discontinuous transmission in a digital cellular systems. In order to reduce battery usage and decrease interference to other users, a method is disclosed which employs the transmission of a silence descriptor frame and the following reduction of power once the frame is properly received in the receiver. While the power is reduced there is still the ability to receive and decode inactivity frames.
    Type: Grant
    Filed: July 15, 1998
    Date of Patent: February 12, 2002
    Assignee: Telefonaktiebolaget L M Ericsson (publ)
    Inventor: Stefan Bruhn
  • Publication number: 20020007270
    Abstract: A first filter (2061 in FIG. 1) calculates a long-time average of first change quantities based on a difference between a line spectral frequency of an input voice signal and a long-time average thereof. A second filter (2062 in FIG. 1) calculates a long-time average of second change quantities based on a difference between a whole band energy of the input voice signal and a long-time average thereof. A third filter (2063 in FIG. 1) calculates a long-time average of third change quantities based on a difference between a low band energy of the input voice signal and a long-time average thereof. A fourth filter (2064 in FIG. 1) calculates a long-time average of fourth change quantities based on a difference between a zero cross number of the input voice signal and a long-time average thereof. A voice/non-voice determining circuit (1040 in FIG.
    Type: Application
    Filed: May 31, 2001
    Publication date: January 17, 2002
    Applicant: NEC Corporation
    Inventor: Atsushi Murashima
  • Patent number: 6324188
    Abstract: An input voice signal is passed from a voice signal encoding unit to a voice signal compressing unit so that it is encoded and compressed. No transmission of a silence signal within the voice signal to a multiplexing unit is performed in a silence signal eliminating unit. A multiplexing unit multiplexes the voice signal with a data to be transmitted which has been passed via a data terminal control unit and a data terminal control unit from a data terminal and is transmitted to a communication line via the transmission control unit. Lapse of a time-out period of time which is present in a timer is detected in a silence signal inserting unit so that a silence signal is generated and is reproduced in a voice signal decompressing unit and a voice signal decoding unit.
    Type: Grant
    Filed: April 29, 1998
    Date of Patent: November 27, 2001
    Assignee: Sharp Kabushiki Kaisha
    Inventor: Keisuke Tsuji
  • Patent number: 6324506
    Abstract: An apparatus for automatically generating punctuation marks in a continuous speech recognition system, comprises means (1,2,3,5) for recognizing user speech and converting the user speech into words, characterized in that means (1,2,3,5) for recognizing user speech is further used to recognize pseudo noises in the user speech; and the apparatus characterized by further comprising: means (9) for marking pseudo noises in output results of means (1,2,3,5) for recognizing user speech; means (10,14,13) for generating punctuation marks by finding most likely pseudo punctuation marks at locations of pseudo noises marked by the means (9) for marking pseudo noises based on a language model containing pseudo punctuation marks.
    Type: Grant
    Filed: May 13, 1999
    Date of Patent: November 27, 2001
    Assignee: International Business Machines Corporation
    Inventors: Donald T. Tang, Xiao Jin Zhu, Li Oin Shen
  • Patent number: 6314394
    Abstract: A method of reducing undesired components from a signal that includes a desired component and undesirable components utilizes an autoregressive model technique. An autoregressive module determines a power spectral density approximation of the signal. An error component of the power spectral density approximation includes the desired component. Portions of the error component having frequencies outside of the expected range of the desired component preferably are filtered so that the result is the desired component with the undesired component removed. The invention is useful, for example, for reducing undesirable noise components from sound signals.
    Type: Grant
    Filed: May 27, 1999
    Date of Patent: November 6, 2001
    Assignee: Lear Corporation
    Inventor: Alan M. Finn
  • Patent number: 6304848
    Abstract: An apparatus and method for forming medical records and a medical record formed thereby are provided. The apparatus preferably includes a voice input device for inputting free dictation of medical personnel, a voice processor responsive to the voice input device for processing the inputted voice and generating voice data therefrom, and a key term recognizer responsive to the voice processor for recognizing key medical terms freely dictated by the medical personnel. The key term recognizer preferably includes a key medical term database for storing a plurality of key medical terms therein. The apparatus preferably also includes a medical term matcher responsive to the key word recognizer for matching recognized key medical terms used by the medical personnel at least with patient conditions to thereby add additional data thereto and a medical record creator responsive to the medical term matcher for creating an actual patient medical record therefrom.
    Type: Grant
    Filed: August 13, 1998
    Date of Patent: October 16, 2001
    Assignee: Medical Manager Corp.
    Inventor: Michael A. Singer
  • Publication number: 20010016811
    Abstract: Silence description coding for multi-rate speech coding systems that employ discontinued transmission. Speech coding systems include multi-rate speech codecs having an encoder and a decoder. The silence description coding is performed in either the encoder or the decoder of the multi-rate speech codec. It may also be performed in a distributed manner wherein it is performed partially in the encoder and partially in the decoder. The silence description coding is performed on a speech signal having a substantially non-speech-like characteristic. Voice activity detection classifies the speech signal as being either substantially speech-like or substantially non-speech-like. The silence description coding is selected from a plurality of coding modes. In certain embodiments of the invention, the silence description coding is a source coding mode that operates at a bit rate that fits within a bit rate budget as determined by all of the available source coding modes within the plurality of coding modes.
    Type: Application
    Filed: April 24, 2001
    Publication date: August 23, 2001
    Applicant: Conexant Systems, Inc.
    Inventors: Jes Thyssen, Huan-Yu Su, Adil Benyassine, Eyal Shlomot
  • Patent number: 6269331
    Abstract: A comfort noise block, that include a hangover period and comfort noise parameters, is transmitted in such a manner that it is not interrupted by other messages, such as FACCH messages. This is accomplished in a mobile station by a determination of whether any FACCH messages are required to be transmitted. If such FACCH messages exist, a further determination may be made as to which transmission can be made in the shortest time (i.e., the FACCH message or messages or the comfort noise parameters message), and this transmission is made first. In any event the comfort noise parameters block is transmitted without interruption. In a further embodiment of this invention the comfort noise parameters message is transmitted by being concatenated with another message, such as a neighbor channel measurement results message, so as to reduce overhead, conserve bandwidth, and reduce power consumption.
    Type: Grant
    Filed: September 25, 1997
    Date of Patent: July 31, 2001
    Assignee: Nokia Mobile Phones Limited
    Inventors: Seppo Alanara, Pekka Kapanen
  • Patent number: 6230134
    Abstract: A method of separating and identifying verbal announcements in a wireless transmission according to the GSM standard, in which verbal announcements from the audio stream can be associated with digital signals with the help of an ID number, so that the announcements can be separated.
    Type: Grant
    Filed: March 18, 1999
    Date of Patent: May 8, 2001
    Assignee: Robert Bosch GmbH
    Inventors: Andreas Kynast, Andreas Vahle, Oliver Ahrens, Ulrich Kersken, Karin Hempel
  • Patent number: 6226607
    Abstract: A method and apparatus for eighth-rate random number generation for speech coders includes a random number generator configured to generate values of a first random variable. A lookup table is used to store values of a second random variable. The lookup table is addressed with the values of the first random variable. The second random variable is an inverse transform of a cumulative distribution function of the first random variable. An codec encodes input silence frames with the values of the first and second random variables, and regenerates the silence frames with the values of the first and second random variables. The speech coder may be an enhanced variable rate coder, and the silence frames may be encoded at eighth rate. The random variables are advantageously Gaussian random variables with values that are uniformly distributed between zero and one.
    Type: Grant
    Filed: February 8, 1999
    Date of Patent: May 1, 2001
    Assignee: Qualcomm Incorporated
    Inventors: Chienchung Chang, Toa Shen
  • Patent number: 6173265
    Abstract: A voice recording and/or reproducing device includes a plurality of coders having different bit rates for coding voice to provide coded voice data, a voice recording mode change over switch for selecting one of the plurality of coders, and a system controller. The system controller stores coding selection data obtained by the change over of the voice recording mode and coded voice data obtained from the selected coder, to a storing medium, and reduces a deterioration of the voice due to the change over. The voice recording and/or reproducing device also includes a detector for detecting the coding selection data, and a plurality of decoders for decoding the coded voice data at the bit rate corresponding to the detected coding selection data.
    Type: Grant
    Filed: December 23, 1996
    Date of Patent: January 9, 2001
    Assignee: Olympus Optical Co., Ltd.
    Inventor: Hidetaka Takahashi
  • Patent number: 6161087
    Abstract: A method for playback of speech in an audio recording. The method comprises performing full word-level recognition of the speech including recognition of silent pauses and filled pauses, suppressing playback of the filled pauses and silent pauses, alerting a listener of the audio recording to locations of suppressed filled pauses and silent pauses during playback of the audio recording, and accepting a user command to disable suppression of any filled pause or silent pause during playback of the audio recording.
    Type: Grant
    Filed: October 5, 1998
    Date of Patent: December 12, 2000
    Assignee: Lernout & Hauspie Speech Products N.V.
    Inventors: Colin W. Wightman, Joan Bachenko
  • Patent number: 6157906
    Abstract: A digital signal processor (100) receives a digitally vocoded signal (102), and calculates a staggered average value (404) from the frame energy of each received frame, or the product of the frame energy and a voicing value. While the staggered average value is above a threshold voice indicator value, speech is declared present.
    Type: Grant
    Filed: July 31, 1998
    Date of Patent: December 5, 2000
    Assignee: Motorola, Inc.
    Inventors: Richard Brent Nicholls, Chin Pan Wong, Martin Thuo Karanja, Patrick Joseph Doran, David James Graham
  • Patent number: 6138091
    Abstract: This invention relates to a method by means of which more than one audio signal can be recorded in compressed form in a memory element, and to a system implementing such a method. In the system according to the invention, audio signal samples are recorded only when voice is detected in the audio signals. The system according to the invention saves memory capacity required by the recording by combining the audio signal samples when voice is detected in samples of more than one audio signal. Furthermore, an audio signal is not recorded when no voice is detected in the signal. The invention also reduces the average computing capacity needed and thus power consumption, since signal combination, or mixing, is advantageously performed only when voice is detected in the samples of more than one audio signal.
    Type: Grant
    Filed: December 17, 1997
    Date of Patent: October 24, 2000
    Assignee: Nokia Mobile Phones Ltd.
    Inventors: Tero Haataja, Ari Sinisalo
  • Patent number: 6098045
    Abstract: The present invention provides a sound compression/decompression system and method which processes sound data, frame by frame, while transmitting information on special processing at the same time without degrading the bit rate. When a frame for special processing is transmitted in the form of a bit stream, this system sets the index value of the bit stream to a special value not used for normal processing, thus preventing the bit rate from being degraded. This system also contains information on special-processing in the non-index bits of the bit stream to implement various types of special processings. For example, a plurality of consecutive unvoiced frames may be transmitted in a one-frame bit stream.
    Type: Grant
    Filed: August 7, 1998
    Date of Patent: August 1, 2000
    Assignee: NEC Corporation
    Inventors: Takashi Yokomizo, Masahiro Serizawa
  • Patent number: 6088601
    Abstract: It is judged before encoding sound data whether one frame of the sound data for transmission is silent, and if judged as silent, switching circuits 202 and 203, which is for stopping/resuming electric power supply to a circuit 201 for encoding the sound data, are turned off by a power control circuit 30A during an encoding period of one frame. The power control circuit 30A consists of a counter 32 initialized by a frame synchronization pulse FSYNC and counting a sound sampling clock CLKS, a constant setting part 34, a comparator 33 activating a coincident signal EQ when a count of the counter 32 coincides with a set value, and a flip-flop 31 set by a no-voice detect pulse NOV and reset when EQ is activated.
    Type: Grant
    Filed: February 18, 1998
    Date of Patent: July 11, 2000
    Assignee: Fujitsu Limited
    Inventor: Satoru Miyagi
  • Patent number: 6070135
    Abstract: A method and apparatus for discriminating non-sounds and voiceless sounds of speech signals, recorded on a recording medium, from each other when playing back the speech signals at a varied play-back speed. The method includes the steps of setting, as a reference voltage level, an optional value between a voltage level corresponding to non-sounds and a voltage level corresponding to voiceless sounds, detecting a pitch component of each waveform of the speech signals, comparing the absolute value of a voltage level of the detected pitch component with the reference voltage level, and distinguishing and outputting a portion of the speech signal associated with the detected pitch component based on the result of the comparison.
    Type: Grant
    Filed: August 12, 1996
    Date of Patent: May 30, 2000
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Chul Hong Kim, Jum Han Bae
  • Patent number: 6049765
    Abstract: A silence compression system that improves data compression in a digital speech storage device, such as a digital telephone answering machine, without undue clipping of voice signals. Instead of employing only real-time compression, the inventive silence system analyzes and compresses or re-compresses digital speech samples stored previously, when the voice messaging system is off-line or otherwise in a low priority state. A method of silence compression comprises receiving real-time speech samples, storing the same in memory, and analyzing the stored speech samples at a later time to determine thresholds for periods of silence. The periods of silence are then compressed, and the silence compressed voice message is restored in memory. In this fashion, the processor is not required to make a silence period determination on-the-fly simultaneous with encoding and compression of the real-time voice message, and thus is not subjected to heavy processor loads typically encountered in real time.
    Type: Grant
    Filed: December 22, 1997
    Date of Patent: April 11, 2000
    Assignee: Lucent Technologies Inc.
    Inventors: Vasu Iyengar, Syed S. Ali
  • Patent number: 6038529
    Abstract: A signal communication apparatus and method enables direct communication between communication systems of the silence compression and the non-silence compression type. The transmission and reception can each discriminate whether an audio signal is in a sound-present period or in a sound-absent period and this discrimination is output as period identification data. The audio signal is encoded, and the encoded data is selected and transmitted when the period identification data represents a sound-present period. Blank data prepared in advance are selected when the period identification data represents a sound-absent period. Encoded audio signal data of a variable bit rate are received, and the encoded data is selected, decoded, and output at a fixed bit rate when the period identification data represents a sound-present period. Data prepared in advance are outputted when the period identification data represents a sound-absent period.
    Type: Grant
    Filed: July 29, 1997
    Date of Patent: March 14, 2000
    Assignee: NEC Corporation
    Inventor: Ryoichi Harada
  • Patent number: 6029127
    Abstract: An audio data compression method improves over existing standards because of its encoding strategy for silence. The method analyzes the audio input to an encoder. If the audio is for an analyzed time frame is silence, a single byte output is generated by the encoder. If the next frame is silence, no output is generated. When a receiver receives the compressed data, and detects a one-byte silence signal, it can capture that signal and repeat it to a decoder. When the compressed signal reaches the decoder, it is decompressed into an analog signal.
    Type: Grant
    Filed: March 28, 1997
    Date of Patent: February 22, 2000
    Assignee: International Business Machines Corporation
    Inventors: Jeffrey T. Delargy, Mark S. Kressin
  • Patent number: 6006176
    Abstract: A speech coding apparatus which allows a speech decoding apparatus to output a more familiar background noise. The speech coding apparatus includes a voice presence/absence discrimination section, a coding section, a unique word production section, and a data switching section which selectively outputs one of outputs of the coding section and the unique word production section as an output of the speech coding apparatus in response to a result of discrimination of the voice presence/absence discrimination section. The speech coding apparatus further includes an amplitude level discrimination section, a clip processing section and an input switching section. The input switching section selects, when the input speech signal includes voice, the input speech signal, but when the input speech signal includes no voice and a code for updating background noise is to be produced, the input switching section selects the input speech signal after clip processing.
    Type: Grant
    Filed: June 26, 1998
    Date of Patent: December 21, 1999
    Assignee: NEC Corporation
    Inventor: Toshihiro Hayata
  • Patent number: 5995925
    Abstract: A voice speed converter comprising a speech classifying unit for classifying an input speech signal into an unvoiced part and another part, a pitch frequency extracting unit for extracting a pitch frequency from the input speech signal and supplying it, a quasi-pitch frequency supplying unit for supplying a quasi-pitch frequency of fixed length, a voice speed converter for performing voice speed conversion processing on the input speech signal by the use of the pitch frequency or the quasi-pitch frequency, and a switch for controlling switching operations according to the classification result by the speech classifying unit, so as to send the quasi-pitch frequency to the voice speed converter when the input speech signal belongs to the unvoiced part, or so as to send the pitch frequency to the voice speed converter when the input speech signal belongs to another part.
    Type: Grant
    Filed: September 16, 1997
    Date of Patent: November 30, 1999
    Assignee: NEC Corporation
    Inventor: Tadashi Emori
  • Patent number: 5974374
    Abstract: In a voice coding section 1, a digital voice signal coded in a voice coder 10, a linear predictive coefficient used as a filter coefficient in a short-term predictive filter 102, a pitch period and a pitch predictive coefficient used, respectively, as a tap coefficient and a filter coefficient in a long-term predictive filter 103, and voice/no-voice status information of an input voice, are multiplexed in a multiplexer 12. Only when the voice/no-voice status information indicate the voice state is a cell assembled and transmitted. In a voice decoding section 2, the received cell is disassembled to provide multiplexed coded data. The voice signal is decoded by a short-term synthesis filter and a long term synthesis filter. The short term synthesis filter uses a linear predictive coefficient as a filter coefficient that is decoded from multiplexed coded data.
    Type: Grant
    Filed: January 20, 1998
    Date of Patent: October 26, 1999
    Assignee: NEC Corporation
    Inventor: Yasuhiro Wake
  • Patent number: 5960389
    Abstract: An improved method for generating comfort noise (CN) in a mobile terminal operating in a discontinuous transmission (DTX) mode. In one embodiment the invention provides an improved method for comfort noise generation, in which a random excitation is modified by a spectral control filter so that the frequency content of comfort noise and background noise become similar. In another embodiment the transmitter identifies speech coding parameters that are not representative of the actual background noise, and replaces the identified parameters with parameters having a median value. In this manner the non-representative parameters do not skew the result of an averaging operation.
    Type: Grant
    Filed: November 6, 1997
    Date of Patent: September 28, 1999
    Assignee: Nokia Mobile Phones Limited
    Inventors: Kari Jarvinen, Pekka Kapanen, Vesa Ruoppila, Jani Rotola-Pukkila
  • Patent number: 5943319
    Abstract: Improved telecommunication apparatus is realized with a structure that is tailored to interact with the telecommunication network strictly in packet format. The apparatus includes circuitry for creating either control or information packets, where each packet comprises a header portion and a payload portion. The apparatus also includes an identifier module that enables the apparatus to identify itself to the telecommunication network with an identifier signal that is unique to it. The apparatus further includes encoding and decoding circuitry to create, and decode, highly compressed digital representations of audio signals and, optionally, encryption and decryption features to enhance security of communication. Still further, the apparatus includes circuitry to enable users to interact with the network and partake of telephony service.
    Type: Grant
    Filed: April 4, 1996
    Date of Patent: August 24, 1999
    Assignee: Lucent Technologies Inc.
    Inventor: Alexander Gibson Fraser