Voiced Or Unvoiced Patents (Class 704/214)

Unified speech/audio codec (USAC) processing windows sequence based mode switching

Patent number: 12361957

Abstract: A Unified Speech and Audio Codec (USAC) that may process a window sequence based on mode switching is provided. The USAC may perform encoding or decoding by overlapping between frames based on a folding point when mode switching occurs. The USAC may process different window sequences for each situation to perform encoding or decoding, and thereby may improve a coding efficiency.

Type: Grant

Filed: January 30, 2024

Date of Patent: July 15, 2025

Assignees: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE, KWANGWOON UNIVERSITY INDUSTRY-ACADEMIC COLLABORATION FOUNDATION

Inventors: Seungkwon Beack, Tae Jin Lee, Min Je Kim, Kyeongok Kang, Dae Young Jang, Jeongil Seo, Jin Woo Hong, Chieteuk Ahn, Ho Chong Park, Young-cheol Park
Conversational service

Patent number: 12243546

Abstract: An apparatus including circuitry configured to: enable a conversational service between a first user of the apparatus and a second user of a remote apparatus wherein the conversational service is a duplex service including simultaneous voice communication from the first user to the second user and voice communication from the second user to the first user; and enable synchronization of a switch to using an active noise cancellation mode at the apparatus for the conversational service and at the remote apparatus for the conversational service, wherein the switch to using the noise cancellation mode is synchronized between the first and second users.

Type: Grant

Filed: December 6, 2022

Date of Patent: March 4, 2025

Assignee: Nokia Technologies Oy

Inventors: Lasse Juhani Laaksonen, Miikka Tapani Vilermo, Arto Juhani Lehtiniemi
Apparatus and method for combined intraband and interband multi-frame demosaicing

Patent number: 12148124

Abstract: A method includes obtaining multiple input image frames and determining how to warp at least one of the input image frames. The method also includes performing an intraband demosaic-warp operation to reconstruct image data in different color channels of the input image frames and warp the at least one input image frame to produce RGB input image frames. The method further includes blending the RGB input image frames to produce a blended RGB image frame, performing an interband denoising operation to produce a denoised RGB image frame, and performing an interband sharpening operation to produce a sharpened RGB image frame. In addition, the method includes performing an interband demosaic operation to substantially equalize high-frequency content in different color channels of the sharpened RGB image frame to produce an equalized sharpened RGB image frame and generating a final image of the scene based on the equalized sharpened RGB image frame.

Type: Grant

Filed: January 27, 2022

Date of Patent: November 19, 2024

Assignee: Samsung Electronics Co., Ltd.

Inventors: John W. Glotzbach, Long N. Le, Zhen Tong, Hamid R. Sheikh, Yibo Xu
Media signature recognition with resource constrained devices

Patent number: 12126846

Abstract: The present invention recognizes media content using signatures generated by network devices with limited processing power. An audio signal is prepared for application of a discrete Fourier transform (DFT). Outputs from the DFT include real components and imaginary components that are used to calculate output magnitudes associated with frequency bins. The frequency-amplitude pairs include the output magnitudes and the associated frequency bins. A signature of the audio signal is generated by selecting a predetermined number of frequency-amplitude pairs having dominant output magnitudes. The network devices that generate the signatures may transmit the signatures to a server for analysis. The server may trigger actions in response to detecting known content based on the received signatures matching known signatures.

Type: Grant

Filed: May 27, 2022

Date of Patent: October 22, 2024

Assignee: Sling TV L.L.C.

Inventors: Eric Dyoniziak, Robert Drew Major
Signal processing device, signal processing method, and program

Patent number: 12123766

Abstract: This signal processing device comprises: an acquisition unit for acquiring an acoustic signal; a measurement unit for measuring an acoustic level of the acoustic signal for every one of first frequency bands, which are a plurality of frequency bands of a preset first bandwidth; a calculation unit that, on the basis of the plurality of acoustic levels of the first frequency bands, identifies an acoustic feature quantity indicating the separation degree from normal acoustic levels of second frequency bands, which are a plurality of frequency bands of a second bandwidth that is wider than the first bandwidth; a first determination unit for determining whether the acoustic levels measured for every one of the first frequency bands are a first threshold value or greater; and a second determination unit for determining whether the acoustic feature quantity is a second threshold value or greater.

Type: Grant

Filed: September 25, 2020

Date of Patent: October 22, 2024

Assignee: MITSUBISHI HEAVY INDUSTRIES, LTD.

Inventors: Motoshi Takasu, Toshifumi Kudo, Yoshiyuki Hamaji, Takanori Kouou
Pitch emphasis apparatus, method and program for the same

Patent number: 12106767

Abstract: Provided is pitch enhancement processing having little unnaturalness even in time segments for consonants, and having little unnaturalness to listeners caused by discontinuities even when time segments for consonants and other time segments switch frequently. A pitch emphasis apparatus carries out the following as the pitch enhancement processing: for a time segment in which a spectral envelope of a signal has been determined to be flat, obtaining an output signal for each of times in the time segment, the output signal being a signal including a signal obtained by adding (1) a signal obtained by multiplying the signal of a time, further in the past than the time by a number of samples T0 corresponding to a pitch period of the time segment, a pitch gain ?0 of the time segment, a predetermined constant B0, and a value greater than 0 and less than 1, to (2) the signal of the time.

Type: Grant

Filed: July 7, 2023

Date of Patent: October 1, 2024

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Yutaka Kamamoto, Ryosuke Sugiura, Takehiro Moriya
Method, apparatus, and system for processing audio data

Patent number: 12100406

Abstract: A method for processing an audio signal includes receiving a bitstream corresponding to the audio signal; obtaining a silence insertion descriptor (SID) type of a current frame of the audio signal by decoding the bitstream; obtaining a low-band parameter of the current frame by decoding the bitstream; obtaining a low-band signal of the current frame based on the low-band parameter; obtaining, based on the SID type of the current frame, a high-band parameter of the current frame; obtaining a high-band signal of the current frame based on the high-band parameter; and obtaining a synthesis signal of the current frame based on the low-band signal and the high-band signal.

Type: Grant

Filed: June 29, 2023

Date of Patent: September 24, 2024

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventor: Zhe Wang
Noise filling without side information for CELP-like coders

Patent number: 12100409

Abstract: An audio decoder provides a decoded audio information on the basis of an encoded audio information including linear prediction coefficients (LPC) and includes a tilt adjuster to adjust a tilt of a noise using linear prediction coefficients of a current frame to acquire a tilt information and a noise inserter configured to add the noise to the current frame in dependence on the tilt information. Another audio decoder includes a noise level estimator to estimate a noise level for a current frame using a linear prediction coefficient of at least one previous frame to acquire a noise level information; and a noise inserter to add a noise to the current frame in dependence on the noise level information provided by the noise level estimator. Thus, side information about a background noise in the bit-stream may be omitted. Methods and computer programs serve a similar purpose.

Type: Grant

Filed: November 24, 2020

Date of Patent: September 24, 2024

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Guillaume Fuchs, Christian Helmrich, Manuel Jander, Benjamin Schubert, Yoshikazu Yokotani
Low power voice activity detector

Patent number: 12094488

Abstract: An apparatus and method for voice activity detection. A multiphase differential output rotating capacitive sampler achieves a frequency down conversion over as many specific frequency bands as are required for analysis. A chirp is created in the rotating sampler as the sum of arbitrary frequencies across the desired analysis band multiplied by a window function. The chirp is sampled at a rate of rotation synchronous with the last state of burst of the chirp, allowing a non-phase synchronous pattern in the coefficient values and allowing a high-Q and arbitrary decomposition of the signal. After the sample is taken, the next clock signal to the sampler is used to define the output voltage of the sampler by shorting the output, which is entirely capacitive, to ground. Processing occurs in the analog domain rather than digitally, avoiding the need for FFTs and allowing for greater speed and lower power consumption.

Type: Grant

Filed: October 23, 2022

Date of Patent: September 17, 2024

Assignee: SiliconIntervention Inc.

Inventor: A. Martin Mallinson
Apparatus and method for determining a predetermined characteristic related to an artificial bandwidth limitation processing of an audio signal

Patent number: 12067995

Abstract: An apparatus for determining a predetermined characteristic related to an artificial bandwidth limitation processing of an audio signal has a slope evaluator configured for evaluating a slope of a spectrum of the audio signal to obtain a slope evaluation result. The apparatus has a frequency evaluator configured for evaluating a cut-off frequency of the spectrum of the audio signal to obtain a frequency evaluation result, and has a processor for providing an information indicating that the audio signal has the predetermined characteristic dependent on an evaluation of the slope evaluation result and an evaluation of the frequency evaluation result.

Type: Grant

Filed: September 25, 2019

Date of Patent: August 20, 2024

Assignee: FRAUNHOFER-GESELLSCHAFT ZUR FOERDERUNG DER ANGEWANDTEN FORSCHUNG E.V.

Inventors: Patrick Gampp, Christian Uhle, Julia Havenstein, Oliver Hellmuth, Sascha Disch, Antonios Karampourniotis, Juergen Herre, Peter Prokein
System and method for managing exception request blocks in a blockchain network

Patent number: 12022016

Abstract: A system for resolving exceptions in requests determines that a request comprises an exception. The exception impeded processing the request to be granted or denied. The system determines a type of exception that indicates whether the exception is incomplete information, incorrect information, or previously-unknown information. The system generates a block in a blockchain network. Based on the stored details in the block, the type of exception is identified. Accordingly, the block is segregated for exception processing based on its exception type. The system stores the exception in the block. The system compares the request with user information previously provided by the user. The system determines a similarity score between the request and the user information. The system compares the similarity score with a threshold percentage. In response to determining that the similarity score exceeds the threshold percentage, the system determines that the exception can be resolved and resolves the exception.

Type: Grant

Filed: April 7, 2022

Date of Patent: June 25, 2024

Assignee: Bank of America Corporation

Inventors: Prashant Khare, Abhishek Trivedi, Gaurav Dadhich, Saurabh Dutta, Shruti Nandini Thakur, Parneet Kaur Gujral, Zeno Valerian Anthony
Method of processing audio data, electronic device and storage medium

Patent number: 11984134

Abstract: A method of processing audio data, an electronic device, and a storage medium, which relates to a field of artificial intelligence, in particular to a field of speech processing technology. The method includes: processing spectral data of the audio data to obtain a first feature information; obtaining a fundamental frequency indication information according to the first feature information, wherein the fundamental frequency indication information indicates valid audio data of the first feature information and invalid audio data of the first feature information; obtaining a fundamental frequency information and a spectral energy information according to the first feature information and the fundamental frequency indication information; and obtaining a harmonic structure information of the audio data according to the fundamental frequency information and the spectral energy information.

Type: Grant

Filed: November 29, 2022

Date of Patent: May 14, 2024

Assignee: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventors: Jiankang Hou, Zhipeng Nie, Liqiang Zhang, Tao Sun, Lei Jia
Method and apparatus with real-time translation

Patent number: 11955118

Abstract: A real-time processor-implemented translation method and apparatus is provided. The real-time translation method includes receiving a content, determining a delay time for real-time translation based on a silence interval of the received content and an utterance interval of the received content, generating a translation result by translating a language used in the received content, and synthesizing the translation result and the received content.

Type: Grant

Filed: April 17, 2020

Date of Patent: April 9, 2024

Assignee: Samsung Electronics Co., Ltd.

Inventors: Youngmin Kim, Hwidong Na, Min-joong Lee, Hodong Lee
Method to mute and unmute a microphone signal

Patent number: 11947868

Abstract: A method for muting and unmuting a microphone is provided. The method includes providing a processor, receiving an input microphone signal, measuring the input microphone signal for a loudness level at a sampling rate, calculating a mute threshold level, checking if the loudness level is higher than or equal to the mute threshold level, and resetting a mute delay timer upon determining that the loudness level is higher than or equal to the mute threshold level and obtaining the input microphone signal, or checking if the mute delay timer is running upon determining that the loudness level is not higher than or equal to the mute threshold level and attenuating the input microphone signal if the mute delay timer is not running or obtaining the input microphone signal if the mute delay timer is still running, and writing the input microphone signal or attenuated input microphone signal to an output buffer.

Type: Grant

Filed: December 28, 2021

Date of Patent: April 2, 2024

Assignee: CREATIVE TECHNOLOGY LTD.

Inventors: Kee Seng Tan, Luen Kai Chan, Ariel Arellano De Castro
Unified speech/audio codec (USAC) processing windows sequence based mode switching

Patent number: 11922962

Abstract: A Unified Speech and Audio Codec (USAC) that may process a window sequence based on mode switching is provided. The USAC may perform encoding or decoding by overlapping between frames based on a folding point when mode switching occurs. The USAC may process different window sequences for each situation to perform encoding or decoding, and thereby may improve a coding efficiency.

Type: Grant

Filed: August 25, 2022

Date of Patent: March 5, 2024

Assignees: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE, KWANGWOON UNIVERSITY INDUSTRY-ACADEMIC COLLABORATION FOUNDATION

Inventors: Seungkwon Beack, Tae Jin Lee, Min Je Kim, Kyeongok Kang, Dae Young Jang, Jeongil Seo, Jin Woo Hong, Chieteuk Ahn, Ho Chong Park, Young-cheol Park
System and method for remote application sharing

Patent number: 11831723

Abstract: Systems and methods are disclosed herein for remote application sharing. An exemplary method comprises detecting a user establishing a connection with a remote application server, authenticating the user based on login information associated with the user, determining that the user has requested execution of a shared application hosted on the remote application server, responsive to determining that the user has requested execution of the shared application, gathering information for accessing the shared application hosted on the remote application server, establishing a user session for executing the shared application, generating an application link comprising the information for accessing the shared application over the user session and publishing the application link for distribution to one or more third party users, wherein activation of the application link by the one or more third party users shares the user session with the one or more third party users.

Type: Grant

Filed: December 21, 2021

Date of Patent: November 28, 2023

Assignee: Parallels International GmbH

Inventors: Marco Borg, Daniel Farrugia, Nikolay Dobrovolskiy, Sergei Beloussov
Cross-lingual voice conversion system and method

Patent number: 11797782

Abstract: A cross-lingual voice conversion system and method comprises a voice feature extractor configured to receive a first voice audio segment in a first language and a second voice audio segment in a second language, and extract, respectively, audio features comprising first-voice, speaker-dependent acoustic features and second-voice, speaker-independent linguistic features. One or more generators are configured to receive extracted features, and produce therefrom a third voice candidate keeping the first-voice, speaker-dependent acoustic features and the second-voice, speaker-independent linguistic features, wherein the third voice candidate speaks the second language. One or more discriminators are configured to compare the third voice candidate with the ground truth data, and provide results of the comparison back to the generator for refining the third voice candidate.

Type: Grant

Filed: December 30, 2020

Date of Patent: October 24, 2023

Assignee: TMRW Foundation IP S. À R.L.

Inventor: Cevat Yerli
Pitch emphasis apparatus, method and program for the same

Patent number: 11749295

Abstract: Provided is pitch enhancement processing having little unnaturalness even in time segments for consonants, and having little unnaturalness to listeners caused by discontinuities even when time segments for consonants and other time segments switch frequently. A pitch emphasis apparatus carries out the following as the pitch enhancement processing: for a time segment in which a spectral envelope of a signal has been determined to be flat, obtaining an output signal for each of times in the time segment, the output signal being a signal including a signal obtained by adding (1) a signal obtained by multiplying the signal of a time, further in the past than the time by a number of samples T0 corresponding to a pitch period of the time segment, a pitch gain ?0 of the time segment, a predetermined constant B0, and a value greater than 0 and less than 1, to (2) the signal of the time.

Type: Grant

Filed: August 31, 2022

Date of Patent: September 5, 2023

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Yutaka Kamamoto, Ryosuke Sugiura, Takehiro Moriya
Method, apparatus, and system for processing audio data

Patent number: 11727946

Abstract: A method for processing an audio signal includes: receiving a bitstream corresponding to the audio signal; obtaining a silence insertion descriptor (SID) type of a current frame of the audio signal by decoding the bitstream; obtaining a low-band parameter of the current frame by decoding the bitstream; obtaining a low-band signal of the current frame based on the low-band parameter; obtaining, based on the SID type of the current frame, a high-band parameter of the current frame; obtaining a high-band signal of the current frame based on the high-band parameter; and obtaining a synthesis signal of the current frame based on the low-band signal and the high-band signal.

Type: Grant

Filed: October 21, 2021

Date of Patent: August 15, 2023

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventor: Zhe Wang
Decoding apparatus, encoding apparatus, and methods and programs therefor

Patent number: 11715484

Abstract: A decoding apparatus includes: a bandwidth extending part 25 obtaining a decoded extended frequency spectrum sequence by arranging samples based on K samples included in a frequency-domain sample sequence obtained by decoding, on a higher side than the frequency-domain sample sequence; and a fricative sound adjustment releasing part 23 obtaining, if inputted information indicating whether a hissing sound or not indicates being a hissing sound, what is obtained by exchanging all or a part of a low-side frequency sample sequence existing on a lower side than a predetermined frequency in the decoded extended frequency spectrum sequence for all or a part of a high-side frequency sample sequence existing on a higher side than the predetermined frequency in the decoded extended frequency spectrum sequence as an adjusted frequency spectrum sequence, the number of all or the part of the high-side frequency spectrum sequence being the same as the number of all or the part of the low-side frequency spectrum sequence.

Type: Grant

Filed: July 1, 2022

Date of Patent: August 1, 2023

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Ryosuke Sugiura, Yutaka Kamamoto, Takehiro Moriya
Speech model parameter estimation and quantization

Patent number: 11715477

Abstract: Quantizing speech model parameters includes, for each of multiple vectors of quantized excitation strength parameters, determining first and second errors between first and second elements of a vector of excitation strength parameters and, respectively, first and second elements of the vector of quantized excitation strength parameters, and determining a first energy and a second energy associated with, respectively, the first and second errors. First and second weights for, respectively, the first error and the second error, are determined and are used to produce first and second weighted errors, which are combined to produce a total error. The total errors of each of the multiple vectors of quantized excitation strength parameters are compared and the vector of quantized excitation strength parameters that produces the smallest total error is selected to represent the vector of excitation strength parameters.

Type: Grant

Filed: April 8, 2022

Date of Patent: August 1, 2023

Assignee: Digital Voice Systems, Inc.

Inventors: Daniel W. Griffin, John C. Hardwick
Rate converter

Patent number: 11677383

Abstract: Embodiments of the invention may be used to implement a rate converter that includes: 6 channels in forward (audio) path, each channel having a 24-bit signal path per channel, an End-to-end SNR of 110 dB, all within the 20 Hz to 20 KHz bandwidth. Embodiment may also be used to implement a rate converter having: 2 channels in a reverse path, such as for voice signals, 16-bit signal path per channel, an End-to-end SNR of 93 dB, all within 20 Hz to 20 KHz bandwidth. The rate converter may include sample rates such as 8, 11.025, 12, 16, 22.05, 24, 32 44.1, 48, and 96 KHz. Further, rate converters according to embodiments may include a gated clock in low-power mode to conserve power.

Type: Grant

Filed: March 8, 2019

Date of Patent: June 13, 2023

Assignee: AVNERA CORPORATION

Inventor: Xudong Zhao
Time domain spectral bandwidth replication

Patent number: 11670311

Abstract: A wireless audio system for encoding and decoding an audio signal using spectral bandwidth replication is provided. Bandwidth extension is performed in the time-domain, enabling low-latency audio coding.

Type: Grant

Filed: April 12, 2021

Date of Patent: June 6, 2023

Assignee: Shure Acquisition Holdings, Inc.

Inventors: Wenshun Tian, Michael Ryan Lester
Apparatus and method for voice event detection

Patent number: 11594244

Abstract: A voice event detection apparatus is disclosed. The apparatus comprises a vibration to digital converter and a computing unit. The vibration to digital converter is configured to convert an input audio signal into vibration data. The computing unit is configured to trigger a downstream module according to a sum of vibration counts of the vibration data for a number X of frames. In an embodiment, the voice event detection apparatus is capable of correctly distinguishing a wake phoneme from the input vibration data so as to trigger a downstream module of a computing system. Thus, the power consumption of the computing system is saved.

Type: Grant

Filed: May 11, 2020

Date of Patent: February 28, 2023

Assignee: BRITISH CAYMAN ISLANDS INTELLIGO TECHNOLOGY INC.

Inventors: Tsan-Jieh Chen, Hong-Ching Chen, Chien Hua Hsu, Tsung-Liang Chen
Oversampling in a combined transposer filter bank

Patent number: 11591657

Abstract: The present invention relates to coding of audio signals, and in particular to high frequency reconstruction methods including a frequency domain harmonic transposer. A system and method for generating a high frequency component of a signal from a low frequency component of the signal is described.

Type: Grant

Filed: March 15, 2021

Date of Patent: February 28, 2023

Assignee: Dolby International AB

Inventors: Lars Villemoes, Per Ekstrand
Pitch emphasis apparatus, method and program for the same

Patent number: 11468907

Abstract: Provided is pitch enhancement processing having little unnaturalness even in time segments for consonants, and having little unnaturalness to listeners caused by discontinuities even when time segments for consonants and other time segments switch frequently. A pitch emphasis apparatus carries out the following as the pitch enhancement processing: for a time segment in which a spectral envelope of a signal has been determined to be flat, obtaining an output signal for each of times in the time segment, the output signal being a signal including a signal obtained by adding (1) a signal obtained by multiplying the signal of a time, further in the past than the time by a number of samples T0 corresponding to a pitch period of the time segment, a pitch gain ?0 of the time segment, a predetermined constant B0, and a value greater than 0 and less than 1, to (2) the signal of the time.

Type: Grant

Filed: April 23, 2019

Date of Patent: October 11, 2022

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Yutaka Kamamoto, Ryosuke Sugiura, Takehiro Moriya
Learning singing from speech

Patent number: 11430431

Abstract: A method, computer program, and computer system is provided for converting a singing voice of a first person associated with a first speaker to a singing voice of a second person using a speaking voice of the second person associated with a second speaker. A context associated with one or more phonemes corresponding to the singing voice of a first person is encoded, and the one or more phonemes are aligned to one or more target acoustic frames based on the encoded context. One or more mel-spectrogram features are recursively generated from the aligned phonemes, the target acoustic frames, and a sample of the speaking voice of the second person. A sample corresponding to the singing voice of a first person is converted to a sample corresponding to the second singing voice using the generated mel-spectrogram features.

Type: Grant

Filed: February 6, 2020

Date of Patent: August 30, 2022

Assignee: TENCENT AMERICA LLC

Inventors: Chengzhu Yu, Heng Lu, Chao Weng, Dong Yu
Methods and apparatus for a voice detector

Patent number: 11380321

Abstract: Various embodiments of the present technology may provide methods and apparatus for a voice detector. The voice detector may provide a microphone and an audio processor. The microphone may provide an active signal generator configured to generate an active signal. The active signal may indicate when the signal level of detected audio is above or below a threshold level with a first state and a second state. The active signal may prevent activity at the microphone I/O interface and may prevent activity at the audio processor's internal logic.

Type: Grant

Filed: October 23, 2019

Date of Patent: July 5, 2022

Assignee: SEMICONDUCTOR COMPONENTS INDUSTRIES, LLC

Inventor: Kenichi Kiyozaki
Signal processing system and a method of determining noise reduction and compensation thereof

Patent number: 11380346

Abstract: A method of determining noise reduction in a signal includes transforming the signal to generate a spectrogram; determining sharp change in a frequency spectrum for each frame in the spectrogram; and comparing a counted number of frames having sharp change with a predetermined value. The signal is determined to be subject to noise reduction if the counted number is greater than the predetermined value.

Type: Grant

Filed: May 22, 2020

Date of Patent: July 5, 2022

Assignee: Wistron Corporation

Inventors: Ching-An Cho, Yu-Yen Chen, Kuo-Ting Huang
Mitigating noise in audio signals

Patent number: 11114109

Abstract: A device implementing a system for mitigating noise includes at least one processor configured to receive a first audio signal corresponding to a first microphone, and determine whether wind noise is present based at least in part on the first audio signal. The processor is configured to select, based on the determining, a second audio signal from between second and third microphones. The second microphone is disposed at a location that experiences less echo coupling when the device is in a particular orientation with respect to a user. The third microphone is disposed at another location that experiences less wind noise. The processor is configured to determine voice and noise reference values based on the first and the selected second audio signals, and perform noise suppression with respect to at least one of the first or the selected second audio signal, based on the voice or the noise reference value.

Type: Grant

Filed: February 7, 2020

Date of Patent: September 7, 2021

Assignee: Apple Inc.

Inventors: Nicholas J. Bryan, Qing Yang, Vasu Iyengar
Speech processing method and terminal

Patent number: 10878833

Abstract: A speech processing method and a terminal are provided. The method includes: receiving signals from a plurality of microphones; performing, by using a same sampling rate, analog-to-digital conversion on the plurality of paths of signals received from the plurality of microphones, to obtain a plurality of paths of time-domain digital signals; performing time-to-frequency-domain conversion on the plurality of paths of time-domain digital signals to obtain a plurality of paths of frequency-domain signals; and determining a signal type of the primary frequency-domain signal based on at least one of a sound pressure difference between the primary frequency-domain signal and each of N paths of secondary frequency-domain signals in the M paths of secondary frequency-domain signals, a phase difference between the primary frequency-domain signal and each of the N paths of secondary frequency-domain signals, and a frequency distribution characteristic of the primary frequency-domain signal.

Type: Grant

Filed: October 12, 2018

Date of Patent: December 29, 2020

Assignee: Huawei Technologies Co., Ltd.

Inventors: Yanbin Du, Zhihai Zhu, Meng Liao, Weijun Zheng, Weibin Chen, Guangzhao Bao, Cunshou Qiu
Audio bandwidth selection

Patent number: 10777213

Abstract: A device includes a receiver configured to receive an audio frame of an audio stream. The audio frame includes information that indicates a coded bandwidth of the audio frame. The device also includes a decoder configured to generate first decoded speech associated with the audio frame and to determine an output mode of the decoder based at least in part on the information that indicates the coded bandwidth. A bandwidth mode indicated by the output mode of the decoder is different than a bandwidth mode indicated by the information that indicates the coded bandwidth. The decoder is further configured to output second decoded speech based on the first decoded speech. The second decoded speech is generated according to an output mode of the decoder.

Type: Grant

Filed: August 3, 2018

Date of Patent: September 15, 2020

Assignee: QUALCOMM Incorporated

Inventors: Venkatraman S. Atti, Venkata Subrahmanyam Chandra Sekhar Chebiyyam, Vivek Rajendran
Method, information processing apparatus for processing speech, and non-transitory computer-readable storage medium

Patent number: 10636438

Abstract: A method for processing speech includes: executing a acquiring process that includes acquiring a speech signal; executing a detection process that includes detecting a first frequency spectrum from the speech signal; executing a calculation process that includes calculating a second spectrum based on an envelope of the first spectrum; executing a correction process that includes correcting the first spectrum based on comparison between a first amplitude of the first spectrum and a second amplitude of the second spectrum; executing a estimation process that includes estimating a pitch frequency of the speech signal in accordance with correlation between the corrected first frequency spectrum and periodic signals corresponding to frequencies in a certain band.

Type: Grant

Filed: August 27, 2018

Date of Patent: April 28, 2020

Assignee: FUJITSU LIMITED

Inventors: Sayuri Nakayama, Taro Togawa, Takeshi Otani
Processing of speech signal

Patent number: 10586529

Abstract: A computer-implemented method for processing a speech signal, includes: identifying speech segments in an input speech signal; calculating an upper variance and a lower variance, the upper variance being a variance of upper spectra larger than a criteria among speech spectra corresponding to frames in the speech segments, the lower variance being a variance of lower spectra smaller than a criteria among the speech spectra corresponding to the frames in the speech segments; determining whether the input speech signal is a special input speech signal using a difference between the upper variance and the lower variance; and performing speech recognition of the input speech signal which has been determined to be the special input speech signal, using a special acoustic model for the special input speech signal.

Type: Grant

Filed: September 14, 2017

Date of Patent: March 10, 2020

Assignee: International Business Machines Corporation

Inventors: Osamu Ichikawa, Takashi Fukuda, Gakuto Kurata, Bhuvana Ramabhadran
Virtual physical layer

Patent number: 10574356

Abstract: A virtual physical layer may be provided. When providing the virtual physical layer, a remote radio head may be used. The remote radio head may comprise a first interface device, a second interface device, a digital-to-analog converter, and an analog-to-digital converter. The first interface device may be connected to a virtual physical layer instance instantiated in a cloud-based environment. The second interface device may be connected to customer premises equipment. The digital-to-analog converter may be connected between the first interface device and the second interface device and the analog-to-digital converter may also be connected between the first interface device and the second interface device.

Type: Grant

Filed: January 26, 2018

Date of Patent: February 25, 2020

Assignee: Cisco Technology, Inc.

Inventors: John T. Chapman, Hang Jin, Alon Shlomo Bernstein
Method and a system for decomposition of acoustic signal into sound objects, a sound object and its use

Patent number: 10565970

Abstract: A method and a system for decomposition of acoustic signal into sound objects having the form of signals with slowly-varying amplitude and frequency, as well as sound objects and their use. The object is achieved by a method for decomposing an acoustic signal into digital sound objects, a digital sound object representing a component of the acoustic signal, the component having a waveform, comprising the steps of converting the analogue acoustic signal into a digital input signal (PIN); determining an instantaneous frequency component of the digital input signal, using a digital filter bank; determining an instantaneous amplitude of the instantaneous frequency component; determining an instantaneous phase of the digital input signal associated with the instantaneous frequency; creating at least one digital sound object, based on the determined instantaneous frequency, phase and amplitude; and storing the digital sound object in a sound object database.

Type: Grant

Filed: January 18, 2018

Date of Patent: February 18, 2020

Assignee: SOUND OBJECT TECHNOLOGIES S.A.

Inventor: Adam Pluta
Methods, encoder and decoder for linear predictive encoding and decoding of sound signals upon transition between frames having different sampling rates

Patent number: 10468045

Abstract: Methods, an encoder and a decoder are configured for transition between frames with different internal sampling rates. Linear predictive (LP) filter parameters are converted from a sampling rate S1 to a sampling rate S2. A power spectrum of a LP synthesis filter is computed, at the sampling rate S1, using the LP filter parameters. The power spectrum of the LP synthesis filter is modified to convert it from the sampling rate S1 to the sampling rate S2. The modified power spectrum of the LP synthesis filter is inverse transformed to determine autocorrelations of the LP synthesis filter at the sampling rate S2. The autocorrelations are used to compute the LP filter parameters at the sampling rate S2.

Type: Grant

Filed: November 16, 2017

Date of Patent: November 5, 2019

Assignee: VoiceAge EVS LLC

Inventors: Redwan Salami, Vaclav Eksler
Methods, encoder and decoder for linear predictive encoding and decoding of sound signals upon transition between frames having different sampling rates

Patent number: 10431233

Abstract: Methods, an encoder and a decoder are configured for transition between frames with different internal sampling rates. Linear predictive (LP) filter parameters are converted from a sampling rate S1 to a sampling rate S2. A power spectrum of a LP synthesis filter is computed, at the sampling rate S1, using the LP filter parameters. The power spectrum of the LP synthesis filter is modified to convert it from the sampling rate S1 to the sampling rate S2. The modified power spectrum of the LP synthesis filter is inverse transformed to determine autocorrelations of the LP synthesis filter at the sampling rate S2. The autocorrelations are used to compute the LP filter parameters at the sampling rate S2.

Type: Grant

Filed: November 15, 2017

Date of Patent: October 1, 2019

Assignee: VoiceAge EVS LLC

Inventors: Redwan Salami, Vaclav Eksler
System and method for the analysis and synthesis of periodic and non-periodic components of speech signals

Patent number: 10354671

Abstract: A voice coder configured to resolve periodic and aperiodic components of spectra is disclosed. The method of voice coding includes parsing the speech signal into a plurality of speech frames; for each of the plurality of speech frames: (a) generating the spectra for the speech frame, (b) parsing the spectra of the speech frame into a plurality of sub-bands, (c) transforming each of the plurality of sub-bands into a time-domain envelope signal, and (d) generating a plurality of sub-band voicing factors, wherein each sub-band voicing factor indicates the harmonicity of one of the plurality of sub-bands, and each sub-band voicing factor is based on the periodicity of one of said time-domain envelope signals associated with one of the plurality of sub-bands.

Type: Grant

Filed: February 21, 2018

Date of Patent: July 16, 2019

Assignee: OBEN, INC.

Inventors: Kantapon Kaewtip, Fernando Villavicencio, Mark Harvilla
Systems and methods for capturing noise for pattern recognition processing

Patent number: 10332543

Abstract: Example systems and methods capture a first plurality of portions of audio data by periodically capturing the audio data at first intervals. Embodiments detect speech onset in the audio data. Responsive to detection of the speech onset, systems and methods switch from periodically capturing the audio data to continuously capturing the audio data. Embodiments combine at least one captured portion of the first plurality of captured portions of the audio data with the continuously captured audio data to provide contiguous audio data.

Type: Grant

Filed: June 22, 2018

Date of Patent: June 25, 2019

Assignee: Cypress Semiconductor Corporation

Inventors: Robert Zopf, Victor Simileysky, Ashutosh Pandey, Patrick Cruise
Robust voice activity detector system for use with an earphone

Patent number: 10163453

Abstract: An electronic device or method for adjusting a gain on a voice operated control system can include one or more processors and a memory having computer instructions. The instructions, when executed by the one or more processors causes the one or more processors to perform the operations of receiving a first microphone signal, receiving a second microphone signal, updating a slow time weighted ratio of the filtered first and second signals, and updating a fast time weighted ratio of the filtered first and second signals. The one or more processors can further perform the operations of calculating an absolute difference between the fast time weighted ratio and the slow time weighted ratio, comparing the absolute difference with a threshold, and increasing the gain when the absolute difference is greater than the threshold. Other embodiments are disclosed.

Type: Grant

Filed: October 26, 2015

Date of Patent: December 25, 2018

Assignee: Staton Techiya, LLC

Inventor: John Usher
Automated detection and filtering of audio advertisements

Patent number: 10146868

Abstract: Apparatuses, systems, methods, and media for filtering a data stream are provided. The data stream is partitioned into a plurality of data stream segments. An acoustic parameter is measured in each of the data stream segments. It is determined whether the acoustic parameter satisfies a first predetermined condition. The first predetermined condition includes a number of variances, in which the acoustic parameter exceeds a predetermined variance threshold, exceeding a predetermined number threshold. An extraneous portion of the data stream is identified in which the first predetermined condition is satisfied. It is determined whether the extraneous portion satisfies a second predetermined condition in the data stream. The extraneous portion is deleted from the data stream to produce a filtered data stream in response to the second predetermined condition being satisfied.

Type: Grant

Filed: June 8, 2017

Date of Patent: December 4, 2018

Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.

Inventors: Yeon-Jun Kim, I. Dan Melamed, Bernard S. Renger, Steven Neil Tischer
Multi-structural, multi-level information formalization and structuring method, and associated apparatus

Patent number: 10140305

Abstract: Systems and methods for structuring information include determining information quantity (IQ) and information value (IV) in an original digital information file (ODIF). An initial manipulation process applied to the ODIF forms a first resulting DIF (FRDIF), and a subsequent manipulation process applied to the FRDIF forms a second resulting DIF, wherein each manipulation process removes at least one element of the processed DIF and/or represents an element combination with a representative element and a first indicia of an interrelationship between the representative element and one or more elements in the combination, to reduce the IQ of the processed DIF, while retaining the IV thereof within a threshold. Manipulation processes are successively applied to the previously resulting DIF until successive applications do not achieve a threshold reduction in IQ. The last resulting DIF has a primary structure with a reduced IQ and an IV within the threshold of the original IV.

Type: Grant

Filed: October 14, 2016

Date of Patent: November 27, 2018

Assignee: GENERAL HARMONICS INTERNATIONAL INC.

Inventors: Alexander Zhirkov, Alexey Oraevsky, Andrei Grichine, George Blondheim, Max Wandinger, Wade Attwood
Comfort noise generation apparatus and method

Patent number: 10079023

Abstract: A comfort noise generation apparatus constituted of: near and far end speech detectors arranged to detect speech activity in near-end and far-end signals and a comfort noise generator, wherein responsive to an indication from the near-end speech detector that speech activity is absent on the near-end signal and an indication from the far-end silence detector that speech activity is absent on the far-end signal, the comfort noise generator is arranged to initiate a determination of an estimation of near-end background noise, wherein responsive to an indication from the near-end speech detector that speech activity is present on the near-end signal or an indication from the far-end silence detector that speech activity is present on the far-end signal, the comfort noise generator is arranged to terminate the estimation determination of near-end background noise, and wherein the comfort noise generator is arranged to output a function of the near-end background noise estimation.

Type: Grant

Filed: September 22, 2016

Date of Patent: September 18, 2018

Assignee: Microsemi Semiconductor (U.S.) Inc.

Inventors: Tanmay Zargar, Dillon Reed Ritter, Rodolfo Silva
Band expander, reception device, band expanding method for expanding signal band

Patent number: 10068580

Abstract: An oversampling LPF unit receives a sound signal. A differentiator differentiates the sound signal. An overtone computation unit generates an overtone signal by multiplying a signal differentiated by the differentiator by the sound signal from the oversampling LPF unit. A HPF unit filters the overtone signal generated by the overtone computation unit. A combiner combines the overtone signal filtered by the HPF unit and the sound signal from the oversampling LPF unit.

Type: Grant

Filed: December 23, 2016

Date of Patent: September 4, 2018

Assignee: JVC KENWOOD Corporation

Inventor: Tatsuya Onoda
Method and device for spectral expansion for an audio signal

Patent number: 10043534

Abstract: A method and device for automatically increasing the spectral bandwidth of an audio signal including generating a “mapping” (or “prediction”) matrix based on the analysis of a reference wideband signal and a reference narrowband signal, the mapping matrix being a transformation matrix to predict high frequency energy from a low frequency energy envelope, generating an energy envelope analysis of an input narrowband audio signal, generating a resynthesized noise signal by processing a random noise signal with the mapping matrix and the envelope analysis, high-pass filtering the resynthesized noise signal, and summing the high-pass filtered resynthesized noise signal with the original an input narrowband audio signal. Other embodiments are disclosed.

Type: Grant

Filed: December 22, 2014

Date of Patent: August 7, 2018

Assignee: Staton Techiya, LLC

Inventors: John Usher, Dan Ellis
High-band signal generation

Patent number: 9837089

Abstract: A device for signal processing includes a receiver and a high-band excitation signal generator. The receiver is configured to receive a parameter associated with a bandwidth-extended audio stream. The high-band excitation signal generator is configured to determine a value of the parameter. The high-band excitation signal generator is also configured to select, based on the value of the parameter, one of target gain information associated with the bandwidth-extended audio stream or filter information associated with the bandwidth-extended audio stream. The high-band excitation signal generator is further configured to generate a high-band excitation signal based on the one of the target gain information or the filter information.

Type: Grant

Filed: May 25, 2016

Date of Patent: December 5, 2017

Assignee: QUALCOMM Incorporated

Inventors: Venkatraman Atti, Venkata Subrahmanyam Chandra Sekhar Chebiyyam
Method and apparatus for exemplary segment classification

Patent number: 9767791

Abstract: Method and apparatus for segmenting speech by detecting the pauses between the words and/or phrases, and to determine whether a particular time interval contains speech or non-speech, such as a pause.

Type: Grant

Filed: September 12, 2016

Date of Patent: September 19, 2017

Assignee: SPEECH MORPHING SYSTEMS, INC.

Inventors: Fathy Yassa, Ben Reaves, Nima Ferdosi
Encoding method, encoder, periodic feature amount determination method, periodic feature amount determination apparatus, program and recording medium

Patent number: 9711158

Abstract: An encoding technique encoding a sound signal at a low bit rate with reduced processing. The technique includes: an interval determination determining an interval T between samples corresponding to periodicity of an audio signal or an integer multiple of a fundamental frequency of the audio signal from a set S of candidates for the interval T; and a side information generating encoding the determined interval T to obtain side information. The interval determining determines the interval T from a set S of Y candidates (Y<Z) including Z2 candidates (Z2<Z) selected from among Z candidates for the interval T representable with the side information without depending on a candidate subjected to the interval determination in a previous frame a predetermined number of frames before the current frame and including a candidate subjected to the interval determination in the previous frame the predetermined number of frames before the current frame.

Type: Grant

Filed: January 18, 2012

Date of Patent: July 18, 2017

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Takehiro Moriya, Noboru Harada, Yusuke Hiwasaki, Yutaka Kamamoto
Automated detection and filtering of audio advertisements

Patent number: 9703865

Abstract: Apparatuses, systems, methods, and media for filtering a data stream are provided. The data stream is analyzed based on an acoustic parameter to determine extraneous portions in which a first predetermined condition is satisfied. When a first extraneous portion is separated from a second extraneous portion by a non-extraneous portion in which the first predetermined condition is not satisfied, it is determined whether the first extraneous portion being separated from the second extraneous portion by the non-extraneous portion satisfies a second predetermined condition. At least one of the first extraneous portion and the second extraneous portion is deleted from the data stream to produce a filtered data stream in response to determining the second predetermined condition is satisfied.

Type: Grant

Filed: September 25, 2015

Date of Patent: July 11, 2017

Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.

Inventors: Yeon-Jun Kim, I. Dan Melamed, Bernard S. Renger, Steven Neil Tischer

1 2 3 4 5 … next