Voiced Or Unvoiced Patents (Class 704/214)
-
Patent number: 12148124Abstract: A method includes obtaining multiple input image frames and determining how to warp at least one of the input image frames. The method also includes performing an intraband demosaic-warp operation to reconstruct image data in different color channels of the input image frames and warp the at least one input image frame to produce RGB input image frames. The method further includes blending the RGB input image frames to produce a blended RGB image frame, performing an interband denoising operation to produce a denoised RGB image frame, and performing an interband sharpening operation to produce a sharpened RGB image frame. In addition, the method includes performing an interband demosaic operation to substantially equalize high-frequency content in different color channels of the sharpened RGB image frame to produce an equalized sharpened RGB image frame and generating a final image of the scene based on the equalized sharpened RGB image frame.Type: GrantFiled: January 27, 2022Date of Patent: November 19, 2024Assignee: Samsung Electronics Co., Ltd.Inventors: John W. Glotzbach, Long N. Le, Zhen Tong, Hamid R. Sheikh, Yibo Xu
-
Patent number: 12123766Abstract: This signal processing device comprises: an acquisition unit for acquiring an acoustic signal; a measurement unit for measuring an acoustic level of the acoustic signal for every one of first frequency bands, which are a plurality of frequency bands of a preset first bandwidth; a calculation unit that, on the basis of the plurality of acoustic levels of the first frequency bands, identifies an acoustic feature quantity indicating the separation degree from normal acoustic levels of second frequency bands, which are a plurality of frequency bands of a second bandwidth that is wider than the first bandwidth; a first determination unit for determining whether the acoustic levels measured for every one of the first frequency bands are a first threshold value or greater; and a second determination unit for determining whether the acoustic feature quantity is a second threshold value or greater.Type: GrantFiled: September 25, 2020Date of Patent: October 22, 2024Assignee: MITSUBISHI HEAVY INDUSTRIES, LTD.Inventors: Motoshi Takasu, Toshifumi Kudo, Yoshiyuki Hamaji, Takanori Kouou
-
Patent number: 12126846Abstract: The present invention recognizes media content using signatures generated by network devices with limited processing power. An audio signal is prepared for application of a discrete Fourier transform (DFT). Outputs from the DFT include real components and imaginary components that are used to calculate output magnitudes associated with frequency bins. The frequency-amplitude pairs include the output magnitudes and the associated frequency bins. A signature of the audio signal is generated by selecting a predetermined number of frequency-amplitude pairs having dominant output magnitudes. The network devices that generate the signatures may transmit the signatures to a server for analysis. The server may trigger actions in response to detecting known content based on the received signatures matching known signatures.Type: GrantFiled: May 27, 2022Date of Patent: October 22, 2024Assignee: Sling TV L.L.C.Inventors: Eric Dyoniziak, Robert Drew Major
-
Patent number: 12106767Abstract: Provided is pitch enhancement processing having little unnaturalness even in time segments for consonants, and having little unnaturalness to listeners caused by discontinuities even when time segments for consonants and other time segments switch frequently. A pitch emphasis apparatus carries out the following as the pitch enhancement processing: for a time segment in which a spectral envelope of a signal has been determined to be flat, obtaining an output signal for each of times in the time segment, the output signal being a signal including a signal obtained by adding (1) a signal obtained by multiplying the signal of a time, further in the past than the time by a number of samples T0 corresponding to a pitch period of the time segment, a pitch gain ?0 of the time segment, a predetermined constant B0, and a value greater than 0 and less than 1, to (2) the signal of the time.Type: GrantFiled: July 7, 2023Date of Patent: October 1, 2024Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Yutaka Kamamoto, Ryosuke Sugiura, Takehiro Moriya
-
Patent number: 12100406Abstract: A method for processing an audio signal includes receiving a bitstream corresponding to the audio signal; obtaining a silence insertion descriptor (SID) type of a current frame of the audio signal by decoding the bitstream; obtaining a low-band parameter of the current frame by decoding the bitstream; obtaining a low-band signal of the current frame based on the low-band parameter; obtaining, based on the SID type of the current frame, a high-band parameter of the current frame; obtaining a high-band signal of the current frame based on the high-band parameter; and obtaining a synthesis signal of the current frame based on the low-band signal and the high-band signal.Type: GrantFiled: June 29, 2023Date of Patent: September 24, 2024Assignee: HUAWEI TECHNOLOGIES CO., LTD.Inventor: Zhe Wang
-
Patent number: 12100409Abstract: An audio decoder provides a decoded audio information on the basis of an encoded audio information including linear prediction coefficients (LPC) and includes a tilt adjuster to adjust a tilt of a noise using linear prediction coefficients of a current frame to acquire a tilt information and a noise inserter configured to add the noise to the current frame in dependence on the tilt information. Another audio decoder includes a noise level estimator to estimate a noise level for a current frame using a linear prediction coefficient of at least one previous frame to acquire a noise level information; and a noise inserter to add a noise to the current frame in dependence on the noise level information provided by the noise level estimator. Thus, side information about a background noise in the bit-stream may be omitted. Methods and computer programs serve a similar purpose.Type: GrantFiled: November 24, 2020Date of Patent: September 24, 2024Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.Inventors: Guillaume Fuchs, Christian Helmrich, Manuel Jander, Benjamin Schubert, Yoshikazu Yokotani
-
Patent number: 12094488Abstract: An apparatus and method for voice activity detection. A multiphase differential output rotating capacitive sampler achieves a frequency down conversion over as many specific frequency bands as are required for analysis. A chirp is created in the rotating sampler as the sum of arbitrary frequencies across the desired analysis band multiplied by a window function. The chirp is sampled at a rate of rotation synchronous with the last state of burst of the chirp, allowing a non-phase synchronous pattern in the coefficient values and allowing a high-Q and arbitrary decomposition of the signal. After the sample is taken, the next clock signal to the sampler is used to define the output voltage of the sampler by shorting the output, which is entirely capacitive, to ground. Processing occurs in the analog domain rather than digitally, avoiding the need for FFTs and allowing for greater speed and lower power consumption.Type: GrantFiled: October 23, 2022Date of Patent: September 17, 2024Assignee: SiliconIntervention Inc.Inventor: A. Martin Mallinson
-
Patent number: 12067995Abstract: An apparatus for determining a predetermined characteristic related to an artificial bandwidth limitation processing of an audio signal has a slope evaluator configured for evaluating a slope of a spectrum of the audio signal to obtain a slope evaluation result. The apparatus has a frequency evaluator configured for evaluating a cut-off frequency of the spectrum of the audio signal to obtain a frequency evaluation result, and has a processor for providing an information indicating that the audio signal has the predetermined characteristic dependent on an evaluation of the slope evaluation result and an evaluation of the frequency evaluation result.Type: GrantFiled: September 25, 2019Date of Patent: August 20, 2024Assignee: FRAUNHOFER-GESELLSCHAFT ZUR FOERDERUNG DER ANGEWANDTEN FORSCHUNG E.V.Inventors: Patrick Gampp, Christian Uhle, Julia Havenstein, Oliver Hellmuth, Sascha Disch, Antonios Karampourniotis, Juergen Herre, Peter Prokein
-
Patent number: 12022016Abstract: A system for resolving exceptions in requests determines that a request comprises an exception. The exception impeded processing the request to be granted or denied. The system determines a type of exception that indicates whether the exception is incomplete information, incorrect information, or previously-unknown information. The system generates a block in a blockchain network. Based on the stored details in the block, the type of exception is identified. Accordingly, the block is segregated for exception processing based on its exception type. The system stores the exception in the block. The system compares the request with user information previously provided by the user. The system determines a similarity score between the request and the user information. The system compares the similarity score with a threshold percentage. In response to determining that the similarity score exceeds the threshold percentage, the system determines that the exception can be resolved and resolves the exception.Type: GrantFiled: April 7, 2022Date of Patent: June 25, 2024Assignee: Bank of America CorporationInventors: Prashant Khare, Abhishek Trivedi, Gaurav Dadhich, Saurabh Dutta, Shruti Nandini Thakur, Parneet Kaur Gujral, Zeno Valerian Anthony
-
Patent number: 11984134Abstract: A method of processing audio data, an electronic device, and a storage medium, which relates to a field of artificial intelligence, in particular to a field of speech processing technology. The method includes: processing spectral data of the audio data to obtain a first feature information; obtaining a fundamental frequency indication information according to the first feature information, wherein the fundamental frequency indication information indicates valid audio data of the first feature information and invalid audio data of the first feature information; obtaining a fundamental frequency information and a spectral energy information according to the first feature information and the fundamental frequency indication information; and obtaining a harmonic structure information of the audio data according to the fundamental frequency information and the spectral energy information.Type: GrantFiled: November 29, 2022Date of Patent: May 14, 2024Assignee: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.Inventors: Jiankang Hou, Zhipeng Nie, Liqiang Zhang, Tao Sun, Lei Jia
-
Patent number: 11955118Abstract: A real-time processor-implemented translation method and apparatus is provided. The real-time translation method includes receiving a content, determining a delay time for real-time translation based on a silence interval of the received content and an utterance interval of the received content, generating a translation result by translating a language used in the received content, and synthesizing the translation result and the received content.Type: GrantFiled: April 17, 2020Date of Patent: April 9, 2024Assignee: Samsung Electronics Co., Ltd.Inventors: Youngmin Kim, Hwidong Na, Min-joong Lee, Hodong Lee
-
Patent number: 11947868Abstract: A method for muting and unmuting a microphone is provided. The method includes providing a processor, receiving an input microphone signal, measuring the input microphone signal for a loudness level at a sampling rate, calculating a mute threshold level, checking if the loudness level is higher than or equal to the mute threshold level, and resetting a mute delay timer upon determining that the loudness level is higher than or equal to the mute threshold level and obtaining the input microphone signal, or checking if the mute delay timer is running upon determining that the loudness level is not higher than or equal to the mute threshold level and attenuating the input microphone signal if the mute delay timer is not running or obtaining the input microphone signal if the mute delay timer is still running, and writing the input microphone signal or attenuated input microphone signal to an output buffer.Type: GrantFiled: December 28, 2021Date of Patent: April 2, 2024Assignee: CREATIVE TECHNOLOGY LTD.Inventors: Kee Seng Tan, Luen Kai Chan, Ariel Arellano De Castro
-
Patent number: 11922962Abstract: A Unified Speech and Audio Codec (USAC) that may process a window sequence based on mode switching is provided. The USAC may perform encoding or decoding by overlapping between frames based on a folding point when mode switching occurs. The USAC may process different window sequences for each situation to perform encoding or decoding, and thereby may improve a coding efficiency.Type: GrantFiled: August 25, 2022Date of Patent: March 5, 2024Assignees: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE, KWANGWOON UNIVERSITY INDUSTRY-ACADEMIC COLLABORATION FOUNDATIONInventors: Seungkwon Beack, Tae Jin Lee, Min Je Kim, Kyeongok Kang, Dae Young Jang, Jeongil Seo, Jin Woo Hong, Chieteuk Ahn, Ho Chong Park, Young-cheol Park
-
Patent number: 11831723Abstract: Systems and methods are disclosed herein for remote application sharing. An exemplary method comprises detecting a user establishing a connection with a remote application server, authenticating the user based on login information associated with the user, determining that the user has requested execution of a shared application hosted on the remote application server, responsive to determining that the user has requested execution of the shared application, gathering information for accessing the shared application hosted on the remote application server, establishing a user session for executing the shared application, generating an application link comprising the information for accessing the shared application over the user session and publishing the application link for distribution to one or more third party users, wherein activation of the application link by the one or more third party users shares the user session with the one or more third party users.Type: GrantFiled: December 21, 2021Date of Patent: November 28, 2023Assignee: Parallels International GmbHInventors: Marco Borg, Daniel Farrugia, Nikolay Dobrovolskiy, Sergei Beloussov
-
Patent number: 11797782Abstract: A cross-lingual voice conversion system and method comprises a voice feature extractor configured to receive a first voice audio segment in a first language and a second voice audio segment in a second language, and extract, respectively, audio features comprising first-voice, speaker-dependent acoustic features and second-voice, speaker-independent linguistic features. One or more generators are configured to receive extracted features, and produce therefrom a third voice candidate keeping the first-voice, speaker-dependent acoustic features and the second-voice, speaker-independent linguistic features, wherein the third voice candidate speaks the second language. One or more discriminators are configured to compare the third voice candidate with the ground truth data, and provide results of the comparison back to the generator for refining the third voice candidate.Type: GrantFiled: December 30, 2020Date of Patent: October 24, 2023Assignee: TMRW Foundation IP S. À R.L.Inventor: Cevat Yerli
-
Patent number: 11749295Abstract: Provided is pitch enhancement processing having little unnaturalness even in time segments for consonants, and having little unnaturalness to listeners caused by discontinuities even when time segments for consonants and other time segments switch frequently. A pitch emphasis apparatus carries out the following as the pitch enhancement processing: for a time segment in which a spectral envelope of a signal has been determined to be flat, obtaining an output signal for each of times in the time segment, the output signal being a signal including a signal obtained by adding (1) a signal obtained by multiplying the signal of a time, further in the past than the time by a number of samples T0 corresponding to a pitch period of the time segment, a pitch gain ?0 of the time segment, a predetermined constant B0, and a value greater than 0 and less than 1, to (2) the signal of the time.Type: GrantFiled: August 31, 2022Date of Patent: September 5, 2023Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Yutaka Kamamoto, Ryosuke Sugiura, Takehiro Moriya
-
Patent number: 11727946Abstract: A method for processing an audio signal includes: receiving a bitstream corresponding to the audio signal; obtaining a silence insertion descriptor (SID) type of a current frame of the audio signal by decoding the bitstream; obtaining a low-band parameter of the current frame by decoding the bitstream; obtaining a low-band signal of the current frame based on the low-band parameter; obtaining, based on the SID type of the current frame, a high-band parameter of the current frame; obtaining a high-band signal of the current frame based on the high-band parameter; and obtaining a synthesis signal of the current frame based on the low-band signal and the high-band signal.Type: GrantFiled: October 21, 2021Date of Patent: August 15, 2023Assignee: HUAWEI TECHNOLOGIES CO., LTD.Inventor: Zhe Wang
-
Patent number: 11715477Abstract: Quantizing speech model parameters includes, for each of multiple vectors of quantized excitation strength parameters, determining first and second errors between first and second elements of a vector of excitation strength parameters and, respectively, first and second elements of the vector of quantized excitation strength parameters, and determining a first energy and a second energy associated with, respectively, the first and second errors. First and second weights for, respectively, the first error and the second error, are determined and are used to produce first and second weighted errors, which are combined to produce a total error. The total errors of each of the multiple vectors of quantized excitation strength parameters are compared and the vector of quantized excitation strength parameters that produces the smallest total error is selected to represent the vector of excitation strength parameters.Type: GrantFiled: April 8, 2022Date of Patent: August 1, 2023Assignee: Digital Voice Systems, Inc.Inventors: Daniel W. Griffin, John C. Hardwick
-
Patent number: 11715484Abstract: A decoding apparatus includes: a bandwidth extending part 25 obtaining a decoded extended frequency spectrum sequence by arranging samples based on K samples included in a frequency-domain sample sequence obtained by decoding, on a higher side than the frequency-domain sample sequence; and a fricative sound adjustment releasing part 23 obtaining, if inputted information indicating whether a hissing sound or not indicates being a hissing sound, what is obtained by exchanging all or a part of a low-side frequency sample sequence existing on a lower side than a predetermined frequency in the decoded extended frequency spectrum sequence for all or a part of a high-side frequency sample sequence existing on a higher side than the predetermined frequency in the decoded extended frequency spectrum sequence as an adjusted frequency spectrum sequence, the number of all or the part of the high-side frequency spectrum sequence being the same as the number of all or the part of the low-side frequency spectrum sequence.Type: GrantFiled: July 1, 2022Date of Patent: August 1, 2023Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Ryosuke Sugiura, Yutaka Kamamoto, Takehiro Moriya
-
Patent number: 11677383Abstract: Embodiments of the invention may be used to implement a rate converter that includes: 6 channels in forward (audio) path, each channel having a 24-bit signal path per channel, an End-to-end SNR of 110 dB, all within the 20 Hz to 20 KHz bandwidth. Embodiment may also be used to implement a rate converter having: 2 channels in a reverse path, such as for voice signals, 16-bit signal path per channel, an End-to-end SNR of 93 dB, all within 20 Hz to 20 KHz bandwidth. The rate converter may include sample rates such as 8, 11.025, 12, 16, 22.05, 24, 32 44.1, 48, and 96 KHz. Further, rate converters according to embodiments may include a gated clock in low-power mode to conserve power.Type: GrantFiled: March 8, 2019Date of Patent: June 13, 2023Assignee: AVNERA CORPORATIONInventor: Xudong Zhao
-
Patent number: 11670311Abstract: A wireless audio system for encoding and decoding an audio signal using spectral bandwidth replication is provided. Bandwidth extension is performed in the time-domain, enabling low-latency audio coding.Type: GrantFiled: April 12, 2021Date of Patent: June 6, 2023Assignee: Shure Acquisition Holdings, Inc.Inventors: Wenshun Tian, Michael Ryan Lester
-
Patent number: 11594244Abstract: A voice event detection apparatus is disclosed. The apparatus comprises a vibration to digital converter and a computing unit. The vibration to digital converter is configured to convert an input audio signal into vibration data. The computing unit is configured to trigger a downstream module according to a sum of vibration counts of the vibration data for a number X of frames. In an embodiment, the voice event detection apparatus is capable of correctly distinguishing a wake phoneme from the input vibration data so as to trigger a downstream module of a computing system. Thus, the power consumption of the computing system is saved.Type: GrantFiled: May 11, 2020Date of Patent: February 28, 2023Assignee: BRITISH CAYMAN ISLANDS INTELLIGO TECHNOLOGY INC.Inventors: Tsan-Jieh Chen, Hong-Ching Chen, Chien Hua Hsu, Tsung-Liang Chen
-
Patent number: 11591657Abstract: The present invention relates to coding of audio signals, and in particular to high frequency reconstruction methods including a frequency domain harmonic transposer. A system and method for generating a high frequency component of a signal from a low frequency component of the signal is described.Type: GrantFiled: March 15, 2021Date of Patent: February 28, 2023Assignee: Dolby International ABInventors: Lars Villemoes, Per Ekstrand
-
Patent number: 11468907Abstract: Provided is pitch enhancement processing having little unnaturalness even in time segments for consonants, and having little unnaturalness to listeners caused by discontinuities even when time segments for consonants and other time segments switch frequently. A pitch emphasis apparatus carries out the following as the pitch enhancement processing: for a time segment in which a spectral envelope of a signal has been determined to be flat, obtaining an output signal for each of times in the time segment, the output signal being a signal including a signal obtained by adding (1) a signal obtained by multiplying the signal of a time, further in the past than the time by a number of samples T0 corresponding to a pitch period of the time segment, a pitch gain ?0 of the time segment, a predetermined constant B0, and a value greater than 0 and less than 1, to (2) the signal of the time.Type: GrantFiled: April 23, 2019Date of Patent: October 11, 2022Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Yutaka Kamamoto, Ryosuke Sugiura, Takehiro Moriya
-
Patent number: 11430431Abstract: A method, computer program, and computer system is provided for converting a singing voice of a first person associated with a first speaker to a singing voice of a second person using a speaking voice of the second person associated with a second speaker. A context associated with one or more phonemes corresponding to the singing voice of a first person is encoded, and the one or more phonemes are aligned to one or more target acoustic frames based on the encoded context. One or more mel-spectrogram features are recursively generated from the aligned phonemes, the target acoustic frames, and a sample of the speaking voice of the second person. A sample corresponding to the singing voice of a first person is converted to a sample corresponding to the second singing voice using the generated mel-spectrogram features.Type: GrantFiled: February 6, 2020Date of Patent: August 30, 2022Assignee: TENCENT AMERICA LLCInventors: Chengzhu Yu, Heng Lu, Chao Weng, Dong Yu
-
Patent number: 11380321Abstract: Various embodiments of the present technology may provide methods and apparatus for a voice detector. The voice detector may provide a microphone and an audio processor. The microphone may provide an active signal generator configured to generate an active signal. The active signal may indicate when the signal level of detected audio is above or below a threshold level with a first state and a second state. The active signal may prevent activity at the microphone I/O interface and may prevent activity at the audio processor's internal logic.Type: GrantFiled: October 23, 2019Date of Patent: July 5, 2022Assignee: SEMICONDUCTOR COMPONENTS INDUSTRIES, LLCInventor: Kenichi Kiyozaki
-
Patent number: 11380346Abstract: A method of determining noise reduction in a signal includes transforming the signal to generate a spectrogram; determining sharp change in a frequency spectrum for each frame in the spectrogram; and comparing a counted number of frames having sharp change with a predetermined value. The signal is determined to be subject to noise reduction if the counted number is greater than the predetermined value.Type: GrantFiled: May 22, 2020Date of Patent: July 5, 2022Assignee: Wistron CorporationInventors: Ching-An Cho, Yu-Yen Chen, Kuo-Ting Huang
-
Patent number: 11114109Abstract: A device implementing a system for mitigating noise includes at least one processor configured to receive a first audio signal corresponding to a first microphone, and determine whether wind noise is present based at least in part on the first audio signal. The processor is configured to select, based on the determining, a second audio signal from between second and third microphones. The second microphone is disposed at a location that experiences less echo coupling when the device is in a particular orientation with respect to a user. The third microphone is disposed at another location that experiences less wind noise. The processor is configured to determine voice and noise reference values based on the first and the selected second audio signals, and perform noise suppression with respect to at least one of the first or the selected second audio signal, based on the voice or the noise reference value.Type: GrantFiled: February 7, 2020Date of Patent: September 7, 2021Assignee: Apple Inc.Inventors: Nicholas J. Bryan, Qing Yang, Vasu Iyengar
-
Patent number: 10878833Abstract: A speech processing method and a terminal are provided. The method includes: receiving signals from a plurality of microphones; performing, by using a same sampling rate, analog-to-digital conversion on the plurality of paths of signals received from the plurality of microphones, to obtain a plurality of paths of time-domain digital signals; performing time-to-frequency-domain conversion on the plurality of paths of time-domain digital signals to obtain a plurality of paths of frequency-domain signals; and determining a signal type of the primary frequency-domain signal based on at least one of a sound pressure difference between the primary frequency-domain signal and each of N paths of secondary frequency-domain signals in the M paths of secondary frequency-domain signals, a phase difference between the primary frequency-domain signal and each of the N paths of secondary frequency-domain signals, and a frequency distribution characteristic of the primary frequency-domain signal.Type: GrantFiled: October 12, 2018Date of Patent: December 29, 2020Assignee: Huawei Technologies Co., Ltd.Inventors: Yanbin Du, Zhihai Zhu, Meng Liao, Weijun Zheng, Weibin Chen, Guangzhao Bao, Cunshou Qiu
-
Patent number: 10777213Abstract: A device includes a receiver configured to receive an audio frame of an audio stream. The audio frame includes information that indicates a coded bandwidth of the audio frame. The device also includes a decoder configured to generate first decoded speech associated with the audio frame and to determine an output mode of the decoder based at least in part on the information that indicates the coded bandwidth. A bandwidth mode indicated by the output mode of the decoder is different than a bandwidth mode indicated by the information that indicates the coded bandwidth. The decoder is further configured to output second decoded speech based on the first decoded speech. The second decoded speech is generated according to an output mode of the decoder.Type: GrantFiled: August 3, 2018Date of Patent: September 15, 2020Assignee: QUALCOMM IncorporatedInventors: Venkatraman S. Atti, Venkata Subrahmanyam Chandra Sekhar Chebiyyam, Vivek Rajendran
-
Patent number: 10636438Abstract: A method for processing speech includes: executing a acquiring process that includes acquiring a speech signal; executing a detection process that includes detecting a first frequency spectrum from the speech signal; executing a calculation process that includes calculating a second spectrum based on an envelope of the first spectrum; executing a correction process that includes correcting the first spectrum based on comparison between a first amplitude of the first spectrum and a second amplitude of the second spectrum; executing a estimation process that includes estimating a pitch frequency of the speech signal in accordance with correlation between the corrected first frequency spectrum and periodic signals corresponding to frequencies in a certain band.Type: GrantFiled: August 27, 2018Date of Patent: April 28, 2020Assignee: FUJITSU LIMITEDInventors: Sayuri Nakayama, Taro Togawa, Takeshi Otani
-
Patent number: 10586529Abstract: A computer-implemented method for processing a speech signal, includes: identifying speech segments in an input speech signal; calculating an upper variance and a lower variance, the upper variance being a variance of upper spectra larger than a criteria among speech spectra corresponding to frames in the speech segments, the lower variance being a variance of lower spectra smaller than a criteria among the speech spectra corresponding to the frames in the speech segments; determining whether the input speech signal is a special input speech signal using a difference between the upper variance and the lower variance; and performing speech recognition of the input speech signal which has been determined to be the special input speech signal, using a special acoustic model for the special input speech signal.Type: GrantFiled: September 14, 2017Date of Patent: March 10, 2020Assignee: International Business Machines CorporationInventors: Osamu Ichikawa, Takashi Fukuda, Gakuto Kurata, Bhuvana Ramabhadran
-
Patent number: 10574356Abstract: A virtual physical layer may be provided. When providing the virtual physical layer, a remote radio head may be used. The remote radio head may comprise a first interface device, a second interface device, a digital-to-analog converter, and an analog-to-digital converter. The first interface device may be connected to a virtual physical layer instance instantiated in a cloud-based environment. The second interface device may be connected to customer premises equipment. The digital-to-analog converter may be connected between the first interface device and the second interface device and the analog-to-digital converter may also be connected between the first interface device and the second interface device.Type: GrantFiled: January 26, 2018Date of Patent: February 25, 2020Assignee: Cisco Technology, Inc.Inventors: John T. Chapman, Hang Jin, Alon Shlomo Bernstein
-
Patent number: 10565970Abstract: A method and a system for decomposition of acoustic signal into sound objects having the form of signals with slowly-varying amplitude and frequency, as well as sound objects and their use. The object is achieved by a method for decomposing an acoustic signal into digital sound objects, a digital sound object representing a component of the acoustic signal, the component having a waveform, comprising the steps of converting the analogue acoustic signal into a digital input signal (PIN); determining an instantaneous frequency component of the digital input signal, using a digital filter bank; determining an instantaneous amplitude of the instantaneous frequency component; determining an instantaneous phase of the digital input signal associated with the instantaneous frequency; creating at least one digital sound object, based on the determined instantaneous frequency, phase and amplitude; and storing the digital sound object in a sound object database.Type: GrantFiled: January 18, 2018Date of Patent: February 18, 2020Assignee: SOUND OBJECT TECHNOLOGIES S.A.Inventor: Adam Pluta
-
Patent number: 10468045Abstract: Methods, an encoder and a decoder are configured for transition between frames with different internal sampling rates. Linear predictive (LP) filter parameters are converted from a sampling rate S1 to a sampling rate S2. A power spectrum of a LP synthesis filter is computed, at the sampling rate S1, using the LP filter parameters. The power spectrum of the LP synthesis filter is modified to convert it from the sampling rate S1 to the sampling rate S2. The modified power spectrum of the LP synthesis filter is inverse transformed to determine autocorrelations of the LP synthesis filter at the sampling rate S2. The autocorrelations are used to compute the LP filter parameters at the sampling rate S2.Type: GrantFiled: November 16, 2017Date of Patent: November 5, 2019Assignee: VoiceAge EVS LLCInventors: Redwan Salami, Vaclav Eksler
-
Patent number: 10431233Abstract: Methods, an encoder and a decoder are configured for transition between frames with different internal sampling rates. Linear predictive (LP) filter parameters are converted from a sampling rate S1 to a sampling rate S2. A power spectrum of a LP synthesis filter is computed, at the sampling rate S1, using the LP filter parameters. The power spectrum of the LP synthesis filter is modified to convert it from the sampling rate S1 to the sampling rate S2. The modified power spectrum of the LP synthesis filter is inverse transformed to determine autocorrelations of the LP synthesis filter at the sampling rate S2. The autocorrelations are used to compute the LP filter parameters at the sampling rate S2.Type: GrantFiled: November 15, 2017Date of Patent: October 1, 2019Assignee: VoiceAge EVS LLCInventors: Redwan Salami, Vaclav Eksler
-
Patent number: 10354671Abstract: A voice coder configured to resolve periodic and aperiodic components of spectra is disclosed. The method of voice coding includes parsing the speech signal into a plurality of speech frames; for each of the plurality of speech frames: (a) generating the spectra for the speech frame, (b) parsing the spectra of the speech frame into a plurality of sub-bands, (c) transforming each of the plurality of sub-bands into a time-domain envelope signal, and (d) generating a plurality of sub-band voicing factors, wherein each sub-band voicing factor indicates the harmonicity of one of the plurality of sub-bands, and each sub-band voicing factor is based on the periodicity of one of said time-domain envelope signals associated with one of the plurality of sub-bands.Type: GrantFiled: February 21, 2018Date of Patent: July 16, 2019Assignee: OBEN, INC.Inventors: Kantapon Kaewtip, Fernando Villavicencio, Mark Harvilla
-
Patent number: 10332543Abstract: Example systems and methods capture a first plurality of portions of audio data by periodically capturing the audio data at first intervals. Embodiments detect speech onset in the audio data. Responsive to detection of the speech onset, systems and methods switch from periodically capturing the audio data to continuously capturing the audio data. Embodiments combine at least one captured portion of the first plurality of captured portions of the audio data with the continuously captured audio data to provide contiguous audio data.Type: GrantFiled: June 22, 2018Date of Patent: June 25, 2019Assignee: Cypress Semiconductor CorporationInventors: Robert Zopf, Victor Simileysky, Ashutosh Pandey, Patrick Cruise
-
Patent number: 10163453Abstract: An electronic device or method for adjusting a gain on a voice operated control system can include one or more processors and a memory having computer instructions. The instructions, when executed by the one or more processors causes the one or more processors to perform the operations of receiving a first microphone signal, receiving a second microphone signal, updating a slow time weighted ratio of the filtered first and second signals, and updating a fast time weighted ratio of the filtered first and second signals. The one or more processors can further perform the operations of calculating an absolute difference between the fast time weighted ratio and the slow time weighted ratio, comparing the absolute difference with a threshold, and increasing the gain when the absolute difference is greater than the threshold. Other embodiments are disclosed.Type: GrantFiled: October 26, 2015Date of Patent: December 25, 2018Assignee: Staton Techiya, LLCInventor: John Usher
-
Patent number: 10146868Abstract: Apparatuses, systems, methods, and media for filtering a data stream are provided. The data stream is partitioned into a plurality of data stream segments. An acoustic parameter is measured in each of the data stream segments. It is determined whether the acoustic parameter satisfies a first predetermined condition. The first predetermined condition includes a number of variances, in which the acoustic parameter exceeds a predetermined variance threshold, exceeding a predetermined number threshold. An extraneous portion of the data stream is identified in which the first predetermined condition is satisfied. It is determined whether the extraneous portion satisfies a second predetermined condition in the data stream. The extraneous portion is deleted from the data stream to produce a filtered data stream in response to the second predetermined condition being satisfied.Type: GrantFiled: June 8, 2017Date of Patent: December 4, 2018Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.Inventors: Yeon-Jun Kim, I. Dan Melamed, Bernard S. Renger, Steven Neil Tischer
-
Patent number: 10140305Abstract: Systems and methods for structuring information include determining information quantity (IQ) and information value (IV) in an original digital information file (ODIF). An initial manipulation process applied to the ODIF forms a first resulting DIF (FRDIF), and a subsequent manipulation process applied to the FRDIF forms a second resulting DIF, wherein each manipulation process removes at least one element of the processed DIF and/or represents an element combination with a representative element and a first indicia of an interrelationship between the representative element and one or more elements in the combination, to reduce the IQ of the processed DIF, while retaining the IV thereof within a threshold. Manipulation processes are successively applied to the previously resulting DIF until successive applications do not achieve a threshold reduction in IQ. The last resulting DIF has a primary structure with a reduced IQ and an IV within the threshold of the original IV.Type: GrantFiled: October 14, 2016Date of Patent: November 27, 2018Assignee: GENERAL HARMONICS INTERNATIONAL INC.Inventors: Alexander Zhirkov, Alexey Oraevsky, Andrei Grichine, George Blondheim, Max Wandinger, Wade Attwood
-
Patent number: 10079023Abstract: A comfort noise generation apparatus constituted of: near and far end speech detectors arranged to detect speech activity in near-end and far-end signals and a comfort noise generator, wherein responsive to an indication from the near-end speech detector that speech activity is absent on the near-end signal and an indication from the far-end silence detector that speech activity is absent on the far-end signal, the comfort noise generator is arranged to initiate a determination of an estimation of near-end background noise, wherein responsive to an indication from the near-end speech detector that speech activity is present on the near-end signal or an indication from the far-end silence detector that speech activity is present on the far-end signal, the comfort noise generator is arranged to terminate the estimation determination of near-end background noise, and wherein the comfort noise generator is arranged to output a function of the near-end background noise estimation.Type: GrantFiled: September 22, 2016Date of Patent: September 18, 2018Assignee: Microsemi Semiconductor (U.S.) Inc.Inventors: Tanmay Zargar, Dillon Reed Ritter, Rodolfo Silva
-
Patent number: 10068580Abstract: An oversampling LPF unit receives a sound signal. A differentiator differentiates the sound signal. An overtone computation unit generates an overtone signal by multiplying a signal differentiated by the differentiator by the sound signal from the oversampling LPF unit. A HPF unit filters the overtone signal generated by the overtone computation unit. A combiner combines the overtone signal filtered by the HPF unit and the sound signal from the oversampling LPF unit.Type: GrantFiled: December 23, 2016Date of Patent: September 4, 2018Assignee: JVC KENWOOD CorporationInventor: Tatsuya Onoda
-
Patent number: 10043534Abstract: A method and device for automatically increasing the spectral bandwidth of an audio signal including generating a “mapping” (or “prediction”) matrix based on the analysis of a reference wideband signal and a reference narrowband signal, the mapping matrix being a transformation matrix to predict high frequency energy from a low frequency energy envelope, generating an energy envelope analysis of an input narrowband audio signal, generating a resynthesized noise signal by processing a random noise signal with the mapping matrix and the envelope analysis, high-pass filtering the resynthesized noise signal, and summing the high-pass filtered resynthesized noise signal with the original an input narrowband audio signal. Other embodiments are disclosed.Type: GrantFiled: December 22, 2014Date of Patent: August 7, 2018Assignee: Staton Techiya, LLCInventors: John Usher, Dan Ellis
-
Patent number: 9837089Abstract: A device for signal processing includes a receiver and a high-band excitation signal generator. The receiver is configured to receive a parameter associated with a bandwidth-extended audio stream. The high-band excitation signal generator is configured to determine a value of the parameter. The high-band excitation signal generator is also configured to select, based on the value of the parameter, one of target gain information associated with the bandwidth-extended audio stream or filter information associated with the bandwidth-extended audio stream. The high-band excitation signal generator is further configured to generate a high-band excitation signal based on the one of the target gain information or the filter information.Type: GrantFiled: May 25, 2016Date of Patent: December 5, 2017Assignee: QUALCOMM IncorporatedInventors: Venkatraman Atti, Venkata Subrahmanyam Chandra Sekhar Chebiyyam
-
Patent number: 9767791Abstract: Method and apparatus for segmenting speech by detecting the pauses between the words and/or phrases, and to determine whether a particular time interval contains speech or non-speech, such as a pause.Type: GrantFiled: September 12, 2016Date of Patent: September 19, 2017Assignee: SPEECH MORPHING SYSTEMS, INC.Inventors: Fathy Yassa, Ben Reaves, Nima Ferdosi
-
Patent number: 9711158Abstract: An encoding technique encoding a sound signal at a low bit rate with reduced processing. The technique includes: an interval determination determining an interval T between samples corresponding to periodicity of an audio signal or an integer multiple of a fundamental frequency of the audio signal from a set S of candidates for the interval T; and a side information generating encoding the determined interval T to obtain side information. The interval determining determines the interval T from a set S of Y candidates (Y<Z) including Z2 candidates (Z2<Z) selected from among Z candidates for the interval T representable with the side information without depending on a candidate subjected to the interval determination in a previous frame a predetermined number of frames before the current frame and including a candidate subjected to the interval determination in the previous frame the predetermined number of frames before the current frame.Type: GrantFiled: January 18, 2012Date of Patent: July 18, 2017Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Takehiro Moriya, Noboru Harada, Yusuke Hiwasaki, Yutaka Kamamoto
-
Patent number: 9703865Abstract: Apparatuses, systems, methods, and media for filtering a data stream are provided. The data stream is analyzed based on an acoustic parameter to determine extraneous portions in which a first predetermined condition is satisfied. When a first extraneous portion is separated from a second extraneous portion by a non-extraneous portion in which the first predetermined condition is not satisfied, it is determined whether the first extraneous portion being separated from the second extraneous portion by the non-extraneous portion satisfies a second predetermined condition. At least one of the first extraneous portion and the second extraneous portion is deleted from the data stream to produce a filtered data stream in response to determining the second predetermined condition is satisfied.Type: GrantFiled: September 25, 2015Date of Patent: July 11, 2017Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.Inventors: Yeon-Jun Kim, I. Dan Melamed, Bernard S. Renger, Steven Neil Tischer
-
Patent number: 9679577Abstract: A voice switching device includes a learning unit configured to learn a background noise model expressing background noise contained in a first voice signal, based on the first voice signal, while the first voice signal having a first frequency band is received; a pseudo noise generation unit configured to generate pseudo noise expressing noise in a pseudo manner, based on the background noise model, after a first time point when the first voice signal is last received in a case where a received voice signal is switched from the first voice signal to a second voice signal having a second frequency band narrower than the first frequency band; and a superimposing unit configured to superimpose the pseudo noise on the second voice signal after the first time point.Type: GrantFiled: July 15, 2015Date of Patent: June 13, 2017Assignee: FUJITSU LIMITEDInventor: Kaori Endo
-
Patent number: 9583114Abstract: The invention provides an audio decoder being configured for decoding a bitstream so as to produce therefrom an audio output signal, the bitstream including at least an active phase followed by at least an inactive phase, wherein the bitstream has encoded therein at least a silence insertion descriptor frame which describes a spectrum of a background noise, the audio decoder including: a silence insertion descriptor decoder configured to decode the silence insertion descriptor frame; a decoding device configured to reconstruct the audio output signal from the bitstream during the active phase; a spectral converter configured to determine a spectrum of the audio output signal; a noise estimator device configured to determine a first spectrum of the noise of the audio output signal; a resolution converter configured to establish a second spectrum of the noise of the audio output signal; a comfort noise spectrum estimation device; and a comfort noise generator.Type: GrantFiled: June 19, 2015Date of Patent: February 28, 2017Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.Inventors: Anthony Lombard, Martin Dietz, Stephan Wilde, Emmanuel Ravelli, Panji Setiawan, Markus Multrus