For Storage Or Transmission Patents (Class 704/201)
  • Patent number: 10672404
    Abstract: An apparatus for decoding an encoded audio signal to obtain a reconstructed audio signal is provided, having: a receiving interface for receiving one or more frames, a coefficient generator, and a signal reconstructor. The coefficient generator is configured to determine one or more first audio signal coefficients, and one or more noise coefficients. Moreover, the coefficient generator is configured to generate one or more second audio signal coefficients, depending on the one or more first audio signal coefficients and depending on the one or more noise coefficients.
    Type: Grant
    Filed: May 2, 2018
    Date of Patent: June 2, 2020
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Michael Schnabel, Goran Markovic, Ralph Sperschneider, Jérémie Lecomte, Christian Helmrich
  • Patent number: 10659787
    Abstract: Techniques are generally described for enhanced compression of video data. In various examples, the techniques may include receiving first video data representing a scene in an environment. The techniques may further include generating illumination map data representing illumination of the scene in the first video data. The techniques may further comprise generating reflectance map data representing a reflectance of at least one object in the first video data. In some examples, the techniques may include sending, to a second computing device, the illumination map data and the reflectance map data. The techniques may further include receiving second video data representing the scene. The techniques may include determining a first illumination difference between the second video data and the first video data. The techniques may comprise sending, to the second computing device, the first illumination difference.
    Type: Grant
    Filed: September 20, 2018
    Date of Patent: May 19, 2020
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Ilya Vladimirovich Brailovskiy, Raghavan Manmatha
  • Patent number: 10658005
    Abstract: Systems and methods are disclosed configured to train an autoencoder using images that include faces, wherein the autoencoder comprises an input layer, an encoder configured to output a latent image from a corresponding input image, and a decoder configured to attempt to reconstruct the input image from the latent image. An image sequence of a face exhibiting a plurality of facial expressions and transitions between facial expressions is generated and accessed. Images of the plurality of facial expressions and transitions between facial expressions are captured from a plurality of different angles and using different lighting. An autoencoder is trained using source images that include the face with different facial expressions captured at different angles with different lighting, and using destination images that include a destination face.
    Type: Grant
    Filed: December 2, 2019
    Date of Patent: May 19, 2020
    Assignee: Neon Evolution Inc.
    Inventors: Carl Davis Bogan, III, Kenneth Michael Lande, Jacob Myles Laser, Brian Sung Lee, Cody Gustave Berlin
  • Patent number: 10657948
    Abstract: A system, method and program product for generating sound masking in an open-plan space. A method is disclosed that includes: establishing a set of acoustic criteria for the space that specifies minimum output levels at a set of specified frequencies; mixing sound samples to create an audio output stream for use as sound masking when broadcast over at least one speaker using a power amplifier; and processing the audio output stream, wherein the processing includes: analyzing the audio output stream with a spectrum analyzer to determine if the minimum output levels at the set of specified frequencies are met; and level adjusting the audio output stream with an equalizer to ensure that minimum output levels at the set of specified frequencies are met but do not exceed a given threshold above the minimum level.
    Type: Grant
    Filed: April 22, 2016
    Date of Patent: May 19, 2020
    Assignee: Rensselaer Polytechnic Institute
    Inventors: Jonas Braasch, Alana Gloria DeLoach, Jeffrey Parkman Carter
  • Patent number: 10651827
    Abstract: Aspects of the disclosure include an apparatus that has a first clock generator and a second clock generator. The first clock generator is configured to drive a first circuit, causing the first circuit to (i) receive a signal corresponding to an audio input, and (ii) determine whether an energy level of the signal exceeds a predetermined threshold. The second clock generator is activated when the first circuit determines that the energy level of the signal exceeds the predetermined threshold. The second clock generator is configured to drive a second circuit, causing the second circuit to determine whether the signal matches a predetermined pattern. A third circuit is activated when the second circuit determines that the signal matches the predetermined pattern.
    Type: Grant
    Filed: November 30, 2016
    Date of Patent: May 12, 2020
    Assignee: MARVELL ASIA PTE, LTD.
    Inventors: Yafei Hu, Liu Han, Kapil Jain, Jin Xie
  • Patent number: 10629207
    Abstract: A cache associated with a voice recognition engine may be configured to receive an audio file corresponding to a voice query. The voice query may be received by the voice recognition engine from a user device in response to receipt of the voice query by the user device at the user device. The cache may be configured to determine an audio fingerprint based on the audio file and to determine whether the audio fingerprint corresponds to one of a plurality of audio transcriptions stored in the cache. The audio transcriptions may correspond to popular voice queries received at the voice recognition engine and processed by a server capable of performing automated speech recognition. In response to determining that the audio fingerprint corresponds to a given one of the stored audio transcriptions, the audio file may be processed based on the stored audio transcription.
    Type: Grant
    Filed: July 13, 2017
    Date of Patent: April 21, 2020
    Assignee: COMCAST CABLE COMMUNICATIONS, LLC
    Inventors: Rui Min, Hongcheng Wang
  • Patent number: 10586549
    Abstract: A method of determining a distribution of bits for coding a transition frame, said method being implemented in a coder/decoder for coding/decoding a digital signal, the transition frame being preceded by a predictive coded preceding frame, coding the transition frame comprising transform coding and predictive coding a single sub-frame of the transition frame, the method comprising the following steps: assigning a bit rate for predictive coding the transition sub-frame, said bit rate being equal to the minimum between the bit rate for transform coding the transition frame and a first predetermined bit rate value; determining a first number of bits allocated for predictive coding the transition sub-frame for said bit rate; and calculating a second number of bits allocated for transform coding the transition frame from the first number of bits and a number of bits available for coding the transition frame.
    Type: Grant
    Filed: July 27, 2015
    Date of Patent: March 10, 2020
    Assignee: ORANGE
    Inventors: Stephane Ragot, Julien Faure
  • Patent number: 10541746
    Abstract: A system for testing line attenuation defects includes a data transmission line configured to transmit a forward signal in a first direction, at least one reflection point at a first location along the data transmission line, a test probe configured to (i) electrically contact a center conductor at a second location along the data transmission line, (ii) introduce a broadband data signal onto the data transmission line, and (iii) measure, at the second location, a return signal from the reflection point, and a spectrum capturing device in operable contact with the test probe. The spectrum capturing device is configured to (i) collect and arrange frequency data measured by the test probe for the test signal, the return signal, and a standing wave created by the sum of the broadband data signal and the return signal, (ii) determine the voltage VSWR of the standing wave, and (iii) calculate a line loss from the VSWR.
    Type: Grant
    Filed: June 27, 2017
    Date of Patent: January 21, 2020
    Assignee: Cable Television Laboratories, Inc
    Inventors: Thomas H. Williams, Ryan Richard Vail
  • Patent number: 10535358
    Abstract: An apparatus and a method to encode and decode a speech signal using an encoding mode are provided. An encoding apparatus may select an encoding mode of a frame included in an input speech signal, and encode a frame having an unvoiced mode for an unvoiced speech as the selected encoding mode.
    Type: Grant
    Filed: February 8, 2018
    Date of Patent: January 14, 2020
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Ho Sang Sung, Ki Hyun Choo, Jung Hoe Kim, Eun Mi Oh
  • Patent number: 10522170
    Abstract: A method for acquiring the number of modified frames for active sound, and a method and apparatus for voice activity detection are disclosed. Firstly, a first voice activity detection decision result and a second voice activity detection decision result are obtained (501), the number of hangover frames for active sound is obtained (502), and the number of background noise updates is obtained (503), and then the number of modified frames for active sound is calculated according to the first voice activity detection decision result, the number of background noise updates and the number of hangover frames for active sound (504), and finally, a voice activity detection decision result of a current frame is calculated according to the number of modified frames for active sound and the second voice activity detection decision result (505).
    Type: Grant
    Filed: November 5, 2015
    Date of Patent: December 31, 2019
    Assignee: ZTE CORPORATION
    Inventors: Changbao Zhu, Hao Yuan
  • Patent number: 10510351
    Abstract: A method of encoding samples in a digital signal is provided that includes receiving a frame of N samples of the digital signal, determining L possible distinct data values in the N samples, determining a reference data value in the L possible distinct data values and a coding order of L?1 remaining possible distinct data values, wherein each of the L?1 remaining possible distinct data values is mapped to a position in the coding order, decomposing the N samples into L?1 coding vectors based on the coding order, wherein each coding vector identifies the locations of one of the L?1 remaining possible distinct data values in the N samples, and encoding the L?1 coding vectors.
    Type: Grant
    Filed: February 8, 2016
    Date of Patent: December 17, 2019
    Assignee: TEXAS INSTRUMENTS INCORPORATED
    Inventors: Lorin Paul Netsch, Jacek Piotr Stachurski
  • Patent number: 10504540
    Abstract: The present invention relates to an audio encoding and, more particularly, to a signal classifying method and device, and an audio encoding method and device using the same, which can reduce a delay caused by an encoding mode switching while improving the quality of reconstructed sound. The signal classifying method may comprise the operations of: classifying a current frame into one of a speech signal and a music signal; determining, on the basis of a characteristic parameter obtained from multiple frames, whether a result of the classifying of the current frame includes an error; and correcting the result of the classifying of the current frame in accordance with a result of the determination. By correcting an initial classification result of an audio signal on the basis of a correction parameter, the present invention can determine an optimum coding mode for the characteristic of an audio signal and can prevent frequent coding mode switching between frames.
    Type: Grant
    Filed: October 1, 2018
    Date of Patent: December 10, 2019
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Ki-hyun Choo, Anton Viktorovich Porov, Konstantin Sergeevich Osipov
  • Patent number: 10489636
    Abstract: The application discloses a lip movement capturing method and device and a storage medium. The method includes: acquiring a real-time image shot by a photographic device and extracting a real-time facial image from the real-time image; inputting the real-time facial image into a pretrained lip average model and recognizing t lip feature points representative of positions of lips in the real-time facial image; and calculating a movement direction and movement distance of the lips in the real-time facial image according to x and y coordinates of the t lip feature points in the real-time facial image. According to the application, movement information of the lips in the real-time facial image is calculated according to the coordinates of the lip feature points to implement real-time capturing of movements of the lips.
    Type: Grant
    Filed: October 31, 2017
    Date of Patent: November 26, 2019
    Assignee: Ping An Technology (Shenzhen) Co., Ltd.
    Inventors: Lin Chen, Guohui Zhang
  • Patent number: 10484218
    Abstract: A receiver for demodulating a pulse width modulated (“PWM”) signal, comprises: a voltage level shifter for shifting the PWM signal to predefined transistor voltage levels; a half-rate PWM decoder for receiving the shifted PWM signal; and a 2-bit-to-N-bit deserializer. The half-rate PWM decoder comprises a first decoder core, a second decoder core, a controller, and a sampler and retiming circuit. The first decoder core and the second decoder core are configured to decode alternating periods of the shifted PWM signal. The controller is coupled to the first decoder core, the second decoder core, the sampler and retiming circuit. The retiming circuit is configured to receive clock signals from the controller and to output half-rate even data from the first decoder core and half-rate odd data from the second decoder core. Outputs of the retiming circuit and an output of the controller are coupled to inputs of the deserializer.
    Type: Grant
    Filed: February 23, 2018
    Date of Patent: November 19, 2019
    Assignee: Invecas, Inc.
    Inventors: Siva Kumar Rapina, Saravana Kumar Durairaj
  • Patent number: 10482888
    Abstract: An apparatus for encoding one or more audio objects to obtain an encoded signal is provided. The apparatus includes a for downmixing the one or more audio objects to obtain one or more unprocessed downmix signals. Moreover, the apparatus includes a processing module and a signal calculator. The signal calculator is configured to calculate each of one or more additional signals based on a difference between one of one or more processed downmix signals and one of the one or more unprocessed downmix signals. Moreover, the apparatus includes an object information generator. Furthermore, the apparatus includes an output interface for outputting the encoded signal. Moreover, a corresponding apparatus for decoding is provided.
    Type: Grant
    Filed: July 14, 2015
    Date of Patent: November 19, 2019
    Assignee: FRAUNHOFER-GESELLSCHAFT ZUR FOERDERUNG DER ANGEWANDTEN FORSCHUNG E.V.
    Inventors: Thorsten Kastner, Juergen Herre, Falko Ridderbusch, Cornelia Falch
  • Patent number: 10468046
    Abstract: Provided are a method and an apparatus for determining an encoding mode for improving the quality of a reconstructed audio signal. A method of determining an encoding mode includes determining one from among a plurality of encoding modes including a first encoding mode and a second encoding mode as an initial encoding mode in correspondence to characteristics of an audio signal, and if there is an error in the determination of the initial encoding mode, generating a modified encoding mode by modifying the initial encoding mode to a third encoding mode.
    Type: Grant
    Filed: July 18, 2018
    Date of Patent: November 5, 2019
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Ki-hyun Choo, Anton Victorovich Porov, Konstantin Sergeevich Osipov, Nam-suk Lee
  • Patent number: 10460733
    Abstract: According to one embodiment, a signal processing apparatus includes a memory and a processor electrically coupled to the memory. The processor separates a plurality of signals by a separation filter, and outputs a plurality of separate signals. The plurality of signals includes signals which are received at different positions and come from different directions. The processor estimates incoming directions of the plurality of separate signals, respectively, and associates the plurality of separate signals with transmission sources of the signals, and present association between the plurality of separate signals and the transmission sources of the signals.
    Type: Grant
    Filed: September 12, 2017
    Date of Patent: October 29, 2019
    Assignee: Kabushiki Kaisha Toshiba
    Inventors: Makoto Hirohata, Toru Taniguchi, Taro Masuda
  • Patent number: 10431199
    Abstract: An electronic device including an earphone device is provided. The earphone device includes a shell, a speaker, a first microphone device, a memory circuit and a controller. The memory circuit stores multiple parameter sets. The first microphone device receives a first sound. The first microphone device generates first data based on the first sound. The controller compares the first data with the parameter sets of the memory circuit and determines which one of the parameter sets corresponds to the first data based on the frequency parameters and the volume parameters. The controller generates second data based on the adjustment parameters of the one of the parameter sets, and the speaker generates a second sound based on the second data. The first sound generates a third sound in the shell, and the phase of the second sound is substantially opposite to the phase of the third sound.
    Type: Grant
    Filed: April 13, 2018
    Date of Patent: October 1, 2019
    Assignee: FORTEMEDIA, INC.
    Inventors: Tsung-Lung Yang, Hung-Chih Teng
  • Patent number: 10425184
    Abstract: A device for sound wave communication, including: a hardware correction table for setting a correction frequency band; a sound wave transmission unit for generating data frequencies at a predetermined base decibel level, generating separate reception filter frequencies at the base decibel level for receiving data carried by a sound wave transmitted from the nearest location when sound waves are received, and generating the correction reference frequency at the base decibel level for correcting hardware transmission; and a sound wave reception unit for receiving a sound wave signal transmitted from the sound wave transmission unit, extracting decibel levels at each of the data frequencies to form an array of decibel levels, correcting the array by shifting the array by a correction value extracted using the hardware correction table, and reconstructing data.
    Type: Grant
    Filed: March 29, 2016
    Date of Patent: September 24, 2019
    Assignees: DANSOL PLUS CO., LTD.
    Inventor: Chang Soon Yim
  • Patent number: 10410634
    Abstract: An embodiment of the invention provides a wireless ear-borne audio device that may be configured in a variety of ways, including, but in no way limited to a device for recording audio information and storing the audio information for later replay, and/or forwarding the audio information to another device. The audio information may be analyzed to perform further functions. An embodiment of the invention also provides a method for compressing audio data and transmitting the audio information to the ear-borne audio device in a manner that reduces the power consumption of the ear-borne audio device in receiving data via a Bluetooth® connection.
    Type: Grant
    Filed: May 17, 2018
    Date of Patent: September 10, 2019
    Assignee: SmartEar, Inc.
    Inventors: Kinuko Masaki, Richard Ling
  • Patent number: 10405288
    Abstract: Provided is a communication method for supporting various bandwidths. The device detects a synchronization signal transmitted by a wide-band cell and receives a downlink (DL) signal from the wide-band cell on the basis of the synchronization signal. A support bandwidth supported by the device is narrower than a system bandwidth supported by the wide-band cell.
    Type: Grant
    Filed: February 24, 2017
    Date of Patent: September 3, 2019
    Assignee: LG ELECTRONICS INC.
    Inventors: Joonkui Ahn, Kijun Kim, Eunsun Kim, Yunjung Yi
  • Patent number: 10397377
    Abstract: Aspects of the present disclosure provide techniques for compressing data packets for cellular internet of things (CIoT) communications. An example method generally includes establishing at least one prefill buffer common to one or more UEs, wherein the prefill buffer includes a plurality of common strings, generating a compressed packet by finding matches to the common strings in at least one of a header portion or payload portion of the packet and associating identifiers with the common strings, and transmitting the packet.
    Type: Grant
    Filed: March 24, 2017
    Date of Patent: August 27, 2019
    Assignee: QUALCOMM Incorporated
    Inventors: Leena Zacharias, Srinivasan Balasubramanian
  • Patent number: 10387110
    Abstract: A first example playback device includes one or more processors and a non-transitory computer-readable medium storing instructions that, when executed by the one or more processors, cause the first playback device to perform functions. The functions include generating a mixed audio signal comprising one or more components of a first audio stream and one or more components of a second audio stream; and playing the mixed audio signal, where the one or more components of the second audio stream of the mixed audio signal played by the first playback device are played in synchrony with a second playback device configured to play the second audio stream. An example non-transitory computer-readable medium and an example method, both related to the first example playback device, are also disclosed herein.
    Type: Grant
    Filed: March 26, 2018
    Date of Patent: August 20, 2019
    Assignee: SOHOS, Inc.
    Inventors: Ted M. Lin, Luis Vega
  • Patent number: 10360920
    Abstract: The invention provides methods and devices for outputting a stereo audio signal having a left channel and a right channel. The apparatus includes a demultiplexer, decoder, and upmixer. The upmixer is configured operate either in a prediction mode or a non-prediction mode based on a parameter encoded in the audio bitstream.
    Type: Grant
    Filed: December 17, 2018
    Date of Patent: July 23, 2019
    Assignee: Dolby International AB
    Inventors: Heiko Purnhagen, Pontus Carlsson, Lars Villemoes
  • Patent number: 10354673
    Abstract: The disclosure provides a noise reduction method and an electronic device. In an embodiment of the disclosure, when determining that a plurality of first applications occupy a plurality of first audio channels connected with a microphone and a second application occupies a second audio channel connected with a speaker, the electronic device resamples the audio data of the second audio channel according to the sampling rates corresponding to the plurality of first audio channels and then performs the noise reduction processing on the audio data of each of the plurality of first audio channels respectively according to the audio data obtained by resampling.
    Type: Grant
    Filed: October 10, 2018
    Date of Patent: July 16, 2019
    Assignees: HISENSE MOBILE COMMUNICATIONS TECHNOLOGY CO., LTD., HISENSE INTERNATIONAL CO., LTD., HISENSE USA CORPORATION
    Inventors: Weibo Zheng, Bingyu Geng
  • Patent number: 10354664
    Abstract: A method and device are provided for determining an optimized scale factor to be applied to an excitation signal or a filter during a process for frequency band extension of an audio frequency signal. The band extension process includes decoding or extracting, in a first frequency band, an excitation signal and parameters of the first frequency band including coefficients of a linear prediction filter, generating an excitation signal extending over at least one second frequency band, filtering using a linear prediction filter for the second frequency band. The determination method includes determining an additional linear prediction filter, of a lower order than that of the linear prediction filter of the first frequency band, the coefficients of the additional filter being obtained from the parameters decoded or extracted from the first frequency and calculating the optimized scale factor as a function of at least the coefficients of the additional filter.
    Type: Grant
    Filed: September 26, 2017
    Date of Patent: July 16, 2019
    Assignee: KONINKLIKJKE PHILIPS N.V.
    Inventors: Magdalena Kaniewska, Stephane Ragot
  • Patent number: 10347267
    Abstract: An audio encoding method and an apparatus are provided. The method includes: determining sparseness of distribution, on spectrums, of energy of N input audio frames (101), where the N audio frames include a current audio frame, and N is a positive integer; and determining, according to the sparseness of distribution, on the spectrums, of the energy of the N audio frames, whether to use a first encoding method or a second encoding method to encode the current audio frame (102), where the first encoding method is an encoding method that is based on time-frequency transform and transform coefficient quantization and that is not based on linear prediction, and the second encoding method is a linear-predication-based encoding method. The method can reduce encoding complexity and ensure that encoding is of relatively high accuracy.
    Type: Grant
    Filed: August 21, 2017
    Date of Patent: July 9, 2019
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventor: Zhe Wang
  • Patent number: 10339962
    Abstract: In described examples, a method for detecting voice activity includes: receiving a first input signal containing noise; sampling the first input signal to form noise samples; determining a first value corresponding to the noise samples; subsequently receiving a second input signal; sampling the second input signal to form second signal samples; determining a second value corresponding to the second signal samples; forming a ratio of the second value to the first value; comparing the ratio to a predetermined threshold value; and responsive to the comparing, indicating whether voice activity is detected in the second input signal.
    Type: Grant
    Filed: April 11, 2017
    Date of Patent: July 2, 2019
    Assignee: TEXAS INSTRUMENTS INCORPORATED
    Inventors: Ganesan Thiagarajan, Tarkesh Pande, David Patrick Magee
  • Patent number: 10339939
    Abstract: Concealing a lost audio frame of a received audio signal by performing a sinusoidal analysis of a part of a previously received or reconstructed audio signal, wherein the sinusoidal analysis involves identifying frequencies of sinusoidal components of the audio signal, applying a sinusoidal model on a segment of the previously received or reconstructed audio signal, wherein said segment is used as a prototype frame in order to create a substitution frame for a lost audio frame, and creating the substitution frame for the lost audio frame by time-evolving sinusoidal components of the prototype frame, up to the time instance of the lost audio frame, in response to the corresponding identified frequencies.
    Type: Grant
    Filed: November 10, 2017
    Date of Patent: July 2, 2019
    Assignee: Telefonaktiebolaget LM Ericsson (publ)
    Inventor: Stefan Bruhn
  • Patent number: 10333900
    Abstract: Enveloping techniques using incoherent wavefront multiplexing (WF muxing or K-muxing) will enhance privacy protection on data communications. The disclosure relates to methods and architectures of packing or enveloping data using WF muxing, or K-muxing, for information transport via multiple communication links such as concurrently via multiple satellites, airborne platforms, wireless terrestrial links, and/or other wireless links. The multi-link communications may include the use of cloud transport of multiple WF-muxed data packages. It is focused to appearance of a digital envelop and reliability of enclosed data. The K-muxing on information digital streams before modulation in a transmitter shall provide enhanced data privacy and better availability. The WF multiplexed (WF muxed or K-muxed) information data streams will be individually and concurrently sent to the multiple links accordingly for data transport.
    Type: Grant
    Filed: January 12, 2016
    Date of Patent: June 25, 2019
    Assignee: SPATIAL DIGITAL SYSTEMS, INC.
    Inventors: Donald C. D. Chang, Juo-Yu Lee, Steve K. Chen
  • Patent number: 10330700
    Abstract: A signal acquisition probe stores compressed or compressed and filtered time domain data samples representing at least one of an impulse response or step response characterizing the signal acquisition probe. The compressed or compressed and filtered time domain data samples of the impulse response or the step response are provided to a signal measurement instrument for compensating the signal measurement instrument for the impulse or step response of the signal measurement instrument.
    Type: Grant
    Filed: August 18, 2015
    Date of Patent: June 25, 2019
    Assignee: Tektronix, Inc.
    Inventors: Richard A. Booman, John J. Pickerd
  • Patent number: 10325604
    Abstract: Provided are a frame error concealment method and apparatus and an error concealment scheme construction method and apparatus. The frame error concealment method includes generating a new signal by synthesizing a plurality of previous signals that are similar to a signal of an error frame and reconstructing the signal of the error frame using the generated signal.
    Type: Grant
    Filed: December 29, 2017
    Date of Patent: June 18, 2019
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Ho-sang Sung, Ki-hyun Choo, Jung-hoe Kim, Eun-mi Oh, Chang-yong Son, Kang-eun Lee
  • Patent number: 10319367
    Abstract: A speech signal processing method of a user terminal includes: receiving a speech signal, detecting a personalized information section including personal information in the speech signal, performing data processing on the personalized information section of the speech signal by using a personalized model generated based on the personal information, and receiving, from a server, a result of the data processing performed by the server on a general information section of the speech signal that is different than the personalized information section of the speech signal.
    Type: Grant
    Filed: November 9, 2015
    Date of Patent: June 11, 2019
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Tae-yoon Kim, Sang-ha Kim, Sung-soo Kim, Jin-sik Lee, Chang-woo Han, Eun-kyoung Kim, Jae-won Lee
  • Patent number: 10311879
    Abstract: An audio signal coding apparatus includes a time-frequency transformer that outputs sub-band spectra from an input signal; a sub-band energy quantizer; a tonality calculator that analyzes tonality of the sub-band spectra; a bit allocator that selects a second sub-band on which quantization is performed by a second quantizer on the basis of the analysis result of the tonality and quantized sub-band energy, and determines a first number of bits to be allocated to a first sub-band on which quantization is performed by a first quantizer; the first quantizer that performs first coding using the first number of bits; the second quantizer that performs coding using a second coding method; and a multiplexer.
    Type: Grant
    Filed: November 17, 2016
    Date of Patent: June 4, 2019
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Takuya Kawashima, Hiroyuki Ehara
  • Patent number: 10304473
    Abstract: Certain example embodiments relate to speech privacy systems and/or associated methods. The techniques described herein disrupt the intelligibility of the perceived speech by, for example, superimposing onto an original speech signal a masking replica of the original speech signal in which portions of it are smeared by a time delay and/or amplitude adjustment, with the time delays and/or amplitude adjustments oscillating over time. In certain example embodiments, smearing of the original signal may be generated in frequency ranges corresponding to formants, consonant sounds, phonemes, and/or other related or non-related information-carrying building blocks of speech. Additionally, or in the alternative, annoying reverberations particular to a room or area in low frequency ranges may be “cut out” of the replica signal, without increasing or substantially increasing perceived loudness.
    Type: Grant
    Filed: March 15, 2017
    Date of Patent: May 28, 2019
    Assignee: Guardian Glass, LLC
    Inventor: Alexey Krasnov
  • Patent number: 10297165
    Abstract: To provide an education support system that allows educators to conduct adequate assessments considering handicaps or characteristics of individual students and to offer effective and efficient learning.
    Type: Grant
    Filed: March 16, 2016
    Date of Patent: May 21, 2019
    Assignee: SHINANO KENSHI KABUSHIKI KAISHA
    Inventors: Tatsuo Nishizawa, Yusuke Tashiro
  • Patent number: 10283126
    Abstract: The invention provides methods and devices for stereo encoding and decoding using complex prediction in the frequency domain. In one embodiment, a decoding method, for obtaining an output stereo signal from an input stereo signal encoded by complex prediction coding and comprising first frequency-domain representations of two input channels, comprises the upmixing steps of: (i) computing a second frequency-domain representation of a first input channel; and (ii) computing an output channel on the basis of the first and second frequency-domain representations of the first input channel, the first frequency-domain representation of the second input channel and a complex prediction coefficient. The upmixing can be suspended responsive to control data.
    Type: Grant
    Filed: December 20, 2017
    Date of Patent: May 7, 2019
    Assignee: Dolby International AB
    Inventors: Heiko Purnhagen, Pontus Carlsson, Lars Villemoes
  • Patent number: 10283117
    Abstract: In one aspect, a first device includes at least one processor, a microphone accessible to the at least one processor, a communication interface accessible to the at least one processor, and storage accessible to the at least one processor. The storage bears instructions executable by the at least one processor to await first voice input comprising a response cue and receive, from the microphone, the first voice input comprising the response cue. The instructions are also executable by the at least one processor to, responsive to receipt of the first voice input comprising the response cue, transmit a wake up command to a second device different from the first device. The wake up command is transmitted via the communication interface using a peripheral device communication protocol.
    Type: Grant
    Filed: June 19, 2017
    Date of Patent: May 7, 2019
    Assignee: Lenovo (Singapore) Pte. Ltd.
    Inventors: John Weldon Nicholson, Daryl Cromer
  • Patent number: 10276177
    Abstract: Technologies for privately processing voice data include a compute device configured to continually or periodically capture voice data of a user by the compute device. The captured voice data is processed to remove or reduce the user's privacy-sensitive information. For example, the compute device fragments the captured voice data to generate a set of voice data fragments and further scrambles the voice data fragments to generate scrambled voice data fragments having a sequential order different from the plurality of voice data fragments. To scramble the voice data fragments, the compute device may reorder the voice data fragments such that each fragment is repositioned from its corresponding original sequential position in the voice data by a particular number of words, syllables, or phrases.
    Type: Grant
    Filed: October 1, 2016
    Date of Patent: April 30, 2019
    Assignee: Intel Corporation
    Inventors: Sefi Kraemer, Ofer Gueta
  • Patent number: 10268679
    Abstract: A processing unit can operate an end-to-end recurrent neural network (RNN) with limited contextual dialog memory that can be jointly trained by supervised signals-user slot tagging, intent prediction and/or system action prediction. The end-to-end RNN, or joint model has shown advantages over separate models for natural language understanding (NLU) and dialog management and can capture expressive feature representations beyond conventional aggregation of slot tags and intents, to mitigate effects of noisy output from NLU. The joint model can apply a supervised signal from system actions to refine the NLU model. By back-propagating errors associated with system action prediction to the NLU model, the joint model can use machine learning to predict user intent by a binary classification obtained by both forward and backward output, and perform slot tagging, and make system action predictions based on user input, e.g., utterances across a number of domains.
    Type: Grant
    Filed: December 2, 2016
    Date of Patent: April 23, 2019
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Xiujun Li, Paul Anthony Crook, Li Deng, Jianfeng Gao, Yun-Nung Chen, Xuesong Yang
  • Patent number: 10249301
    Abstract: Embodiments of the present application relate to a method for speech recognition processing, a system for speech recognition processing, and a computer program product for speech recognition processing. A method for speech recognition processing is provided. The method includes receiving a speech recognition request, the speech recognition request including a pre-assigned speech ID that uniquely identifies speech information and a speech recognition command to convert speech into text, locating speech information based on the speech ID, automatically converting the speech information to obtain text information corresponding to the speech information in response to the speech recognition command, and sending the obtained text information as a speech recognition result to a user that caused the speech recognition request to be issued.
    Type: Grant
    Filed: April 27, 2018
    Date of Patent: April 2, 2019
    Assignee: Alibaba Group Holding Limited
    Inventor: Lufeng Zhai
  • Patent number: 10229171
    Abstract: A method of automatic discovery of analysis scripts for a dataset, the method including: utilizing at least one processor to execute computer code that performs the steps of: receiving, at a script searching tool, an input dataset; searching, in a script repository, a plurality of datasets having analysis scripts associated therewith; the searching comprising finding, based on a feature of the input dataset, one or more datasets of the plurality of datasets having the feature; identifying, based on the one or more datasets of the plurality of datasets having the feature, one or more associated analysis scripts; and returning, via a user interface, a result listing of the one or more associated analysis scripts. Other aspects are described and claimed.
    Type: Grant
    Filed: January 11, 2016
    Date of Patent: March 12, 2019
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Vijay Ekambaram, Pratyush Kumar, Ashok Pon Kumar Sree Prakash
  • Patent number: 10224040
    Abstract: The present application relates to packet loss concealment apparatus and method, and audio processing system. According to an embodiment, the packet loss concealment apparatus is provided for concealing packet losses in a stream of audio packets, each audio packet comprising at least one audio frame in transmission format comprising at least one monaural component and at least one spatial component. The packet loss concealment apparatus may comprises a first concealment unit for creating the at least one monaural component for a lost frame in a lost packet and a second concealment unit for creating the at least one spatial component for the lost frame. According to the embodiment, spatial artifacts such as incorrect angle and diffuseness may be avoided as far as possible in PLC for multi-channel spatial or sound field encoded audio signals.
    Type: Grant
    Filed: July 2, 2014
    Date of Patent: March 5, 2019
    Assignees: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Shen Huang, Xuejing Sun, Heiko Purnhagen
  • Patent number: 10206207
    Abstract: The MAC frame in a wireless communication system includes a terminal ID allocated to each of multiple terminals. At least one connection ID is allocated to each terminal having the terminal ID, and sub-carrier allocation information is allocated to each connection having the connection ID. The sub-carrier allocation information includes a sub-carrier allocation status for each sub-carrier, and the number of allocated information bits for each sub-carrier. The sub-carrier allocation status and the number of allocated information bits for each sub-carrier can be allocated, by sub-carriers, to the sub-carrier allocation information using a same number of bits; or the information on the sub-carrier allocation status is first allocated to the sub-carrier allocation information and the number of allocated information bits for each sub-carrier is allocated.
    Type: Grant
    Filed: March 6, 2017
    Date of Patent: February 12, 2019
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Hyoung-Soo Lim, Woo-Yong Choi, Yong-Sun Kim, Jae-Young Ahn
  • Patent number: 10194200
    Abstract: A method for removing transitional noise in an audio data stream is performed by a computing device. After obtaining first and second audio frames in a time sequence of the audio data stream, the computing device determines a frame segmentation position in the audio frames based on predefined closeness criteria that require a predefined distance between the sampling point values at the frame segmentation position in the first and second audio frames to satisfy a predefined closeness condition. Next, the computing device obtains respective sampling point values preceding the frame segmentation position in the second audio frame and ones following the frame segmentation position in the first audio frame. The computing device then sequentially stitches the respective sampling point values obtained from the first and second audio frame to generate a third audio frame and inserts the third audio frame between the first audio frame and the second audio frame.
    Type: Grant
    Filed: April 11, 2018
    Date of Patent: January 29, 2019
    Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    Inventor: Jiang Yang
  • Patent number: 10192564
    Abstract: A sampler module may divide an audio signal into a series of sequential samples. A signal quality detector module may identify a consistent brick wall frequency of the audio signal spanning a plurality of the sequential samples at an outset of the audio signal and determine a signal treatment indication proportional to the brick wall frequency. A signal enhancer module may sequentially receive and analyze one or more sample components of the audio signal to identify lost parts of the audio signal in the one or more sample components of respective sequential samples, and generate, in accordance with the signal quality indication, a corresponding signal treatment for each of the one or more sample components of respective sequential samples having a corresponding identified lost part.
    Type: Grant
    Filed: January 6, 2015
    Date of Patent: January 29, 2019
    Assignee: Harman International Industries, Incorporated
    Inventors: Kevin Eric Heber, Gilbert Arthur Joseph Soulodre
  • Patent number: 10148391
    Abstract: In accordance with an example embodiment of the present invention, disclosed is a method and an apparatus thereof for removing jitter introduced by a packet switched network. Each received audio frame comprises a primary portion and a redundancy portion. The redundancy portion comprises a partial redundant copy of a previous frame that is offset by k frames. If a frame n is lost, a frame n+k that comprises the partial redundant copy of the lost frame n, is located in a jitter buffer. Based on the frame n+k, a substitute frame n? substituting the lost frame n is created and a substitution indicator of the substitute frame n? is set to indicate that the redundancy portion of the substitute fame n? should be used in decoding.
    Type: Grant
    Filed: September 15, 2016
    Date of Patent: December 4, 2018
    Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)
    Inventors: Stefan Bruhn, Tomas Frankkila, Karl Hellwig
  • Patent number: 10147432
    Abstract: The invention provides a decoder being configured for processing an encoded audio bitstream, wherein the decoder includes: a bitstream decoder configured to derive a decoded audio signal from the bitstream, wherein the decoded audio signal includes at least one decoded frame; a noise estimation device configured to produce a noise estimation signal containing an estimation of the level and/or the spectral shape of a noise in the decoded audio signal; a comfort noise generating device configured to derive a comfort noise signal from the noise estimation signal; and a combiner configured to combine the decoded frame of the decoded audio signal and the comfort noise signal in order to obtain an audio output signal.
    Type: Grant
    Filed: June 19, 2015
    Date of Patent: December 4, 2018
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Guillaume Fuchs, Anthony Lombard, Emmanuel Ravelli, Stefan Doehla, Jeremie Lecomte, Martin Dietz
  • Patent number: 10141004
    Abstract: A method for hybrid speech enhancement which employs parametric-coded enhancement (or blend of parametric-coded and waveform-coded enhancement) under some signal conditions and waveform-coded enhancement (or a different blend of parametric-coded and waveform-coded enhancement) under other signal conditions. Other aspects are methods for generating a bitstream indicative of an audio program including speech and other content, such that hybrid speech enhancement can be performed on the program, a decoder including a buffer which stores at least one segment of an encoded audio bitstream generated by any embodiment of the inventive method, and a system or device (e.g., an encoder or decoder) configured (e.g., programmed) to perform any embodiment of the inventive method. At least some of speech enhancement operations are performed by a recipient audio decoder with Mid/Side speech enhancement metadata generated by an upstream audio encoder.
    Type: Grant
    Filed: August 27, 2014
    Date of Patent: November 27, 2018
    Assignees: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Jeroen Koppens, Hannes Muesch
  • Patent number: RE47536
    Abstract: An apparatus for processing an audio signal and method thereof are disclosed. The present invention includes receiving, by an audio processing apparatus, an audio signal including a first data of a first block encoded with rectangular coding scheme and a second data of a second block encoded with non-rectangular coding scheme; receiving a compensation signal corresponding to the second block; estimating a prediction of an aliasing part using the first data; and, obtaining a reconstructed signal for the second block based on the second data, the compensation signal and the prediction of aliasing part.
    Type: Grant
    Filed: December 15, 2017
    Date of Patent: July 23, 2019
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Hyen-O Oh, Chang Heon Lee, Hong Goo Kang, Jeungook Song