Detect Speech In Noise Patents (Class 704/233)
  • Patent number: 11069352
    Abstract: Described herein is a system for media presence detection in audio. The system analyzes audio data to recognize whether a given audio segment contains sounds from a media source as a way of differentiating recorded media source sounds from other live sounds. In exemplary embodiments, the system includes a hierarchical model architecture for processing audio data segments, where individual audio data segments are processed by a trained machine learning model operating locally, and another trained machine learning model provides historical and contextual information to determine a score indicating the likelihood that the audio data segment contains sounds from a media source.
    Type: Grant
    Filed: February 18, 2019
    Date of Patent: July 20, 2021
    Assignee: Amazon Technologies, Inc.
    Inventors: Qingming Tang, Ming Sun, Chieh-Chi Kao, Chao Wang, Viktor Rozgic
  • Patent number: 11069353
    Abstract: A system and method performs multilingual wakeword detection by determining a language corresponding to the wakeword. A first wakeword-detection component, which may execute using a digital-signal processor, determines that audio data includes a representation of the wakeword and determines a language corresponding to the wakeword. A second, more accurate wakeword-detection component may then process the audio data using the language to confirm that it includes the representation of the wakeword. The audio data may then be sent to a remote system for further processing.
    Type: Grant
    Filed: May 6, 2019
    Date of Patent: July 20, 2021
    Assignee: Amazon Technologies, Inc.
    Inventors: Yixin Gao, Ming Sun, Jason Krone, Shiv Naga Prasad Vitaladevuni, Yuzong Liu
  • Patent number: 11062708
    Abstract: A method and an apparatus for dialoguing based on a mood of a user, where the method includes: collecting first audio data from the user, determining the mood of the user according to a feature of the first audio data, and dialoguing with the user using second audio data corresponding to the mood of the user. The method and the apparatus for dialoguing based on the mood of the user provided by the present disclosure may make different responses according to the mood of the user when dialoguing with the user. Therefore, it further enriches response that the electronic device may make according to voice data of the user, and further improves the user experience during dialoguing with the electronic device.
    Type: Grant
    Filed: July 12, 2019
    Date of Patent: July 13, 2021
    Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
    Inventors: Li Xu, Yingchao Li, Xiaoxin Ma
  • Patent number: 11036305
    Abstract: There is provided a signal processing apparatus that includes a control unit that executes, on a basis of a waveform signal generated in accordance with a motion of an attachment portion of a sensor attached to a tool or a body, effect processing for the waveform signal or another waveform signal, the waveform signal being output from the sensor.
    Type: Grant
    Filed: June 4, 2020
    Date of Patent: June 15, 2021
    Assignee: SONY CORPORATION
    Inventors: Heesoon Kim, Masaharu Yoshino, Tatsushi Nashida, Masahiko Inami, Kouta Minamizawa, Yuta Sugiura, Yusuke Mizushina
  • Patent number: 11031008
    Abstract: A terminal device is provided. The terminal device includes a communication interface, and a processor configured to receive performance information of one or more other terminal devices from each of the one or more other terminal devices, identify an edge device to perform voice recognition based on the performance information received from each of the one or more other terminal devices, based on the terminal device being identified as the edge device, receive information associated with reception quality from one or more other terminal devices which receive a sound wave including a triggering word, determine a terminal device to acquire the sound wave for voice recognition from based on the received information associated with the reception quality, and transmit, to the determined terminal device, a command to transmit the sound wave acquired for voice recognition to an external voice recognition device.
    Type: Grant
    Filed: April 10, 2019
    Date of Patent: June 8, 2021
    Assignee: Samsung Electronics Co., Ltd.
    Inventor: Minseok Kim
  • Patent number: 11024274
    Abstract: Systems, devices, and methods for segmenting musical compositions are described. Discrete, musically-coherent segments (such as intro, verse, chorus, bridge, solo, and the like) of a musical composition are identified. Distance measures are used to evaluate whether each bar of a musical composition is more like the bars that directly precede it or more like the bars that directly succeed it, and each respective series of musically similar bars is assigned to the same respective segment. Large changes in the distance measure(s) between adjacent bars may be used to identify boundaries between abutting musical segments. Computer systems and computer program products for implementing segmentation are also described. The results of segmentation may advantageously be applied in computer-based composition of music and musical variations, as well as in other applications involving labelling, characterizing, or otherwise processing music.
    Type: Grant
    Filed: January 28, 2020
    Date of Patent: June 1, 2021
    Assignee: Obeebo Labs Ltd.
    Inventor: Colin P. Williams
  • Patent number: 11024331
    Abstract: Systems and methods for optimizing voice detection via a network microphone device are disclosed herein. In one example, individual microphones of a network microphone device detect sound. The sound data is captured in a first buffer and analyzed to detect a trigger event. Metadata associated with the sound data is captured in a second buffer and provided to at least one network device to determine at least one characteristic of the detected sound based on the metadata. The network device provides a response that includes an instruction, based on the determined characteristic, to modify at least one performance parameter of the NMD. The NMD then modifies the at least one performance parameter based on the instruction.
    Type: Grant
    Filed: September 21, 2018
    Date of Patent: June 1, 2021
    Assignee: Sonos, Inc.
    Inventors: Connor Kristopher Smith, Kurt Thomas Soto, Charles Conor Sleith
  • Patent number: 10997967
    Abstract: A method for initializing a device for performing acoustic speech recognition (ASR) using an ASR model, by a computer system including at least one processor and a system memory element. The method includes obtaining a plurality of voice data articulations of predetermined phrases, by the at least one processor via a user interface. The plurality of voice data articulations includes a first quantity of audio samples of actual articulated voice data, and each of the plurality of voice data articulations includes one of the audio samples including acoustic frequency components. The method further includes performing a plurality of augmentations to the plurality of voice data articulations of predetermined phrases, to generate a corpus audio data set that includes the first quantity of audio samples and a second quantity of audio samples including augmented versions of the first quantity of audio samples.
    Type: Grant
    Filed: April 18, 2019
    Date of Patent: May 4, 2021
    Assignee: HONEYWELL INTERNATIONAL INC.
    Inventors: Luning Wang, Wei Yang, Zhiyong Dai
  • Patent number: 10972834
    Abstract: This disclosure describes techniques for detecting voice commands from a user of an ear-based device. The ear-based device may include an in-ear facing microphone to capture sound emitted in an ear of the user, and an exterior facing microphone to capture sound emitted in an exterior environment of the user. The in-ear microphone may generate an inner audio signal representing the sound emitted in the ear, and the exterior microphone may generate an outer audio signal representing sound from the exterior environment. The ear-based device may compute a ratio of a power of the inner audio signal to the outer audio signal and may compare this ratio to a threshold. If the ratio is larger than the threshold, the ear-based device may detect the voice of the user. Further, the ear-based device may set a value of the threshold based on a level of acoustic seal of the ear-based device.
    Type: Grant
    Filed: February 11, 2020
    Date of Patent: April 6, 2021
    Assignee: Amazon Technologies, Inc.
    Inventors: Kuan-Chieh Yen, Daniel Wayne Harris, Carlo Murgia, Taro Kimura
  • Patent number: 10956734
    Abstract: An electronic device and a method of operating the electronic device are provided. The electronic device includes a proximity detector; an iris recognition module; a memory; and a processor electrically connected to the proximity detector, the iris recognition module, and the memory, wherein the processor is configured to execute an iris recognition operation based on the iris recognition module; determine proximity of an object based on the proximity detector while the iris recognition operation is performed; and, if the proximity of the object includes within a set reference range, stop the iris recognition operation.
    Type: Grant
    Filed: July 10, 2017
    Date of Patent: March 23, 2021
    Inventors: Hyung-Woo Shin, Hyemi Lee, Hyung Min Lee
  • Patent number: 10958468
    Abstract: A portable acoustic unit is adapted for insertion into an electrical receptacle. The portable acoustic unit has an integrated microphone and a wireless network interface to an automation controller. The portable acoustic unit detects spoken voice commands from users in the vicinity of the electrical receptacle. The portable acoustic unit merely plugs into a conventional electrical outlet to provide an extremely simple means of voice control through a home or business.
    Type: Grant
    Filed: August 29, 2018
    Date of Patent: March 23, 2021
    Assignee: AT&T INTELLECTUAL PROPERTY I, L. P.
    Inventors: Nafiz Haider, Ross Newman, Kristin Patterson, Thomas Risley, Curtis Stephenson, David Vaught
  • Patent number: 10951996
    Abstract: A binaural hearing system includes a first hearing device and a second hearing device, each of which comprising: an input transducer; a transducer audio signal processor configured to provide a processed input transducer audio signal; an ear canal microphone; an ear canal audio signal processor configured to provide a processed ear canal audio signal; a first signal combiner configured to combine the processed input transducer audio signal with the processed ear canal audio signal to obtain an output transducer audio signal; a signal level detector configured to determine a signal level of (1) the output transducer audio signal or (2) an audio signal included in formation of the output transducer audio signal; and an output transducer; wherein the binaural hearing system further comprises a binaural excessive level detector connected to the first hearing device's signal level detector and the second hearing device's signal level detector.
    Type: Grant
    Filed: June 17, 2019
    Date of Patent: March 16, 2021
    Assignee: GN Hearing A/S
    Inventors: Søren Christian Voigt Pedersen, Jonathan Boley, James Robert Anderson
  • Patent number: 10950243
    Abstract: A system and method for improving T-matrix training for speaker recognition are provided. The method includes receiving an audio input, divisible into a plurality of audio frames, wherein at least a first audio frame includes an audio sample of a human speaker, the sample having a length above a first threshold; generating for each audio frame a feature vector; generating for a first plurality of feature vectors centered statistics of at least a zero order and a first order; generating a first i-vector, the first i-vector representing the human speaker; generating an optimized T-matrix training sequence computation, based on the first i-vector, an initialized T-matrix, the centered statistics, and a Gaussian mixture model (GMM) of a trained universal background model (UBM).
    Type: Grant
    Filed: March 1, 2019
    Date of Patent: March 16, 2021
    Assignee: ILLUMA Labs Inc.
    Inventor: Milind Borkar
  • Patent number: 10943598
    Abstract: Methods and systems for determining periods of excessive noise for smart speaker voice commands. An electronic timeline of volume levels of currently playing content is made available to a smart speaker. From this timeline, periods of high content volume are determined, and the smart speaker alerts users during periods of high volume, requesting that they wait until the high-volume period has passed before issuing voice commands. In this manner, the smart speaker helps prevent voice commands that may not be detected, or may be detected inaccurately, due to the noise of the content currently being played.
    Type: Grant
    Filed: March 18, 2019
    Date of Patent: March 9, 2021
    Assignee: ROVI GUIDES, INC.
    Inventors: Gyanveer Singh, Sukanya Agarwal, Vikram Makam Gupta
  • Patent number: 10937448
    Abstract: A voice activity detection method and an apparatus are provided by embodiments of the present application. The method includes: performing framing processing on a voice to be detected to obtain a plurality of audio frames to be detected; obtaining an acoustic feature of each of the audio frames to be detected, and sequentially inputting the acoustic feature of the each of the audio frames to be detected to a VAD model, wherein the VAD model is configured to classify a first N voice frame in the voice to be detected as a noise frame, classify frames from an (N+1)-th voice frame to a last voice frame as voice frames, and classify a M noise frame after the last voice frame as a voice frame, where N and M are integers; and determining, according to a classification result output by the VAD model.
    Type: Grant
    Filed: December 27, 2018
    Date of Patent: March 2, 2021
    Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
    Inventors: Chao Li, Weixin Zhu
  • Patent number: 10917717
    Abstract: Gain mismatch and related problems can be solved by a system and method that applies an automatic microphone signal gain equalization without any direct absolute reference or calibration phase. The system and method performs the steps of receiving, by a computing device, a speech signal from a speaking person via a plurality of microphones, determining a speech signal component in the time-frequency domain for each microphone of the plurality of microphones, calculating an instantaneous cross-talk coupling matrix based on the speech signal components across the microphones, estimating gain factors based on calculated cross-talk couplings and a given expected cross-talk attenuation, limiting the gain factors to appropriate maximum and minimum values, and applying the gain factors to the speech signal used in the control path to control further speech enhancement algorithms or used in the signal path for direct influence on the speech enhanced audio output signal.
    Type: Grant
    Filed: May 30, 2019
    Date of Patent: February 9, 2021
    Assignee: Nuance Communications, Inc.
    Inventors: Timo Matheja, Markus Buck
  • Patent number: 10903863
    Abstract: A first set of signal data is received. Generative machine learning models are trained based on the first set of signal data. The generative machine learning models include at least a first model trained to identify a first signal component and a second model trained to identify a second signal component. An incoming mixed signal data stream is dynamically separated into a clean signal component and a noise signal component by running the generative machine learning models.
    Type: Grant
    Filed: December 11, 2019
    Date of Patent: January 26, 2021
    Assignee: International Business Machines Corporation
    Inventors: Francois Pierre Luus, Etienne Eben Vos, Komminist Weldemariam
  • Patent number: 10885902
    Abstract: Techniques are described for using stenography to protect sensitive information within conversational audio data by generating a pseudo-language representation of conversational audio data. In some implementations, audio data corresponding to an utterance is received. The audio data is classified as likely sensitive audio data. A particular set of sentiments associated with the audio data is determined. Data indicating the particular set of sentiments associated with the audio data is provided to a model. The model is trained to output, for each of different sets of sentiments, desensitized, pseudo-language audio data that exhibits the set of sentiments, and is not classified as likely sensitive audio data. A particular desensitized, pseudo-language audio data is received from the model. The audio data is replaced with the particular desensitized, pseudo-language audio data and stored within an audio data repository.
    Type: Grant
    Filed: November 21, 2018
    Date of Patent: January 5, 2021
    Assignee: X Development LLC
    Inventors: Antonio Raymond Papania-Davis, Bin Ni, Shelby Lin
  • Patent number: 10879827
    Abstract: An apparatus for reducing belt slip of a vehicle, in which the vehicle includes an engine as a vibration source and an electric motor connected to the engine through a belt to transmit torque of the engine, includes: a signal generator configured to generate a reference signal with a frequency corresponding to vibration of the engine; an adaptive filter configured to calculate a filter coefficient to remove an error value between a rotational speed of the engine and a rotational speed of the electric motor and apply the filter coefficient to the reference signal to generate a reference torque signal; and a torque compensator configured to generate a belt slip compensation torque signal by changing an amplitude of the reference torque signal, and apply the belt slip compensation torque signal to determine a final torque command of the electric motor.
    Type: Grant
    Filed: November 26, 2018
    Date of Patent: December 29, 2020
    Assignees: HYUNDAI MOTOR COMPANY, KIA MOTORS CORPORATION
    Inventors: Tae Young Chung, Jun Mo An, Byung Hoon Yang
  • Patent number: 10867613
    Abstract: An apparatus for decoding an audio signal is provided, having a receiving interface, configured to receive a first frame having a first audio signal portion of the audio signal, and configured to receive a second frame having a second audio signal portion of the audio signal; a noise level tracing unit, wherein the noise level tracing unit is configured to determine noise level information depending on at least one of the first audio signal portion and the second audio signal portion; a first reconstruction unit for reconstructing, in a first reconstruction domain, a third audio signal portion of the audio signal depending on the noise level information; a transform unit for transforming the noise level information to a second reconstruction domain; and a second reconstruction unit for reconstructing, in the second reconstruction domain, a fourth audio signal portion of the audio signal depending on the noise level information.
    Type: Grant
    Filed: May 15, 2018
    Date of Patent: December 15, 2020
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Michael Schnabel, Markovic Goran, Ralph Sperschneider, Jérémie Lecomte, Christian Helmrich
  • Patent number: 10861459
    Abstract: An apparatus for determining a reliability of a recommendation based on an environment of a vehicle may include: an input device configured to detect a sound in the vehicle; a storage configured to store information associated with the sound; an output device configured to output information associated with a recommendation action; and a controller electrically connected with the input device, the storage, and the output device.
    Type: Grant
    Filed: November 16, 2018
    Date of Patent: December 8, 2020
    Assignees: Hyundai Motor Company, Kia Motors Corporation
    Inventors: Kyung Chul Lee, Sung Soo Park
  • Patent number: 10853808
    Abstract: The present invention is a method and apparatus for a first user to purchase a controlled product. The first user creates an order and specifies the set of conditions that the second user must meet to satisfy the order. The second user satisfies those conditions then can access an authorization from a third user prior to submitting a bid to supply the order. The first user can accept the bid then authorize payment. The first user and second user can confirm the product was delivered to the first user.
    Type: Grant
    Filed: December 18, 2016
    Date of Patent: December 1, 2020
    Inventor: Mark Lawrence
  • Patent number: 10854199
    Abstract: In one example of the disclosure, microphone data indicative of a user spoken phrase is captured utilizing a microphone. The microphone data is analyzed to detect a trigger phrase. A wakeup event is performed responsive to the detection of the trigger phrase. At least a portion of the microphone data is sent to the computing device for the computing device to send to a server system.
    Type: Grant
    Filed: April 22, 2016
    Date of Patent: December 1, 2020
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventor: David H. Hanes
  • Patent number: 10853653
    Abstract: A system for exploiting visual information for enhancing audio signals via source separation and beamforming is disclosed. The system may obtain visual content associated with an environment of a user, and may extract, from the visual content, metadata associated with the environment. The system may determine a location of the user based on the extracted metadata. Additionally, the system may load, based on the location, an audio profile corresponding to the location of the user. The system may also load a user profile of the user that includes audio data associated with the user. Furthermore, the system may cancel, based on the audio profile and user profile, noise from the environment of the user. Moreover, the system may include adjusting, based on the audio profile and user profile, an audio signal generated by the user so as to enhance the audio signal during a communications session of the user.
    Type: Grant
    Filed: August 30, 2019
    Date of Patent: December 1, 2020
    Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.
    Inventors: Dimitrios Dimitriadis, Donald J. Bowen, Lusheng Ji, Horst J. Schroeter
  • Patent number: 10846263
    Abstract: Systems and methods that enable implementation of content aware file management labeling. Techniques disclosed enable real-time analysis of a data file so that associations between the data file and applicable file management label(s) can be automatically suggested and/or made while a user is working in the data file. A user may deploy an application on a client device to edit a data file. While the user is actively editing the data file, the application may transmit modification data to a content analysis service which analyzes the modification data to determine whether the modifications result in a file management label becoming applicable to the data file. Ultimately, the content analysis service may transmit a verdict to the client device to cause the application to display a labeling suggestion to the user and/or to automatically apply a label to the data file while it is being worked on by the user.
    Type: Grant
    Filed: December 22, 2017
    Date of Patent: November 24, 2020
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Kannan Dhanasekaran, Nakul Garg, Krishna Kumar Parthasarathy, Dheepak Ramaswamy, Yu Li, Martin Gagne, Yonghua Yang, Jinhao Li, Liming Chen, Mingquan Xue
  • Patent number: 10847173
    Abstract: Apparatuses and methods for selecting, for communicating, between signals provided by various pickups on the basis of a calculated signal to noise ratio (SNR) are disclosed. The various pickups may include a microphone and a vibration sensor. Signals from the microphone may be compared with signals from the vibration sensor by extracting a root-mean-square (RMS) profile for each, and comparing the RMS profiles to determine an SNR estimate for the microphone signal. The microphone signal may be selected if the SNR estimate is below a predetermined threshold, otherwise the vibration sensor signal may be selected. The vibration sensor signal may be subject to further processing if selected to approximate the microphone signal.
    Type: Grant
    Filed: April 12, 2018
    Date of Patent: November 24, 2020
    Assignee: Intel Corporation
    Inventors: Hector Alfonso Cordourier Maruri, Jonathan Huang, Paulo Lopez Meyer, Willem Beltman
  • Patent number: 10831440
    Abstract: One embodiment provides a method, including: receiving, at a first device, a user input; activating, from the first device, a second device; and thereafter coordinating, using the processor of the first device, with the second device to process the user input. Other aspects are described and claimed.
    Type: Grant
    Filed: September 26, 2016
    Date of Patent: November 10, 2020
    Assignee: Lenovo (Singapore) Pte. Ltd.
    Inventors: John Carl Mese, Nathan J. Peterson, Arnold S. Weksler, Russell Speight VanBlon
  • Patent number: 10832683
    Abstract: A system and method for efficient universal background model (UBM) training for speaker recognition, including: receiving an audio input, divisible into a plurality of audio frames, wherein at least a first audio frame of the plurality of audio frames includes an audio sample having a length above a first threshold extracting at least one identifying feature from the first audio frame and generating a feature vector based on the at least one identifying feature; generating an optimized training sequence computation based on the feature vector and a Gaussian Mixture Model (GMM), wherein the GMM is associated with a plurality of components, wherein each of the plurality of components is defined by a covariance matrix, a mean vector, and a weight vector; and updating any of the associated components of the GMM based on the generated optimized training sequence computation.
    Type: Grant
    Filed: November 28, 2018
    Date of Patent: November 10, 2020
    Assignee: ILLUMA Labs LLC.
    Inventor: Milind Borkar
  • Patent number: 10818309
    Abstract: An embodiment of the present invention provides an apparatus for noise canceling that includes: an input unit configured to receive an input voice signal; and one or more processors configured to perform a first noise cancellation using as input the received input voice signal to generate a first voice signal by cancelling noise from the input voice signal using a noise canceling model which is trained using a plurality of reference voice signals, perform a second noise cancellation using as input the first voice signal generated by the noise canceling model to generate a second voice signal in which residual noise is canceled from the first voice signal using statistical analysis, and generate an output voice signal comprising an encoding of the second voice signal.
    Type: Grant
    Filed: August 9, 2019
    Date of Patent: October 27, 2020
    Assignee: LG ELECTRONICS INC.
    Inventors: Bongki Lee, Youngman Kim, Keunsang Lee, Jaewoong Jeong, Seunghyun Hwang
  • Patent number: 10819858
    Abstract: A method for improving an echo cancellation effect and a system thereof are disclosed. The method comprises includes: performing a non-linear compensation for a non-linear response portion of an excitation signal using an NLC algorithm to obtain a compensated excitation signal; outputting the compensated excitation signal to an echo cancellation system; and performing echo cancellation for the compensated excitation signal by the echo cancellation system. According to the present disclosure, using the NLC algorithm, non-linear compensation is performed for the non-linear portion of the excitation signal, non-linear outputs generated due to non-linear characteristics of the system are pre-compensated when being input to the echo cancellation system, such that the echo signal output by the echo cancellation system is minimized and the echo cancellation effect is improved.
    Type: Grant
    Filed: August 6, 2019
    Date of Patent: October 27, 2020
    Assignee: AAC Technologies Pte. Ltd.
    Inventor: Henglizi Zhang
  • Patent number: 10819301
    Abstract: The present disclosure relates to a method and system for controlling loudness of an audio based on signal analysis and deep learning. The method includes analyzing an audio characteristic in a frame level based on signal analysis, analyzing the audio characteristic in the frame level based on learning, and controlling loudness of the audio in the frame level, by combining the analysis results. Accordingly, reliability of audio characteristic analysis can be enhanced and audio loudness can be optimally controlled.
    Type: Grant
    Filed: October 18, 2018
    Date of Patent: October 27, 2020
    Assignee: KOREA ELECTRONICS TECHNOLOGY INSTITUTE
    Inventors: Choong Sang Cho, Young Han Lee
  • Patent number: 10811024
    Abstract: In some embodiments, a pitch filter for filtering a preliminary audio signal generated from an audio bitstream is disclosed. The pitch filter has an operating mode selected from one of either: (i) an active mode where the preliminary audio signal is filtered using filtering information to obtain a filtered audio signal, and (ii) an inactive mode where the pitch filter is disabled. The preliminary audio signal is generated in an audio encoder or audio decoder having a coding mode selected from at least two distinct coding modes, and the pitch filter is capable of being selectively operated in either the active mode or the inactive mode while operating in the coding mode based on control information.
    Type: Grant
    Filed: March 12, 2019
    Date of Patent: October 20, 2020
    Assignee: Dolby International AB
    Inventors: Barbara Resch, Kristofer Kjörling, Lars Villemoes
  • Patent number: 10811030
    Abstract: A system may perform speech enhancement of audio data in real-time by suppressing noise components that are present in the audio data while preserving speech components. The system may include an in-ear module and a separate signal processing module that is wirelessly communicatively coupled to the in-ear module. The system may include non-negative matrix factorization (NMF) dictionaries capable of identifying frequency band components associated with speech and frequency band components associated with noise. The NMF dictionaries may be trained using voice samples and noise samples. The NMF dictionaries may be applied to noisy speech data to produce an NMF representation of the speech data which may then be applied using a dynamic mask to the noisy speech data in order to suppress the noise components of the noisy speech data and produce speech enhanced data.
    Type: Grant
    Filed: September 12, 2018
    Date of Patent: October 20, 2020
    Assignee: BOARD OF TRUSTEES OF MICHIGAN STATE UNIVERSITY
    Inventors: Mi Zhang, Kai Cao, Xiao Zeng, Haochen Sun
  • Patent number: 10796805
    Abstract: Described embodiments include apparatus that includes a network interface (28) and a processor (30). The processor is configured to receive, via the network interface, speech of a subject (22) who suffers from a pulmonary condition related to accumulation of excess fluid, to identify, by analyzing the speech, one or more speech-related parameters of the speech, to assess, in response to the speech-related parameters, a status of the pulmonary condition, and to generate, in response thereto, an output indicative of the status of the pulmonary condition. Other embodiments are also described.
    Type: Grant
    Filed: October 5, 2016
    Date of Patent: October 6, 2020
    Assignee: CORDIO MEDICAL LTD.
    Inventors: Chaim Lotan, Sigal Kremer-Tal, Aviv Lotan, Zeev Schlik, Avinoam Gemer, Yehuda Snir, Yonatan Sasson, Margarita Sheinkerman
  • Patent number: 10795929
    Abstract: Embodiments of the invention are directed to apparatuses, methods, and computer program products for generating a rating score based on color. In some embodiments, the apparatuses, methods, and computer program products are configured to: receive at least one input, wherein the at least one input comprises at least one of a first numerical value, an emoticon, a color, or at least one keyword; process the at least one input; and generate a second numerical value based on processing the at least one input.
    Type: Grant
    Filed: September 1, 2018
    Date of Patent: October 6, 2020
    Inventor: Trent R McKenzie
  • Patent number: 10789947
    Abstract: Audio information defining audio content may be accessed. The audio content may have a duration. The audio content may be segmented into audio segments. Individual audio segments may correspond to a portion of the duration. Feature vectors of the audio segments may be determined. The feature vectors may be processed through a classifier. The classifier may output scores on whether the audio segments contain voice. One or more of the audio segments may be identified as containing voice based on the scores and a two-step hysteresis thresholding. Storage of the identification of the one or more of the audio segments as containing voice in one or more storage media may be effectuated.
    Type: Grant
    Filed: December 19, 2019
    Date of Patent: September 29, 2020
    Assignee: GoPro, Inc.
    Inventor: Gabriel Lema
  • Patent number: 10789942
    Abstract: A computer-implemented method, computer program product, and computer processing system are provided for word embedding. The method includes receiving, by a processor device, a word embedding matrix. The method further includes generating, by a processor device, an average pooling vector and a max pooling vector, based on the word embedding matrix. The method also includes generating, by the processor device, a prediction by applying a Multi-Layer Perceptron (MLP) to the average pooling vector and the max pooling vector.
    Type: Grant
    Filed: October 18, 2018
    Date of Patent: September 29, 2020
    Assignee: NEC Corporation
    Inventors: Renqiang Min, Dinghan Shen
  • Patent number: 10783899
    Abstract: Systems and methods are introduced to perform noise suppression of an audio signal. The audio signal includes foreground speech components and background noise. The foreground speech components correspond to speech from a user's speaking into an audio receiving device. The background noise includes babble noise that includes speech from one or more interfering speakers. A soft speech detector determines, dynamically, a speech detection result indicating a likelihood of a presence of the foreground speech components in the audio signal. The speech detection result is employed to control, dynamically, an amount of attenuation of the noise suppression to reduce the babble noise in the audio signal. Further processing achieves a more stationary background and reduction of musical tones in the audio signal.
    Type: Grant
    Filed: November 18, 2016
    Date of Patent: September 22, 2020
    Assignee: Cerence Operating Company
    Inventors: Simon Graf, Tobias Herbig, Markus Buck
  • Patent number: 10770090
    Abstract: A method of audio source separation includes steps of applying a demixing matrix on a plurality of received signals to generate a plurality of separated results; performing a recognition operation on the plurality of separated results to generate a plurality of recognition scores; generating a constraint according to the plurality of recognition scores; and adjusting the demixing matrix according to the constraint; where the adjusted demixing matrix is applied to the plurality of received signals to generate a plurality of updated separated results from the plurality of received signals.
    Type: Grant
    Filed: June 2, 2017
    Date of Patent: September 8, 2020
    Assignee: Realtek Semiconductor Corp.
    Inventors: Ming-Tang Lee, Chung-Shih Chu
  • Patent number: 10761522
    Abstract: A method includes obtaining closed-loop data associated with operation of an industrial process controller, where the industrial process controller is configured to control at least part of an industrial process using at least one model. The method also includes generating at least one noise model associated with the industrial process controller using at least some of the closed-loop data. The method further includes filtering the closed-loop data based on the at least one noise model. In addition, the method includes generating one or more model parameters for the industrial process controller using the filtered closed-loop data.
    Type: Grant
    Filed: June 28, 2017
    Date of Patent: September 1, 2020
    Assignee: Honeywell Limited
    Inventors: Qiugang Lu, Lee D. Rippon, R. Bhushan Gopaluni, Michael G. Forbes, Philip D. Loewen, Johan U. Backstrom, Guy A. Dumont
  • Patent number: 10759447
    Abstract: A driving support method for a vehicle includes: acquiring a sound of a sound source placed outside the vehicle; and displaying, on a display portion, driving support information corresponding to an utterance content of the sound, the utterance content of the sound being recognized by a sound recognition process, in a display mode suggesting a relative position of the sound source from the vehicle, the relative position being specified based on the sound.
    Type: Grant
    Filed: February 11, 2019
    Date of Patent: September 1, 2020
    Assignee: TOYOTA JIDOSHA KABUSHIKI KAISHA
    Inventor: Chihiro Inaba
  • Patent number: 10762905
    Abstract: A method for use in speaker verification, comprises: receiving a sample of a person's speech; estimating a noise measure in the received sample of the person's speech; if the estimated noise measure does not meet a predetermined criterion, rejecting the received sample of the person's speech; and if the estimated noise measure meets the predetermined criterion: adding noise to the received sample to form a sample with increased noise; extracting features of the sample with increased noise; and forming a model of the person's speech from the extracted features.
    Type: Grant
    Filed: July 31, 2018
    Date of Patent: September 1, 2020
    Assignee: Cirrus Logic, Inc.
    Inventors: Carlos Vaquero Avilés-Casco, David Martínez González
  • Patent number: 10764676
    Abstract: A system configured to improve spatial coverage of output audio and a corresponding user experience by performing upmixing and loudspeaker beamforming to stereo input signals. The system can perform upmixing to the stereo (e.g., two channel) input signal to extract a center channel and generate three-channel audio data. The system may then perform loudspeaker beamforming to the three-channel audio data to enable two loudspeakers to generate output audio having three distinct beams. The user may interpret the three distinct beams as originating from three separate locations, resulting in the user perceiving a wide virtual sound stage despite the loudspeakers being spaced close together on the device.
    Type: Grant
    Filed: September 17, 2019
    Date of Patent: September 1, 2020
    Assignee: Amazon Technologies, Inc.
    Inventors: Yuancheng Luo, Wontak Kim, Mihir Dhananjay Shetye
  • Patent number: 10755698
    Abstract: Various examples are provided related to speech recognition. In one example, a method includes converting an auditory signal into a pulse train, segmenting the pulse train into a series of frames having a predefined duration, and identifying a portion of the auditory signal by applying at least a portion of the series of frames segmented from the pulse train to a kernel adaptive autoregressive-moving-average (KAARMA) network. In another example, a speech recognition system includes processing circuitry configured to convert an auditory signal into a pulse train, segment the pulse train into a secured of frames, and identifying a portion of the auditory signal by applying at least a portion of the series of frames segmented from the pulse train to a KAARMA network. The series of frames segmented from the pulse train can be applied to a KAARMA chain including a plurality of KAARMA networks for identification.
    Type: Grant
    Filed: December 7, 2016
    Date of Patent: August 25, 2020
    Assignee: UNIVERSITY OF FLORIDA RESEARCH FOUNDATION, INC.
    Inventors: Jose C. Principe, Kan Li
  • Patent number: 10757503
    Abstract: Active noise control (ANC), including active and adaptive noise cancellation (ANC) with non-voice-coil transducers having highly linear transfer functions, such as planar transducers, planar magnetic transducers, electro-static transducers, and piezo-electric transducers. This active and adaptive noise cancellation (ANC) may be used with: planar transducer headphones and earphones; open-backed and closed-back headphones and earphones; in-ear earphones, and phase plugs.
    Type: Grant
    Filed: February 15, 2018
    Date of Patent: August 25, 2020
    Assignee: Audeze, LLC
    Inventor: Dragoslav Colich
  • Patent number: 10735861
    Abstract: An apparatus for reducing cross-talk between transmitted audio signals and received audio in a headset. The headset includes one or more of a set of earphones, a headset frame, a microphone boom with an array of MEMS microphone configured to isolate the earphone audio from the microphone audio, a VOX circuit, low crosstalk cable(s), and/or other components. Sets of microphones may be enabled and/or disabled to reduce cross-talk between received audio signals and transmitted audio signals. The VOX circuit is configured to reduce cross-talk between received audio signals and transmitted audio signals.
    Type: Grant
    Filed: February 6, 2019
    Date of Patent: August 4, 2020
    Assignee: HM Electronics, Inc.
    Inventors: Charles Butten, Karl Knoblock, Robert Snyder
  • Patent number: 10725523
    Abstract: Examples disclosed herein provide the ability for a computing device to determine a noise threshold to wake on ambient noises. In one example method, the computing device tracks sound, detected by a microphone of the computing device, over a period of time and, based on the sound tracked over the period of time, determines a noise threshold. The computing device tunes a sensitivity of the microphone to wake the computing device when ambient noise, detected by the microphone, is to have a signal strength equal to or exceeding the noise threshold.
    Type: Grant
    Filed: April 11, 2016
    Date of Patent: July 28, 2020
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventors: Alexander Wayne Clark, Kent E Biggs, Richard E Hodges
  • Patent number: 10705620
    Abstract: There is provided a signal processing apparatus including: a control unit that executes, on a basis of a waveform signal generated in accordance with a motion of an attachment portion of a sensor attached to a tool or a body, effect processing for the waveform signal or another waveform signal, the waveform signal being output from the sensor. The signal processing apparatus performs presentation so that a body motion itself can be aurally felt.
    Type: Grant
    Filed: October 6, 2016
    Date of Patent: July 7, 2020
    Assignee: SONY CCORPORATION
    Inventors: Heesoon Kim, Masaharu Yoshino, Masahiko Inami, Kouta Minamizawa, Yuta Sugiura, Yusuke Mizushina, Tatsushi Nashida
  • Patent number: 10681450
    Abstract: A wireless earpiece includes a wireless earpiece housing, at least one microphone for detecting ambient environment sound, and a processor disposed within the wireless earpiece housing, the processor configured to distinguish between two or more sources of sound within the ambient environment sound. The wireless earpiece further includes a user interface operatively connected to the processor. The processor is configured to receive user input through the user interface to select one of the sources of sound within the ambient environment sound and wherein the processor is configured to process the ambient environment sound to emphasize portions of the ambient environment sound generated by the one of the sources of the ambient environment sound selected by the user to produce a modified sound. The earpiece may further include a speaker operatively connected to the processor to reproduce the modified sound.
    Type: Grant
    Filed: July 30, 2018
    Date of Patent: June 9, 2020
    Assignee: BRAGI GmbH
    Inventors: Peter Vincent Boesen, Darko Dragicevic
  • Patent number: 10666791
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for evaluating the quality of a communication session. One of the methods includes identifying, by a communication system, a communication session between one or more users of the communication system, wherein, during the communication session, session data is routed between a first communications device of a first user of the communication system and one or more other communications devices along a communication path; obtaining, from each of a plurality of communication nodes along the communication path, quality data relating to a quality of the communication session at the communication node; generating, using the quality data, a model input to a quality score machine learning model; and providing the model input as input to the quality score machine learning model to generate the estimated quality score for at least the portion of the communication session.
    Type: Grant
    Filed: March 12, 2019
    Date of Patent: May 26, 2020
    Assignee: RingCentral, Inc.
    Inventors: Kira Makagon, Helen Prask, Yuri Ardulov, Igor Rusinov, Ivan Gennadevich Anisimov