Detect Speech In Noise Patents (Class 704/233)

Apparatus for reducing belt slip

Patent number: 10879827

Abstract: An apparatus for reducing belt slip of a vehicle, in which the vehicle includes an engine as a vibration source and an electric motor connected to the engine through a belt to transmit torque of the engine, includes: a signal generator configured to generate a reference signal with a frequency corresponding to vibration of the engine; an adaptive filter configured to calculate a filter coefficient to remove an error value between a rotational speed of the engine and a rotational speed of the electric motor and apply the filter coefficient to the reference signal to generate a reference torque signal; and a torque compensator configured to generate a belt slip compensation torque signal by changing an amplitude of the reference torque signal, and apply the belt slip compensation torque signal to determine a final torque command of the electric motor.

Type: Grant

Filed: November 26, 2018

Date of Patent: December 29, 2020

Assignees: HYUNDAI MOTOR COMPANY, KIA MOTORS CORPORATION

Inventors: Tae Young Chung, Jun Mo An, Byung Hoon Yang
Apparatus and method for improved signal fade out in different domains during error concealment

Patent number: 10867613

Abstract: An apparatus for decoding an audio signal is provided, having a receiving interface, configured to receive a first frame having a first audio signal portion of the audio signal, and configured to receive a second frame having a second audio signal portion of the audio signal; a noise level tracing unit, wherein the noise level tracing unit is configured to determine noise level information depending on at least one of the first audio signal portion and the second audio signal portion; a first reconstruction unit for reconstructing, in a first reconstruction domain, a third audio signal portion of the audio signal depending on the noise level information; a transform unit for transforming the noise level information to a second reconstruction domain; and a second reconstruction unit for reconstructing, in the second reconstruction domain, a fourth audio signal portion of the audio signal depending on the noise level information.

Type: Grant

Filed: May 15, 2018

Date of Patent: December 15, 2020

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Michael Schnabel, Markovic Goran, Ralph Sperschneider, Jérémie Lecomte, Christian Helmrich
Apparatus and method for determining reliability of recommendation based on environment of vehicle

Patent number: 10861459

Abstract: An apparatus for determining a reliability of a recommendation based on an environment of a vehicle may include: an input device configured to detect a sound in the vehicle; a storage configured to store information associated with the sound; an output device configured to output information associated with a recommendation action; and a controller electrically connected with the input device, the storage, and the output device.

Type: Grant

Filed: November 16, 2018

Date of Patent: December 8, 2020

Assignees: Hyundai Motor Company, Kia Motors Corporation

Inventors: Kyung Chul Lee, Sung Soo Park
Exploiting visual information for enhancing audio signals via source separation and beamforming

Patent number: 10853653

Abstract: A system for exploiting visual information for enhancing audio signals via source separation and beamforming is disclosed. The system may obtain visual content associated with an environment of a user, and may extract, from the visual content, metadata associated with the environment. The system may determine a location of the user based on the extracted metadata. Additionally, the system may load, based on the location, an audio profile corresponding to the location of the user. The system may also load a user profile of the user that includes audio data associated with the user. Furthermore, the system may cancel, based on the audio profile and user profile, noise from the environment of the user. Moreover, the system may include adjusting, based on the audio profile and user profile, an audio signal generated by the user so as to enhance the audio signal during a communications session of the user.

Type: Grant

Filed: August 30, 2019

Date of Patent: December 1, 2020

Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.

Inventors: Dimitrios Dimitriadis, Donald J. Bowen, Lusheng Ji, Horst J. Schroeter
Communications with trigger phrases

Patent number: 10854199

Abstract: In one example of the disclosure, microphone data indicative of a user spoken phrase is captured utilizing a microphone. The microphone data is analyzed to detect a trigger phrase. A wakeup event is performed responsive to the detection of the trigger phrase. At least a portion of the microphone data is sent to the computing device for the computing device to send to a server system.

Type: Grant

Filed: April 22, 2016

Date of Patent: December 1, 2020

Assignee: Hewlett-Packard Development Company, L.P.

Inventor: David H. Hanes
Method and apparatus for controlled products

Patent number: 10853808

Abstract: The present invention is a method and apparatus for a first user to purchase a controlled product. The first user creates an order and specifies the set of conditions that the second user must meet to satisfy the order. The second user satisfies those conditions then can access an authorization from a third user prior to submitting a bid to supply the order. The first user can accept the bid then authorize payment. The first user and second user can confirm the product was delivered to the first user.

Type: Grant

Filed: December 18, 2016

Date of Patent: December 1, 2020

Inventor: Mark Lawrence
Systems and methods for implementing content aware file management labeling

Patent number: 10846263

Abstract: Systems and methods that enable implementation of content aware file management labeling. Techniques disclosed enable real-time analysis of a data file so that associations between the data file and applicable file management label(s) can be automatically suggested and/or made while a user is working in the data file. A user may deploy an application on a client device to edit a data file. While the user is actively editing the data file, the application may transmit modification data to a content analysis service which analyzes the modification data to determine whether the modifications result in a file management label becoming applicable to the data file. Ultimately, the content analysis service may transmit a verdict to the client device to cause the application to display a labeling suggestion to the user and/or to automatically apply a label to the data file while it is being worked on by the user.

Type: Grant

Filed: December 22, 2017

Date of Patent: November 24, 2020

Assignee: Microsoft Technology Licensing, LLC

Inventors: Kannan Dhanasekaran, Nakul Garg, Krishna Kumar Parthasarathy, Dheepak Ramaswamy, Yu Li, Martin Gagne, Yonghua Yang, Jinhao Li, Liming Chen, Mingquan Xue
Selection between signal sources based upon calculated signal to noise ratio

Patent number: 10847173

Abstract: Apparatuses and methods for selecting, for communicating, between signals provided by various pickups on the basis of a calculated signal to noise ratio (SNR) are disclosed. The various pickups may include a microphone and a vibration sensor. Signals from the microphone may be compared with signals from the vibration sensor by extracting a root-mean-square (RMS) profile for each, and comparing the RMS profiles to determine an SNR estimate for the microphone signal. The microphone signal may be selected if the SNR estimate is below a predetermined threshold, otherwise the vibration sensor signal may be selected. The vibration sensor signal may be subject to further processing if selected to approximate the microphone signal.

Type: Grant

Filed: April 12, 2018

Date of Patent: November 24, 2020

Assignee: Intel Corporation

Inventors: Hector Alfonso Cordourier Maruri, Jonathan Huang, Paulo Lopez Meyer, Willem Beltman
System and method for efficient processing of universal background models for speaker recognition

Patent number: 10832683

Abstract: A system and method for efficient universal background model (UBM) training for speaker recognition, including: receiving an audio input, divisible into a plurality of audio frames, wherein at least a first audio frame of the plurality of audio frames includes an audio sample having a length above a first threshold extracting at least one identifying feature from the first audio frame and generating a feature vector based on the at least one identifying feature; generating an optimized training sequence computation based on the feature vector and a Gaussian Mixture Model (GMM), wherein the GMM is associated with a plurality of components, wherein each of the plurality of components is defined by a covariance matrix, a mean vector, and a weight vector; and updating any of the associated components of the GMM based on the generated optimized training sequence computation.

Type: Grant

Filed: November 28, 2018

Date of Patent: November 10, 2020

Assignee: ILLUMA Labs LLC.

Inventor: Milind Borkar
Coordinating input on multiple local devices

Patent number: 10831440

Abstract: One embodiment provides a method, including: receiving, at a first device, a user input; activating, from the first device, a second device; and thereafter coordinating, using the processor of the first device, with the second device to process the user input. Other aspects are described and claimed.

Type: Grant

Filed: September 26, 2016

Date of Patent: November 10, 2020

Assignee: Lenovo (Singapore) Pte. Ltd.

Inventors: John Carl Mese, Nathan J. Peterson, Arnold S. Weksler, Russell Speight VanBlon
Apparatus for noise canceling and method for the same

Patent number: 10818309

Abstract: An embodiment of the present invention provides an apparatus for noise canceling that includes: an input unit configured to receive an input voice signal; and one or more processors configured to perform a first noise cancellation using as input the received input voice signal to generate a first voice signal by cancelling noise from the input voice signal using a noise canceling model which is trained using a plurality of reference voice signals, perform a second noise cancellation using as input the first voice signal generated by the noise canceling model to generate a second voice signal in which residual noise is canceled from the first voice signal using statistical analysis, and generate an output voice signal comprising an encoding of the second voice signal.

Type: Grant

Filed: August 9, 2019

Date of Patent: October 27, 2020

Assignee: LG ELECTRONICS INC.

Inventors: Bongki Lee, Youngman Kim, Keunsang Lee, Jaewoong Jeong, Seunghyun Hwang
Audio loudness control method and system based on signal analysis and deep learning

Patent number: 10819301

Abstract: The present disclosure relates to a method and system for controlling loudness of an audio based on signal analysis and deep learning. The method includes analyzing an audio characteristic in a frame level based on signal analysis, analyzing the audio characteristic in the frame level based on learning, and controlling loudness of the audio in the frame level, by combining the analysis results. Accordingly, reliability of audio characteristic analysis can be enhanced and audio loudness can be optimally controlled.

Type: Grant

Filed: October 18, 2018

Date of Patent: October 27, 2020

Assignee: KOREA ELECTRONICS TECHNOLOGY INSTITUTE

Inventors: Choong Sang Cho, Young Han Lee
Method for improving echo cancellation effect and system thereof

Patent number: 10819858

Abstract: A method for improving an echo cancellation effect and a system thereof are disclosed. The method comprises includes: performing a non-linear compensation for a non-linear response portion of an excitation signal using an NLC algorithm to obtain a compensated excitation signal; outputting the compensated excitation signal to an echo cancellation system; and performing echo cancellation for the compensated excitation signal by the echo cancellation system. According to the present disclosure, using the NLC algorithm, non-linear compensation is performed for the non-linear portion of the excitation signal, non-linear outputs generated due to non-linear characteristics of the system are pre-compensated when being input to the echo cancellation system, such that the echo signal output by the echo cancellation system is minimized and the echo cancellation effect is improved.

Type: Grant

Filed: August 6, 2019

Date of Patent: October 27, 2020

Assignee: AAC Technologies Pte. Ltd.

Inventor: Henglizi Zhang
Post filter for audio signals

Patent number: 10811024

Abstract: In some embodiments, a pitch filter for filtering a preliminary audio signal generated from an audio bitstream is disclosed. The pitch filter has an operating mode selected from one of either: (i) an active mode where the preliminary audio signal is filtered using filtering information to obtain a filtered audio signal, and (ii) an inactive mode where the pitch filter is disabled. The preliminary audio signal is generated in an audio encoder or audio decoder having a coding mode selected from at least two distinct coding modes, and the pitch filter is capable of being selectively operated in either the active mode or the inactive mode while operating in the coding mode based on control information.

Type: Grant

Filed: March 12, 2019

Date of Patent: October 20, 2020

Assignee: Dolby International AB

Inventors: Barbara Resch, Kristofer Kjörling, Lars Villemoes
System and apparatus for real-time speech enhancement in noisy environments

Patent number: 10811030

Abstract: A system may perform speech enhancement of audio data in real-time by suppressing noise components that are present in the audio data while preserving speech components. The system may include an in-ear module and a separate signal processing module that is wirelessly communicatively coupled to the in-ear module. The system may include non-negative matrix factorization (NMF) dictionaries capable of identifying frequency band components associated with speech and frequency band components associated with noise. The NMF dictionaries may be trained using voice samples and noise samples. The NMF dictionaries may be applied to noisy speech data to produce an NMF representation of the speech data which may then be applied using a dynamic mask to the noisy speech data in order to suppress the noise components of the noisy speech data and produce speech enhanced data.

Type: Grant

Filed: September 12, 2018

Date of Patent: October 20, 2020

Assignee: BOARD OF TRUSTEES OF MICHIGAN STATE UNIVERSITY

Inventors: Mi Zhang, Kai Cao, Xiao Zeng, Haochen Sun
Interactive music feedback system

Patent number: 10795929

Abstract: Embodiments of the invention are directed to apparatuses, methods, and computer program products for generating a rating score based on color. In some embodiments, the apparatuses, methods, and computer program products are configured to: receive at least one input, wherein the at least one input comprises at least one of a first numerical value, an emoticon, a color, or at least one keyword; process the at least one input; and generate a second numerical value based on processing the at least one input.

Type: Grant

Filed: September 1, 2018

Date of Patent: October 6, 2020

Inventor: Trent R McKenzie
Assessment of a pulmonary condition by speech analysis

Patent number: 10796805

Abstract: Described embodiments include apparatus that includes a network interface (28) and a processor (30). The processor is configured to receive, via the network interface, speech of a subject (22) who suffers from a pulmonary condition related to accumulation of excess fluid, to identify, by analyzing the speech, one or more speech-related parameters of the speech, to assess, in response to the speech-related parameters, a status of the pulmonary condition, and to generate, in response thereto, an output indicative of the status of the pulmonary condition. Other embodiments are also described.

Type: Grant

Filed: October 5, 2016

Date of Patent: October 6, 2020

Assignee: CORDIO MEDICAL LTD.

Inventors: Chaim Lotan, Sigal Kremer-Tal, Aviv Lotan, Zeev Schlik, Avinoam Gemer, Yehuda Snir, Yonatan Sasson, Margarita Sheinkerman
Word embedding system

Patent number: 10789942

Abstract: A computer-implemented method, computer program product, and computer processing system are provided for word embedding. The method includes receiving, by a processor device, a word embedding matrix. The method further includes generating, by a processor device, an average pooling vector and a max pooling vector, based on the word embedding matrix. The method also includes generating, by the processor device, a prediction by applying a Multi-Layer Perceptron (MLP) to the average pooling vector and the max pooling vector.

Type: Grant

Filed: October 18, 2018

Date of Patent: September 29, 2020

Assignee: NEC Corporation

Inventors: Renqiang Min, Dinghan Shen
Systems and methods for identifying voice

Patent number: 10789947

Abstract: Audio information defining audio content may be accessed. The audio content may have a duration. The audio content may be segmented into audio segments. Individual audio segments may correspond to a portion of the duration. Feature vectors of the audio segments may be determined. The feature vectors may be processed through a classifier. The classifier may output scores on whether the audio segments contain voice. One or more of the audio segments may be identified as containing voice based on the scores and a two-step hysteresis thresholding. Storage of the identification of the one or more of the audio segments as containing voice in one or more storage media may be effectuated.

Type: Grant

Filed: December 19, 2019

Date of Patent: September 29, 2020

Assignee: GoPro, Inc.

Inventor: Gabriel Lema
Babble noise suppression

Patent number: 10783899

Abstract: Systems and methods are introduced to perform noise suppression of an audio signal. The audio signal includes foreground speech components and background noise. The foreground speech components correspond to speech from a user's speaking into an audio receiving device. The background noise includes babble noise that includes speech from one or more interfering speakers. A soft speech detector determines, dynamically, a speech detection result indicating a likelihood of a presence of the foreground speech components in the audio signal. The speech detection result is employed to control, dynamically, an amount of attenuation of the noise suppression to reduce the babble noise in the audio signal. Further processing achieves a more stationary background and reduction of musical tones in the audio signal.

Type: Grant

Filed: November 18, 2016

Date of Patent: September 22, 2020

Assignee: Cerence Operating Company

Inventors: Simon Graf, Tobias Herbig, Markus Buck
Method and device of audio source separation

Patent number: 10770090

Abstract: A method of audio source separation includes steps of applying a demixing matrix on a plurality of received signals to generate a plurality of separated results; performing a recognition operation on the plurality of separated results to generate a plurality of recognition scores; generating a constraint according to the plurality of recognition scores; and adjusting the demixing matrix according to the constraint; where the adjusted demixing matrix is applied to the plurality of received signals to generate a plurality of updated separated results from the plurality of received signals.

Type: Grant

Filed: June 2, 2017

Date of Patent: September 8, 2020

Assignee: Realtek Semiconductor Corp.

Inventors: Ming-Tang Lee, Chung-Shih Chu
Driving support method, vehicle, and driving support system

Patent number: 10759447

Abstract: A driving support method for a vehicle includes: acquiring a sound of a sound source placed outside the vehicle; and displaying, on a display portion, driving support information corresponding to an utterance content of the sound, the utterance content of the sound being recognized by a sound recognition process, in a display mode suggesting a relative position of the sound source from the vehicle, the relative position being specified based on the sound.

Type: Grant

Filed: February 11, 2019

Date of Patent: September 1, 2020

Assignee: TOYOTA JIDOSHA KABUSHIKI KAISHA

Inventor: Chihiro Inaba
Speaker verification

Patent number: 10762905

Abstract: A method for use in speaker verification, comprises: receiving a sample of a person's speech; estimating a noise measure in the received sample of the person's speech; if the estimated noise measure does not meet a predetermined criterion, rejecting the received sample of the person's speech; and if the estimated noise measure meets the predetermined criterion: adding noise to the received sample to form a sample with increased noise; extracting features of the sample with increased noise; and forming a model of the person's speech from the extracted features.

Type: Grant

Filed: July 31, 2018

Date of Patent: September 1, 2020

Assignee: Cirrus Logic, Inc.

Inventors: Carlos Vaquero Avilés-Casco, David Martínez González
Closed-loop model parameter identification techniques for industrial model-based process controllers

Patent number: 10761522

Abstract: A method includes obtaining closed-loop data associated with operation of an industrial process controller, where the industrial process controller is configured to control at least part of an industrial process using at least one model. The method also includes generating at least one noise model associated with the industrial process controller using at least some of the closed-loop data. The method further includes filtering the closed-loop data based on the at least one noise model. In addition, the method includes generating one or more model parameters for the industrial process controller using the filtered closed-loop data.

Type: Grant

Filed: June 28, 2017

Date of Patent: September 1, 2020

Assignee: Honeywell Limited

Inventors: Qiugang Lu, Lee D. Rippon, R. Bhushan Gopaluni, Michael G. Forbes, Philip D. Loewen, Johan U. Backstrom, Guy A. Dumont
Loudspeaker beamforming for improved spatial coverage

Patent number: 10764676

Abstract: A system configured to improve spatial coverage of output audio and a corresponding user experience by performing upmixing and loudspeaker beamforming to stereo input signals. The system can perform upmixing to the stereo (e.g., two channel) input signal to extract a center channel and generate three-channel audio data. The system may then perform loudspeaker beamforming to the three-channel audio data to enable two loudspeakers to generate output audio having three distinct beams. The user may interpret the three distinct beams as originating from three separate locations, resulting in the user perceiving a wide virtual sound stage despite the loudspeakers being spaced close together on the device.

Type: Grant

Filed: September 17, 2019

Date of Patent: September 1, 2020

Assignee: Amazon Technologies, Inc.

Inventors: Yuancheng Luo, Wontak Kim, Mihir Dhananjay Shetye
Pulse-based automatic speech recognition

Patent number: 10755698

Abstract: Various examples are provided related to speech recognition. In one example, a method includes converting an auditory signal into a pulse train, segmenting the pulse train into a series of frames having a predefined duration, and identifying a portion of the auditory signal by applying at least a portion of the series of frames segmented from the pulse train to a kernel adaptive autoregressive-moving-average (KAARMA) network. In another example, a speech recognition system includes processing circuitry configured to convert an auditory signal into a pulse train, segment the pulse train into a secured of frames, and identifying a portion of the auditory signal by applying at least a portion of the series of frames segmented from the pulse train to a KAARMA network. The series of frames segmented from the pulse train can be applied to a KAARMA chain including a plurality of KAARMA networks for identification.

Type: Grant

Filed: December 7, 2016

Date of Patent: August 25, 2020

Assignee: UNIVERSITY OF FLORIDA RESEARCH FOUNDATION, INC.

Inventors: Jose C. Principe, Kan Li
Active noise control with planar transducers

Patent number: 10757503

Abstract: Active noise control (ANC), including active and adaptive noise cancellation (ANC) with non-voice-coil transducers having highly linear transfer functions, such as planar transducers, planar magnetic transducers, electro-static transducers, and piezo-electric transducers. This active and adaptive noise cancellation (ANC) may be used with: planar transducer headphones and earphones; open-backed and closed-back headphones and earphones; in-ear earphones, and phase plugs.

Type: Grant

Filed: February 15, 2018

Date of Patent: August 25, 2020

Assignee: Audeze, LLC

Inventor: Dragoslav Colich
Spatial low-crosstalk headset

Patent number: 10735861

Abstract: An apparatus for reducing cross-talk between transmitted audio signals and received audio in a headset. The headset includes one or more of a set of earphones, a headset frame, a microphone boom with an array of MEMS microphone configured to isolate the earphone audio from the microphone audio, a VOX circuit, low crosstalk cable(s), and/or other components. Sets of microphones may be enabled and/or disabled to reduce cross-talk between received audio signals and transmitted audio signals. The VOX circuit is configured to reduce cross-talk between received audio signals and transmitted audio signals.

Type: Grant

Filed: February 6, 2019

Date of Patent: August 4, 2020

Assignee: HM Electronics, Inc.

Inventors: Charles Butten, Karl Knoblock, Robert Snyder
Waking computing devices based on ambient noise

Patent number: 10725523

Abstract: Examples disclosed herein provide the ability for a computing device to determine a noise threshold to wake on ambient noises. In one example method, the computing device tracks sound, detected by a microphone of the computing device, over a period of time and, based on the sound tracked over the period of time, determines a noise threshold. The computing device tunes a sensitivity of the microphone to wake the computing device when ambient noise, detected by the microphone, is to have a signal strength equal to or exceeding the noise threshold.

Type: Grant

Filed: April 11, 2016

Date of Patent: July 28, 2020

Assignee: Hewlett-Packard Development Company, L.P.

Inventors: Alexander Wayne Clark, Kent E Biggs, Richard E Hodges
Signal processing apparatus and signal processing method

Patent number: 10705620

Abstract: There is provided a signal processing apparatus including: a control unit that executes, on a basis of a waveform signal generated in accordance with a motion of an attachment portion of a sensor attached to a tool or a body, effect processing for the waveform signal or another waveform signal, the waveform signal being output from the sensor. The signal processing apparatus performs presentation so that a body motion itself can be aurally felt.

Type: Grant

Filed: October 6, 2016

Date of Patent: July 7, 2020

Assignee: SONY CCORPORATION

Inventors: Heesoon Kim, Masaharu Yoshino, Masahiko Inami, Kouta Minamizawa, Yuta Sugiura, Yusuke Mizushina, Tatsushi Nashida
Earpiece with source selection within ambient environment

Patent number: 10681450

Abstract: A wireless earpiece includes a wireless earpiece housing, at least one microphone for detecting ambient environment sound, and a processor disposed within the wireless earpiece housing, the processor configured to distinguish between two or more sources of sound within the ambient environment sound. The wireless earpiece further includes a user interface operatively connected to the processor. The processor is configured to receive user input through the user interface to select one of the sources of sound within the ambient environment sound and wherein the processor is configured to process the ambient environment sound to emphasize portions of the ambient environment sound generated by the one of the sources of the ambient environment sound selected by the user to produce a modified sound. The earpiece may further include a speaker operatively connected to the processor to reproduce the modified sound.

Type: Grant

Filed: July 30, 2018

Date of Patent: June 9, 2020

Assignee: BRAGI GmbH

Inventors: Peter Vincent Boesen, Darko Dragicevic
System and method for evaluating the quality of a communication session

Patent number: 10666791

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for evaluating the quality of a communication session. One of the methods includes identifying, by a communication system, a communication session between one or more users of the communication system, wherein, during the communication session, session data is routed between a first communications device of a first user of the communication system and one or more other communications devices along a communication path; obtaining, from each of a plurality of communication nodes along the communication path, quality data relating to a quality of the communication session at the communication node; generating, using the quality data, a model input to a quality score machine learning model; and providing the model input as input to the quality score machine learning model to generate the estimated quality score for at least the portion of the communication session.

Type: Grant

Filed: March 12, 2019

Date of Patent: May 26, 2020

Assignee: RingCentral, Inc.

Inventors: Kira Makagon, Helen Prask, Yuri Ardulov, Igor Rusinov, Ivan Gennadevich Anisimov
Interactive system, terminal, method of controlling dialog, and program for causing computer to function as interactive system

Patent number: 10657960

Abstract: A dialog content is generated using information that is unique to a user and information that is not unique. The processing executed by a dialog system includes a step of identifying a person based on a dialog with a user, a step of acquiring personal information, a step of analyzing the dialog, a step of extracting an event, a step of searching for a local episode and a global episode based on the personal information and the event, a step of generating dialog data using the search result, a step of outputting a dialog, and a step of accepting user evaluation.

Type: Grant

Filed: July 22, 2016

Date of Patent: May 19, 2020

Assignee: SHARP KABUSHIKI KAISHA

Inventors: Rei Tokunaga, Toru Ueda
Promoting voice actions to hotwords

Patent number: 10643614

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for designating certain voice commands as hotwords. The methods, systems, and apparatus include actions of receiving a hotword followed by a voice command. Additional actions include determining that the voice command satisfies one or more predetermined criteria associated with designating the voice command as a hotword, where a voice command that is designated as a hotword is treated as a voice input regardless of whether the voice command is preceded by another hotword. Further actions include, in response to determining that the voice command satisfies one or more predetermined criteria associated with designating the voice command as a hotword, designating the voice command as a hotword.

Type: Grant

Filed: December 10, 2018

Date of Patent: May 5, 2020

Assignee: Google LLC

Inventor: Matthew Sharifi
Execution of voice commands in a multi-device system

Patent number: 10621980

Abstract: Performing speech recognition in a multi-device system includes receiving a first audio signal that is generated by a first microphone in response to a verbal utterance, and a second audio signal that is generated by a second microphone in response to the verbal utterance; dividing the first audio signal into a first sequence of temporal segments; dividing the second audio signal into a second sequence of temporal segments; comparing a sound energy level associated with a first temporal segment of the first sequence to a sound energy level associated with a first temporal segment of the second sequence; based on the comparing, selecting, as a first temporal segment of a speech recognition audio signal, one of the first temporal segment of the first sequence and the first temporal segment of the second sequence; and performing speech recognition on the speech recognition audio signal.

Type: Grant

Filed: March 21, 2017

Date of Patent: April 14, 2020

Assignee: Harman International Industries, Inc.

Inventor: Seon Man Kim
System and method for mobile automatic speech recognition

Patent number: 10607600

Abstract: A system and method of updating automatic speech recognition parameters on a mobile device are disclosed. The method comprises storing user account-specific adaptation data associated with ASR on a computing device associated with a wireless network, generating new ASR adaptation parameters based on transmitted information from the mobile device when a communication channel between the computing device and the mobile device becomes available and transmitting the new ASR adaptation data to the mobile device when a communication channel between the computing device and the mobile device becomes available. The new ASR adaptation data on the mobile device more accurately recognizes user utterances.

Type: Grant

Filed: February 12, 2018

Date of Patent: March 31, 2020

Assignee: NUANCE COMMUNICATIONS, INC.

Inventors: Sarangarajan Parthasarathy, Richard Cameron Rose
Speech signal recognition system and method

Patent number: 10607597

Abstract: A speech signal recognition method, apparatus, and system. The speech signal recognition method may include obtaining by or from a terminal an output of a personalization layer, with respect to a speech signal provided by a user of the terminal, having been implemented by input of the speech signal to the personalization layer, the personalization layer being previously trained based on speech features of the user, implementing a global model by providing the obtained output of the personalization layer to the global model, the global model being configured to output a phonemic signal indicating a phoneme included in the speech signal through the global model being previously trained based on speech features common to a plurality of users, and re-training the personalization layer based on the phonemic signal output from the global model, where the personalization layer and the global model collectively represent an acoustic model.

Type: Grant

Filed: March 9, 2018

Date of Patent: March 31, 2020

Assignee: Samsung Electronics Co., Ltd.

Inventors: Minyoung Mun, SangHyun Yoo, Young Sang Choi, Ki Soo Kwon, Hodong Lee
Method, system and device for detecting a silence period status in a user equipment

Patent number: 10602387

Abstract: A second device that is in communication with a first device receives transmissions of the first device and detects a SILENCE period status of the first device, which corresponds to a status wherein the first device has no speech samples to be transmitted towards the second apparatus. The second device determines the type of the received transmissions, counts the number of received transmissions of a first type, and times a time interval between the last received transmission of the first type and the last received transmission of a second determined type. At reception of a transmission of the first type, the second device detects whether the first device is in the SILENCE period status on an evaluation of the counted number of transmissions of the first type and the time interval of the last received transmission of the first type and the last received transmission of the second type.

Type: Grant

Filed: July 12, 2018

Date of Patent: March 24, 2020

Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)

Inventors: Carola Faronius, Saad Naveed Ahmed, Don Corry
Reducing audibility of sensor noise floor in a road noise cancellation system

Patent number: 10593317

Abstract: A road noise cancellation (RNC) system may include a controller and attenuator for reducing the audibility of the noise floor caused by the system's vibration sensors. A level of anti-noise at a location in a passenger cabin that may be attributed to the sensor noise floor may be estimated. An actual sound level in the passenger cabin may be measured or estimated, with the sensor noise floor component algorithmically removed. The difference in levels may be compared to a predetermined threshold to determine an amount of attenuation, if any, to be applied to an anti-noise signal to reduce audibility.

Type: Grant

Filed: December 20, 2018

Date of Patent: March 17, 2020

Assignee: HARMAN INTERNATIONAL INDUSTRIES, INCORPORATED

Inventors: Kevin J. Bastyr, James May
Voice activity detector for audio signals

Patent number: 10586557

Abstract: According to one aspect, a method for determining voice activity is disclosed, the method including receiving a frame of an input audio signal, the input audio signal having a sample rate, and spitting the audio signal into a plurality of subbands, the plurality of subbands including at least a lowest subband and a highest subband. The method further comprises filtering the lowest subband to reduce an energy of the lowest subband, estimating a noise level for at least some of the plurality of subbands, and computing a signal-to-noise ratio for at least some of the plurality of subbands. The method also includes determining a speech activity level based at least in part on the computed signal-to-noise ratios and an average of an energy of at least some of the plurality of subbands.

Type: Grant

Filed: July 19, 2019

Date of Patent: March 10, 2020

Assignee: Dolby Laboratories Licensing Corporation

Inventor: Hannes Muesch
Health monitoring system and appliance

Patent number: 10573314

Abstract: Systems and methods are disclosed. A digitized human vocal expression of a user and digital images are received over a network from a remote device. The digitized human vocal expression is processed to determine characteristics of the human vocal expression, including: pitch, volume, rapidity, a magnitude spectrum identify, and/or pauses in speech. Digital images are received and processed to detect characteristics of the user face, including detecting if one or more of the following is present: a sagging lip, a crooked smile, uneven eyebrows, and/or facial droop. Based at least on part on the human vocal expression characteristics and face characteristics, a determination is made as to what action is to be taken. A cepstrum pitch may be determined using an inverse Fourier transform of a logarithm of a spectrum of a human vocal expression signal. The volume may be determined using peak heights in a power spectrum of the human vocal expression.

Type: Grant

Filed: February 27, 2019

Date of Patent: February 25, 2020

Inventor: Karen Elaine Khaleghi
Speech recognition system and method using an adaptive incremental learning approach

Patent number: 10573304

Abstract: The present disclosure relates to speech recognition systems and methods using an adaptive incremental learning approach. More specifically, the present disclosure relates to adaptive incremental learning in a self-taught vocal user interface.

Type: Grant

Filed: November 4, 2015

Date of Patent: February 25, 2020

Assignee: KATHOLIEKE UNIVERSITEIT LEUVEN

Inventors: Jort Gemmeke, Bart Ons, Hugo Van Hamme
Direction based end-pointing for speech recognition

Patent number: 10566012

Abstract: A speech recognition system utilizing automatic speech recognition techniques such as end-pointing techniques in conjunction with beamforming and/or signal processing to isolate speech from one or more speaking users from multiple received audio signals and to detect the beginning and/or end of the speech based at least in part on the isolation. Audio capture devices such as microphones may be arranged in a beamforming array to receive the multiple audio signals. Multiple audio sources including speech may be identified in different beams and processed.

Type: Grant

Filed: October 12, 2018

Date of Patent: February 18, 2020

Assignee: AMAZON TECHNOLOGIES, INC.

Inventors: Kenneth John Basye, Jeffrey Penrod Adams
User voice activity detection methods, devices, assemblies, and components

Patent number: 10564925

Abstract: Many headsets include automatic noise cancellation (ANC) which dramatically reduces perceived background noise and improves user listening experience. Unfortunately, the voice microphones in these devices often capture ambient noise that the headsets output during phone calls or other communication sessions to other users. In response, many headsets and communication devices provide manual muting circuitry, but users frequently forget to turn the muting on and/or off, creating further problems as they communicate. To address this, the present inventors devised, among other things, an exemplary headset that detects the absence or presence of user speech, automatically muting and unmuting the voice microphone without user intervention. Some embodiments leverage relationships between feedback and feedforward signals in ANC circuitry to detect user speech, avoiding the addition of extra hardware to the headset.

Type: Grant

Filed: September 21, 2017

Date of Patent: February 18, 2020

Inventors: Jiajin An, Michael Jon Wurtz, David Wurtz, Manpreet Khaira, Amit Kumar, Shawn O'Connor, Shankar Rathoud, James Scanlan, Eric Sorensen
Systems and methods for locating mobile devices within a vehicle

Patent number: 10555133

Abstract: A method includes receiving, by sensors inside an enclosure of a vehicle, signals generated by signal generators in the enclosure of the vehicle. One of the sensors or signal generators may be part of a mobile device inside the enclosure. The method also includes determining a location and orientation of the mobile device from the signals. The method further includes determining, based on the location and orientation of the mobile device, an object in the enclosure that the mobile device is pointing to. The mobile device further includes transmitting a message to the mobile device in response to determining that the mobile device is pointing to the object, so as to cause the mobile device to display a user interface to allow the mobile device to control the object.

Type: Grant

Filed: September 14, 2017

Date of Patent: February 4, 2020

Assignee: Apple Inc.

Inventors: Sawyer I. Cohen, Jack J. Wanderman, Romain A. Teil, Scott M. Herz
Voice operating system, server device, on-vehicle device, and voice operating method

Patent number: 10540969

Abstract: A purpose of the present invention is to provide a technique for easily performing accurate voice recognition.

Type: Grant

Filed: July 21, 2016

Date of Patent: January 21, 2020

Assignee: Clarion Co., Ltd.

Inventors: Takashi Yamaguchi, Yasushi Nagai
Systems and methods for identifying voice

Patent number: 10535340

Abstract: Audio information defining audio content may be accessed. The audio content may have a duration. The audio content may be segmented into audio segments. Individual audio segments may correspond to a portion of the duration. Feature vectors of the audio segments may be determined. The feature vectors may be processed through a classifier. The classifier may output scores on whether the audio segments contain voice. One or more of the audio segments may be identified as containing voice based on the scores and a two-step hysteresis thresholding. Storage of the identification of the one or more of the audio segments as containing voice in one or more storage media may be effectuated.

Type: Grant

Filed: August 15, 2019

Date of Patent: January 14, 2020

Assignee: GoPro, Inc.

Inventor: Gabriel Lema
Apparatus and method for generating an enhanced signal using independent noise-filling identified by an identification vector

Patent number: 10529348

Abstract: An apparatus for generating an enhanced signal from an input signal, wherein the enhanced signal has spectral values for an enhancement spectral region, the spectral values for the enhancement spectral regions not being contained in the input signal, includes a mapper for mapping a source spectral region of the input signal to a target region in the enhancement spectral region, the source spectral region including a noise-filling region; and a noise filler configured for generating first noise values for the noise-filling region in the source spectral region of the input signal and for generating second noise values for a noise region in the target region, wherein the second noise values are decorrelated from the first noise values or for generating second noise values for a noise region in the target region, wherein the second noise values are decorrelated from first noise values in the source region.

Type: Grant

Filed: January 24, 2017

Date of Patent: January 7, 2020

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Sascha Disch, Ralf Geiger, Andreas Niedermeier, Matthias Neusinger, Konstantin Schmidt, Stephan Wilde, Benjamin Schubert, Christian Neukam
Method and system for reducing background sounds in a noisy environment

Patent number: 10529358

Abstract: A method for reducing noise to a user to enable a conversation-of-interest to be heard, the noise originating from a noise source, the method comprising the steps of: operating at least one first device located at a first distance from the noise source, the user having noise-cancellation earphones connected to a second mobile device, the second mobile device located at a second distance from the noise source, the first distance less than the second distance; prehearing noise from the noise source using the at least one first device; analyzing the preheard noise to yield a respective analyzed noise signal; and processing the respective analyzed noise signal to effect noise cancellation for the noise-cancellation earphones.

Type: Grant

Filed: February 23, 2018

Date of Patent: January 7, 2020

Inventor: Shmuel Ur
Devices and methods for fall detection based on phase segmentation

Patent number: 10506990

Abstract: Aspects of the subject matter described in this disclosure can be implemented in a fall detection device and method. One or more motion sensors can access a user's acceleration data. The acceleration data can be segmented using a segmentation algorithm to identify a potential fall event. The segmentation algorithm can determine a cumulative sum of the acceleration data, where the cumulative sum is based on acceleration values being greater than or less than an acceleration threshold value, and a potential fall event can be identified where the cumulative sum is greater than a cumulative sum threshold value. Statistical features can be extracted from the segmented acceleration data and aggregated, and a determination can be made as to whether the potential fall event is a fall event based at least in part on the statistical features.

Type: Grant

Filed: September 9, 2016

Date of Patent: December 17, 2019

Assignee: QUALCOMM Incorporated

Inventors: Jin Won Lee, Xinzhou Wu, Rashid Ahmed Akbar Attar, Feng Han

prev 1 2 3 4 5 6 7 … next