Patents Examined by Sonia L Gay
  • Patent number: 10650809
    Abstract: The present disclosure provides a speech recognition method and device. The method includes: receiving a speech signal; decoding the speech signal according to an acoustic model, a language model and a decoding network established in advance, and dynamically adding a blank unit in a decoding process to obtain an optimum decoding path with the added blank unit, in which the acoustic model is obtained based on connectionist temporal classification training, the acoustic model includes basic pronunciation units and the blank unit, and the decoding network includes a plurality of decoding paths consisting of the basic pronunciation units; and outputting the optimum decoding path as a recognition result of the speech signal.
    Type: Grant
    Filed: July 26, 2016
    Date of Patent: May 12, 2020
    Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
    Inventors: Sheng Qian, Fuping Pan
  • Patent number: 10645514
    Abstract: Disclosed is an apparatus and method for processing a multichannel audio signal. A multichannel audio signal processing method may include: generating an N-channel audio signal of N channels by down-mixing an M-channel audio signal of M channels; and generating a stereo audio signal by performing binaural rendering of the N-channel audio signal.
    Type: Grant
    Filed: September 10, 2018
    Date of Patent: May 5, 2020
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Yong Ju Lee, Jeong Il Seo, Seung Kwon Beack, Kyeong Ok Kang, Jin Woong Kim, Jae Hyoun Yoo
  • Patent number: 10642573
    Abstract: Embodiments of the disclosure include an improved content streaming system that is configured to simplify and streamline the process of streaming media content from one or more content providers to one or more electronic devices. In some embodiments, the interaction of a user with one or more components in a content distribution system is used to initiate the streaming of media content to one or more content players from either a first content server or a second content server.
    Type: Grant
    Filed: March 18, 2019
    Date of Patent: May 5, 2020
    Assignee: LOGITECH EUROPE S.A.
    Inventors: James L. Thurman, John Dittlinger, William Johnson, Xiao Li, Yezhou Wang
  • Patent number: 10643639
    Abstract: The present disclosure describes a system and method for determining cardiac parameters and physiological conditions of a user by analysing speech samples of said user. A user device of the user may record specifics of speech and use these specifics of speech as a speech sample of user's utterance. The user device may transmit the speech samples to a backend system. The system may isolate phonation segments from the speech samples. The system may filter one or more phonation segments. The system may isolate uttered speech segments from one or more phonation segments. The system may perform an acoustic-phonetic analysis of the uttered speech segments. The acoustic-phonetic analysis may use plurality of features for the analysis. The IPA phonemes may be used to derive speech markers that correspond to specific cardiac parameters and physiological conditions. The system may generate a resulting report after analysis which is transmitted to the user.
    Type: Grant
    Filed: June 7, 2018
    Date of Patent: May 5, 2020
    Inventors: Ajit Arun Zadgaonkar, Arun Shrihari Zadgaonkar
  • Patent number: 10643638
    Abstract: A technique determination device according to one embodiment of the present invention comprises an input sound acquisition unit acquiring an input sound, a pitch detection unit detecting a pitch on a time-series basis based on the input sound, a sound-volume detection unit detecting a sound volume on the time series basis based on the input sound, a first starting-point detection unit determining whether variation of the sound volume is equal to or larger than a predetermined threshold for each predetermined period and detecting a starting point of a period in which the variation of the sound volume is equal to or larger than the threshold as a first starting point, and a technique determination unit determining a technique of the input sound based on a change of the sound volume after the first starting point and variation of the pitch after the first starting point.
    Type: Grant
    Filed: May 25, 2018
    Date of Patent: May 5, 2020
    Assignee: Yamaha Corporation
    Inventors: Ryuichi Nariyama, Shuichi Matsumoto
  • Patent number: 10636420
    Abstract: Disclosed is an electronic device including: at least one processor; and a memory electrically connected to the at least one processor, wherein the memory stores instructions to recognize a received first voice, recognizes a first speaker based on the recognized first voice, and determines a response corresponding to the first voice based on a result of the recognition of the first speaker. Other embodiments are possible.
    Type: Grant
    Filed: December 21, 2017
    Date of Patent: April 28, 2020
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Shin-Jae Kang, Jeong-Hyun Ha, Seok-Yeong Jung
  • Patent number: 10638246
    Abstract: Embodiments of the example embodiment relate to audio object extraction. A method for audio object extraction from audio content is disclosed. The method comprises determining a sub-band object probability for a sub-band of the audio signal in a frame of the audio content, the sub-band object probability indicating a probability of the sub-band of the audio signal containing an audio object. The method further comprises splitting the sub-band of the audio signal into an audio object portion and a residual audio portion based on the determined sub-band object probability. Corresponding system and computer program product are also disclosed.
    Type: Grant
    Filed: October 16, 2017
    Date of Patent: April 28, 2020
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Lianwu Chen, Lie Lu
  • Patent number: 10631116
    Abstract: The positions of a plurality of speakers at a media consumption site are determined. Audio information in an object-based format is received. Gain adjustment value for a sound content portion in the object-based format may be determined based on the position of the sound content portion and the positions of the plurality of speakers. Audio information in a ring-based channel format is received. Gain adjustment value for each ring-based channel in a set of ring-based channels may be determined based on the ring to which the ring-based channel belongs and the positions of the speakers at a media consumption site.
    Type: Grant
    Filed: June 25, 2018
    Date of Patent: April 21, 2020
    Assignees: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Nicolas R. Tsingos, David S. McGrath, Freddie Sanchez, Antonio Mateos Sole
  • Patent number: 10629199
    Abstract: This disclosure describes, in part, techniques for implementing voice-enabled devices in vehicle environments to facilitate voice interaction with vehicle computing devices. Due to the differing communication capabilities of existing vehicle computing devices, the techniques described herein describe different communication topologies for facilitating voice interaction with the vehicle computing devices. In some examples, the voice-enabled device may be communicatively coupled to a user device, which may communicate with a remote speech-processing system to determine and perform operations responsive to the voice commands, such as conducting phone calls using loudspeakers of the vehicle computing device, streaming music to the vehicle computing device, and so forth.
    Type: Grant
    Filed: December 12, 2017
    Date of Patent: April 21, 2020
    Assignee: Amazon Technologies, Inc.
    Inventors: Rangaprabhu Parthasarathy, Snehal G. Joshi, Arvind Mandhani, Dhananjay Motwani, Dibyendu Nandy, Hans Edward Birch-Jensen, Ambika Pajjuri
  • Patent number: 10627979
    Abstract: Method, device, and storage medium for generating and self-adaptively rotating a voice recognition interface are provided. The method for generating and self-adaptively rotating a voice recognition interface includes: identifying a direction of an interface layout of a current interface when a voice recognition is triggered; generating the voice recognition interface in a same layout direction as the direction of the current interface layout, the voice recognition interface covering a portion of a screen; monitoring in real time whether the interface layout of the current interface rotates; and rotating the generated voice recognition interface self-adaptively along with the current interface layout, in response to a rotation of the interface layout of the current interface.
    Type: Grant
    Filed: December 12, 2017
    Date of Patent: April 21, 2020
    Assignee: GUANGZHOU SHENMA MOBILE INFORMATION TECHNOLOGY CO., LTD.
    Inventor: Ying Su
  • Patent number: 10621982
    Abstract: A mobile construction machine detects a speech processing trigger. It then performs speech processing (such as speech recognition and natural language understanding, speech synthesis, etc.) based on the detected speech processing trigger, to generate a speech processing result. A control signal generator generates control signals based on the speech processing result. The control signals can be used to control the mobile construction machine, to control another mobile construction machine, to provide information to a remote server location, or to aggregate information from multiple remote server locations.
    Type: Grant
    Filed: December 21, 2017
    Date of Patent: April 14, 2020
    Assignee: Deere & Company
    Inventors: Mark J. Cherney, Keith N. Chaston, Michael G. Kean, Douglas K. Wink, Sean P. West, John M. Hageman, Scott S. Hendron
  • Patent number: 10621971
    Abstract: Embodiments of the present disclosure provide a method and a device for extracting a speech feature based on artificial intelligence. The method includes performing a spectrum analysis on a speech to be recognized, to obtain a spectrum program of the speech; and extracting features of the spectrum program by using an Inception convolution structure of an image recognition algorithm, to obtain the speech feature of the speech. In embodiments, by performing the spectrum analysis on the speech to be recognized, the consecutive speech to be recognized is converted into the spectrum diagram. As the Inception convolution structure is an effective image recognition manner being able to accurately recognize features of an image, the spectrum program is recognized with the Inception convolution structure to extract the relative accurate speech feature from the speech to be recognized. Thus, the accuracy rate of the speech recognition is improved.
    Type: Grant
    Filed: December 21, 2017
    Date of Patent: April 14, 2020
    Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
    Inventors: Chao Li, Xiangang Li
  • Patent number: 10616691
    Abstract: A microphone circuit having an amplifier with an input operably coupled to a microphone motor also includes a low pass filter operably coupled to the output of the amplifier and a positive feedback network that operably couples to an output of the low-pass filter and to the amplifier input. For many useful application settings the aforementioned amplifier has unity gain while the positive feedback network has a fractional gain less than unity.
    Type: Grant
    Filed: November 11, 2016
    Date of Patent: April 7, 2020
    Assignee: Knowles Electronics, LLC
    Inventors: Dean Badillo, Michael Jennings
  • Patent number: 10606551
    Abstract: Embodiments of the disclosure include an improved content streaming system that is configured to simplify and streamline the process of streaming media content from one or more content providers to one or more electronic devices. In some embodiments, the interaction of a user with one or more components in a content distribution system is used to initiate the streaming of media content to one or more content players from either a first content server or a second content server.
    Type: Grant
    Filed: May 23, 2019
    Date of Patent: March 31, 2020
    Assignee: LOGITECH EUROPE S.A.
    Inventors: James L. Thurman, John Dittlinger, William Johnson, Xiao Daphne Li, Yezhou Wang
  • Patent number: 10609473
    Abstract: This disclosure relates to speakers and more specifically to an array speaker for distributing music uniformly across a room. A number of audio drivers can be radially distributed within a speaker housing so that an output of the drivers is distributed evenly throughout the room. In some embodiments, the exit geometry of the audio drivers can be configured to bounce off a surface supporting the array speaker to improve the distribution of music throughout the room. The array speaker can include a number of vibration isolation elements distributed within a housing of the array speaker. The vibration isolation elements can be configured to reduce the strength of forces generated by a subwoofer of the array speaker.
    Type: Grant
    Filed: June 2, 2017
    Date of Patent: March 31, 2020
    Assignee: Apple Inc.
    Inventors: Craig M. Stanley, Simon K. Porter, John H. Sheerin, Glenn K. Trainer, Jason C. Della Rosa, Ethan L. Huwe, Sean T. McIntosh, Erik L. Wang, Christopher J. Stringer, Molly J. Anderson
  • Patent number: 10601373
    Abstract: A dynamic boost audio system includes a booster circuit having a dynamically sliding power supply unit (PSU) capable of outputting power among a plurality of different power levels. The booster circuit is configured to identify a real-time audio level of an audio signal, and automatically adjust the power to the power level such that the audio signal is output in response to the real-time audio level.
    Type: Grant
    Filed: March 31, 2017
    Date of Patent: March 24, 2020
    Assignee: TYMPHANY HK LIMITED
    Inventor: Sebastiaan de Vries
  • Patent number: 10589984
    Abstract: The application relates to MEMS microphone transducers having a vent structure provided in the membrane layer and an opening formed at the vent structure for tuning the frequency response of the microphone.
    Type: Grant
    Filed: March 19, 2018
    Date of Patent: March 17, 2020
    Assignee: Cirrus Logic, Inc.
    Inventor: Marek Sebastian Piechocinski
  • Patent number: 10593340
    Abstract: There are provided decoding and encoding methods for encoding and decoding of multichannel audio content for playback on a speaker configuration with N channels. The decoding method comprises decoding, in a first decoding module, M input audio signals into M mid signals which are suitable for playback on a speaker configuration with M channels; and for each of the N channels in excess of M channels, receiving an additional input audio signal corresponding to one of the M mid signals and decoding the input audio signal and its corresponding mid signal so as to generate a stereo signal including a first and a second audio signal which are suitable for playback on two of the N channels of the speaker configuration.
    Type: Grant
    Filed: May 9, 2019
    Date of Patent: March 17, 2020
    Assignee: Dolby International AB
    Inventors: Heiko Purnhagen, Harald Mundt, Kristofer Kjoerling
  • Patent number: 10582324
    Abstract: Disclosed is an apparatus and method for processing a multichannel audio signal. A multichannel audio signal processing method may include: generating an N-channel audio signal of N channels by down-mixing an M-channel audio signal of M channels; and generating a stereo audio signal by performing binaural rendering of the N-channel audio signal.
    Type: Grant
    Filed: September 10, 2018
    Date of Patent: March 3, 2020
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Yong Ju Lee, Jeong Il Seo, Seung Kwon Beack, Kyeong Ok Kang, Jin Woong Kim, Jae Hyoun Yoo
  • Patent number: 10582288
    Abstract: One or more embodiments set forth an audio processing system for a personal listening device that includes a set of microphones, a noise reduction module, an audio ducker, and a mixer. The set of microphones is configured to receive a first set of audio signals from an environment. The noise reduction module is configured to detect when a signal of interest is present in the first plurality of audio signals, and, upon detecting a signal of interest, transmit a ducking control signal. The audio ducker is configured to receive the ducking control signal, and receive a second plurality of audio signals via a playback device. The audio ducker is further configured to reduce an amplitude of a second plurality of audio signals relative to the signal of interest based on the ducking control signal. The mixer combines the first plurality of audio signals and second plurality of audio signals.
    Type: Grant
    Filed: June 26, 2015
    Date of Patent: March 3, 2020
    Assignee: HARMAN INTERNATIONAL INDUSTRIES, INCORPORATED
    Inventors: James M. Kirsch, Jeffrey Hutchings