Pseudo Stereophonic Patents (Class 381/17)
  • Patent number: 10993063
    Abstract: A method for processing a 3D audio effect and related products are provided. The method includes the following. In a VR scenario, eight first positions of eight monaural sound sources and eight pieces of monaural data corresponding to the eight monaural sound sources are obtained, where the eight monaural sound sources correspond to a target object, and each eight monaural sound source locates at a position with unique 3D coordinates. A second position of a target object in the VR scenario is obtained, where the second position has 3D coordinates, and the eight first positions and the second position are based on the same coordinate origin. A distance between each of the eight first positions and the second position is determined to obtain eight distances. The eight pieces of monaural data are adjusted according to the eight distances. The eight pieces of monaural data adjusted are synthesized to 7.1-channel data.
    Type: Grant
    Filed: September 12, 2019
    Date of Patent: April 27, 2021
    Assignee: GUANGDONG OPPO MOBILE TELECOMMUNICATIONS CORP., LTD.
    Inventor: Fenggui Yan
  • Patent number: 10993064
    Abstract: An apparatus comprising means configured to: receive first audio content from a remote apparatus, the audio content comprising at least one audio stream associated with respective location information; present the audio stream as spatial audio to be perceived as originating from a respective perceived-direction based on the location information; determine a user input direction to identify one of the audio streams; transmit a message to the remote apparatus identifying the identified audio stream; receive second audio content comprising an enhanced version of the at least one identified audio stream; present the enhanced version of the identified audio stream as spatial audio, wherein the perceived-direction of the identified audio stream is offset from the second perceived-direction of the enhanced version of the identified audio stream.
    Type: Grant
    Filed: November 19, 2019
    Date of Patent: April 27, 2021
    Assignee: Nokia Technologies Oy
    Inventors: Sujeet Shyamsundar Mate, Arto Lehtiniemi, Miikka Tapani Vilermo
  • Patent number: 10993065
    Abstract: Systems and methods to calibrate listening devices are disclosed herein. In some embodiments, a method to calibrate earphones includes determining a Head Related Transfer Functions (HRTF) corresponding to different parts of a user's anatomy (e.g., one or both of a listener's pinnae). The resulting HRTFs are combined to form a composite HRTF. In some embodiments, a first and a second HRTF are respectively determined for a first and second part of the user's anatomy. A composite HRTF of the user is generated by combining portions of the first and second HRTFs.
    Type: Grant
    Filed: November 9, 2018
    Date of Patent: April 27, 2021
    Assignee: Harman International Industries, Incorporated
    Inventors: Jason Riggs, Joy Lyons, Nicholas Millias, Aderito Beltran
  • Patent number: 10979806
    Abstract: An audio system having a depth capturing device and a microphone array to respectively detect a point cloud and a local sound field, is described. The point cloud includes points corresponding to objects in a source space, and the local sound field includes sounds received in respective directions from the source space. The audio system includes one or more processors to generate a global sound field based on distances to the points and directions of the sounds. The global sound field includes virtual sound sources emitting respective sounds at respective points. A speaker can render the virtual sound sources to a user in a virtual space that corresponds to the source space.
    Type: Grant
    Filed: April 19, 2019
    Date of Patent: April 13, 2021
    Assignee: Apple Inc.
    Inventors: Martin E. Johnson, Jonathan D. Sheaffer
  • Patent number: 10978057
    Abstract: A computing device may be configured to receive a content asset and to determine whether the content asset comprises one or more triggers. The trigger may be a word, phrase, or passcode that alerts a voice activated device to the presence of a voice command and may serve as an instruction to the voice activated device to cause execution of the voice command. In response to determining that the content asset comprises one or more triggers, the computing device may be configured to insert one or more signal markers into the content asset at a location corresponding to the one or more triggers, and to cause transmission and/or presentation of the content asset with the one or more signal markers. The signal markers may cause a voice activated device to ignore a voice command in the content asset, despite the presence of one or more triggers.
    Type: Grant
    Filed: November 6, 2018
    Date of Patent: April 13, 2021
    Assignee: Comcast Cable Communications, LLC
    Inventors: Shawn O'Malley, Michael Rosenwald, Jeremy Zaucha
  • Patent number: 10972850
    Abstract: A head mounted display (HMD) determines a distance between eyes of a user wearing the HMD, and head-related transfer functions (HRTFs) are selected based on the eye distance. The HMD displays a virtual image to the user and processes sound with the HRTFs into binaural sound that externally localizes at the virtual image. The HMD that determines the distance between the eyes of the user is also the HMD that displays the virtual image and processes the sound with the HRTFs.
    Type: Grant
    Filed: May 3, 2020
    Date of Patent: April 6, 2021
    Inventors: Glen A. Norris, Philip Scott Lyren
  • Patent number: 10971167
    Abstract: A personal audio assistant includes a first microphone for capturing audio, a logic circuit coupled to the first microphone for analyzing the captured audio, a communication module coupled to the logic circuit, and a user interaction element. The interaction element is configured to control at least one function or operation among invoking an operation of an electronic device communicatively coupled to the personal audio assistant via the communication module, invoking access to information on a network, invoking delivery of audio content, invoking delivery of text to speech, invoking speech to text processing of email, invoking delivery of text to speech of stock market information, invoking execution of a commercial transaction, invoking delivery of calendaring information, invoking delivery of medication reminders, or invoking access to location information using a GPS system. Other embodiments are disclosed.
    Type: Grant
    Filed: January 7, 2014
    Date of Patent: April 6, 2021
    Assignee: Staton Techiya, LLC
    Inventor: Steven Wayne Goldstein
  • Patent number: 10970092
    Abstract: The present invention relates to a terminal and a method of operating the terminal. The terminal can execute at least one task. The terminal can execute at least one task, can cause the display to display a soft key and information related to the executed at least one task at a first region of the display and can cause the display to display at least one function indicator related to the information at a second region of the display.
    Type: Grant
    Filed: December 9, 2019
    Date of Patent: April 6, 2021
    Assignee: LG Electronics Inc.
    Inventors: Eunkyung Choi, Sunghye Yoon, Hyungtae Jang, Goeun Joo, Yoojin Choi
  • Patent number: 10972856
    Abstract: An audio processing apparatus has a setting processor that sets a size of a virtual sound source; and a signal processor that generates an audio signal by imparting to an audio signal a plurality of head-related transfer characteristics. The plurality of head-related transfer characteristics corresponds to respective points within a range that accords with the size set by the setting processor from among a plurality of points, with each point having a different position relative to a listening point.
    Type: Grant
    Filed: July 7, 2020
    Date of Patent: April 6, 2021
    Assignee: YAMAHA CORPORATION
    Inventors: Tsukasa Suenaga, Futoshi Shirakihara
  • Patent number: 10972835
    Abstract: A conference system with transmitting and receiving sides. The transmitting side has a microphone array unit with microphone capsules, and a processing unit. The processing unit is configured to receive output signals of the microphone capsules and to execute audio beamforming based on the received output signals for acquiring sound coming from an audio source in a first direction. The processing unit has a direction-recognition unit that computes from the output signals of said microphone capsules a score for each of multiple search grid spatial positions and uses a search grid spatial position having a higher score to identify said first direction. The receiving side has an audio reproduction system that reproduces an audio signal detected by the microphone array with directional information of the first direction. The detected audio signal and the directional information regarding the first direction are transmitted from the transmitting side to the receiving side.
    Type: Grant
    Filed: August 15, 2019
    Date of Patent: April 6, 2021
    Assignee: Sennheiser electronic GmbH & Co. KG
    Inventor: J. Douglas Rollow, IV
  • Patent number: 10972849
    Abstract: An electronic apparatus includes a memory configured to store instructions; and a processor configured to execute the instructions to: separate an input audio signal into a plurality of channel signals; identify a gain corresponding to a feature difference between a first channel signal, from among the plurality of channel signals, and a second channel signal, from among the plurality of channel signals; and adjust relative ratios among a plurality of output signals according to the identified gain to generate an output audio signal in which a sound image is varied.
    Type: Grant
    Filed: July 11, 2018
    Date of Patent: April 6, 2021
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Yoon-jae Lee, Woo-jung Lee, Young-suk Song, Hae-kwang Park
  • Patent number: 10966044
    Abstract: A system, computer-implemented method, and tangible computer-readable media for media playback. The method includes receiving a request from a user to play a media asset, detecting capabilities of the playback device, detecting playback environment acoustic characteristics, optimizing media playback settings for the playback device and the playback environment characteristics, preparing the media asset for playback on the playback device, and transferring the prepared media asset to the playback device for playback. The method can further transfer optimized playback settings to the playback device. Media asset preparation can be based at least in part on the optimized playback settings. A playback profile can store environment characteristics and playback device capabilities. The method continuously detects changes in the playback environment characteristics, and optimizes media playback settings when detected changes exceed a threshold.
    Type: Grant
    Filed: November 14, 2017
    Date of Patent: March 30, 2021
    Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.
    Inventors: Andrea Basso, Zhu Liu, Bernard S. Renger, Behzad Shahraray
  • Patent number: 10959033
    Abstract: Embodiments are described for a system of rendering object-based audio content through a system that includes individually addressable drivers, including at least one driver that is configured to project sound waves toward one or more surfaces within a listening environment for reflection to a listening area within the listening environment; a renderer configured to receive and process audio streams and one or more metadata sets associated with each of the audio streams and specifying a playback location of a respective audio stream; and a playback system coupled to the renderer and configured to render the audio streams to a plurality of audio feeds corresponding to the array of audio drivers in accordance with the one or more metadata sets.
    Type: Grant
    Filed: July 22, 2019
    Date of Patent: March 23, 2021
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Sripal S Mehta, Brett G Crockett, S. Spencer Hooks, Alan Seefeldt, Christophe Chabanne, C. Phillip Brown, Joshua B Lando, Brad Basler, Stewart Murrie
  • Patent number: 10952008
    Abstract: Systems and methods for audio communication are disclosed. The system includes a plurality of transducer units and plurality of three dimensional sensor modules (TDSMs) adapted to be located in a plurality of sites/spaces to which service should be provided by the system. The plurality of transducer units are capable of emitting/directing and focusing ultra-sonic signals to respective coverage zones in the sites, such that localized (confined) sound field can be formed at selected spatial position in the coverage zones by utilizing sound from ultrasound technique. The TDSMs are associated with respective sensing volumes in the sites and are operable to obtain sensory data indicative of the 3D arrangement of elements in a within the sites.
    Type: Grant
    Filed: January 15, 2017
    Date of Patent: March 16, 2021
    Assignee: NOVETO SYSTEMS LTD.
    Inventors: Silviu Zilberman, Tomer Shani, Noam Babayoff
  • Patent number: 10945067
    Abstract: An area reproduction method includes converting a sound pressure distribution at each frequency of a reproduced sound from a sound pressure distribution in a frequency domain into a sound pressure distribution in a spatial frequency domain. The reproduced sound is realized on a control line, including a reproduction line in which sound waves emitted from a speaker array including a plurality of speakers arranged intensify with each other, and a non-reproduction line in which the sound waves weaken with each other. The method includes determining a spatial frequency for use in adjustment of the reproduced sound, in the sound pressure distribution in the spatial frequency domain, based on a positional relationship between the speaker array and the control line, and adjusting a sound pressure of the reproduced sound, which is to be output by each of the plurality of speakers using the determined spatial frequency.
    Type: Grant
    Filed: December 12, 2018
    Date of Patent: March 9, 2021
    Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventor: Atsushi Sakaguchi
  • Patent number: 10943599
    Abstract: An audio cancellation system includes a voice enabled computing system that is connected to an audio output device using a wired or wireless communication network. The voice enabled computing device can provide media content to a user and receive a voice command from the user. The connection between the voice enabled computing system and the audio output device introduces a time delay between the media content being generated at the voice enabled computing device and the media content being reproduced at the audio output device. The system operates to determine a calibration value adapted for the voice enabled computing system and the audio output device. The system uses the calibration value to filter the user's voice command from a recording of ambient sound including the media content, without requiring significant use of memory and computing resources.
    Type: Grant
    Filed: October 4, 2019
    Date of Patent: March 9, 2021
    Assignee: Spotify AB
    Inventors: Richard Mitic, Robert Swain, Daniel Bromand, Wagar Sheikh, James Robert Stansfield
  • Patent number: 10939222
    Abstract: A three-dimensional audio playing method and playing apparatus are disclosed. The three-dimensional audio playing method according to the present invention comprises: a decoding step of decoding a received audio signal and outputting the decoded audio signal and metadata; a room impulse response (RIR) decoding step of decoding RIR data when the RIR data is included in the received audio signal; a head-related impulse response (HRIR) generation step of generating HRIR data by using user head information when the RIR data is included in the received audio signal; a binaural room impulse response (BRIR) synthesis step of generating BRIR data by synthesizing the decoded RIR data and modeled HRIR data; and a binaural rendering step of outputting a binaural rendered audio signal by applying the generated BRIR data to the decoded audio signal. In addition, the three-dimensional audio playing method and playing apparatus, according to the present invention, support a 3DoF environment and a 6DoF environment.
    Type: Grant
    Filed: November 14, 2017
    Date of Patent: March 2, 2021
    Assignee: LG ELECTRONICS INC.
    Inventors: Tung Chin Lee, Sejin Oh
  • Patent number: 10939219
    Abstract: Audio perception in local proximity to visual cues is provided. A device includes a video display, first row of audio transducers, and second row of audio transducers. The first and second rows can be vertically disposed above and below the video display. An audio transducer of the first row and an audio transducer of the second row form a column to produce, in concert, an audible signal. The perceived emanation of the audible signal is from a plane of the video display (e.g., a location of a visual cue) by weighing outputs of the audio transducers of the column. In certain embodiments, the audio transducers are spaced farther apart at a periphery for increased fidelity in a center portion of the plane and less fidelity at the periphery.
    Type: Grant
    Filed: November 19, 2019
    Date of Patent: March 2, 2021
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Christophe Chabanne, Nicolas R. Tsingos, Charles Q. Robinson
  • Patent number: 10923142
    Abstract: A system, method and computer product for training a neural network system. The method comprises applying an audio signal to the neural network system, the audio signal including a vocal component and a non-vocal component. The method also comprises comparing an output of the neural network system to a target signal, and adjusting at least one parameter of the neural network system to reduce a result of the comparing, for training the neural network system to estimate one of the vocal component and the non-vocal component. In one example embodiment, the system comprises a U-Net architecture. After training, the system can estimate vocal or instrumental components of an audio signal, depending on which type of component the system is trained to estimate.
    Type: Grant
    Filed: January 8, 2019
    Date of Patent: February 16, 2021
    Assignee: Spotify AB
    Inventors: Andreas Simon Thore Jansson, Angus William Sackfield, Ching Chuan Sung
  • Patent number: 10924849
    Abstract: Provided is a sound source separation device that includes a combining unit that combines a first sound source separation signal of a predetermined sound source, the first sound source separation signal being separated from a mixed sound signal by a first sound source separation system, with a second sound source separation signal of the sound source, the second sound source separation signal being separated from the mixed sound signal by a second sound source separation system that differs in separation performance from the first sound source separation system in predetermined units of time, and that outputs a sound source separation signal obtained by the combination.
    Type: Grant
    Filed: August 25, 2017
    Date of Patent: February 16, 2021
    Assignee: Sony Corporation
    Inventors: Naoya Takahashi, Yuhki Mitsufuji, Stefan Uhlich, Michael Enenkl, Thomas Kemp
  • Patent number: 10924877
    Abstract: An audio signal processing method, includes: acquiring 5.1-channel audio signals; acquiring head related transfer function (HRTF) data corresponding to each virtual speaker box in 5.1-channel virtual speaker boxes based on coordinates of the 5.1-channel virtual speaker boxes in a virtual environment; obtaining processed 5.1-channel audio signals by processing corresponding channel audio signals in the 5.1-channel audio signals based on the HRTF data corresponding to each virtual speaker box; and synthesizing the processed 5.1-channel audio signals into a stereo audio signal.
    Type: Grant
    Filed: November 30, 2018
    Date of Patent: February 16, 2021
    Assignee: GUANGZHOU KUGOU COMPUTER TECHNOLOGY CO., LTD
    Inventor: Jiaze Liu
  • Patent number: 10917737
    Abstract: A handheld portable electronic device (HPED) defines a zone that extends from a floor and around a user wearing a wearable electronic device (WED). The HPED designates a location in the zone for where binaural sound originates to the user. The WED tracks a location of the HPED, and sound is processed to originate as the binaural sound in the zone. The WED displays a virtual image at the location of the binaural sound in the zone.
    Type: Grant
    Filed: August 31, 2019
    Date of Patent: February 9, 2021
    Inventors: Philip Scott Lyren, Glen A. Norris
  • Patent number: 10911862
    Abstract: A display apparatus includes: a display module including a display panel configured to display an image, a vibration plate on a rear surface of the display module, a connection member between the rear surface of the display module and the vibration plate, at least one vibration module on the vibration plate, and an enclosure between the display module and the vibration plate, the enclosure being spaced apart from the vibration module, the enclosure surrounding the vibration module.
    Type: Grant
    Filed: August 7, 2019
    Date of Patent: February 2, 2021
    Assignee: LG Display Co., Ltd.
    Inventors: JaeMin Shin, ChanHeon Lee
  • Patent number: 10904690
    Abstract: An audio processing apparatus includes an interface, a control processor, an adjustment processor, channel modifiers, and a channel combiner. The interface is configured to receive audio channels including respective audio signals. The control processor is configured to generate a control signal from the audio signals. The adjustment processor is configured to calculate, based on the control signal, an adjusting parameter to an amplitude of at least one of the audio signals. The channel modifiers are configured to, using the adjusting parameter, adjust the audio signals in the respective audio channels. The channel combiner is configured to sum the audio channels after at least one channel has been adjusted, and output the summed audio channel to a user.
    Type: Grant
    Filed: December 15, 2019
    Date of Patent: January 26, 2021
    Assignee: NUVOTON TECHNOLOGY CORPORATION
    Inventor: Ittai Barkai
  • Patent number: 10904689
    Abstract: An audio metadata providing apparatus and method and a multichannel audio data playback apparatus and method to support a dynamic format conversion are provided. Dynamic format conversion information may include information about a plurality of format conversion schemes that are used to convert a first format set by a writer of multichannel audio data into a second format that is based on a playback environment of the multichannel audio data and that are set for each of playback periods of the multichannel audio data. The audio metadata providing apparatus may provide audio metadata including the dynamic format conversion information. The multichannel audio data playback apparatus may identify the dynamic format conversion information from the audio metadata, may convert the first format of the multichannel audio data into the second format based on the identified dynamic format conversion information, and may play back the multichannel audio data with the second format.
    Type: Grant
    Filed: February 21, 2020
    Date of Patent: January 26, 2021
    Assignees: Electronics and Telecommunications Research Institute, Kyonggi University Industry & Academia Cooperation Foundation
    Inventors: Jae Hyoun Yoo, Tae Jin Lee, Seok Jin Lee
  • Patent number: 10897570
    Abstract: A system generates an output audio signal for an object or virtual object using image data of a room to select a room impulse response from a database. A headset may include a depth camera assembly (DCA) and processing circuitry. The DCA generates depth image data of a room. The processing circuitry determines room parameters such as the dimensions of the room based on the depth image data. A room impulse response for the room is determined based on referencing a database of room impulse responses using the room parameters. An output audio signal is generated by convolving a source audio signal of an object with the room impulse response.
    Type: Grant
    Filed: January 28, 2019
    Date of Patent: January 19, 2021
    Assignee: Facebook Technologies, LLC
    Inventors: Philip Robinson, Scott Colburn, William Owen Brimijoin, II, Vamsi Krishna Ithapu, Peter Harty Dodds, Sebastiá Vicenç Amengual Gari, Carl Schissler
  • Patent number: 10893374
    Abstract: An electronic device according to various embodiments of the present invention may comprise: a gyro sensor for sensing rotation of an electronic device; a memory for storing a plurality of head related transfer functions corresponding to a plurality of directions; and a processor configured to generate a stereo signal by applying, to an audio signal, a first head related transfer function corresponding to a first direction, generate a stereo signal by applying, to the audio signal, a second head related transfer function corresponding to a third direction symmetric to a second direction with reference to the first direction when the electronic device rotates from the first direction to the second direction, and generate a stereo signal by applying the first head related transfer function to the audio signal when a pre-designated time elapses after the electronic device rotates from the first direction to the second direction.
    Type: Grant
    Filed: June 16, 2017
    Date of Patent: January 12, 2021
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Jae Mo Yang, Min Ho Bae, Gang Youl Kim, Beak Kwon Son, Chul Min Choi, Ga Hee Kim, Ho Chul Hwang
  • Patent number: 10887721
    Abstract: A communication system, a communication device and related method, the communication device comprising a processor, a source interface comprising a first source interface for a first external source, and an output interface, wherein the processor is configured to obtain, e.g. receive, a first input signal via the first source interface, obtain a first complementing signal, combine the first input signal and the first complementing signal to a first combined signal, apply a first filter function to the first combined signal for provision of a first left output signal and a first right output signal, the first filter function comprising one or more head related transfer functions, and output a left output signal and a right output signal via the output interface, wherein the left output signal is based on the first left output signal and the right output signal is based on the first right output signal.
    Type: Grant
    Filed: June 7, 2019
    Date of Patent: January 5, 2021
    Assignee: GN Hearing A/S
    Inventors: Søren Christian Pedersen, Casper Silbo Raft
  • Patent number: 10885927
    Abstract: A personal audio assistant includes a first microphone for capturing audio, a logic circuit coupled to the first microphone, a communication module coupled to the logic circuit, a memory storage unit communicatively coupled to the logic circuit, and a user interaction element. The interaction element is configured to perform at least one among accessing selection of audio content via an audio input received at the first microphone or determining a user preference of audio content by tracking genre of content selected, artist of content selected, and at least one among the day of the week or time of day. Other embodiments are disclosed.
    Type: Grant
    Filed: January 7, 2014
    Date of Patent: January 5, 2021
    Assignee: Staton Techiya, LLC
    Inventor: Steven Wayne Goldstein
  • Patent number: 10880669
    Abstract: A first microphone on a first earpiece of a personal audio delivery device measures a first head related transfer function (HRTF) associated with a sound source. A second microphone on an earpiece of a personal audio delivery device measures a second HRTF associated with the sound source. An interaural time difference (ITD) is determined based on the first HRTF and second HRTF. A determination is made that the sound source is located in a first region based on the interaural time difference. A determination is made that the sound source is located in a second region within the first region based on the ITD of the first HRTF and the second HRTF and an ITD associated with third HRTFs for the second region. A location of the sound source is determined within the second region, which in some examples, is improved with a Kalman filter model.
    Type: Grant
    Filed: September 27, 2019
    Date of Patent: December 29, 2020
    Assignee: EmbodyVR, Inc.
    Inventors: Kaushik Sunder, Kapil Jain, Yuxiang Wang
  • Patent number: 10880638
    Abstract: Provided are a sound field forming apparatus and a method that enhances the reproducibility of a wavefront at a listener position. The sound field forming apparatus has a position acquisition unit to acquire position information indicative of a position of a listener or a position of a sound source to be formed, a control point specification unit to specify a control point with a distance from a speaker array of the listener or the sound source on the basis of the position information, and a filter unit to generate a speaker drive signal for forming a predetermined sound field by the speaker array by convoluting a filter coefficient corresponding to the specified control point with a sound source signal.
    Type: Grant
    Filed: June 21, 2017
    Date of Patent: December 29, 2020
    Assignee: SONY CORPORATION
    Inventors: Yu Maeno, Yuhki Mitsufuji, Masafumi Takahashi
  • Patent number: 10872612
    Abstract: When compressing an HOA data frame representation, a gain control (15, 151) is applied for each channel signal before it is perceptually encoded (16). The gain values are transferred in a differential manner as side information. However, for starting decoding of such streamed compressed HOA data frame representation absolute gain values are required, which should be coded with a minimum number of bits. For determining such lowest integer number (?e) of bits the HOA data frame representation (C(k)) is rendered in spatial domain to virtual loudspeaker signals lying on a unit sphere, followed by normalisation of the HOA data frame representation (C(k)). Then the lowest integer number of bits is set to ?e=?log2(?log2(?{square root over (KMAX)}·O)?+1)?.
    Type: Grant
    Filed: January 23, 2019
    Date of Patent: December 22, 2020
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Sven Kordon, Alexander Krueger
  • Patent number: 10872602
    Abstract: Computer-implemented methods for training an acoustic model for a far-field utterance processing system are provided. The acoustic model may be configured to map an input audio signal into linguistic or paralinguistic units. The training may involve imparting far-field acoustic characteristics upon near-field training vectors that include a plurality of near-microphone utterance signals. Imparting the far-field acoustic characteristics may involve generating a plurality of simulated room impulse responses, convolving one or more of the simulated room impulse responses with the near-field training vectors, to produce a plurality of simulated far-field utterance signals and saving the results of the training in one or more non-transitory memory devices corresponding with the acoustic model. Generating simulated room impulse responses may involve simulating room reverberation times but not simulating early reflections from room surfaces.
    Type: Grant
    Filed: May 2, 2019
    Date of Patent: December 22, 2020
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Hadis Nosrati, David S. McGrath, Richard J. Cartwright
  • Patent number: 10869124
    Abstract: Provided is an information processing apparatus that is used by being adhered to the body of a user. The information processing apparatus includes a sound pickup sensor that has a sound pickup function, a communicating part that wirelessly transmits audio data picked up by the sound pickup sensor to the outside, a control part that controls the sound pickup sensor and the communicating part, a power source part that supplies a power source to at least one of the sound pickup sensor, the communicating part, or the control part, a housing part that accommodates therein at least one of the sound pickup sensor, the communicating part, the control part, or the power source part, and an adhering part that fixes the housing part to the user. The sound pickup part picks up a sound of the user using, for example, flesh conduction or bone conduction.
    Type: Grant
    Filed: March 23, 2018
    Date of Patent: December 15, 2020
    Assignee: SONY CORPORATION
    Inventors: Go Igarashi, Kazuma Yoshii, Junya Suzuki
  • Patent number: 10863032
    Abstract: Methods and apparatus assist listeners in distinguishing between electronically generated binaural sound and physical environment sound while the listener wears a wearable electronic device that provides the binaural sound to the listener. The wearable electronic device generates a visual alert or audio alert when the electronically generated binaural sound occurs.
    Type: Grant
    Filed: August 31, 2019
    Date of Patent: December 8, 2020
    Inventors: Glen A. Norris, Philip Scott Lyren
  • Patent number: 10854213
    Abstract: An apparatus for generating loudspeaker signals includes an object metadata processor configured to receive metadata, to calculate a second position of the audio object depending on the first position of the audio object and on a size of a screen if the audio object is indicated in the metadata as being screen-related, to feed the first position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being not screen-related, and to feed the second position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being screen-related. The apparatus further includes an object renderer configured to receive an audio object and to generate the loudspeaker signals depending on the audio object and on position information.
    Type: Grant
    Filed: December 28, 2018
    Date of Patent: December 1, 2020
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Simone Fueg, Jan Plogsties, Sascha Dick, Johannes Hilpert, Julien Robilliard, Achim Kuntz, Andreas Hoelzer
  • Patent number: 10848900
    Abstract: A method for processing an audio signal in accordance with a room impulse response is described. The audio signal is processed with an early part of the room impulse response separate from a late reverberation of the room impulse response, wherein the processing of the late reverberation has generating a scaled reverberated signal, the scaling being dependent on the audio signal. The processed early part of the audio signal and the scaled reverberated signal are combined.
    Type: Grant
    Filed: March 15, 2018
    Date of Patent: November 24, 2020
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Simone Neukam, Jan Plogsties
  • Patent number: 10848890
    Abstract: Disclosed is an audio signal processing device for processing an audio signal. The audio signal processing device includes a processor. The processor obtains an input audio signal including an object audio signal, selects at least one of a plurality of rendering methods based on an azimuth of a sound object with respect to a listener, corresponding to the object audio signal in a virtual space simulated by an output audio signal, renders the object audio signal using a selected rendering method, and outputs the output audio signal including the rendered object audio signal.
    Type: Grant
    Filed: January 6, 2019
    Date of Patent: November 24, 2020
    Assignee: GAUDI AUDIO LAB, INC.
    Inventors: Hyunjoo Chung, Hyunoh Oh, Sangbae Chon
  • Patent number: 10834501
    Abstract: An information processing method includes obtaining an audio signal that represents sound collected by a sound collecting device; calculating, using the audio signal obtained, a volume of the sound collected by the sound collecting device; identifying, using the audio signal obtained, a type of a sound source of the sound collected by the sound collecting device; estimating a distance between the sound collecting device and the sound source of the sound collected by the sound collecting device based on a standard volume and the volume calculated, the standard volume being (i) stored in a database in which the type of the sound source and the standard volume are preassociated with each other, (ii) the volume of the sound from the sound source at a predetermined distance, and (iii) associated with the type of the sound source identified; and outputting an estimation result.
    Type: Grant
    Filed: August 15, 2019
    Date of Patent: November 10, 2020
    Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventor: Ko Mizuno
  • Patent number: 10827293
    Abstract: A sound reproducing method used in sound reproducing apparatus that includes the steps outlined below is provided. A sound signal is generated with a 3D sound generating process according to listener data and sound data. Pre-recorded sound data is retrieved to further generate a target distance function corresponding to the sound distance. A fixed head-related transfer function corresponding to a fixed distance is retrieved. A target head-related transfer function corresponding to the sound distance is generated by adapting the target distance function to the fixed head-related transfer function. The sound signal is reproduced by multiplying the sound signal by the target head-related transfer function.
    Type: Grant
    Filed: October 15, 2018
    Date of Patent: November 3, 2020
    Assignee: HTC Corporation
    Inventors: Chun-Min Liao, Yan-Min Kuo, Li-Yen Lin, Chi-Tang Ho, Tien-Ming Wang, Tsung-Yu Tsai, Yen-Chieh Wang, Shuo-Yen Lin
  • Patent number: 10820132
    Abstract: A voice providing method includes obtaining position information from a voice reproducing device, and performing processing, in a case in which a first position of obtained position information and a second position of a virtual talker are in a predetermined relationship, to cause the voice reproducing device to reproduce voice for the virtual talker to have a dialogue with a user with content according to the user.
    Type: Grant
    Filed: October 9, 2019
    Date of Patent: October 27, 2020
    Assignees: Yamaha Corporation, Fujitsu Limited
    Inventors: Miho Kyoya, Mitsuki Arita, Ikuro Ohshima, Hiroki Uchida, Hiroaki Sugimura, Yukio Tada
  • Patent number: 10818300
    Abstract: An apparatus including: an input configured to receive from at least two microphones at least two audio signals; at least two processor instances configured to generate separate output audio signal tracks from the at least two audio signals from the at least two microphones; a file processor configured to link the at least two output audio signal tracks within a file structure.
    Type: Grant
    Filed: October 24, 2018
    Date of Patent: October 27, 2020
    Assignee: Nokia Technologies Oy
    Inventors: Marko Tapani Yliaho, Ari Juhani Koski
  • Patent number: 10812907
    Abstract: An expandable wireless speaker system and method is disclosed. The speaker system comprises a center module and at least two or more speakers. The center module has a unique identification number and has mechanical connections on at least one side. The center module is configured to receive an external device, wirelessly or directly connected and has a transceiver configured to scan for an unused channel and transmit audio. The speakers each have a mechanical connection on either side to connect to the center module or the other speakers. The two or more speakers are configured to scan the communication spectrum for the center module's unique identification and when the two or more speakers locate the center module's unique identification, the two or more speakers pair with the center module to receive audio. The communication spectrum can be a wireless channel such as 5.8 GHz or the FM radio spectrum.
    Type: Grant
    Filed: January 9, 2019
    Date of Patent: October 20, 2020
    Assignee: DGL Group Ltd.
    Inventors: Marc Liniado, Andrew Kohlsmith
  • Patent number: 10812925
    Abstract: An input unit receives input of an assumed listening position of sound of an object, which is a sound source, and outputs assumed listening position information indicating the assumed listening position. A position information correction unit corrects position information of each object on the basis of the assumed listening position information to obtain corrected position information. A gain/frequency characteristic correction unit performs gain correction and frequency characteristic correction on a waveform signal of an object on the basis of the position information and the corrected position information. A spatial acoustic characteristic addition unit further adds a spatial acoustic characteristic to the waveform signal resulting from the gain correction and the frequency characteristic correction on the basis of the position information of the object and the assumed listening position information. The present technology is applicable to an audio processing device.
    Type: Grant
    Filed: May 26, 2020
    Date of Patent: October 20, 2020
    Assignee: SONY CORPORATION
    Inventors: Minoru Tsuji, Toru Chinen
  • Patent number: 10811032
    Abstract: A method and apparatus to determine a direction of arrival (DOA) of a talker in the presence of a source of spatially-coherent noise. A time sequence of audio samples that include the spatially-coherent noise is received and buffered. Aided by previously known data, a trigger point is detected in the time sequence of audio samples when the talker begins to talk. The buffered time sequence of audio samples is separated into a noise segment and a signal-plus-noise segment based on the trigger point. For each direction of a plurality of distinct directions: an energy difference is computed for the direction between the noise segment and the signal-plus-noise segment, and the DOA of the talker is selected as the direction of the plurality of distinct directions having a largest of the computed energy differences.
    Type: Grant
    Filed: December 19, 2018
    Date of Patent: October 20, 2020
    Assignee: Cirrus Logic, Inc.
    Inventors: Ghassan Maalouli, Seth Suppappola
  • Patent number: 10805729
    Abstract: A system of crosstalk cancelled zone creation in audio playback comprising: main transducers emitting stereo soundwaves of an audio playback; a local system comprising at least two or more close-proximity-transducers (CPTs), each is arranged proximal to one of left and right-side ear canals of a listener. Each of the CPTs comprises: a position tracking device for tracking the relative positions of the main transducers to the CPT and the other CPTs; a control unit for receiving the relative position data from the position tracking device and generating control signal according to the relative position data for the generation of crosstalk cancellation (XTC) soundwaves. Each of the CPTs is configured to generate XTC soundwaves corresponding to the stereo soundwaves arriving at the corresponding ear of the listener. The generated XTC soundwaves are synchronized with the audio playback and with respect to the relative positions.
    Type: Grant
    Filed: January 3, 2020
    Date of Patent: October 13, 2020
    Assignees: Scuola universitaria professionale della Svizzera italiana (SUPSI)
    Inventors: Wai-Shan Lam, Daniel Weiss, Tiziano Leidi, Alberto Vancheri
  • Patent number: 10805730
    Abstract: A sound input/output device for a vehicle includes: microphones that are provided within a vehicle cabin; a plurality of directional speakers that are provided within the vehicle cabin, and that output sound to respective seats; an output position specifying section that specifies a seat that is an output destination from a voice collected by one or more of the microphones or from input from an utterer of a voice; and an output control section that outputs sound from one or more of the speakers to a seat specified by the output position specifying section.
    Type: Grant
    Filed: September 12, 2019
    Date of Patent: October 13, 2020
    Assignee: Toyota Jidosha Kabushiki Kaisha
    Inventors: Hideyuki Sakurai, Masashi Mori, Katsuya Shimazu
  • Patent number: 10798513
    Abstract: Methods, systems, and devices for head-related transfer function generation are described. A device may receive a digital representation of a first audio signal associated with a location relative to a subject, and select from a database a first reference head-related transfer function measurement pair corresponding to the location of the first audio signal. The device may then obtain a second head-related transfer function measurement pair by performing a style transfer operation on the selected reference head-related transfer function measurement pair based on a set of head-related transfer function measurement pairs specific to the subject. As a result, the device may output a second audio signal based on the digital representation of the first audio signal and the second head-related transfer function measurement pair.
    Type: Grant
    Filed: November 30, 2018
    Date of Patent: October 6, 2020
    Assignee: Qualcomm Incorporated
    Inventors: Lae-Hoon Kim, Dongmei Wang, Erik Visser
  • Patent number: 10798511
    Abstract: Processing input audio channels for generating spatial audio can include receiving a plurality of microphone signals that capture a sound field. Each microphone signal can be transformed into a frequency domain signal. From each frequency domain signal, a direct component and a diffuse component can be extracted. The direct component can be processed with a parametric renderer. The diffuse component can be processed with a linear renderer. The components can be combined, resulting in a spatial audio output. The levels of the components can be adjusted to match a direct to diffuse ratio (DDR) of the output with the DDR of the captured sound field. Other aspects are also described and claimed.
    Type: Grant
    Filed: April 8, 2019
    Date of Patent: October 6, 2020
    Assignee: APPLE INC.
    Inventors: Jonathan D. Sheaffer, Juha O. Merimaa, Jason Wung, Martin E. Johnson, Peter A. Raffensperger, Joshua D. Atkins, Symeon Delikaris Manias, Mehrez Souden
  • Patent number: 10798510
    Abstract: A method expedites playing sound of a talking emoji from a first person with a first portable electronic device (PED) to a second person with a second PED. The second PED receives the talking emoji in mono sound and convolves the mono sound into binaural sound before receiving a request to play the sound to the second user. The second PED then plays the sound of the talking emoji in binaural sound after receiving the request from the second user.
    Type: Grant
    Filed: April 18, 2018
    Date of Patent: October 6, 2020
    Inventor: Philip Scott Lyren