For Storage Or Transmission Patents (Class 704/201)
  • Patent number: 11302317
    Abstract: Achieving voice utterance that can attract an interest of a target further effectively. There is provided an information processing apparatus that includes an utterance control unit that controls output of voice utterance. The utterance control unit determines a target on the basis of an analyzed context, and controls an output device to output an attracting utterance that attracts an interest of the target. Furthermore, there is provided an information processing method that includes executing, by a processor, output control of voice utterance. The execution of the output control further includes determining a target on the basis of an analyzed context and controlling an output device to output an attracting utterance that attracts an interest of the target.
    Type: Grant
    Filed: December 26, 2017
    Date of Patent: April 12, 2022
    Assignee: SONY CORPORATION
    Inventors: Mari Saito, Hiro Iwase, Shinichi Kawano
  • Patent number: 11272249
    Abstract: A disclosed example apparatus includes means for storing a logged media impression for a media identifier representative of media accessed via the Internet, means for communicating to send a third-party device identifier or a user identifier corresponding to the user to a database proprietor when a user has not elected to not participate in third-party tracking corresponding to online activities, and receive user information from the database proprietor based on the third-party device identifier or the user identifier, and means for processing to log a demographic impression based on the media impression and the user information, and generate an impression report corresponding to the media based on the demographic impression.
    Type: Grant
    Filed: November 2, 2020
    Date of Patent: March 8, 2022
    Assignee: THE NIELSEN COMPANY (US), LLC
    Inventors: Steven J. Splaine, Adrian Swift
  • Patent number: 11263198
    Abstract: Systems and methods are provided for systematically finding and fixing automatic speech recognition (ASR) mistranscriptions and natural language understanding (NLU) misinterpretations and labeling data for machine learning. High similarity of non-identical consecutive queries indicates ASR mistranscriptions. Consecutive queries with close vectors in a semantic embedding space indicates NLU misinterpretations. Key phrases and barge-in also indicate errors. Only queries within a short amount of time are considered.
    Type: Grant
    Filed: September 5, 2019
    Date of Patent: March 1, 2022
    Assignee: SOUNDHOUND, INC.
    Inventors: Olivia Bettaglio, Pranav Singh
  • Patent number: 11249718
    Abstract: What is described is a system for outputting audio signals having a first output device for outputting audio signals having predeterminable parameter values of a first set of settable parameters, a data storage for storing parameter values, and an administration device. Thus, the administration device accesses the data storage and searches for stored parameter values for a set of parameters which is equal or similar to the first set of parameters. In addition, in case of having found stored parameter values, the administration device transfers the parameter values and/or parameter values determined therefrom to the first output device.
    Type: Grant
    Filed: May 31, 2018
    Date of Patent: February 15, 2022
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventor: Jens Ekkehart Appell
  • Patent number: 11250870
    Abstract: In accordance with an aspect of the disclosure, an electronic device comprises a communication circuitry configured to establish a voice call with an external electronic device; a microphone; a memory configured to store a first sound quality enhancement parameter; and a processor, wherein the processor is configured to: obtain an audio signal associated with speech through the microphone, during the voice call; transmit, to a server, voice data based on the audio signal when the ratio is within a first range; transmit, to the server, noise data based on the audio signal, when the ratio is within a second range; receive an updated sound quality enhancement parameter from the server with the communication circuit during the voice call; and adjust the first sound quality enhancement parameter stored in the memory, based on the updated sound quality enhancement parameter received from the server.
    Type: Grant
    Filed: December 5, 2019
    Date of Patent: February 15, 2022
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Gunwoo Lee, Hangil Moon, Soonho Baek, Beakkwon Son, Jaemo Yang
  • Patent number: 11238876
    Abstract: The present invention proposes a new method and a new apparatus for enhancement of audio source coding systems utilising high frequency reconstruction (HFR). It utilises a detection mechanism on the encoder side to assess what parts of the spectrum will not be correctly reproduced by the HFR method in the decoder. Information on this is efficiently coded and sent to the decoder, where it is combined with the output of the HFR input.
    Type: Grant
    Filed: August 29, 2019
    Date of Patent: February 1, 2022
    Assignee: Dolby International AB
    Inventors: Kristofer Kjoerling, Per Ekstrand, Holger Hoerich
  • Patent number: 11221608
    Abstract: A device includes: a first acquiring unit to acquire context information corresponding to running operation among pieces of context information; a second acquiring unit to acquire detection information output from a detecting unit detecting a physical quantity of a target device; an extracting unit to extract, from the detection information, feature information indicating a feature of the detection information in an interval including a specific operation interval of the target device; a selecting unit to select reference feature information used as reference based on the feature information, and sequentially select pieces of target feature information; a calculating unit to calculate a likelihood of a process interval based on a comparison between the reference feature information and each piece of target feature information; a determining unit to determine whether the target feature information corresponding to the likelihood is included in the process interval based on the likelihood; and an estimating uni
    Type: Grant
    Filed: March 16, 2018
    Date of Patent: January 11, 2022
    Assignees: RICOH COMPANY, LTD., FANUC CORPORATION
    Inventors: Satoru Goto, Junichi Takami, Yasunobu Shirata, Tomoki Umezawa, Yohsuke Muramoto, Takafumi Horio, Yu Teshima
  • Patent number: 11170797
    Abstract: An audio decoder for providing a decoded audio information on the basis of an encoded audio information is disclosed. The audio decoder includes a linear-prediction-domain decoder configured to provide a first decoded audio information on the basis of an audio frame encoded in a linear prediction domain, a frequency domain decoder configured to provide a second decoded audio information on the basis of an audio frame encoded in a frequency domain, and a transition processor. The transition processor is configured to obtain a zero-input-response of a linear predictive filtering, wherein an initial state of the linear predictive filtering is defined depending on the first decoded audio information and the second decoded audio information, and modify the second decoded audio information depending on the zero-input-response, to obtain a smooth transition between the first and the modified second decoded audio information.
    Type: Grant
    Filed: May 31, 2019
    Date of Patent: November 9, 2021
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Emmanuel Ravelli, Guillaume Fuchs, Sascha Disch, Markus Multrus, Grzegorz Pietrzyk, Benjamin Schubert
  • Patent number: 11132177
    Abstract: CMOS-compatible high-speed and low power random number generator and techniques for use thereof are provided. In one aspect, a random number generator includes: a noise amplification unit configured to generate an amplified noise signal, wherein the noise amplification unit includes noise amplification unit transistors having a threshold voltage (Vt,amp) of about 0; and a computing unit configured to process the amplified noise signal from the noise amplification unit to generate a stream of random numbers, wherein the computing unit comprises computing unit transistors having absolute values of a Vt,compute that are larger than the Vt,amp of the noise amplification unit transistors in the noise amplification unit. For digital implementations, an analog-to-digital converter configured to digitize the amplified noise signal can be employed. For analog implementations, a sample and hold circuit configured to sample the amplified noise signal can be employed.
    Type: Grant
    Filed: May 14, 2019
    Date of Patent: September 28, 2021
    Assignee: International Business Machines Corporation
    Inventors: Bahman Hekmatshoartabari, Ghavam Shahidi
  • Patent number: 11127411
    Abstract: An audio decoder for providing a decoded audio information on the basis of an encoded audio information is disclosed. The audio decoder includes a linear-prediction-domain decoder configured to provide a first decoded audio information on the basis of an audio frame encoded in a linear prediction domain, a frequency domain decoder configured to provide a second decoded audio information on the basis of an audio frame encoded in a frequency domain, and a transition processor. The transition processor is configured to obtain a zero-input-response of a linear predictive filtering, wherein an initial state of the linear predictive filtering is defined depending on the first decoded audio information and the second decoded audio information, and modify the second decoded audio information depending on the zero-input-response, to obtain a smooth transition between the first and the modified second decoded audio information.
    Type: Grant
    Filed: May 31, 2019
    Date of Patent: September 21, 2021
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Emmanuel Ravelli, Guillaume Fuchs, Sascha Disch, Markus Multrus, Grzegorz Pietrzyk, Benjamin Schubert
  • Patent number: 11069373
    Abstract: A speech processing method for estimating a pitch frequency includes: executing a conversion process that includes acquiring an input spectrum from an input signal by converting the input signal from a time domain to a frequency domain; executing a feature amount acquisition process that includes acquiring a feature amount of speech likeness for each band included in a target band based on the input spectrum; executing a selection process that includes selecting a selection band selected from the target band based on the feature amount of speech likeness for each band; and executing a detection process that includes detecting a pitch frequency based on the input spectrum and the selection band.
    Type: Grant
    Filed: September 20, 2018
    Date of Patent: July 20, 2021
    Assignee: FUJITSU LIMITED
    Inventors: Sayuri Nakayama, Taro Togawa, Takeshi Otani
  • Patent number: 11030995
    Abstract: A method, computer system, and a computer program product for adaptively selecting an acoustic feature extractor in an Artificial Intelligence system is provided. The present invention may include acquiring a frame of an acoustic signal. The present invention may include checking a status of a flag to be used to indicate a proper acoustic feature extractor to be selected. The present invention may include processing the frame of the acoustic signal by the selected acoustic feature extractor indicated by the checked status. The present invention may include determining, based on data generated in the processing of the frame of the acoustic signal, an actual status of the frame of the acoustic signal. The present invention may include updating the status of the flag according to the actual status.
    Type: Grant
    Filed: June 18, 2019
    Date of Patent: June 8, 2021
    Assignee: International Business Machines Corporation
    Inventors: Xiao Xing Liang, Ning Zhang, Yu Ling Zheng, Yu Chen Zhou
  • Patent number: 11019445
    Abstract: A parametric stereo upmix method for generating a left signal and a right signal from a mono downmix signal based on spatial parameters includes predicting a difference signal comprising a difference between the left signal and the right signal based on the mono downmix signal scaled with a prediction coefficient. The prediction coefficient is derived from the spatial parameters. The method further includes deriving the left signal and the right signal based on a sum and a difference of the mono downmix signal and said difference signal.
    Type: Grant
    Filed: October 22, 2018
    Date of Patent: May 25, 2021
    Assignee: Koninklijke Philips N.V.
    Inventor: Erik Gosuinus Petrus Schuijers
  • Patent number: 11017769
    Abstract: Systems, methods, and computer-readable media that may be used to modify a voice action system to include voice actions provided by advertisers or users are provided. One method includes receiving electronic voice action bids from advertisers to modify the voice action system to include a specific voice action (e.g., a triggering phrase and an action). One or more bids may be selected. The method includes, for each of the selected bids, modifying data associated with the voice action system to include the voice action associated with the bid, such that the action associated with the respective voice action is performed when voice input from a user is received that the voice action system determines to correspond to the triggering phrase associated with the respective voice action.
    Type: Grant
    Filed: April 3, 2019
    Date of Patent: May 25, 2021
    Assignee: GOOGLE LLC
    Inventor: Pedro J. Moreno Mengibar
  • Patent number: 10991362
    Abstract: Provided is a target speech signal extraction method for robust speech recognition including: receiving information on a direction of arrival of the target speech source with respect to the microphones; generating a nullformer by using the information on the direction of arrival of the target speech source to remove the target speech signal from the input signals and to estimate noise; setting a real output of the target speech source using an adaptive vector as a first channel and setting a dummy output by the nullformer as a remaining channel; setting a cost function for minimizing dependency between the real output of the target speech source and the dummy output using the nullformer by performing independent component analysis (ICA) or independent vector analysis (IVA); setting an auxiliary function to the cost function; and estimating the target speech signal by using the cost function and the auxiliary function.
    Type: Grant
    Filed: April 15, 2020
    Date of Patent: April 27, 2021
    Assignee: INDUSTRY-UNIVERSITY COOPERATION FOUNDATION SOGANG UNIVERSITY
    Inventors: Hyung Min Park, Seoyoung Lee, Seung-Yun Kim, Byung Joon Cho, Uihyeop Shin
  • Patent number: 10984799
    Abstract: A speech interface device is configured with “hybrid” capabilities, which allows the speech interface device to perform actions in response to user speech, even when the speech interface device is unable to communicate with a remote system over a wide area network (e.g., the Internet). A hybrid request selector of the speech interface device sends audio data representing user speech to both a remote speech processing system and a local speech processing component executing on the speech interface device, and then waits for a response from either or both components. The local speech processing component may start execution based on the audio data and subsequently suspend the execution until further instruction from the hybrid request selector. The hybrid request selector can then determine which response to use, and, depending on which response is chosen, may instruct the local speech processing component to either continue or terminate the suspended execution.
    Type: Grant
    Filed: March 23, 2018
    Date of Patent: April 20, 2021
    Inventors: Stanislaw Ignacy Pasko, Michal Papierski, Maciej Makowski, Marcin Fuszara
  • Patent number: 10971156
    Abstract: A method, an apparatus, and a system for speech recognition are provided. A third-party application corresponding to a speech signal of a user can be determined according to the speech signal and by means of semantic analysis; and third-party application registry information is searched for and a third-party program is started, so that the user does not need to tap the third-party application to start the corresponding program, thereby providing more intelligent service for the user and facilitating use for the user.
    Type: Grant
    Filed: January 24, 2019
    Date of Patent: April 6, 2021
    Assignee: Huawei Teciinologies Co., Ltd.
    Inventors: Shanfu Li, Mingjie Dong
  • Patent number: 10891964
    Abstract: A User Equipment (UE) is operative to generate CN (Comfort Noise) control parameters, e.g., as part of audio-decoding processing by the UE. A buffer of a predetermined size implemented in the UE is configured to store CN parameters for SID (Silence Insertion Descriptor) frames and active hangover frames. Processing circuitry of the UE is configured to determine a CN parameter subset relevant for SID frames based on the age of the stored CN parameters and on residual energies, and use the determined CN parameter subset to determine CN control parameters for a first SID frame following an active signal frame.
    Type: Grant
    Filed: June 28, 2019
    Date of Patent: January 12, 2021
    Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)
    Inventor: Tomas Jansson Toftgård
  • Patent number: 10878830
    Abstract: Described are an encoder for coding speech-like content and/or general audio content, wherein the encoder is configured to embed, at least in some frames, parameters in a bitstream, which parameters enhance a concealment in case an original frame is lost, corrupted or delayed, and a decoder for decoding speech-like content and/or general audio content, wherein the decoder is configured to use parameters which are sent later in time to enhance a concealment in case an original frame is lost, corrupted or delayed, as well as a method for encoding and a method for decoding.
    Type: Grant
    Filed: February 27, 2017
    Date of Patent: December 29, 2020
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Jérémie Lecomte, Benjamin Schubert, Michael Schnabel, Martin Dietz
  • Patent number: 10861210
    Abstract: Embodiments of the present disclosure can provide systems, methods, and computer-readable medium for providing audio and/or video effects based at least in part on facial features and/or voice feature characteristics of the user. For example, video and/or an audio signal of the user may be recorded by a device. Voice audio features and facial feature characteristics may be extracted from the voice audio signal and the video, respectively. The facial features of the user may be used to modify features of a virtual avatar to emulate the facial feature characteristics of the user. The extracted voice audio features may modified to generate an adjusted audio signal or an audio signal may be composed from the voice audio features. The adjusted/composed audio signal may simulate the voice of the virtual avatar. A preview of the modified video/audio may be provided at the user's device.
    Type: Grant
    Filed: July 11, 2018
    Date of Patent: December 8, 2020
    Assignee: Apple Inc.
    Inventors: Carlos M. Avendano, Sean A. Ramprashad
  • Patent number: 10863270
    Abstract: A wearable computer is configured to use beamforming techniques to isolate a user's speech from extraneous audio signals occurring within a physical environment. A microphone array of the wearable computer may generate audio signal data from an utterance from a user's mouth. A motion sensor(s) of the wearable computer may generate motion data from movement of the wearable computer. This motion data may be used to determine a direction vector pointing from the wearable computer to the user's mouth, and a beampattern may be defined that has a beampattern direction in substantial alignment with the determined direction vector to focus the microphone array on the user's mouth for speech isolation.
    Type: Grant
    Filed: March 22, 2019
    Date of Patent: December 8, 2020
    Assignee: Amazon Technologies, Inc.
    Inventors: Jeffrey Cornelius O'Neill, Amit Singh Chhetri, Philip Ryan Hilmes, Ramya Gopalan
  • Patent number: 10847184
    Abstract: An apparatus, system, and method allow a user to initiate a live stream that includes retrospectively captured images. Initiating the live stream includes providing provisional data that includes retrospectively captured images to a wireless radio.
    Type: Grant
    Filed: August 29, 2017
    Date of Patent: November 24, 2020
    Assignee: KNAPP INVESTMENT COMPANY LIMITED
    Inventor: Christopher A. Wiklof
  • Patent number: 10839819
    Abstract: Provided is an apparatus and method for encoding/decoding audio based on a block. A method of encoding an audio signal may include dividing each of frame of input signal that constitute an audio signal into a plurality of subframes; transforming the subframes to a frequency domain; determining a two-dimensional (2D) intra block using the subframes transformed to the frequency domain; and encoding the 2D intra block. The 2D intra block may be a block that two-dimensionally displays frequency coefficients of the subframes transformed to the frequency domain using a time and a frequency.
    Type: Grant
    Filed: March 21, 2017
    Date of Patent: November 17, 2020
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Seung Kwon Beack, Tae Jin Lee, Jongmo Sung, Mi Suk Lee, Dae Young Jang, Jin Soo Choi
  • Patent number: 10834146
    Abstract: A UE includes an EUTRA-CMR reception unit that receives a codec mode request (EUTRA-CMR) including a codec mode that is determined by an eNB in accordance with a radio condition of the UE, a mode switching notification unit that notifies an encoder of switching to the codec mode included in the received codec mode request; and a mode switching acknowledgement unit that transmits a response message to the eNB when confirming that the encoder switches the codec mode.
    Type: Grant
    Filed: September 7, 2018
    Date of Patent: November 10, 2020
    Assignee: Panasonic Intellectual Property Corporation of America
    Inventors: Takako Hori, Prateek Basu Mallick, Hidetoshi Suzuki, Ayako Horiuchi, Joachim Loehr
  • Patent number: 10803241
    Abstract: Systems and methods for text normalization in a plurality of noisy channels receive a text entry and channel origin data of the text entry; determine whether the text entry matches an in-vocabulary (IV) entry or whether the text entry is an out-of-vocabulary (OOV) entry; if the text entry is determined to have a matching IV entry, output the matching IV entry, and if the text entry is determined to be an OOV entry, implement a channel-specific error-type adapter framework based on the channel origin data, wherein the channel-specific error-type adapter framework is optimized for a specific channel from which the text entry originated; normalize the text entry using the channel-specific error-type adapter framework; and output one or more candidate normalized forms of the text entry.
    Type: Grant
    Filed: December 28, 2015
    Date of Patent: October 13, 2020
    Assignee: NICE LTD.
    Inventors: Hila Weisman, Peter Izsak, Inna Achlow, Victor Shafran
  • Patent number: 10779769
    Abstract: There is described a method for evaluating a level of noise in a biosignal, the method comprising: receiving a time signal representative of a biological activity, the time signal comprising a biological activity component and a noise component; determining a modulation spectrum for the time signal, the modulation spectrum representing a signal frequency as a function of a modulation frequency; from the modulation spectrum determining a first amount of modulation energy corresponding to the biological activity component and a second amount of modulation energy corresponding to the noise component determining an indication of the level of noise using the first and second amounts of modulation energy; and outputting the indication of the level of noise.
    Type: Grant
    Filed: February 17, 2015
    Date of Patent: September 22, 2020
    Assignee: INSTITUT NATIONAL DE LA RECHERCHE SCIENTIFIQUE
    Inventors: Tiago Henrique Falk, Diana Patricia Tobon Vallejo, Martin Maier
  • Patent number: 10783630
    Abstract: The present invention relates to a collision force analysis (CFA) method for obtaining in vivoestimates of contact force and pressure in the vocal cords directly from laryngeal videoendoscopies. The method comprises the steps of: providing at least a high-speed laryngeal videoendoscopy (also called High Speed Videoendoscopy or HSV) to generate at least one image and videos of the vocal cords; pre-processing the image in a processing unit, to define a region of interest (ROI) of the location of the glottis; detecting in the processing unit, the edges of the vocal folds in the images obtained by means of the HSV; recording in the processing unit, the points of the edges detected by means of a sequence of images; estimating in the processing unit, the path of the vocal cord edge during collision throughout time; and estimating the values of contact and impact of the vocal cords by means of a collision model.
    Type: Grant
    Filed: January 14, 2019
    Date of Patent: September 22, 2020
    Assignee: UNIVERSIDAD TECNICA FEDERICO SANTA MARIA
    Inventors: Matías Zañartu Salas, Manuel Esteban Diaz Cadiz
  • Patent number: 10755704
    Abstract: An information processing apparatus identifies, by using an audio signal acquired by collecting a user's voice, evaluation target time that includes at least either time not including the user's voice or time during which the user is producing a meaningless utterance and produces an output appropriate to the identified evaluation target time.
    Type: Grant
    Filed: September 8, 2016
    Date of Patent: August 25, 2020
    Assignee: SONY INTERACTIVE ENTERTAINMENT INC.
    Inventors: Shinichi Honda, Hiroyuki Segawa, Shinichi Kariya
  • Patent number: 10741195
    Abstract: A first signal weighting processor outputs a weighted signal obtained by performing a weighting on part of an input signal representing a feature of a target signal included in the input signal. A neural network processor outputs an enhancement signal for the target signal by using a coupling coefficient. An inverse filter cancels the weighting on the feature representation of the target signal in the enhancement signal. A second signal weighting processor outputs a weighted signal obtained by performing a weighting on part of a supervisory signal representing a feature of a target signal. An error evaluator output a coupling coefficient to have a value indicating that a learning error between the weighted signal output from the second signal weighting processor and the output signal of the neural network processor is less than or equal to a set value.
    Type: Grant
    Filed: February 15, 2016
    Date of Patent: August 11, 2020
    Assignee: MITSUBISHI ELECTRIC CORPORATION
    Inventor: Satoru Furuta
  • Patent number: 10742475
    Abstract: Method, apparatus and systems for object tracking are disclosed. In one example, a system for tracking a plurality of objects in a venue is disclosed. The system comprises a transmitter configured for transmitting a series of probe signals in a broadcasting manner through a wireless multipath channel, wherein the wireless multipath channel is impacted by a movement of at least one of the plurality of objects in the venue; and a plurality of heterogeneous target wireless receivers each of which is associated with an object of the plurality of objects in the venue.
    Type: Grant
    Filed: June 19, 2019
    Date of Patent: August 11, 2020
    Assignee: ORIGIN WIRELESS, INC.
    Inventors: Hung-Quoc Duc Lai, David N. Claffey, Chen Chen, Feng Zhang, Chenshu Wu, Chau-Wai Wong, Beibei Wang, Oscar Chi-Lim Au, K. J. Ray Liu
  • Patent number: 10734007
    Abstract: A codec allowing for switching between different coding modes is improved by, responsive to a switching instance, performing temporal smoothing and/or blending at a respective transition.
    Type: Grant
    Filed: January 17, 2018
    Date of Patent: August 4, 2020
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Martin Dietz, Eleni Fotopoulou, Jérémie Lecomte, Markus Multrus, Benjamin Schubert
  • Patent number: 10732834
    Abstract: A computer-implemented method for management of voicemail messages, performed at a portable electronic device with a touch screen display, includes: displaying a list of voicemail messages; detecting selection by a user of a respective voicemail message in the list; responding to the user selection of the respective voicemail message by initiating playback of the user-selected voicemail message; displaying a progress bar for the user-selected voicemail message, wherein the progress bar indicates the portion of the user-selected voicemail message that has been played; detecting movement of a finger of the user from a first position on the progress bar to a second position on the progress bar; and responding to the detection of the finger movement by restarting playback of the user-selected voicemail message at a position within the user-selected voicemail message corresponding substantially to the second position on the progress bar.
    Type: Grant
    Filed: June 26, 2018
    Date of Patent: August 4, 2020
    Assignee: Apple Inc.
    Inventors: Freddy Allen Anzures, Gregory N. Christie, Scott Forstall, Gregory Novick, Steven P. Jobs, Imran Chaudhri, Stephen O. Lemay, Patrick L. Coffman, Elizabeth Caroline Cranfill
  • Patent number: 10726828
    Abstract: A method, computer system, and a computer program product for generating a plurality of voice data having a particular speaking style is provided. The present invention may include preparing a plurality of original voice data corresponding to at least one word or at least one phrase is prepared. The present invention may also include attenuating a low frequency component and a high frequency component in the prepared plurality of original voice data. The present invention may then include reducing power at a beginning and an end of the prepared plurality of original voice data. The present invention may further include storing a plurality of resultant voice data obtained after the attenuating and the reducing.
    Type: Grant
    Filed: May 31, 2017
    Date of Patent: July 28, 2020
    Assignee: International Business Machines Corporation
    Inventors: Takashi Fukuda, Osamu Ichikawa, Gakuto Kurata, Masayuki Suzuki
  • Patent number: 10715522
    Abstract: An online system authenticates a user through a voiceprint biometric verification process. When a user needs to be authenticated, the online system generates and provides a random phrase to the user. The online system receives an audio recording of the randomly generated phrase and retrieves a previously trained voiceprint model for the user. The online system analyzes the audio recording by applying the voiceprint model to determine whether the audio recording satisfies a first criteria of whether the voice in the audio recording belongs the user and a second criteria of whether the audio recording includes a vocalization of the randomly generated phrase. If the audio recording satisfies both criteria, the online system authenticates the user. Therefore, the user can be provided access to a new communication session in response to being authenticated.
    Type: Grant
    Filed: January 31, 2018
    Date of Patent: July 14, 2020
    Assignee: salesforce.com
    Inventor: Eugene Lew
  • Patent number: 10694262
    Abstract: An apparatus includes a capture device and a processor. The capture device may be configured to generate video frames of an environment near a vehicle. The processor may be configured to receive video frames from the capture device, perform video operations to detect objects in the video frames, determine advertisement locations in the video frames based on the objects detected in the video frames, insert advertisements at the advertisement locations and generate video data for a display. The video data may comprise the video frames with the inserted advertisements.
    Type: Grant
    Filed: March 12, 2019
    Date of Patent: June 23, 2020
    Assignee: Ambarella International LP
    Inventors: Greg Hedman, Shimon Pertsel
  • Patent number: 10692068
    Abstract: Tokens are assigned to data to be processed. Each token is associated with a unique sound waveform defined by at least two contemporaneous tones of different frequency. The waveforms can be emitted as sound at computing devices having speakers. Other computing devices have microphones and capture emitted sound to extract tokens from the waveforms. Extracted tokens can be taken as confirmation that the data is to be processed. Data can represent transactions.
    Type: Grant
    Filed: August 16, 2016
    Date of Patent: June 23, 2020
    Assignee: SOUNDPAYS INC.
    Inventors: Peter Misek, Jason Squire
  • Patent number: 10672404
    Abstract: An apparatus for decoding an encoded audio signal to obtain a reconstructed audio signal is provided, having: a receiving interface for receiving one or more frames, a coefficient generator, and a signal reconstructor. The coefficient generator is configured to determine one or more first audio signal coefficients, and one or more noise coefficients. Moreover, the coefficient generator is configured to generate one or more second audio signal coefficients, depending on the one or more first audio signal coefficients and depending on the one or more noise coefficients.
    Type: Grant
    Filed: May 2, 2018
    Date of Patent: June 2, 2020
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Michael Schnabel, Goran Markovic, Ralph Sperschneider, Jérémie Lecomte, Christian Helmrich
  • Patent number: 10657948
    Abstract: A system, method and program product for generating sound masking in an open-plan space. A method is disclosed that includes: establishing a set of acoustic criteria for the space that specifies minimum output levels at a set of specified frequencies; mixing sound samples to create an audio output stream for use as sound masking when broadcast over at least one speaker using a power amplifier; and processing the audio output stream, wherein the processing includes: analyzing the audio output stream with a spectrum analyzer to determine if the minimum output levels at the set of specified frequencies are met; and level adjusting the audio output stream with an equalizer to ensure that minimum output levels at the set of specified frequencies are met but do not exceed a given threshold above the minimum level.
    Type: Grant
    Filed: April 22, 2016
    Date of Patent: May 19, 2020
    Assignee: Rensselaer Polytechnic Institute
    Inventors: Jonas Braasch, Alana Gloria DeLoach, Jeffrey Parkman Carter
  • Patent number: 10659787
    Abstract: Techniques are generally described for enhanced compression of video data. In various examples, the techniques may include receiving first video data representing a scene in an environment. The techniques may further include generating illumination map data representing illumination of the scene in the first video data. The techniques may further comprise generating reflectance map data representing a reflectance of at least one object in the first video data. In some examples, the techniques may include sending, to a second computing device, the illumination map data and the reflectance map data. The techniques may further include receiving second video data representing the scene. The techniques may include determining a first illumination difference between the second video data and the first video data. The techniques may comprise sending, to the second computing device, the first illumination difference.
    Type: Grant
    Filed: September 20, 2018
    Date of Patent: May 19, 2020
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Ilya Vladimirovich Brailovskiy, Raghavan Manmatha
  • Patent number: 10658005
    Abstract: Systems and methods are disclosed configured to train an autoencoder using images that include faces, wherein the autoencoder comprises an input layer, an encoder configured to output a latent image from a corresponding input image, and a decoder configured to attempt to reconstruct the input image from the latent image. An image sequence of a face exhibiting a plurality of facial expressions and transitions between facial expressions is generated and accessed. Images of the plurality of facial expressions and transitions between facial expressions are captured from a plurality of different angles and using different lighting. An autoencoder is trained using source images that include the face with different facial expressions captured at different angles with different lighting, and using destination images that include a destination face.
    Type: Grant
    Filed: December 2, 2019
    Date of Patent: May 19, 2020
    Assignee: Neon Evolution Inc.
    Inventors: Carl Davis Bogan, III, Kenneth Michael Lande, Jacob Myles Laser, Brian Sung Lee, Cody Gustave Berlin
  • Patent number: 10651827
    Abstract: Aspects of the disclosure include an apparatus that has a first clock generator and a second clock generator. The first clock generator is configured to drive a first circuit, causing the first circuit to (i) receive a signal corresponding to an audio input, and (ii) determine whether an energy level of the signal exceeds a predetermined threshold. The second clock generator is activated when the first circuit determines that the energy level of the signal exceeds the predetermined threshold. The second clock generator is configured to drive a second circuit, causing the second circuit to determine whether the signal matches a predetermined pattern. A third circuit is activated when the second circuit determines that the signal matches the predetermined pattern.
    Type: Grant
    Filed: November 30, 2016
    Date of Patent: May 12, 2020
    Assignee: MARVELL ASIA PTE, LTD.
    Inventors: Yafei Hu, Liu Han, Kapil Jain, Jin Xie
  • Patent number: 10629207
    Abstract: A cache associated with a voice recognition engine may be configured to receive an audio file corresponding to a voice query. The voice query may be received by the voice recognition engine from a user device in response to receipt of the voice query by the user device at the user device. The cache may be configured to determine an audio fingerprint based on the audio file and to determine whether the audio fingerprint corresponds to one of a plurality of audio transcriptions stored in the cache. The audio transcriptions may correspond to popular voice queries received at the voice recognition engine and processed by a server capable of performing automated speech recognition. In response to determining that the audio fingerprint corresponds to a given one of the stored audio transcriptions, the audio file may be processed based on the stored audio transcription.
    Type: Grant
    Filed: July 13, 2017
    Date of Patent: April 21, 2020
    Assignee: COMCAST CABLE COMMUNICATIONS, LLC
    Inventors: Rui Min, Hongcheng Wang
  • Patent number: 10586549
    Abstract: A method of determining a distribution of bits for coding a transition frame, said method being implemented in a coder/decoder for coding/decoding a digital signal, the transition frame being preceded by a predictive coded preceding frame, coding the transition frame comprising transform coding and predictive coding a single sub-frame of the transition frame, the method comprising the following steps: assigning a bit rate for predictive coding the transition sub-frame, said bit rate being equal to the minimum between the bit rate for transform coding the transition frame and a first predetermined bit rate value; determining a first number of bits allocated for predictive coding the transition sub-frame for said bit rate; and calculating a second number of bits allocated for transform coding the transition frame from the first number of bits and a number of bits available for coding the transition frame.
    Type: Grant
    Filed: July 27, 2015
    Date of Patent: March 10, 2020
    Assignee: ORANGE
    Inventors: Stephane Ragot, Julien Faure
  • Patent number: 10541746
    Abstract: A system for testing line attenuation defects includes a data transmission line configured to transmit a forward signal in a first direction, at least one reflection point at a first location along the data transmission line, a test probe configured to (i) electrically contact a center conductor at a second location along the data transmission line, (ii) introduce a broadband data signal onto the data transmission line, and (iii) measure, at the second location, a return signal from the reflection point, and a spectrum capturing device in operable contact with the test probe. The spectrum capturing device is configured to (i) collect and arrange frequency data measured by the test probe for the test signal, the return signal, and a standing wave created by the sum of the broadband data signal and the return signal, (ii) determine the voltage VSWR of the standing wave, and (iii) calculate a line loss from the VSWR.
    Type: Grant
    Filed: June 27, 2017
    Date of Patent: January 21, 2020
    Assignee: Cable Television Laboratories, Inc
    Inventors: Thomas H. Williams, Ryan Richard Vail
  • Patent number: 10535358
    Abstract: An apparatus and a method to encode and decode a speech signal using an encoding mode are provided. An encoding apparatus may select an encoding mode of a frame included in an input speech signal, and encode a frame having an unvoiced mode for an unvoiced speech as the selected encoding mode.
    Type: Grant
    Filed: February 8, 2018
    Date of Patent: January 14, 2020
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Ho Sang Sung, Ki Hyun Choo, Jung Hoe Kim, Eun Mi Oh
  • Patent number: 10522170
    Abstract: A method for acquiring the number of modified frames for active sound, and a method and apparatus for voice activity detection are disclosed. Firstly, a first voice activity detection decision result and a second voice activity detection decision result are obtained (501), the number of hangover frames for active sound is obtained (502), and the number of background noise updates is obtained (503), and then the number of modified frames for active sound is calculated according to the first voice activity detection decision result, the number of background noise updates and the number of hangover frames for active sound (504), and finally, a voice activity detection decision result of a current frame is calculated according to the number of modified frames for active sound and the second voice activity detection decision result (505).
    Type: Grant
    Filed: November 5, 2015
    Date of Patent: December 31, 2019
    Assignee: ZTE CORPORATION
    Inventors: Changbao Zhu, Hao Yuan
  • Patent number: 10510351
    Abstract: A method of encoding samples in a digital signal is provided that includes receiving a frame of N samples of the digital signal, determining L possible distinct data values in the N samples, determining a reference data value in the L possible distinct data values and a coding order of L?1 remaining possible distinct data values, wherein each of the L?1 remaining possible distinct data values is mapped to a position in the coding order, decomposing the N samples into L?1 coding vectors based on the coding order, wherein each coding vector identifies the locations of one of the L?1 remaining possible distinct data values in the N samples, and encoding the L?1 coding vectors.
    Type: Grant
    Filed: February 8, 2016
    Date of Patent: December 17, 2019
    Assignee: TEXAS INSTRUMENTS INCORPORATED
    Inventors: Lorin Paul Netsch, Jacek Piotr Stachurski
  • Patent number: 10504540
    Abstract: The present invention relates to an audio encoding and, more particularly, to a signal classifying method and device, and an audio encoding method and device using the same, which can reduce a delay caused by an encoding mode switching while improving the quality of reconstructed sound. The signal classifying method may comprise the operations of: classifying a current frame into one of a speech signal and a music signal; determining, on the basis of a characteristic parameter obtained from multiple frames, whether a result of the classifying of the current frame includes an error; and correcting the result of the classifying of the current frame in accordance with a result of the determination. By correcting an initial classification result of an audio signal on the basis of a correction parameter, the present invention can determine an optimum coding mode for the characteristic of an audio signal and can prevent frequent coding mode switching between frames.
    Type: Grant
    Filed: October 1, 2018
    Date of Patent: December 10, 2019
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Ki-hyun Choo, Anton Viktorovich Porov, Konstantin Sergeevich Osipov
  • Patent number: 10489636
    Abstract: The application discloses a lip movement capturing method and device and a storage medium. The method includes: acquiring a real-time image shot by a photographic device and extracting a real-time facial image from the real-time image; inputting the real-time facial image into a pretrained lip average model and recognizing t lip feature points representative of positions of lips in the real-time facial image; and calculating a movement direction and movement distance of the lips in the real-time facial image according to x and y coordinates of the t lip feature points in the real-time facial image. According to the application, movement information of the lips in the real-time facial image is calculated according to the coordinates of the lip feature points to implement real-time capturing of movements of the lips.
    Type: Grant
    Filed: October 31, 2017
    Date of Patent: November 26, 2019
    Assignee: Ping An Technology (Shenzhen) Co., Ltd.
    Inventors: Lin Chen, Guohui Zhang
  • Patent number: 10484218
    Abstract: A receiver for demodulating a pulse width modulated (“PWM”) signal, comprises: a voltage level shifter for shifting the PWM signal to predefined transistor voltage levels; a half-rate PWM decoder for receiving the shifted PWM signal; and a 2-bit-to-N-bit deserializer. The half-rate PWM decoder comprises a first decoder core, a second decoder core, a controller, and a sampler and retiming circuit. The first decoder core and the second decoder core are configured to decode alternating periods of the shifted PWM signal. The controller is coupled to the first decoder core, the second decoder core, the sampler and retiming circuit. The retiming circuit is configured to receive clock signals from the controller and to output half-rate even data from the first decoder core and half-rate odd data from the second decoder core. Outputs of the retiming circuit and an output of the controller are coupled to inputs of the deserializer.
    Type: Grant
    Filed: February 23, 2018
    Date of Patent: November 19, 2019
    Assignee: Invecas, Inc.
    Inventors: Siva Kumar Rapina, Saravana Kumar Durairaj