Patents Examined by Michael N. Opsasnick
-
Patent number: 11437050Abstract: Techniques are described for coding audio signals. For example, using a neural network, a residual signal is generated for a sample of an audio signal based on inputs to the neural network. The residual signal is configured to excite a long-term prediction filter and/or a short-term prediction filter. Using the long-term prediction filter and/or the short-term prediction filter, a sample of a reconstructed audio signal is determined. The sample of the reconstructed audio signal is determined based on the residual signal generated using the neural network for the sample of the audio signal.Type: GrantFiled: December 10, 2019Date of Patent: September 6, 2022Assignee: QUALCOMM IncorporatedInventors: Zisis Iason Skordilis, Vivek Rajendran, Guillaume Konrad Sautière, Daniel Jared Sinder
-
Patent number: 11437047Abstract: In accordance with an example embodiment of the present invention, disclosed is a method and an apparatus thereof for controlling a concealment method for a lost audio frame of a received audio signal. A method for a decoder of concealing a lost audio frame comprises detecting in a property of the previously received and reconstructed audio signal, or in a statistical property of observed frame losses, a condition for which the substitution of a lost frame provides relatively reduced quality. In case such a condition is detected, the concealment method is modified by selectively adjusting a phase or a spectrum magnitude of a substitution frame spectrum.Type: GrantFiled: December 19, 2019Date of Patent: September 6, 2022Assignee: Telefonaktiebolaget L M Ericsson (publ)Inventors: Stefan Bruhn, Jonas Svedberg
-
Patent number: 11437049Abstract: A device for signal processing includes a memory and a processor. The memory is configured to store a parameter associated with a bandwidth-extended audio stream. The processor is configured to select a plurality of non-linear processing functions based at least in part on a value of the parameter. The processor is also configured to generate a high-band excitation signal based on the plurality of non-linear processing functions.Type: GrantFiled: October 28, 2020Date of Patent: September 6, 2022Assignee: QUALCOMM IncorporatedInventors: Venkatraman Atti, Venkata Subrahmanyam Chandra Sekhar Chebiyyam
-
Patent number: 11423886Abstract: The intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions. The system can be implemented using any of a number of different platforms, such as the web, email, smartphone, and the like, or any combination thereof. In one embodiment, the system is based on sets of interrelated domains and tasks, and employs additional functionally powered by external services with which the system can interact.Type: GrantFiled: May 20, 2020Date of Patent: August 23, 2022Assignee: Apple Inc.Inventors: Thomas Robert Gruber, Adam John Cheyer, Dag Kittlaus, Didier Rene Guzzoni, Christopher Dean Brigham, Richard Donald Giuli, Marcello Bastea-Forte, Harry Joseph Saddler
-
Patent number: 11417338Abstract: An electronic apparatus and method of controlling the electronic apparatus are provided. The electronic apparatus includes a communicator, a storage storing information on places wherein Internet of Things (IoT) devices are located, and a processor configured to, based on receiving a control signal for controlling an IoT device located in a specific place through the communicator, control the IoT device located in the specific place based on information on the place stored in the storage. The processor is further configured to receive motion information generated based on a motion of a wearable device from the wearable device, identify a place corresponding to the motion information, and store the identified place as information on a place of an IoT device located within a predetermined distance from the wearable device, in the storage.Type: GrantFiled: July 26, 2019Date of Patent: August 16, 2022Assignee: Samsung Electronics Co., Ltd.Inventors: Seongil Hahm, Taejun Kwon, Venkatraman Iyer, Daesung An
-
Patent number: 11380333Abstract: Systems and methods of diarization using linguistic labeling include receiving a set of diarized textual transcripts. A least one heuristic is automatedly applied to the diarized textual transcripts to select transcripts likely to be associated with an identified group of speakers. The selected transcripts are analyzed to create at least one linguistic model. The linguistic model is applied to transcripted audio data to label a portion of the transcripted audio data as having been spoken by the identified group of speakers. Still further embodiments of diarization using linguistic labeling may serve to label agent speech and customer speech in a recorded and transcripted customer service interaction.Type: GrantFiled: December 4, 2019Date of Patent: July 5, 2022Assignee: Verint Systems Inc.Inventors: Omer Ziv, Ran Achituv, Ido Shapira, Jeremie Dreyfuss
-
Patent number: 11373639Abstract: A speech recognition system successively processes each encoder state of encoded acoustic features with a frame-synchronous decoder (FSD) and label-synchronous decoder (LSD) modules. Upon identifying an encoder state carrying information about new transcription output, the system expands a current list of FSD prefixes with FSD module, evaluates the FSD prefixes with LSD module, and prunes the FSD prefixes according to joint FSD and LSD scores. FSD and LSD modules are synchronized by having LSD module to process the portion of the encoder states including new transcription output identified by the FSD module and to produce LSD scores for the FSD prefixes determined by the FSD module.Type: GrantFiled: December 12, 2019Date of Patent: June 28, 2022Assignee: Mitsubishi Electric Research Laboratories, Inc.Inventors: Niko Moritz, Takaaki Hori, Jonathan Le Roux
-
Patent number: 11367450Abstract: Systems and methods of diarization using linguistic labeling include receiving a set of diarized textual transcripts. A least one heuristic is automatedly applied to the diarized textual transcripts to select transcripts likely to be associated with an identified group of speakers. The selected transcripts are analyzed to create at least one linguistic model. The linguistic model is applied to transcripted audio data to label a portion of the transcripted audio data as having been spoken by the identified group of speakers. Still further embodiments of diarization using linguistic labeling may serve to label agent speech and customer speech in a recorded and transcripted customer service interaction.Type: GrantFiled: December 4, 2019Date of Patent: June 21, 2022Assignee: Verint Systems Inc.Inventors: Omer Ziv, Ran Achituv, Ido Shapira, Jeremie Dreyfuss
-
Patent number: 11355134Abstract: A method, system, and computer readable medium for decomposing an audio signal into different isolated sources. The techniques and mechanisms convert an audio signal into K input spectrogram fragments. The fragments are sent into a deep neural network to isolate for different sources. The isolated fragments are then combined to form full isolated source audio signals.Type: GrantFiled: October 2, 2020Date of Patent: June 7, 2022Assignee: AUDIOSHAKE, INC.Inventor: Luke Miner
-
Patent number: 11341972Abstract: In one aspect, a method comprises accessing audio data generated by a computing device based on audio input from a user, the audio data encoding one or more user utterances. The method further comprises generating a first transcription of the utterances by performing speech recognition on the audio data using a first speech recognizer that employs a language model based on user-specific data. The method further comprises generating a second transcription of the utterances by performing speech recognition on the audio data using a second speech recognizer that employs a language model independent of user-specific data. The method further comprises determining that the second transcription of the utterances includes a term from a predefined set of one or more terms. The method further comprises, based on determining that the second transcription of the utterance includes the term, providing an output of the first transcription of the utterance.Type: GrantFiled: October 22, 2020Date of Patent: May 24, 2022Assignee: Google LLCInventors: Alexander H. Gruenstein, Petar Aleksic
-
Patent number: 11322154Abstract: Systems and methods of diarization using linguistic labeling include receiving a set of diarized textual transcripts. At least one heuristic is automatedly applied to the diarized textual transcripts to select transcripts likely to be associated with an identified group of speakers. The selected transcripts are analyzed to create at least one linguistic model. The linguistic model is applied to transcripted audio data to label a portion of the transcripted audio data as having been spoken by the identified group of speakers. Still further embodiments of diarization using linguistic labeling may serve to label agent speech and customer speech in a recorded and transcribed customer service interaction.Type: GrantFiled: December 4, 2019Date of Patent: May 3, 2022Assignee: Verint Systems Inc.Inventors: Omer Ziv, Ran Achituv, Ido Shapira, Jeremie Dreyfuss
-
Patent number: 11315554Abstract: Methods, systems, and media for connecting an IoT device to a call are provided. In some embodiments, a method is provided, the method comprising: establishing, at a first end-point device, a telecommunication channel with a second end-point device; subsequent to establishing the telecommunication channel, and prior to a termination of the telecommunication channel, detecting, using the first end-point device, a voice command that includes a keyword; and in response to detecting the voice command, causing information associated with an IoT device that corresponds to the keyword to be transmitted to the second end-point device.Type: GrantFiled: March 15, 2018Date of Patent: April 26, 2022Assignee: Google LLCInventors: Saptarshi Bhattacharya, Shreedhar Madhavapeddi
-
Patent number: 11314481Abstract: Systems and methods for enabling voice-based interactions with electronic devices can include a data processing system maintaining a plurality of device action data sets and a respective identifier for each device action data set. The data processing system can receive, from an electronic device, an audio signal representing a voice query and an identifier. The data processing system can identify, using the identifier, a device action data set. The data processing system can identify a device action from device action data set based on content of the audio signal. The data processing system can then identify, from the device action dataset, a command associated with the device action and send the command to the for execution device for execution.Type: GrantFiled: May 7, 2018Date of Patent: April 26, 2022Assignee: GOOGLE LLCInventors: Bo Wang, Venkat Kotla, Chad Yoshikawa, Chris Ramsdale, Pravir Gupta, Alfonso Gomez-Jordana, Kevin Yeun, Jae Won Seo, Lantian Zheng, Sang Soo Sung
-
Patent number: 11308969Abstract: A method performed in an audio decoder for decoding M encoded audio channels representing N audio channels is disclosed. The method includes receiving a bitstream containing the M encoded audio channels and a set of spatial parameters, decoding the M encoded audio channels, and extracting the set of spatial parameters from the bitstream. The method also includes analyzing the M audio channels to detect a location of a transient, decorrelating the M audio channels, and deriving N audio channels from the M audio channels and the set of spatial parameters. A first decorrelation technique is applied to a first subset of each audio channel and a second decorrelation technique is applied to a second subset of each audio channel. The first decorrelation technique represents a first mode of operation of a decorrelator, and the second decorrelation technique represents a second mode of operation of the decorrelator.Type: GrantFiled: October 5, 2020Date of Patent: April 19, 2022Assignee: Dolby Laboratories Licensing CorporationInventor: Mark F. Davis
-
Patent number: 11302309Abstract: A technique for aligning spike timing of models is disclosed. A first model having a first architecture trained with a set of training samples is generated. Each training sample includes an input sequence of observations and an output sequence of symbols having different length from the input sequence. Then, one or more second models are trained with the trained first model by minimizing a guide loss jointly with a normal loss for each second model and a sequence recognition task is performed using the one or more second models. The guide loss evaluates dissimilarity in spike timing between the trained first model and each second model being trained.Type: GrantFiled: September 13, 2019Date of Patent: April 12, 2022Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Gakuto Kurata, Kartik Audhkhasi
-
Patent number: 11301642Abstract: One general aspect includes a system to translate language exhibited on a publicly viewable sign, the system including: a memory configured to include one or more executable instructions and a processor configured to execute the executable instructions, where the executable instructions enable the processor to carry out the steps of: reviewing the sign; translating relevant information conveyed on the sign from a first language to a second language; and producing an output in an interior of a vehicle, the output based on the second language of the relevant information.Type: GrantFiled: April 17, 2019Date of Patent: April 12, 2022Assignee: GM GLOBAL TECHNOLOGY OPERATIONS LLCInventors: Brunno L. Moretti, Esther Anderson, Luis Goncalves
-
Patent number: 11282531Abstract: A method includes receiving multiple samples of time-domain data that includes noise, computing a first two-dimensional (2D) time-frequency representation of the time domain data, and processing the first time-frequency representation using a time-frequency noise reduction mask to generate a second, noise-reduced time-frequency representation of the time domain data. The method also includes generating a time domain output based on the noise-reduced time-frequency representation.Type: GrantFiled: February 3, 2020Date of Patent: March 22, 2022Assignee: Bose CorporationInventors: Ankita D. Jain, Cristian Marius Hera, Elie Bou Daher
-
Patent number: 11282524Abstract: A device may receive a set of audio data files corresponding to a set of calls, wherein the set of audio data files includes digital representations of one or more segments of respective calls of the set of calls, and wherein the set of calls includes audio data relating to a particular industry. The device may receive a set of transcripts corresponding to the set of audio data files. The device may determine a plurality of text-audio pairs within the set of calls, wherein a text-audio pair, of the plurality of text-audio pairs, comprises: a digital representation of a segment a call of the set of calls, and a corresponding excerpt of text from the set of transcripts. The device may train, using a machine learning process, an industry-specific text-to-speech model, tailored for the particular industry, based on the plurality of text-audio pairs.Type: GrantFiled: September 25, 2020Date of Patent: March 22, 2022Assignee: Capital One Services, LLCInventor: Abhishek Dube
-
Patent number: 11276414Abstract: An electronic device includes an audio input module, an audio output module, and a processor. The processor is configured to provide a first signal and a second signal into which a first audio signal is processed, output the first audio signal through the audio output module, acquire an external audio signal comprising the first audio signal of the electronic device, acquire a first output value through a first input channel of an audio filter, acquire a second output value through a second input channel of the audio filter, and provide a second audio signal, based at least on a first difference value between the magnitude value corresponding to the first frequency of the external audio signal and the first output value and a second difference value between the magnitude value corresponding to the second frequency of the external audio signal and the second output value.Type: GrantFiled: August 30, 2018Date of Patent: March 15, 2022Assignee: Samsung Electronics Co., Ltd.Inventors: Jaemo Yang, Hangil Moon, Soonho Baek, Beak-Kwon Son, Kiho Cho, Chulmin Choi
-
Patent number: 11262975Abstract: A soft decision audio decoding system for preserving audio continuity in a digital wireless audio receiver is provided that deduces the likelihood of errors in a received digital signal, based on generated hard bits and soft bits. The soft bits may be utilized by a soft audio decoder to determine whether the digital signal should be decoded or muted. The soft bits may be generated based on a degree of closeness of a detected phase trajectory to known legal phase trajectories determined from the running the phase trajectory through a soft-output Viterbi algorithm. The value of the soft bits may indicate confidence in the strength of the hard bit generation. The soft decision audio decoding system may infer errors and decode perceptually acceptable audio without requiring error detection, as in conventional systems, as well as have low latency and improved granularity.Type: GrantFiled: June 5, 2020Date of Patent: March 1, 2022Assignee: Shure Acquisition Holdings, Inc.Inventor: Robert Mamola