Specialized Equations Or Comparisons Patents (Class 704/236)
  • Patent number: 11023520
    Abstract: Implementations relate to techniques for providing context-dependent search results. The techniques can include receiving a query and background audio. The techniques can also include identifying the background audio, establishing concepts related to the background audio and obtaining terms related to the concepts related to the background audio. The techniques can also include obtaining search results based on the query and on at least one of the terms. The techniques can also include providing the search results.
    Type: Grant
    Filed: January 10, 2019
    Date of Patent: June 1, 2021
    Assignee: GOOGLE LLC
    Inventors: Jason Sanders, John J. Lee, Gabriel Taubman
  • Patent number: 11024311
    Abstract: The various implementations described herein include methods and systems for determining device leadership among voice interface devices. In one aspect, a method is performed at a first electronic device of a plurality of electronic devices, each having microphones, a speaker, processors, and memory storing programs for execution by the processors. The first device detects a voice input. It determines a device state and a relevance of the voice input. It identifies a subset of electronic devices from the plurality to which the voice input is relevant. In accordance with a determination that the subset includes the first device, the first device determines a first score of a criterion associated with the voice input and receives second scores of the criterion from other devices in the subset. In accordance with a determination that the first score is higher than the second scores, the first device responds to the detected input.
    Type: Grant
    Filed: February 10, 2020
    Date of Patent: June 1, 2021
    Assignee: GOOGLE LLC
    Inventors: Kenneth Mixter, Diego Melendo Casado, Alexander Houston Gruenstein, Terry Tai, Christopher Thaddeus Hughes, Matthew Nirvan Sharifi
  • Patent number: 11002789
    Abstract: An analog circuit fault feature extraction method based on a parameter random distribution neighbor embedding winner-take-all method, comprising the following steps: (1) collecting a time-domain response signal of an analog circuit under test, wherein the input of the analog circuit under test is excited by using a pulse signal, a voltage signal is sampled at an output end, and the collected time-domain response signal is an output voltage signal of the analog circuit; (2) applying a discrete wavelet packet transform for the collected time-domain response signal to acquire each wavelet node signal; (3) calculating energy values and kurtosis values of the acquired wavelet node signals to form an initial fault feature data set of the analog circuit; and (4) analyzing the initial fault feature data by the parameter random distribution neighbor embedding winner-take-all method, to acquire optimum low-dimensional feature data.
    Type: Grant
    Filed: October 20, 2018
    Date of Patent: May 11, 2021
    Assignee: WUHAN UNIVERSITY
    Inventors: Yigang He, Wei He, Hui Zhang, Liulu He, Baiqiang Yin, Bing Li
  • Patent number: 10997277
    Abstract: An integrated circuit device such as a neural network accelerator can be programmed to select a numerical value based on a multinomial distribution. In various examples, the integrated circuit device can include an execution engine that includes multiple separate execution units. The multiple execution units can operate in parallel on different streams of data. For example, to make a selection based on a multinomial distribution, the execution units can be configured to perform cumulative sums on sets of numerical values, where the numerical values represent probabilities. In this example, to then obtain cumulative sums across the sets of numerical values, the largest values from the sets can be accumulated, and then added, in parallel to the sets. The resulting cumulative sum across all the numerical values can then be used to randomly select a specific index, which can provide a particular numerical value as the selected value.
    Type: Grant
    Filed: March 26, 2019
    Date of Patent: May 4, 2021
    Assignee: Amazon Technologies, Inc.
    Inventors: Yu Zhou, Vignesh Vivekraja, Ron Diamant
  • Patent number: 10997964
    Abstract: A system, method and computer-readable storage devices are for normalizing text for ASR and TTS in a language-neutral way. The system described herein divides Unicode text into meaningful chunks called “atomic tokens.” The atomic tokens strongly correlate to their actual pronunciation, and not to their meaning. The system combines the tokenization with a data-driven classification scheme, followed by class-determined actions to convert text to normalized form. The classification labels are based on pronunciation, unlike alternative approaches that typically employ Named Entity-based categories. Thus, this approach is relatively simple to adapt to new languages. Non-experts can easily annotate training data because the tokens are based on pronunciation alone.
    Type: Grant
    Filed: August 16, 2019
    Date of Patent: May 4, 2021
    Assignee: AT&T INTELLECTUAL PROPERTY 1, L.P.
    Inventors: Ladan Golipour, Alistair D. Conkie
  • Patent number: 10971135
    Abstract: Systems, methods, and computer-readable storage devices for crowd-sourced data labeling. The system requests a respective response from each of a set of entities. The set of entities includes crowd workers. Next, the system incrementally receives a number of responses from the set of entities until one of an accuracy threshold is reached and m responses are received, wherein the accuracy threshold is based on characteristics of the number of responses. Finally, the system generates an output response based on the number of responses.
    Type: Grant
    Filed: July 11, 2019
    Date of Patent: April 6, 2021
    Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.
    Inventors: Jason Williams, Tirso Alonso, Barbara B. Hollister, Ilya Dan Melamed
  • Patent number: 10963679
    Abstract: Methods and systems for recognizing emotions in video are disclosed. One example method includes the steps of receiving a video including images, detecting a face of the individual in the images, mapping the detected face to a model including at least two separated points in space corresponding to detectable emotions, each of the at least two separated points in space representing a plurality of example faces corresponding to one of the detectable emotions, determining the emotion of the individual from the detectable emotions based on a proximity of the detected face to the at least two separated points in space.
    Type: Grant
    Filed: March 12, 2019
    Date of Patent: March 30, 2021
    Assignee: Snap Inc.
    Inventors: Victor Shaburov, Yurii Monastyrshyn
  • Patent number: 10963510
    Abstract: A natural language processing system that includes an artificial intelligence (AI) engine and a tagging engine. The AI engine is configured to receive a set of audio files and to identify concepts within the set of audio files. The AI engine is further configured to determine a usage frequency for each of the identified concepts and to generate an AI-defined tag for concepts with a usage frequency that is greater than a usage frequency threshold. The tagging engine is configured to receive an audio file and to identify observed concepts within the audio file. The tagging engine is further configured to compare the observed concepts to the first set of concepts, to determine one or more observed concepts matches concepts linked with AI-defined tags, and to modify metadata for the audio file to include AI-defined tags.
    Type: Grant
    Filed: August 9, 2018
    Date of Patent: March 30, 2021
    Assignee: Bank of America Corporation
    Inventors: James McCormack, Sean M. Gutman, Manu J. Kurian, Sasidhar Purushothaman, Suki Ramasamy, William P. Jacobson
  • Patent number: 10957339
    Abstract: The present disclosure provides a speaker recognition method and apparatus, a computer device and a computer-readable medium. The method comprises: receiving target speech data of a to-be-recognized user in a target group; according to the target speech data, a pre-collected speech database and a pre-trained speaker recognition model, obtaining speech output features corresponding to the target speech data and speech output features corresponding to each of said speech data in the speech database; the speaker recognition model employs a convolution neural network model; recognizing the user corresponding to the target speech data according to the speech output features corresponding to the target speech data and the speech output features corresponding to each of said speech data in the speech database.
    Type: Grant
    Filed: March 5, 2018
    Date of Patent: March 23, 2021
    Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
    Inventors: Ying Cao, Xiao Liu, Peng Hu, Jie Zhou, Shilei Wen
  • Patent number: 10949736
    Abstract: Systems, apparatus and methods are described including operations for a flexible neural network accelerator.
    Type: Grant
    Filed: November 3, 2016
    Date of Patent: March 16, 2021
    Assignee: Intel Corporation
    Inventors: Michael E Deisher, Ohad Falik
  • Patent number: 10924611
    Abstract: An evaluation criterion for a call performed between an operator and a customer is set without taking time and effort. A voice recognition system includes a call recording unit that records a call performed between a customer and an operator, a voice recognition unit that recognizes the call recorded by the call recording unit and a value of non-verbal information indicating a feature of a calling party in the call and accumulates a recognized result in a storage unit, and a voice recognition result managing unit that sets a reference value for evaluating the calling party on the basis of the value of the non-verbal information included in the recognized result.
    Type: Grant
    Filed: November 6, 2018
    Date of Patent: February 16, 2021
    Assignee: HITACHI INFORMATION & TELECOMMUNICATION ENGINEERING, LTD.
    Inventors: Yuko Kanetsuki, Takashi Sugiyama, Terumi Saito
  • Patent number: 10902105
    Abstract: Systems and methods for call detail record (CDR) analysis to determine a risk score for a call and identify fraudulent activity and for fraud detection in Interactive Voice Response (IVR) systems. An example method may store information extracted from received calls. Queries of the stored information may be performed to select data using keys, wherein each key relates to one of the received calls, and wherein the queries are parallelized. The selected data may be transformed into feature vectors, wherein each feature vector relates to one of the received calls and includes a velocity feature and at least one of a behavior feature or a reputation feature. A risk score for the call may be generated during the call based on the feature vectors.
    Type: Grant
    Filed: July 18, 2019
    Date of Patent: January 26, 2021
    Assignee: Pindrop Security, Inc.
    Inventors: Scott Strong, Kailash Patil, David Dewey, Raj Bandyopadhyay, Telvis Calhoun, Vijay Balasubramaniyan
  • Patent number: 10896682
    Abstract: A speaker recognition algorithm is trained (one or more of its models are tuned) with samples of a microphone signal produced by an inside microphone of a headphone, while the headphone is worn by a speaker. The trained speaker recognition algorithm then tests other samples of the inside microphone signal and produces multiple speaker identification scores for its given models, or a single speaker verification likelihood score for a single given model. Other embodiments are also described and claimed.
    Type: Grant
    Filed: October 23, 2018
    Date of Patent: January 19, 2021
    Assignee: APPLE INC.
    Inventor: Sorin V. Dusan
  • Patent number: 10884420
    Abstract: A cleaning robot and a shortest path planning method based on a cleaning robot are disclosed, a plurality of cleaning lines are formed by controlling the cleaning robot to perform cleaning in an area according to a zigzag-shaped path; association information of midpoints of at least a part of the cleaning lines is recorded to form a node skeleton tree in which midpoints are represented by nodes, the association information of each midpoint includes: position information of a node corresponding to the midpoint, position information of a parent node, and information of the number of child nodes of the parent node; in the process of traversing upwardly from a current node or traversing upwardly from both a current node and a target node in the node skeleton tree, the node skeleton tree is compressed, so as to determine the shortest planned path from the current node to the target node.
    Type: Grant
    Filed: December 12, 2018
    Date of Patent: January 5, 2021
    Assignee: SHENZHEN SILVER STAR INTELLIGENT TECHNOLOGY CO., LTD.
    Inventors: Xuyi Deng, Yuxi Liu
  • Patent number: 10878837
    Abstract: An acoustic environment identification system is disclosed that can use neural networks to accurately identify environments. The acoustic environment identification system can use one or more convolutional neural networks to generate audio feature data. A recursive neural network can process the audio feature data to generate characterization data. The characterization data can be modified using a weighting system that weights signature data items. Classification neural networks can be used to generate a classification of an environment.
    Type: Grant
    Filed: February 28, 2018
    Date of Patent: December 29, 2020
    Assignee: Snap Inc.
    Inventors: Jinxi Guo, Jia Li, Ning Xu
  • Patent number: 10878807
    Abstract: The present disclosure relates to speech recognition systems and methods that enable personalized vocal user interfaces. More specifically, the present disclosure relates to combining a self-learning speech recognition system based on semantics with a speech-to-text system optionally integrated with a natural language processing system. The combined system has the advantage of automatically and continually training the semantics-based speech recognition system and increasing recognition accuracy.
    Type: Grant
    Filed: December 1, 2015
    Date of Patent: December 29, 2020
    Assignee: FLuent.AI Inc.
    Inventors: Vikrant Tomar, Mathieu Desruisseaux, Helge Seetzen
  • Patent number: 10861464
    Abstract: The present disclosure provides an electronic apparatus having an incremental enrollment unit and a method thereof. The electronic apparatus at least includes a microphone, a storage device, and a processor. The storage device stores a first screening rule, an enrollment database, and a first temporary storage library. The processor receives a command voice transmitted by the microphone, and compare the command voice with enrolled voices in the enrollment database. If determining that a similarity is larger than a threshold value, the processor stores the command voice as a first temporarily stored voice in the first temporary storage library. When a quantity of the first temporarily stored voices in the first temporary storage library is larger than a first predetermined value, the processor screens out a part of the first temporarily stored voices according to the first screening rule, so as to perform incremental enrollment.
    Type: Grant
    Filed: September 25, 2018
    Date of Patent: December 8, 2020
    Assignee: ASUSTEK COMPUTER INC.
    Inventor: Hai-Hsing Lin
  • Patent number: 10832684
    Abstract: In non-limiting examples of the present disclosure, systems, methods and devices for providing personalized experiences to a computing device based on user input such as voice, text and gesture input are provided. Acoustic patterns associated with voice input, speech patterns, language patterns and natural language processing may be used to identify a specific user providing input from a plurality of users, identify user background characteristics and traits for the specific user, and topically categorize user input in a tiered hierarchical index. Topically categorized user input may be supplemented with user data and world knowledge and personalized responses and feedback for an identified specific user may be provided reactively and proactively.
    Type: Grant
    Filed: August 31, 2016
    Date of Patent: November 10, 2020
    Assignee: Microsoft Technology Licensing, LLC
    Inventor: Ruhi Sarikaya
  • Patent number: 10818288
    Abstract: Systems and processes for operating a virtual assistant to provide natural assistant interaction are provided. In accordance with one or more examples, a method includes, at an electronic device with one or more processors and memory: receiving a first audio stream including one or more utterances; determining whether the first audio stream includes a lexical trigger; generating one or more candidate text representations of the one or more utterances; determining whether at least one candidate text representation of the one or more candidate text representations is to be disregarded by the virtual assistant. If at least one candidate text representation is to be disregarded, one or more candidate intents are generated based on candidate text representations of the one or more candidate text representations other than the to be disregarded at least one candidate text representation.
    Type: Grant
    Filed: June 26, 2018
    Date of Patent: October 27, 2020
    Assignee: Apple Inc.
    Inventors: Juan Carlos Garcia, Paul S. McCarthy, Kurt Piersol
  • Patent number: 10803879
    Abstract: Apparatus and methods for audio classifying and processing are disclosed. In one embodiment, an audio processing apparatus includes an audio classifier for classifying an audio signal into at least one audio type in real time; an audio improving device for improving experience of audience; and an adjusting unit for adjusting at least one parameter of the audio improving device in a continuous manner based on the confidence value of the at least one audio type.
    Type: Grant
    Filed: November 9, 2017
    Date of Patent: October 13, 2020
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Lie Lu, Alan J. Seefeldt, Jun Wang
  • Patent number: 10789962
    Abstract: A system and method are presented for the correction of packet loss in audio in automatic speech recognition (ASR) systems. Packet loss correction, as presented herein, occurs at the recognition stage without modifying any of the acoustic models generated during training. The behavior of the ASR engine in the absence of packet loss is thus not altered. To accomplish this, the actual input signal may be rectified, the recognition scores may be normalized to account for signal errors, and a best-estimate method using information from previous frames and acoustic models may be used to replace the noisy signal.
    Type: Grant
    Filed: November 12, 2018
    Date of Patent: September 29, 2020
    Inventors: Srinath Cheluvaraja, Ananth Nagaraja Iyer, Aravind Ganapathiraju, Felix Immanuel Wyss
  • Patent number: 10777189
    Abstract: Techniques for using a dynamic wakeword detection threshold are described. A device detects a wakeword in audio data using a first wakeword detection threshold value. Thereafter, the device receives audio including speech. If the device receives the audio within a predetermined duration of time after detecting the previous wakeword, the device attempts to detect a wakeword in second audio data, corresponding to the audio including the speech, using a second, lower wakeword detection threshold value.
    Type: Grant
    Filed: December 5, 2017
    Date of Patent: September 15, 2020
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Gengshen Fu, Shiv Naga Prasad Vitaladevuni, Paul McIntyre, Shuang Wu
  • Patent number: 10776137
    Abstract: An approach is provided for decluttering a device desktop. Using a classification technique, a subject of a current task of a user using a device is determined. Based on a determination that the subject matches a category of first desktop object(s), the first desktop object(s) are identified as being related to the current task. Based on a determination that the subject does not match one or more categories of second desktop object(s), the second desktop object(s) are identified as being not related to the current task. Based on the second desktop object(s) being not related to the current task, the second desktop object(s) are hidden from being viewed on the desktop.
    Type: Grant
    Filed: November 21, 2018
    Date of Patent: September 15, 2020
    Assignee: International Business Machines Corporation
    Inventors: Sarbajit K. Rakshit, James E. Bostick, Martin G. Keen, John M. Ganci, Jr.
  • Patent number: 10770076
    Abstract: A method of detecting a replay attack on a voice biometrics system comprises: receiving an audio signal representing speech; detecting a magnetic field; determining if there is a correlation between the audio signal and the magnetic field; and if there is a correlation between the audio signal and the magnetic field, determining that the audio signal may result from a replay attack.
    Type: Grant
    Filed: June 27, 2018
    Date of Patent: September 8, 2020
    Assignee: Cirrus Logic, Inc.
    Inventors: César Alonso, John Paul Lesso
  • Patent number: 10755696
    Abstract: A speech service control apparatus and a method thereof are provided. Speech data is obtained, and a keyword in the speech data is recognized to determine a confidence value corresponding to the keyword, which is a match level of the keyword relative to a wakeup keyword to request for speech services. When the confidence value is inferior to a recognized threshold, a number of cumulative failures is determined. The speech services are requested because the confidence value is greater than the recognized threshold, and the number of cumulative failure is a cumulative number accumulated when the speech data and previous speech data are inferior to the recognized threshold within a time period. The recognized threshold is modified according to the number of cumulative failure, a calculation relationship of confidence values of the speech data and the previous speech data, to enable the speech services successfully.
    Type: Grant
    Filed: June 26, 2018
    Date of Patent: August 25, 2020
    Assignee: Wistron Corporation
    Inventor: Chin-Lung Lee
  • Patent number: 10741185
    Abstract: The intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions. The system can be implemented using any of a number of different platforms, such as the web, email, smartphone, and the like, or any combination thereof. In one embodiment, the system is based on sets of interrelated domains and tasks, and employs additional functionally powered by external services with which the system can interact.
    Type: Grant
    Filed: March 13, 2019
    Date of Patent: August 11, 2020
    Assignee: Apple Inc.
    Inventors: Thomas R. Gruber, Adam J. Cheyer, Daniel Keen
  • Patent number: 10733375
    Abstract: Systems and processes for operating an intelligent automated assistant are provided. An example process receives natural language input and determines a first and a second parsing result for the natural language input. The first and the second parsing results include respective mappings of one or more properties of a domain corresponding to the natural language input to one or more words of the natural language input. The process determines whether the second parsing result corresponds to a data item in a knowledge base, and in accordance with determining that the second parsing result corresponds to the data item in the knowledge base, the process ranks the second parsing result higher than the first parsing result. Based on the ranking, the process generates a task flow using the second parsing result and executes the task flow to provide an output based on the data item.
    Type: Grant
    Filed: June 19, 2018
    Date of Patent: August 4, 2020
    Assignee: Apple Inc.
    Inventors: Lin Li, Deepak Muralidharan, Xiao Yang, Justine Kao, Lavanya Colinjivadi Viswanathan, Mubarak Ali Seyed Ibrahim, Ashish Garg
  • Patent number: 10726022
    Abstract: In one embodiment, a method includes receiving a search query inputted by a first user, wherein the search query comprises one or more n-grams; calculating a needle-confidence score for the search query that is calculated by a needle-intent classifier based on at least the n-grams of the search query and a language model analysis of the n-grams, and wherein the needle-confidence score represents a probability that the search query was intended as a needle search; classifying the search query as a needle search if the calculated needle-confidence score is above a threshold confidence score; and generating a plurality of search-result modules, each search-result module comprising one or more search results matching the search query, wherein one of the search-result modules is a social module, and wherein the number of search results in the social module is based on the classification of the search query as a needle search.
    Type: Grant
    Filed: August 26, 2016
    Date of Patent: July 28, 2020
    Assignee: Facebook, Inc.
    Inventors: Shiun-Zu Kuo, Veselin S. Stoyanov, Rose Marie Philip, Melissa Rose Winstanley
  • Patent number: 10718629
    Abstract: A navigation device includes a touch screen configured to receive a search term and to display at least one Point To Interest (POI) information corresponding to the search term, a computer-readable memory configured to store a POI data by region, and a controller configured to split the search term into at least two parts including a front portion and a second portion next to the front portion, determine a target region based on the second portion, find a matched POI data among POI data related to the target region based on the whole of the search term, and output a search result as the at least one POI information into the touch screen.
    Type: Grant
    Filed: October 11, 2017
    Date of Patent: July 21, 2020
    Assignees: Hyundai Motor Company, Kia Motors Corporation
    Inventor: Won Seok Yang
  • Patent number: 10720152
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for performing dynamic, stroke-based alignment of touch displays. In one aspect, a method includes obtaining a candidate transcription that an automated speech recognizer generates for an utterance, determining a particular context associated with the utterance, determining that a particular n-gram that is included in the candidate transcription is included among a set of undesirable n-grams that is associated with the context, adjusting a speech recognition confidence score associated with the transcription based on determining that the particular n-gram that is included in the candidate transcription is included among the set of undesirable n-grams that is associated with the context, and determining whether to provide the candidate transcription for output based at least on the adjusted speech recognition confidence score.
    Type: Grant
    Filed: May 8, 2019
    Date of Patent: July 21, 2020
    Assignee: Google LLC
    Inventors: Pedro J. Moreno Mengibar, Petar Aleksic
  • Patent number: 10699169
    Abstract: Provided is a machine learning-based object detection method performed by an object detection apparatus. the method comprises constructing an object detection model by performing machine learning on a training image set, wherein the object detection model is a model for detecting a target object in an input image based on the result of comparing a confidence score for the target object with a threshold value, obtaining an input image given a detection result for the target object, wherein the obtained input image is an image not included in the training image set, predicting one or more object regions, in which the target object exists, in the obtained input image by using the object detection model, classifying a region not matching the detection result among the predicted object regions as a false detection region; and adjusting the threshold value of the object detection model based on a confidence score of the false detection region.
    Type: Grant
    Filed: September 18, 2018
    Date of Patent: June 30, 2020
    Assignee: SAMSUNG SDS CO., LTD.
    Inventors: Sun Ah Kang, Sang Hak Lee, Bo Youn Kim, Seong Jong Ha
  • Patent number: 10657966
    Abstract: Systems and processes for operating a virtual assistant programmed to refer to shared domain concepts using concept nodes are provided. An example process includes receiving a user speech input, determining a primary domain corresponding to a textual representation of the user speech input, identifying, from the textual representation, a first substring that corresponds to a first concept of the primary domain, parsing the first substring to determine a secondary domain of the plurality of domains, and based on the secondary domain, obtaining a data item corresponding to the first substring. In accordance with determining that the data item is valid for resolving the first concept of the primary domain, extracting, from the data item, a parameter value for the first concept of the primary domain and invoking a service based on the primary domain to produce a result using the parameter value for the first concept.
    Type: Grant
    Filed: August 27, 2018
    Date of Patent: May 19, 2020
    Assignee: APPLE INC.
    Inventors: Richard D. Giuli, Nicholas K. Treadgold
  • Patent number: 10650828
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving audio data corresponding to an utterance, determining that the audio data corresponds to a hotword, generating a hotword audio fingerprint of the audio data that is determined to correspond to the hotword, comparing the hotword audio fingerprint to one or more stored audio fingerprints of audio data that was previously determined to correspond to the hotword, detecting whether the hotword audio fingerprint matches a stored audio fingerprint of audio data that was previously determined to correspond to the hotword based on whether the comparison indicates a similarity between the hotword audio fingerprint and one of the one or more stored audio fingerprints that satisfies a predetermined threshold, and in response to detecting that the hotword audio fingerprint matches a stored audio fingerprint, disabling access to a computing device into which the utterance was spoken.
    Type: Grant
    Filed: March 25, 2019
    Date of Patent: May 12, 2020
    Assignee: Google LLC
    Inventors: Matthew Sharifi, Jakob Nicolaus Foerster
  • Patent number: 10636414
    Abstract: There is provided a speech processing apparatus to improve the flexibility of processing regarding speech recognition, the speech processing apparatus including: a determination unit configured to determine how to deal with a recognition result of speech data obtained by a first speech recognizer on a basis of a comparison between a certainty factor of the recognition result of the speech data obtained by the first speech recognizer and a threshold; and a threshold setting unit configured to set dynamically the threshold. The method further comprises using three operation modes where third, second and first modes comprise of three, two and one recognizers respectively, and the threshold in the third mode is lower than the second mode and higher than the first mode.
    Type: Grant
    Filed: November 28, 2016
    Date of Patent: April 28, 2020
    Assignee: SONY CORPORATION
    Inventors: Emiru Tsunoo, Toshiyuki Kumakura
  • Patent number: 10629204
    Abstract: Utterance-based user interfaces can include activation trigger processing techniques for detecting activation triggers and causing execution of certain commands associated with particular command pattern activation triggers without waiting for output from a separate speech processing engine. The activation trigger processing techniques can also detect speech analysis patterns and selectively activate a speech processing engine.
    Type: Grant
    Filed: October 3, 2018
    Date of Patent: April 21, 2020
    Assignee: SPOTIFY AB
    Inventor: Richard Mitic
  • Patent number: 10628741
    Abstract: Techniques are described for machine-trained analysis for multimodal machine learning. A computing device captures a plurality of information channels, wherein the plurality of information channels includes contemporaneous audio information and video information from an individual. A multilayered convolutional computing system learns trained weights using the audio information and the video information from the plurality of information channels, wherein the trained weights cover both the audio information and the video information and are trained simultaneously, and wherein the learning facilitates emotional analysis of the audio information and the video information. A second computing device captures further information and analyzes the further information using trained weights to provide an emotion metric based on the further information.
    Type: Grant
    Filed: September 11, 2018
    Date of Patent: April 21, 2020
    Assignee: Affectiva, Inc.
    Inventors: Rana el Kaliouby, Seyedmohammad Mavadati, Taniya Mishra, Timothy Peacock, Panu James Turcot
  • Patent number: 10629192
    Abstract: The present disclosure provides a voice recognition system configured to generate a custom phoneme mapping for a user. The voice recognition system can analyze a user speech sample of a grammar training set in order to generate the custom phoneme mapping. The custom phoneme mapping can be used for subsequent recognition of the user's voice within an application.
    Type: Grant
    Filed: January 9, 2018
    Date of Patent: April 21, 2020
    Assignee: ELECTRONIC ARTS INC.
    Inventor: David Gershon Streat
  • Patent number: 10621991
    Abstract: A speaker recognition system includes a previously-trained joint neural network. An enrollment machine of the speaker recognition system is configured to operate the previously-trained joint neural network to enroll a new speaker based on audiovisual data featuring the newly enrolled speaker. A recognition machine of the speaker recognition system is configured to operate the previously-trained joint neural network to recognize a previously-enrolled speaker based on audiovisual data featuring the previously-enrolled speaker.
    Type: Grant
    Filed: June 28, 2018
    Date of Patent: April 14, 2020
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Shixiong Zhang, Eyal Krupka
  • Patent number: 10621993
    Abstract: An apparatus for generating an error concealment signal, includes: an LPC representation generator for generating a replacement LPC representation; an LPC synthesizer for filtering a codebook information using the replacement LPC representation; and a noise estimator for estimating a noise estimate during a reception of good audio frames, wherein the noise estimate depends on the good audio frames representation generator is configured to use the noise estimate estimated by the noise estimator in generating the replacement LPC representation.
    Type: Grant
    Filed: November 1, 2018
    Date of Patent: April 14, 2020
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Michael Schnabel, Jérémie Lecomte, Ralph Sperschneider, Manuel Jander
  • Patent number: 10614818
    Abstract: An apparatus for generating an error concealment signal includes an LPC (linear prediction coding) representation generator for generating a first replacement LPC representation and a different second replacement LPC representation; an LPC synthesizer for filtering a first codebook information using the first replacement representation to obtain a first replacement signal and for filtering a different second codebook information using the second replacement LPC representation to obtain a second replacement signal; and a replacement signal combiner for combining the first replacement signal and the second replacement signal to obtain the error concealment signal.
    Type: Grant
    Filed: November 1, 2018
    Date of Patent: April 7, 2020
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Michael Schnabel, Jérémie Lecomte, Ralph Sperschneider, Manuel Jander
  • Patent number: 10592609
    Abstract: Disclosed herein are system, method, and computer program product embodiments for recognizing a human emotion in a message. An embodiment operates by receiving a message from a user. The embodiment labels each word of the message with a part of speech (POS) thereby creating a POS set. The embodiment creates a bag of words (BOW) for the message. The embodiment determines an incongruity score for a combination of words in the POS set using a knowledgebase. The embodiment determines a preliminary emotion detection score for an emotion for the message based on the POS set and the BOW. Finally, the embodiment calculates a final emotion detection score for the emotion for the message based on the preliminary emotion detection score and the incongruity score.
    Type: Grant
    Filed: April 26, 2019
    Date of Patent: March 17, 2020
    Assignee: Tucknologies Holdings, Inc.
    Inventors: Craig Tucker, Bryan Novak
  • Patent number: 10558926
    Abstract: An apparatus for extracting selected information from a set of symbols includes said alignment module is configured to retrieve test patterns from a symbol input, and to attempt alignment of test patterns with a canonical pattern. Successful alignment between a particular test pattern and said canonical pattern indicates of existence of information of interest in a particular candidate pattern. Upon detection of a successful alignment, the alignment module passes information concerning the test pattern to a user. Additionally, in response to detecting an unsuccessful attempt to align the first test pattern and the canonical pattern, said alignment module passes, to said user, information concerning the first test pattern.
    Type: Grant
    Filed: November 20, 2015
    Date of Patent: February 11, 2020
    Assignee: ACADEMIA SINICA
    Inventor: Wen-lian Hsu
  • Patent number: 10554908
    Abstract: Exemplary embodiments relate to the application of media effects, such as visual overlays, sound effects, etc. to a video conversation. A media effect may be applied as a reaction to an occurrence in the conversation, such as in response to an emotional reaction detected by emotion analysis of information associated with the video. Effect application may be controlled through gestures, such as applying different effects with different gestures, or canceling automatic effect application using a gesture. Effects may also be applied in group settings, and may affect multiple users. A real-time data channel may synchronize effect application across multiple participants. When broadcasting a video stream that includes effects, the three channels may be sent to an intermediate server, which stitches the three channels together into a single video stream; the single video stream may then be sent to a broadcast server for distribution to the broadcast recipients.
    Type: Grant
    Filed: December 5, 2016
    Date of Patent: February 4, 2020
    Assignee: FACEBOOK, INC.
    Inventors: Stephane Taine, Brendan Benjamin Aronoff, Jason Clark
  • Patent number: 10553237
    Abstract: A transmission controller monitors a sound pressure determination signal and a distance determination signal. The transmission controller controls a transmission voice processor to start an operation of generating a transmission voice signal, when the distance determination signal indicates that a distance is equal to or less than a first distance. The transmission controller controls to start an operation of determining a sound pressure of a voice signal, when the distance determination signal indicates that the distance is equal to or less than a second distance shorter than the first distance. The transmission controller supplies a transmission control signal to a transmission circuit so that the transmission circuit transmits the transmission voice signal as a radio wave, when the sound pressure determination signal indicates that the sound pressure is equal to or greater than a predetermined threshold value.
    Type: Grant
    Filed: February 7, 2018
    Date of Patent: February 4, 2020
    Assignee: JVC KENWOOD CORPORATION
    Inventor: Manabu Nakano
  • Patent number: 10491690
    Abstract: Disclosed are apparatuses, methods, and computer readable media for improved intelligent personal assistant (IPA) software agents that are configured to interact with various people, service providers, and/or smart devices across multiple connection protocols, communications formats, and communication protocols in a seamless and more accurate fashion. More particularly, but not by way of limitation, this disclosure relates to apparatuses, methods, and computer readable media for an improved Message Understanding Service (MUS) that is able to match generic user commands and queries (i.e., commands and queries that are not explicitly directed to a particular service endpoint or smart device) with the service endpoint(s) that have the greatest confidence level of being able to handle the generic command or query.
    Type: Grant
    Filed: December 31, 2016
    Date of Patent: November 26, 2019
    Assignee: Entefy Inc.
    Inventors: Alston Ghafourifar, Brienne Ghafourifar, Mehdi Ghafourifar
  • Patent number: 10409797
    Abstract: A system and method is provided for providing searchable customer call indexes. Consistent with disclosed embodiments, a system may receive call information associated with telephone conversations between callers and a vendor, the call information including an audio recording or transcript for each telephone conversation. The system may also identify one or more keywords from the audio recordings or transcripts and index the call information into one or more indexes based on the identified keywords. Finally, the system may determine search results responsive to a search query based on the indexing. In some embodiments, changes to customer service may be identified based on the search results.
    Type: Grant
    Filed: January 25, 2018
    Date of Patent: September 10, 2019
    Assignee: Capital One Services, LLC
    Inventor: Nikhil Murgai
  • Patent number: 10410637
    Abstract: Systems and processes for providing user-specific acoustic models are provided. In accordance with one example, a method includes, at an electronic device having one or more processors, receiving a plurality of speech inputs, each of the speech inputs associated with a same user of the electronic device; providing each of the plurality of speech inputs to a user-independent acoustic model, the user-independent acoustic model providing a plurality of speech results based on the plurality of speech inputs; initiating a user-specific acoustic model on the electronic device; and adjusting the user-specific acoustic model based on the plurality of speech inputs and the plurality of speech results.
    Type: Grant
    Filed: September 22, 2017
    Date of Patent: September 10, 2019
    Assignee: Apple Inc.
    Inventors: Matthias Paulik, Henry G. Mason, Jason A. Skinder
  • Patent number: 10402827
    Abstract: Embodiments of the invention are directed to systems and methods for biometrics transaction processing. A location of a device associated with a user may be determined. A reference to a biometric data model associated with the user stored within a database may be retrieved, based at least in part on the location. Biometric data may be received from the user. Using the reference, the biometric data may be compared to the biometric data model stored within the database. A determination may be made whether the user is authenticated for the transaction based on the comparing step.
    Type: Grant
    Filed: October 5, 2018
    Date of Patent: September 3, 2019
    Assignee: Visa International Service Association
    Inventors: John F. Sheets, Kim R. Wagner, Mark A. Nelsen
  • Patent number: 10382626
    Abstract: A call handling platform receives a call placed by a caller to a calling number. The call handling platform computes an experience score for the caller using measurements of a subset of data points based on an interaction of the caller with an interactive voice response (IVR) module during the call. The experience score reflects a numerical measure of a level of satisfaction of the caller in interacting with the IVR module. The call handling platform compares the experience score to a predetermined threshold that indicates a minimum level of caller satisfaction, and determines that the experience score indicates that the caller has a lower level of satisfaction than the minimum level of satisfaction. Conditioned on this determination, the call handling platform routes the call to a human agent at a call center, along with enabling the agent to perceive a representation of the experience score.
    Type: Grant
    Filed: January 8, 2018
    Date of Patent: August 13, 2019
    Inventors: Praphul Kumar, Aaron Wellman, Ahmed Tewfik Bouzid
  • Patent number: 10379507
    Abstract: A voice control type bath system and an operating method thereof are disclosed. The voice control type bath system is utilized for a massage bath equipment and includes at least one attached device for actuating the massage bath equipment, a voice receiving unit for receiving at least one voice signal, a voice analyzing module for analyzing the at least one voice signal to generate at least one controlling command; and a main control device for controlling the at least one attached device to actuate the massage bath equipment according to the at least one controlling command and/or for controlling an actuation of the at least one attached device according to the at least one controlling command. The voice control type bath system and the operating method thereof can directly control the at least one attached device via the at least one voice signal.
    Type: Grant
    Filed: November 24, 2016
    Date of Patent: August 13, 2019
    Assignee: DARTPOINT TECH. CO., LTD.
    Inventors: Chi-Lin Kang, Chao-Yuan Huang