Specialized Equations Or Comparisons Patents (Class 704/236)
  • Patent number: 11514354
    Abstract: An Artificial Intelligence (AI) based performance prediction system predicts the performance and behavior of an entity via a complex structure made of iterative and parallel machine learning (ML) model rebuilds with real time data collection. The engine selects a best model at every level and scores the entity to help in predicting the behavior of the entity. Model selection is based on various model selection criteria. The selected model determines a propensity score that indicates a likelihood of the entity migrating from a currently categorized segment to another segment of higher or lower value. Accordingly, messages or alerts with one or more of corrective actions or system enhancements can be transmitted based on the status of the entity via various targeting channels and a post treatment analysis is carried out to find the effect of the corrective actions on the entity.
    Type: Grant
    Filed: June 4, 2018
    Date of Patent: November 29, 2022
    Assignee: ACCENTURE GLOBAL SOLUTIONS LIMITED
    Inventors: Mamta Aggarwal Rajnayak, Charu Nahata, Sorabh Kalra, Harshila Srivastav
  • Patent number: 11482213
    Abstract: Systems, methods, and computer-readable media for correcting transcriptions created through automatic speech recognition. A transcription of speech created using an automatic speech recognition system can be received. One or more domain-specific contexts associated with the speech can be identified and a text span that includes a mistranscribed entry can be recognized from the speech based on the one or more domain-specific contexts. Additionally, features can be extracted from the mistranscribed entry and the extracted features can be matched against an index of domain-specific entries to identify a correct entry of the mistranscribed entry. Subsequently, the transcription can be corrected by replacing with the mistranscribed entry with the correct entry.
    Type: Grant
    Filed: January 29, 2019
    Date of Patent: October 25, 2022
    Assignee: CISCO TECHNOLOGY, INC.
    Inventors: Karthik Raghunathan, Arushi Raghuvanshi, Vijay Ramakrishnan Thimmaiyah, Lucien Serapio Carroll, Varsha Ravikumar Embar
  • Patent number: 11468243
    Abstract: A computing device can receive a communication including text that can be presented on a display screen of the computing device. A camera of the computing device can capture image data. The computing device can determine, from the image data, an identity represented in the image data. The computing device can determine an amount of the communication to present on the display screen based on the identity. The computing device can determine, from the image data, user attention is directed toward the display screen. The computing device can present the amount of the communication on the display screen. In some embodiments, the computing device can determine which content of the communication to display based on the identity. The computing device can display a summary of the communication. The computing device can display an amount of the summary and/or the content of the summary based on the identity.
    Type: Grant
    Filed: July 1, 2019
    Date of Patent: October 11, 2022
    Assignee: Amazon Technologies, Inc.
    Inventor: Ryan H. Cassidy
  • Patent number: 11455999
    Abstract: Data is received that encapsulates a spoken response to a prompt text comprising a string of words. Thereafter, the received data is transcribed into a string of words. The string of words is then compared with a prompt so that a similarity grid representation of the comparison can be generated that characterizes a level of similarity between the string of words in the spoken response and the string of words in the prompt text. The grid representation is then scored using at least one machine learning model. The score indicates a likelihood of the spoken response having been off-topic. Data providing the encapsulated score can then be provided. Related apparatus, systems, techniques and articles are also described.
    Type: Grant
    Filed: April 9, 2020
    Date of Patent: September 27, 2022
    Assignee: Educational Testing Service
    Inventors: Xinhao Wang, Su-Youn Yoon, Keelan Evanini, Klaus Zechner, Yao Qian
  • Patent number: 11443734
    Abstract: A text search query including one or more words may be received. An ASR index created for an audio recording may be searched over using the query to produce ASR search results including words, each word associated with a confidence score. For each of the words in the ASR search results associated with a confidence score below a threshold (and in some cases having one or more preceding words in the ASR index and one or more subsequent words in the ASR index), a phonetic representation of the audio recording may be searched for the word having the confidence score below the threshold, where it occurs in the audio recording, possibly after the one or more preceding words and in the audio recording before the one or more subsequent words, to produce phonetic search results. Search results may be returned include ASR and phonetic results.
    Type: Grant
    Filed: August 26, 2019
    Date of Patent: September 13, 2022
    Assignee: NICE LTD.
    Inventors: William Mark Finlay, Robert William Morris, Peter S. Cardillo, Maria Michaela Kunin
  • Patent number: 11423913
    Abstract: An apparatus for generating an error concealment signal, includes: an LPC representation generator for generating a replacement LPC representation; an LPC synthesizer for filtering a codebook information using the replacement LPC representation; and a noise estimator for estimating a noise estimate during a reception of good audio frames, wherein the noise estimate depends on the good audio frames representation generator is configured to use the noise estimate estimated by the noise estimator in generating the replacement LPC representation.
    Type: Grant
    Filed: March 27, 2020
    Date of Patent: August 23, 2022
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Michael Schnabel, Jérémie Lecomte, Ralph Sperschneider, Manuel Jander
  • Patent number: 11410686
    Abstract: In one aspect, a computerized method for implementing voice and acupressure-based lifestyle management includes the step of measuring a speed at which a user is speaking. A wearable device records the user's voice with a microphone and communicates a digital recording of the user's voice to a computer processor. The method includes the step of measuring a time spacing between a set of user's words and a length of the set of user's words. The method includes the step of determining at least one anomaly by comparing the digital recording of the user's voice with a benchmark recording of the user's voice. The method includes the step of alerting the user of the detected anomaly.
    Type: Grant
    Filed: July 2, 2019
    Date of Patent: August 9, 2022
    Assignee: VOECE, INC.
    Inventor: Rashmi Panda
  • Patent number: 11405506
    Abstract: Systems and methods are provided for attribute-based client callbacks. A client is prompted to leave a voice message. Attributes are extracted from the voice message and, based on the attributes, tokens created for the selection of an appropriate agent is connected to the client, such as having skills or attributes matching one or more tokens. A callback application server transmits prompts and receives requests for client callbacks. an interaction manager determines agent availability and arranges callback handling, and a session management server initiates callbacks to connect the selected agent with the client.
    Type: Grant
    Filed: June 29, 2020
    Date of Patent: August 2, 2022
    Assignee: Avaya Management L.P.
    Inventors: Manish Dusad, Kazim Hussain
  • Patent number: 11393479
    Abstract: An apparatus for generating an error concealment signal includes an LPC (linear prediction coding) representation generator for generating a first replacement LPC representation and a different second replacement LPC representation; an LPC synthesizer for filtering a first codebook information using the first replacement representation to obtain a first replacement signal and for filtering a different second codebook information using the second replacement LPC representation to obtain a second replacement signal; and a replacement signal combiner for combining the first replacement signal and the second replacement signal to obtain the error concealment signal.
    Type: Grant
    Filed: March 3, 2020
    Date of Patent: July 19, 2022
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Michael Schnabel, Jérémie Lecomte, Ralph Sperschneider, Manuel Jander
  • Patent number: 11392773
    Abstract: Techniques for generating conversational training data are described. In some instances, a request to generate conversational training data for a goal-oriented conversation model is received, a transitional graph of intents is traversed to generate a conversation template for each intent of the transitional graph, each intent being a task to fulfill a request and comprising one or more slot to be filled by a user of the bot machine learning model, the conversation template including a path including at least one placeholder for an utterance or a slot level utterance, and at least utterances from one or more dictionaries are sampled to fill in the placeholders for the utterances of the path to generate conversational training data.
    Type: Grant
    Filed: January 31, 2019
    Date of Patent: July 19, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Rashmi Gangadharaiah, Ajay Mishra, Roger Scott Jenke, Meghana Puvvadi
  • Patent number: 11368583
    Abstract: One example method of operation may include identifying call data associated with a received call, identifying call parameters from the call data, and the call parameters include one or more call routing parameters associated with call routing of the call and one or more call session parameters associated with a call session of the call, assigning weights to one or more of the call routing parameters and the call session parameters, determining a scam score for the call based on a sum of the weights applied to the call routing parameters and the call session parameters, and blocking the call when the scam score is greater than or equal to a predetermined threshold scam score.
    Type: Grant
    Filed: November 3, 2020
    Date of Patent: June 21, 2022
    Assignee: FIRST ORION CORP.
    Inventors: Mark Hamilton Botner, Collin Michael Turney, Daniel Francis Kliebhan, Robert Francis Piscopo, Jr., Charles Donald Morgan, Jamelle Adnan Brown, Chee-Fung Choy, Samuel Kenton Welch, Nysia Inet George, Andrew Collin Shaddox
  • Patent number: 11357431
    Abstract: Methods and apparatus to identify an emotion evoked by media are disclosed. An example apparatus includes a synthesizer to generate a first synthesized sample based on a pre-verbal utterance associated with a first emotion. A feature extractor is to identify a first value of a first feature of the first synthesized sample. The feature extractor to identify a second value of the first feature of first media evoking an unknown emotion. A classification engine is to create a model based on the first feature. The model is to establish a relationship between the first value of the first feature and the first emotion. The classification engine is to identify the first media as evoking the first emotion when the model indicates that the second value corresponds to the first value.
    Type: Grant
    Filed: October 16, 2020
    Date of Patent: June 14, 2022
    Assignee: The Nielsen Company (US), LLC
    Inventors: Robert T. Knight, Ramachandran Gurumoorthy, Alexander Topchy, Ratnakar Dev, Padmanabhan Soundararajan, Anantha Pradeep
  • Patent number: 11361675
    Abstract: Provided is a system and a non-transitory computer-readable medium having computer-executable instructions stored thereon which, when executed by one or more processors effectuate operations comprising dividing a text of a plurality of words in a foreign language into one or more Interpretation Phrases, each of the Interpretation Phrases being made of one or more words chosen at an optimal composition for a user to listen to, read along, and maintain comprehension and engagement, wherein the optimal composition is determined based on the biographical data of the user and the historical usage by the user; reading aloud the first Interpretation Phrase by a narrator; and after reading aloud the first Interpretation Phrase, interpreting aloud the first Interpretation Phrase into the said user's native language to provide understanding of the Interpretation Phrase in the user's native language, to maintain the flow of the story, and to create and promote subconscious associations between native and foreign languag
    Type: Grant
    Filed: November 7, 2018
    Date of Patent: June 14, 2022
    Assignee: MAGICAL TRANSLATIONS, INC.
    Inventor: Leslie Omana Begert
  • Patent number: 11347803
    Abstract: Systems and methods for adaptive question answering are provided in which an answer is adaptive to a user's characteristics, goals and needs by continuously learning from user interactions and adapting both the context and data visualization. An exemplary system comprises software modules embodied on a computer network, and the software modules comprise an interpretation engine, an answering engine and a learning engine.
    Type: Grant
    Filed: January 27, 2020
    Date of Patent: May 31, 2022
    Assignee: Cuddle Artificial Intelligence Private Limited
    Inventors: Neha Prabhugaonkar, Abhay Parab, Natwar Mall
  • Patent number: 11336972
    Abstract: Systems, methods, and computer-readable media are disclosed for systems and methods for automated video preview generation. Example methods may include determining video content, determining a first shot transition, a second shot transition, a third shot transition, and a fourth shot transition in the video content, and determining that human speech is present during the first shot transition and the second shot transition. Example methods may include determining a first timestamp associated with the third shot transition, determining a second timestamp associated with the fourth shot transition, generating a first video preview of the video content, where the first video preview includes a segment of the video content from the first timestamp to the second timestamp, and causing presentation of the first video preview, where the first video preview does not include a segment of the video content between the first shot transition and the second shot transition.
    Type: Grant
    Filed: January 5, 2021
    Date of Patent: May 17, 2022
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Muhammad Raffay Hamid, Kewen Chen, Anne TuAnh Thanh Thuy Ho, Guy Friedel, Arun Velayudhan Pillai, Dhaval Damani, Jacob William Jensen, Zuzanna Maria Stepniakowska Coggins, Maciej Tadeusz Golonka, Anantha Krishna Hodrali Srinivasa Bhatta
  • Patent number: 11328733
    Abstract: Systems and methods for speaker verification comprise optimizing a neural network by minimizing a generalized negative log likelihood function, including receiving a training batch of audio samples comprising a plurality of utterances for each of a plurality of speakers, extracting features from the audio samples to generate a batch of features, processing the batch of features using a neural network to generate a plurality of embedding vectors configured to differentiate audio samples by speaker, computing a generalized negative log-likelihood loss (GNLL) value for the training batch based, at least in part, on the embedding vectors, and modifying weights of the neural network to reduce the GNLL value. Computing the GNLL may include generating a centroid vector for each of a plurality of speakers, based at least in part on the embedding vectors.
    Type: Grant
    Filed: September 24, 2020
    Date of Patent: May 10, 2022
    Assignee: SYNAPTICS INCORPORATED
    Inventors: Saeed Mosayyebpour Kaskari, Atabak Pouya
  • Patent number: 11308964
    Abstract: Systems, apparatus, methods, and articles of manufacture for cooperatively-overlapped and Artificial Intelligence (AI)-managed interfaces. For example, multiple cooperatively and/or partially overlapped interfaces may be provided (e.g., via an electronic and/or touch-screen device), with such interfaces being dynamically managed by various AI components, such as natural language processing, machine learning techniques, and/or neural network data processing.
    Type: Grant
    Filed: August 18, 2020
    Date of Patent: April 19, 2022
    Assignee: The Travelers Indemnity Company
    Inventors: Douglas Calegari, Stephen Ziegelmayer
  • Patent number: 11270692
    Abstract: A speech recognition method, performed by a computer, with an improved recognition accuracy is disclosed. The method includes: performing speech recognition of an input speech to acquire a plurality of recognition candidates through a plurality of speech recognition processes different from each other for a section having a reliability lower than a predetermined value; verifying similarities between each of the acquired plurality of recognition candidates and meta-information corresponding to the input speech; and determining, based on the verified similarities, a recognition result of the low-reliability section from among the acquired plurality of recognition candidates.
    Type: Grant
    Filed: June 28, 2019
    Date of Patent: March 8, 2022
    Assignee: FUJITSU LIMITED
    Inventors: Yusuke Hamada, Keisuke Asakura
  • Patent number: 11264012
    Abstract: Conversations between agents of a contact center and a customer are often transcribed so that text is maintained. However, text conversations consist only of text and omit significant portions of a conversation that are conveyed outside of the specific words spoken. By determining the emotion, tone, or other aspect in a conversation, which may contradict the text content, a data structure may be maintained such that the textual content is annotated with emotion or tonal information and/or utilized in a routing decision to cause a communication network to be altered, such as to include at least one additional node based upon a particular emotion or tone.
    Type: Grant
    Filed: December 31, 2019
    Date of Patent: March 1, 2022
    Assignee: Avaya Inc.
    Inventors: Piyush Mital, Nikita Kotak, Asmita Gokhale, Robert E. Braudes
  • Patent number: 11257484
    Abstract: According to some embodiments, a multi-layer speech recognition transcript post processing system may include a data-driven, statistical layer associated with a trained automatic speech recognition model that selects an initial transcript. A rule-based layer may receive the initial transcript from the data-driven, statistical layer and execute at least one pre-determined rule to generate a first modified transcript. A machine learning approach layer may receive the first modified transcript from the rule-based layer and perform a neural model inference to create a second modified transcript. A human editor layer may receive the second modified transcript from the machine learning approach layer along with an adjustment from at least one human editor. The adjustment may create, in some embodiments, a final transcript that may be used to fine-tune the data-driven, statistical layer.
    Type: Grant
    Filed: August 21, 2019
    Date of Patent: February 22, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Dimitrios Basile Dimitriadis, Xie Chen, Nanshan Zeng, Yu Shi, Liyang Lu
  • Patent number: 11250840
    Abstract: Some embodiments provide a method of training a MT network to detect a wake expression that directs a digital assistant to perform an operation based on a request that follows the expression. The MT network includes processing nodes with configurable parameters. The method iteratively selects different sets of input values with known sets of output values. Each of a first group of input value sets includes a vocative use of the expression. Each of a second group of input value sets includes a non-vocative use of the expression. For each set of input values, the method uses the MT network to process the input set to produce an output value set and computes an error value that expresses an error between the produced output value set and the known output value set. Based on the error values, the method adjusts configurable parameters of the processing nodes of the MT network.
    Type: Grant
    Filed: April 5, 2019
    Date of Patent: February 15, 2022
    Assignee: PERCEIVE CORPORATION
    Inventor: Steven L. Teig
  • Patent number: 11244698
    Abstract: Systems and methods are provided for analyzing voice-based audio inputs. A voice-based audio input associated with a user (e.g., wherein the voice-based audio input is a prompt or a command) is received and measures of one or more features are extracted. One or more parameters are calculated based on the measures of the one or more features. The occurrence of one or more mistriggers is identified by inputting the one or more parameters into a predictive model. Further, systems and methods are provided for identifying human mental health states using mobile device data. Mobile device data (including sensor data) associated with a mobile device corresponding to a user is received. Measurements are derived from the mobile device data and input into a predictive model. The predictive model is executed and outputs probability values of one or more symptoms associated with the user.
    Type: Grant
    Filed: March 8, 2019
    Date of Patent: February 8, 2022
    Assignee: Cogito Corporation
    Inventors: Joshua Feast, Ali Azarbayejani, Skyler Place
  • Patent number: 11244689
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for determining voice characteristics are provided. One of the methods includes: obtaining speech data of a speaker; inputting the speech data into a model trained at least by jointly minimizing a first loss function and a second loss function, wherein the first loss function comprises a non-sampling-based loss function and the second loss function comprises a Gaussian mixture loss function with non-unit multi-variant covariance matrix; and obtaining from the trained model one or more voice characteristics of the speaker.
    Type: Grant
    Filed: March 22, 2021
    Date of Patent: February 8, 2022
    Assignee: ALIPAY (HANGZHOU) INFORMATION TECHNOLOGY CO., LTD.
    Inventors: Zhiming Wang, Kaisheng Yao, Xiaolong Li
  • Patent number: 11238859
    Abstract: A natural-language voice chatbot is initiated and a voice session is established between the chatbot and a customer while the customer is operating a vehicle device within a vehicle. A pre-staged order is taken from a customer during the session and the session is suspended until the customer arrives at a store associated with the pre-staged order. A location-based trigger is raised when the customer is detected as being present at a transaction terminal of a store; the session is resumed on the transaction terminal and/or the vehicle device. The pre-stage order is confirmed during the resumed session and payment is obtained from the customer for the order when payment was not already obtained from the customer. The order is sent to a fulfillment station and, in an embodiment, the items associated with the order are delivered to the customer while the customer remains at the terminal.
    Type: Grant
    Filed: June 28, 2019
    Date of Patent: February 1, 2022
    Assignee: NCR Corporation
    Inventors: Matthew Robert Burris, Shelby Frances Apps, Andrew Cohen, Gary C. Dalton, Jason Robert Dyer, Jodessiah Sumpter
  • Patent number: 11200217
    Abstract: A method includes searching for data contained in a structured data structure. The method includes receiving a query. The query includes a structured data structure path and a first element related to the structured data structure path. One or more patterns are created comprising at least a portion of the structured data structure path and one or more elements related to the first element. For each of the one or more patterns, a hash is created. The created hashes are looked-up in a hash index to identify one or more structured data structures correlated to the hashes. The one or more structured data structures are identified to a user.
    Type: Grant
    Filed: May 26, 2017
    Date of Patent: December 14, 2021
    Assignee: PERFECT SEARCH CORPORATION
    Inventors: Bruce R. Tietjen, Ronald P. Millett
  • Patent number: 11194998
    Abstract: An intelligent assistant records speech spoken by a first user and determines a self-selection score for the first user. The intelligent assistant sends the self-selection score to another intelligent assistant, and receives a remote-selection score for the first user from the other intelligent assistant. The intelligent assistant compares the self-selection score to the remote-selection score. If the self-selection score is greater than the remote-selection score, the intelligent assistant responds to the first user and blocks subsequent responses to all other users until a disengagement metric of the first user exceeds a blocking threshold. If the self-selection score is less than the remote-selection score, the intelligent assistant does not respond to the first user.
    Type: Grant
    Filed: July 24, 2017
    Date of Patent: December 7, 2021
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Kazuhito Koishida, Alexander A Popov, Uros Batricevic, Steven Nabil Bathiche
  • Patent number: 11113607
    Abstract: A response generation apparatus ensures accurate output. A computer stores graph knowledge including a response generation module generating a response to an input document including a plurality of sentences, the graph knowledge database includes graph data that manages a structure of each type of graph knowledge, and the response generation module generates a first graph knowledge from each of the sentences; searches a second graph knowledge similar to each of the plurality of first graph knowledge while referring to the graph data on the basis of the plurality of first graph knowledge; identifies the plurality of second graph knowledge included in a dense location where a density of the second graph knowledge is high in a graph space; searches third graph knowledge for generating the response while referring to the graph data on the basis of the identified second graph knowledge; and generates the response using the third graph knowledge.
    Type: Grant
    Filed: June 7, 2017
    Date of Patent: September 7, 2021
    Assignee: HITACHI, LTD.
    Inventors: Toshinori Miyoshi, Miaomei Lei, Hiroki Sato
  • Patent number: 11102590
    Abstract: A hearing device, e.g. a hearing aid, comprises a) a multitude of input units, each providing an electric input signal representing sound in the environment of the user in a time-frequency representation, wherein the sound is a mixture of speech and additive noise or other distortions, e.g. reverberation, b) a multitude of beamformer filtering units, each being configured to receive at least two, e.g. all, of said multitude of electric input signals, each of said multitude of beamformer filtering units being configured to provide a beamformed signal representative of the sound in a different one of a multitude of spatial segments, e.g. spatial cells, around the user, c) a multitude of speech probability estimators each configured to receive the beamformed signal for a particular spatial segment and to estimate a probability that said particular spatial segment contains speech at a given point in time and frequency, wherein at least one, e.g.
    Type: Grant
    Filed: July 17, 2019
    Date of Patent: August 24, 2021
    Assignee: Oticon A/S
    Inventor: Jesper Jensen
  • Patent number: 11087744
    Abstract: Term masking is performed by generating a time-alignment value for a plurality of identifiable units of sound in vocal audio content contained in a mixed audio track, force-aligning each of the plurality of identifiable units of sound to the vocal audio content based on the time-alignment value, thereby generating a plurality of force-aligned identifiable units of sound, identifying from the plurality of force-aligned identifiable units of sound a force-aligned identifiable unit of sound to be muddled, and audio muddling the force-aligned identifiable unit of sound to be muddled.
    Type: Grant
    Filed: December 17, 2019
    Date of Patent: August 10, 2021
    Assignee: Spotify AB
    Inventors: Andreas Jansson, Eric J. Humphrey, Rachel Malia Bittner, Sravana K. Reddy
  • Patent number: 11023520
    Abstract: Implementations relate to techniques for providing context-dependent search results. The techniques can include receiving a query and background audio. The techniques can also include identifying the background audio, establishing concepts related to the background audio and obtaining terms related to the concepts related to the background audio. The techniques can also include obtaining search results based on the query and on at least one of the terms. The techniques can also include providing the search results.
    Type: Grant
    Filed: January 10, 2019
    Date of Patent: June 1, 2021
    Assignee: GOOGLE LLC
    Inventors: Jason Sanders, John J. Lee, Gabriel Taubman
  • Patent number: 11024311
    Abstract: The various implementations described herein include methods and systems for determining device leadership among voice interface devices. In one aspect, a method is performed at a first electronic device of a plurality of electronic devices, each having microphones, a speaker, processors, and memory storing programs for execution by the processors. The first device detects a voice input. It determines a device state and a relevance of the voice input. It identifies a subset of electronic devices from the plurality to which the voice input is relevant. In accordance with a determination that the subset includes the first device, the first device determines a first score of a criterion associated with the voice input and receives second scores of the criterion from other devices in the subset. In accordance with a determination that the first score is higher than the second scores, the first device responds to the detected input.
    Type: Grant
    Filed: February 10, 2020
    Date of Patent: June 1, 2021
    Assignee: GOOGLE LLC
    Inventors: Kenneth Mixter, Diego Melendo Casado, Alexander Houston Gruenstein, Terry Tai, Christopher Thaddeus Hughes, Matthew Nirvan Sharifi
  • Patent number: 11002789
    Abstract: An analog circuit fault feature extraction method based on a parameter random distribution neighbor embedding winner-take-all method, comprising the following steps: (1) collecting a time-domain response signal of an analog circuit under test, wherein the input of the analog circuit under test is excited by using a pulse signal, a voltage signal is sampled at an output end, and the collected time-domain response signal is an output voltage signal of the analog circuit; (2) applying a discrete wavelet packet transform for the collected time-domain response signal to acquire each wavelet node signal; (3) calculating energy values and kurtosis values of the acquired wavelet node signals to form an initial fault feature data set of the analog circuit; and (4) analyzing the initial fault feature data by the parameter random distribution neighbor embedding winner-take-all method, to acquire optimum low-dimensional feature data.
    Type: Grant
    Filed: October 20, 2018
    Date of Patent: May 11, 2021
    Assignee: WUHAN UNIVERSITY
    Inventors: Yigang He, Wei He, Hui Zhang, Liulu He, Baiqiang Yin, Bing Li
  • Patent number: 10997277
    Abstract: An integrated circuit device such as a neural network accelerator can be programmed to select a numerical value based on a multinomial distribution. In various examples, the integrated circuit device can include an execution engine that includes multiple separate execution units. The multiple execution units can operate in parallel on different streams of data. For example, to make a selection based on a multinomial distribution, the execution units can be configured to perform cumulative sums on sets of numerical values, where the numerical values represent probabilities. In this example, to then obtain cumulative sums across the sets of numerical values, the largest values from the sets can be accumulated, and then added, in parallel to the sets. The resulting cumulative sum across all the numerical values can then be used to randomly select a specific index, which can provide a particular numerical value as the selected value.
    Type: Grant
    Filed: March 26, 2019
    Date of Patent: May 4, 2021
    Assignee: Amazon Technologies, Inc.
    Inventors: Yu Zhou, Vignesh Vivekraja, Ron Diamant
  • Patent number: 10997964
    Abstract: A system, method and computer-readable storage devices are for normalizing text for ASR and TTS in a language-neutral way. The system described herein divides Unicode text into meaningful chunks called “atomic tokens.” The atomic tokens strongly correlate to their actual pronunciation, and not to their meaning. The system combines the tokenization with a data-driven classification scheme, followed by class-determined actions to convert text to normalized form. The classification labels are based on pronunciation, unlike alternative approaches that typically employ Named Entity-based categories. Thus, this approach is relatively simple to adapt to new languages. Non-experts can easily annotate training data because the tokens are based on pronunciation alone.
    Type: Grant
    Filed: August 16, 2019
    Date of Patent: May 4, 2021
    Assignee: AT&T INTELLECTUAL PROPERTY 1, L.P.
    Inventors: Ladan Golipour, Alistair D. Conkie
  • Patent number: 10971135
    Abstract: Systems, methods, and computer-readable storage devices for crowd-sourced data labeling. The system requests a respective response from each of a set of entities. The set of entities includes crowd workers. Next, the system incrementally receives a number of responses from the set of entities until one of an accuracy threshold is reached and m responses are received, wherein the accuracy threshold is based on characteristics of the number of responses. Finally, the system generates an output response based on the number of responses.
    Type: Grant
    Filed: July 11, 2019
    Date of Patent: April 6, 2021
    Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.
    Inventors: Jason Williams, Tirso Alonso, Barbara B. Hollister, Ilya Dan Melamed
  • Patent number: 10963510
    Abstract: A natural language processing system that includes an artificial intelligence (AI) engine and a tagging engine. The AI engine is configured to receive a set of audio files and to identify concepts within the set of audio files. The AI engine is further configured to determine a usage frequency for each of the identified concepts and to generate an AI-defined tag for concepts with a usage frequency that is greater than a usage frequency threshold. The tagging engine is configured to receive an audio file and to identify observed concepts within the audio file. The tagging engine is further configured to compare the observed concepts to the first set of concepts, to determine one or more observed concepts matches concepts linked with AI-defined tags, and to modify metadata for the audio file to include AI-defined tags.
    Type: Grant
    Filed: August 9, 2018
    Date of Patent: March 30, 2021
    Assignee: Bank of America Corporation
    Inventors: James McCormack, Sean M. Gutman, Manu J. Kurian, Sasidhar Purushothaman, Suki Ramasamy, William P. Jacobson
  • Patent number: 10963679
    Abstract: Methods and systems for recognizing emotions in video are disclosed. One example method includes the steps of receiving a video including images, detecting a face of the individual in the images, mapping the detected face to a model including at least two separated points in space corresponding to detectable emotions, each of the at least two separated points in space representing a plurality of example faces corresponding to one of the detectable emotions, determining the emotion of the individual from the detectable emotions based on a proximity of the detected face to the at least two separated points in space.
    Type: Grant
    Filed: March 12, 2019
    Date of Patent: March 30, 2021
    Assignee: Snap Inc.
    Inventors: Victor Shaburov, Yurii Monastyrshyn
  • Patent number: 10957339
    Abstract: The present disclosure provides a speaker recognition method and apparatus, a computer device and a computer-readable medium. The method comprises: receiving target speech data of a to-be-recognized user in a target group; according to the target speech data, a pre-collected speech database and a pre-trained speaker recognition model, obtaining speech output features corresponding to the target speech data and speech output features corresponding to each of said speech data in the speech database; the speaker recognition model employs a convolution neural network model; recognizing the user corresponding to the target speech data according to the speech output features corresponding to the target speech data and the speech output features corresponding to each of said speech data in the speech database.
    Type: Grant
    Filed: March 5, 2018
    Date of Patent: March 23, 2021
    Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
    Inventors: Ying Cao, Xiao Liu, Peng Hu, Jie Zhou, Shilei Wen
  • Patent number: 10949736
    Abstract: Systems, apparatus and methods are described including operations for a flexible neural network accelerator.
    Type: Grant
    Filed: November 3, 2016
    Date of Patent: March 16, 2021
    Assignee: Intel Corporation
    Inventors: Michael E Deisher, Ohad Falik
  • Patent number: 10924611
    Abstract: An evaluation criterion for a call performed between an operator and a customer is set without taking time and effort. A voice recognition system includes a call recording unit that records a call performed between a customer and an operator, a voice recognition unit that recognizes the call recorded by the call recording unit and a value of non-verbal information indicating a feature of a calling party in the call and accumulates a recognized result in a storage unit, and a voice recognition result managing unit that sets a reference value for evaluating the calling party on the basis of the value of the non-verbal information included in the recognized result.
    Type: Grant
    Filed: November 6, 2018
    Date of Patent: February 16, 2021
    Assignee: HITACHI INFORMATION & TELECOMMUNICATION ENGINEERING, LTD.
    Inventors: Yuko Kanetsuki, Takashi Sugiyama, Terumi Saito
  • Patent number: 10902105
    Abstract: Systems and methods for call detail record (CDR) analysis to determine a risk score for a call and identify fraudulent activity and for fraud detection in Interactive Voice Response (IVR) systems. An example method may store information extracted from received calls. Queries of the stored information may be performed to select data using keys, wherein each key relates to one of the received calls, and wherein the queries are parallelized. The selected data may be transformed into feature vectors, wherein each feature vector relates to one of the received calls and includes a velocity feature and at least one of a behavior feature or a reputation feature. A risk score for the call may be generated during the call based on the feature vectors.
    Type: Grant
    Filed: July 18, 2019
    Date of Patent: January 26, 2021
    Assignee: Pindrop Security, Inc.
    Inventors: Scott Strong, Kailash Patil, David Dewey, Raj Bandyopadhyay, Telvis Calhoun, Vijay Balasubramaniyan
  • Patent number: 10896682
    Abstract: A speaker recognition algorithm is trained (one or more of its models are tuned) with samples of a microphone signal produced by an inside microphone of a headphone, while the headphone is worn by a speaker. The trained speaker recognition algorithm then tests other samples of the inside microphone signal and produces multiple speaker identification scores for its given models, or a single speaker verification likelihood score for a single given model. Other embodiments are also described and claimed.
    Type: Grant
    Filed: October 23, 2018
    Date of Patent: January 19, 2021
    Assignee: APPLE INC.
    Inventor: Sorin V. Dusan
  • Patent number: 10884420
    Abstract: A cleaning robot and a shortest path planning method based on a cleaning robot are disclosed, a plurality of cleaning lines are formed by controlling the cleaning robot to perform cleaning in an area according to a zigzag-shaped path; association information of midpoints of at least a part of the cleaning lines is recorded to form a node skeleton tree in which midpoints are represented by nodes, the association information of each midpoint includes: position information of a node corresponding to the midpoint, position information of a parent node, and information of the number of child nodes of the parent node; in the process of traversing upwardly from a current node or traversing upwardly from both a current node and a target node in the node skeleton tree, the node skeleton tree is compressed, so as to determine the shortest planned path from the current node to the target node.
    Type: Grant
    Filed: December 12, 2018
    Date of Patent: January 5, 2021
    Assignee: SHENZHEN SILVER STAR INTELLIGENT TECHNOLOGY CO., LTD.
    Inventors: Xuyi Deng, Yuxi Liu
  • Patent number: 10878807
    Abstract: The present disclosure relates to speech recognition systems and methods that enable personalized vocal user interfaces. More specifically, the present disclosure relates to combining a self-learning speech recognition system based on semantics with a speech-to-text system optionally integrated with a natural language processing system. The combined system has the advantage of automatically and continually training the semantics-based speech recognition system and increasing recognition accuracy.
    Type: Grant
    Filed: December 1, 2015
    Date of Patent: December 29, 2020
    Assignee: FLuent.AI Inc.
    Inventors: Vikrant Tomar, Mathieu Desruisseaux, Helge Seetzen
  • Patent number: 10878837
    Abstract: An acoustic environment identification system is disclosed that can use neural networks to accurately identify environments. The acoustic environment identification system can use one or more convolutional neural networks to generate audio feature data. A recursive neural network can process the audio feature data to generate characterization data. The characterization data can be modified using a weighting system that weights signature data items. Classification neural networks can be used to generate a classification of an environment.
    Type: Grant
    Filed: February 28, 2018
    Date of Patent: December 29, 2020
    Assignee: Snap Inc.
    Inventors: Jinxi Guo, Jia Li, Ning Xu
  • Patent number: 10861464
    Abstract: The present disclosure provides an electronic apparatus having an incremental enrollment unit and a method thereof. The electronic apparatus at least includes a microphone, a storage device, and a processor. The storage device stores a first screening rule, an enrollment database, and a first temporary storage library. The processor receives a command voice transmitted by the microphone, and compare the command voice with enrolled voices in the enrollment database. If determining that a similarity is larger than a threshold value, the processor stores the command voice as a first temporarily stored voice in the first temporary storage library. When a quantity of the first temporarily stored voices in the first temporary storage library is larger than a first predetermined value, the processor screens out a part of the first temporarily stored voices according to the first screening rule, so as to perform incremental enrollment.
    Type: Grant
    Filed: September 25, 2018
    Date of Patent: December 8, 2020
    Assignee: ASUSTEK COMPUTER INC.
    Inventor: Hai-Hsing Lin
  • Patent number: 10832684
    Abstract: In non-limiting examples of the present disclosure, systems, methods and devices for providing personalized experiences to a computing device based on user input such as voice, text and gesture input are provided. Acoustic patterns associated with voice input, speech patterns, language patterns and natural language processing may be used to identify a specific user providing input from a plurality of users, identify user background characteristics and traits for the specific user, and topically categorize user input in a tiered hierarchical index. Topically categorized user input may be supplemented with user data and world knowledge and personalized responses and feedback for an identified specific user may be provided reactively and proactively.
    Type: Grant
    Filed: August 31, 2016
    Date of Patent: November 10, 2020
    Assignee: Microsoft Technology Licensing, LLC
    Inventor: Ruhi Sarikaya
  • Patent number: 10818288
    Abstract: Systems and processes for operating a virtual assistant to provide natural assistant interaction are provided. In accordance with one or more examples, a method includes, at an electronic device with one or more processors and memory: receiving a first audio stream including one or more utterances; determining whether the first audio stream includes a lexical trigger; generating one or more candidate text representations of the one or more utterances; determining whether at least one candidate text representation of the one or more candidate text representations is to be disregarded by the virtual assistant. If at least one candidate text representation is to be disregarded, one or more candidate intents are generated based on candidate text representations of the one or more candidate text representations other than the to be disregarded at least one candidate text representation.
    Type: Grant
    Filed: June 26, 2018
    Date of Patent: October 27, 2020
    Assignee: Apple Inc.
    Inventors: Juan Carlos Garcia, Paul S. McCarthy, Kurt Piersol
  • Patent number: 10803879
    Abstract: Apparatus and methods for audio classifying and processing are disclosed. In one embodiment, an audio processing apparatus includes an audio classifier for classifying an audio signal into at least one audio type in real time; an audio improving device for improving experience of audience; and an adjusting unit for adjusting at least one parameter of the audio improving device in a continuous manner based on the confidence value of the at least one audio type.
    Type: Grant
    Filed: November 9, 2017
    Date of Patent: October 13, 2020
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Lie Lu, Alan J. Seefeldt, Jun Wang
  • Patent number: 10789962
    Abstract: A system and method are presented for the correction of packet loss in audio in automatic speech recognition (ASR) systems. Packet loss correction, as presented herein, occurs at the recognition stage without modifying any of the acoustic models generated during training. The behavior of the ASR engine in the absence of packet loss is thus not altered. To accomplish this, the actual input signal may be rectified, the recognition scores may be normalized to account for signal errors, and a best-estimate method using information from previous frames and acoustic models may be used to replace the noisy signal.
    Type: Grant
    Filed: November 12, 2018
    Date of Patent: September 29, 2020
    Inventors: Srinath Cheluvaraja, Ananth Nagaraja Iyer, Aravind Ganapathiraju, Felix Immanuel Wyss