Patents Examined by Paras D Shah
-
Patent number: 11853695Abstract: Data processing apparatus comprises a data memory; a selection controller comprising a computer processor; and a digital interface between a control process implemented by the selection controller and a text handling process implemented by the computer processor or another processor; in which: the selection controller is configured to provide a text document from the data memory to the text handling process to identify one or more characteristics of words in the text document; the selection controller is configured to provide user selection of one or more of the words in the text document to be substituted and of one or more target characteristics; and the selection controller is configured to request from the text handling process a set of one or more substitute words for the selected words such that the substitute words comply with the selected one or more of the target characteristics.Type: GrantFiled: January 12, 2021Date of Patent: December 26, 2023Assignee: SONY CORPORATIONInventor: Michael Anslow
-
Patent number: 11848012Abstract: Provided are an artificial intelligence (AI) system that utilizes a machine learning algorithm such as deep learning, etc., and an application of the AI system. A method performed by a device for providing a voice assistant service through a voice assistant program includes: receiving, from an external device, a character specialized model for the voice assistant program; receiving a user voice input including a request for a response of the voice assistant program and a word indicating a character; determining the character specialized model according to the word indicating the character; generating a response message to the request for the response of the voice assistant program, using the character specialized model; and outputting the generated response message.Type: GrantFiled: September 19, 2019Date of Patent: December 19, 2023Assignee: SAMSUNG ELECTRONICS CO., LTD.Inventors: Inchul Hwang, Dohee Kang, Seungyoun Kim, Dongchul Ma, Minkyu Park, Daegeun Yi, Dohun Cha
-
Patent number: 11847417Abstract: In some examples, data-driven social media analytics application synthesis may include generating, for each social media analytics application of a plurality of social media analytics applications, a corpus, performing term normalization, and generating a normalized corpus. An actor, an action and an object may be generated for each social media analytics application, which may be mapped into an embedding space. A semantic cohesion network may be generated for each social media analytics application, and a pair-wise semantic cohesion may be determined to identify semantically cohesive groups. A new social media analytics application may be synthesized based on the identified semantically cohesive groups.Type: GrantFiled: March 12, 2021Date of Patent: December 19, 2023Assignee: ACCENTURE GLOBAL SOLUTIONS LIMITEDInventors: Janardan Misra, Vikrant Kaulgud, Sanjay Podder
-
Patent number: 11837214Abstract: Various embodiments of the present disclosure evaluate transcription accuracy. In some implementations, the system normalizes a first transcription of an audio file and a baseline transcription of the audio file. The baseline transcription can be used as an accurate transcription of the audio file. The system can further determine an error rate of the first transcription by aligning each portion of the first transcription with the portion of the baseline transcription, and assigning a label to each portion based on a comparison of the portion of the first transcription with the portion of the baseline transcription.Type: GrantFiled: October 29, 2020Date of Patent: December 5, 2023Assignee: United Services Automobile Association (USAA)Inventors: Michael J. Szentes, Carlos Chavez, Robert E. Lewis, Nicholas S. Walker
-
Patent number: 11830480Abstract: Systems and methods are provided for automatic speech recognition. In the method, the system obtains a padded sequence by processing a plurality of acoustic signals. The system compresses the padded sequence by reducing the size of the padded sequence to obtain a compressed sequence. The system inputs the compressed sequence into a pre-trained encoder neural network to obtain an encoded sequence and then decompresses the encoded sequence by recovering the encoded sequence to an original sequential ordering. The system inputs the encoded sequence to a decoding module to obtain recognition texts.Type: GrantFiled: February 17, 2021Date of Patent: November 28, 2023Assignee: KWAI INC.Inventors: Yongxiong Ren, Yang Liu, Heng Liu, Lingzhi Liu
-
Patent number: 11830477Abstract: An automatic speech recognition (ASR) system that determines a textual representation of a word from a word spoken in a natural language is provided. The ASR system uses an acoustic model, a language model, and a decoder. When the ASR system receives a spoken word, the acoustic model generates word candidates for the spoken word. The language model determines an n-gram score for each word candidate. The n-gram score includes a base score and a bias score. The bias score is based on a logarithmic probability of the word candidate, where the logarithmic probability is derived using a class-based language model where the words are clustered into non-overlapping clusters according to word statistics. The decoder decodes a textual representation of the spoken word from the word candidates and the corresponding n-gram score for each word candidate.Type: GrantFiled: August 14, 2020Date of Patent: November 28, 2023Assignee: Salesforce, Inc.Inventors: Young Mo Kang, Yingbo Zhou
-
Patent number: 11830482Abstract: Embodiments of the present disclosure relate to a method and an apparatus for speech interaction, and a computer readable storage medium. The method may include determining text information corresponding to a received speech signal. The method also includes obtaining label information of the text information by labeling elements in the text information. In addition, the method further includes determining first intention information of the text information based on the label information. The method further includes determining a semantic of the text information based on the first intention information and the label information.Type: GrantFiled: June 8, 2020Date of Patent: November 28, 2023Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTDInventors: Zhen Wu, Yufang Wu, Hua Liang, Jiaxiang Ge, Xingyuan Peng, Jinfeng Bai, Lei Jia
-
Patent number: 11822884Abstract: A method, computer program, and computer system to recover a dropped pronoun is provided for receiving data corresponding to one or more input words and determining contextual representations for the received input word data. The dropped pronoun may be identified based on a probability value associated with the contextual representations, and a span associated with one or more of the received input words may and that corresponds to which of the input words the dropped pronoun refers may be determined.Type: GrantFiled: July 25, 2022Date of Patent: November 21, 2023Assignee: TENCENT AMERICA LLCInventor: Linfeng Song
-
Patent number: 11810559Abstract: Embodiments described herein provide for a computer that detects one or more keywords of interest using acoustic features, to detect or query commonalities across multiple fraud calls. Embodiments described herein may implement unsupervised keyword spotting (UKWS) or unsupervised word discovery (UWD) in order to identify commonalities across a set of calls, where both UKWS and UWD employ Gaussian Mixture Models (GMM) and one or more dynamic time-warping algorithms. A user may indicate a training exemplar or occurrence of call-specific information, referred to herein as “a named entity,” such as a person's name, an account number, account balance, or order number. The computer may perform a redaction process that computationally nullifies the import of the named entity in the modeling processes described herein.Type: GrantFiled: June 6, 2022Date of Patent: November 7, 2023Assignee: PINDROP SECURITY, INC.Inventor: Hrishikesh Rao
-
Patent number: 11810584Abstract: An apparatus for processing an audio signal includes a configurable first audio signal processor for processing the audio signal in accordance with different configuration settings to obtain a processed audio signal, wherein the apparatus is adapted so that different configuration settings result in different sampling rates of the processed audio signal. The apparatus furthermore includes n analysis filter bank having a first number of analysis filter bank channels, a synthesis filter bank having a second number of synthesis filter bank channels, a second audio processor being adapted to receive and process an audio signal having a predetermined sampling rate, and a controller for controlling the first number of analysis filter bank channels or the second number of synthesis filter bank channels in accordance with a configuration setting.Type: GrantFiled: February 10, 2021Date of Patent: November 7, 2023Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.Inventors: Markus Lohwasser, Manuel Jander, Max Neuendorf, Ralf Geiger, Markus Schnell, Matthias Hildenbrand, Tobias Chalupka
-
Patent number: 11810578Abstract: Systems and processes for operating an intercom system via a digital assistant are provided. The intercom system is trigger-free, in that users communicate, in real-time, via devices without employing a trigger to speak. Acoustic fingerprints are employed to associate users with devices. Acoustic fingerprints include vector embeddings of speech input in an acoustic-feature vector space. Speech heard at multiple devices, as embedded in a fingerprint, may be clustered in the vector space, and the structure of the clusters is employed to associate users and devices. Based on the fingerprints, a device is mapped to a user, and the user employs that device to participate in a conversation, via the intercom service.Type: GrantFiled: October 16, 2020Date of Patent: November 7, 2023Assignee: Apple Inc.Inventors: Benjamin S. Phipps, Sachin Kajarekar, Eugene Ray, Mahesh Ramaray Shanbhag, Kisun You, Patrick L. Coffman
-
Patent number: 11806213Abstract: A speech transmission compensation apparatus that assists discrimination of speech heard by a user, includes: one or more computers each including a memory and a processor configured to: accept input of a speech signal, detect a specific type of sound in the speech signal, analyze an acoustic characteristic of the specific type of sound in the speech signal and output the acoustic characteristic; accept input of the acoustic characteristic being output by the memory and the processor, generate a vibration signal of a duration corresponding to the acoustic characteristic and output the vibration signal; and accept input of the vibration signal being output by the memory and the processor and provide the user with vibration for the duration on the basis of the vibration signal.Type: GrantFiled: April 30, 2020Date of Patent: November 7, 2023Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Asuka Ono, Momoko Nakatani, Ai Nakane, Yoko Ishii
-
Patent number: 11804232Abstract: An apparatus for processing an audio signal includes a configurable first audio signal processor for processing the audio signal in accordance with different configuration settings to obtain a processed audio signal, wherein the apparatus is adapted so that different configuration settings result in different sampling rates of the processed audio signal. The apparatus furthermore includes n analysis filter bank having a first number of analysis filter bank channels, a synthesis filter bank having a second number of synthesis filter bank channels, a second audio processor being adapted to receive and process an audio signal having a predetermined sampling rate, and a controller for controlling the first number of analysis filter bank channels or the second number of synthesis filter bank channels in accordance with a configuration setting.Type: GrantFiled: February 10, 2021Date of Patent: October 31, 2023Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.Inventors: Markus Lohwasser, Manuel Jander, Max Neuendorf, Ralf Geiger, Markus Schnell, Matthias Hildenbrand, Tobias Chalupka
-
Patent number: 11790931Abstract: A first VAD system outputs a pulse stream for zero crossings in an audio signal. The pulse density of the pulse stream is evaluated to identify speech. The audio signal may have noise added to it before evaluating zero crossings. A second VAD system rectifies each audio signal sample and processes each rectified sample by updating a first statistic and evaluating the rectified sample per a first threshold condition that is a function of the first statistic. Rectified samples meeting the first threshold condition may be used to update a second statistic and the rectified sample evaluated per a second threshold condition that is a function of the second statistic. Rectified samples meeting the second threshold condition may be used to update a third statistic. The audio signal sample may be selected as speech if the second statistic is less than a downscaled third statistic.Type: GrantFiled: October 27, 2020Date of Patent: October 17, 2023Assignee: Ambiq Micro, Inc.Inventor: Roger David Serwy
-
Patent number: 11790928Abstract: An apparatus for processing an audio signal includes a configurable first audio signal processor for processing the audio signal in accordance with different configuration settings to obtain a processed audio signal, wherein the apparatus is adapted so that different configuration settings result in different sampling rates of the processed audio signal. The apparatus furthermore includes n analysis filter bank having a first number of analysis filter bank channels, a synthesis filter bank having a second number of synthesis filter bank channels, a second audio processor being adapted to receive and process an audio signal having a predetermined sampling rate, and a controller for controlling the first number of analysis filter bank channels or the second number of synthesis filter bank channels in accordance with a configuration setting.Type: GrantFiled: February 10, 2021Date of Patent: October 17, 2023Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.Inventors: Markus Lohwasser, Manuel Jander, Max Neuendorf, Ralf Geiger, Markus Schnell, Matthias Hildenbrand, Tobias Chalupka
-
Patent number: 11790926Abstract: A method and apparatus for processing an audio signal are disclosed. According to an example embodiment, a method of processing an audio signal may include acquiring a final audio signal for an initial audio signal using a plurality of neural network models generating output audio signals by encoding and decoding input audio signals, calculating a difference between the initial audio signal and the final audio signal in a time domain, converting the initial audio signal and the final audio signal into Mel-spectra, calculating a difference between the Mel-spectra of the initial audio signal and the final audio signal in a frequency domain, training the plurality of neural network models based on results calculated in the time domain and the frequency domain, and generating a new final audio signal distinguished from the final audio signal from the initial audio signal using the trained neural network models.Type: GrantFiled: January 22, 2021Date of Patent: October 17, 2023Assignees: Electronics and Telecommunications Research Institute, The Trustees of Indiana UniversityInventors: Mi Suk Lee, Seung Kwon Beack, Jongmo Sung, Tae Jin Lee, Jin Soo Choi, Minje Kim, Kai Zhen
-
Patent number: 11775763Abstract: Systems and methods for weakly-supervised training a machine-learning model to perform named-entity recognition. All possible entity candidates and all possible rule candidates are automatically identified in an input data set of unlabeled text. An initial training of the machine-learning model is performed using labels assigned to entity candidates by a set of seeding rules as a first set of training data. The trained machine-learning model is then applied to the unlabeled text and a subset of rules from the rule candidates is identified that produces labels that most accurately match the labels assigned by the trained machine-learning model. The machine-learning model is then retrained using the labels assigned by the identified subset of rules as the second set of training data. This process is iteratively repeated to further refine and improve the performance of the machine-learning model for named-entity recognition.Type: GrantFiled: February 25, 2021Date of Patent: October 3, 2023Assignee: Robert Bosch GmbHInventors: Jiacheng Li, Haibo Ding, Zhe Feng
-
Patent number: 11776544Abstract: An embodiment of the present invention provides an artificial intelligence (AI) apparatus for recognizing a speech of a user, the artificial intelligence apparatus includes a memory to store a speech recognition model and a processor to obtain a speech signal for a user speech, to convert the speech signal into a text using the speech recognition model, to measure a confidence level for the conversion, to perform a control operation corresponding to the converted text if the measured confidence level is greater than or equal to a reference value, and to provide feedback for the conversion if the measured confidence level is less than the reference value.Type: GrantFiled: May 18, 2022Date of Patent: October 3, 2023Assignee: LG ELECTRONICS INC.Inventors: Jaehong Kim, Hyoeun Kim, Hangil Jeong, Heeyeon Choi
-
Patent number: 11769488Abstract: A system and method invoke virtual assistant action, which may comprise an argument. From audio, a probability of an intent is inferred. A probability of a domain and a plurality of variable values may also be inferred. Invoking the action is in response to the intent probability exceeding a threshold. Invoking the action may also be in response to the domain probability exceeding a threshold, a variable value probability exceeding a threshold, detecting an end of utterance, and a specific amount of time having elapsed. The intent probability may increase when the audio includes speech of words with the same meaning in multiple natural languages. Invoking the action may also be conditional on the variable value exceeding its threshold within a certain period of time of the intent probability exceeding its threshold.Type: GrantFiled: March 3, 2022Date of Patent: September 26, 2023Assignee: SoundHound AI IP, LLCInventors: Sudharsan Krishnaswamy, Maisy Wieman, Jonah Probell
-
Patent number: 11763821Abstract: Various tools are disclosed for providing assistive or augmentative means to enhance the fluency and accuracy of persons having speech disabilities. These technologies may automatically ascertain and dynamically improve the accuracy with which automatic speech recognition (ASR) systems recognize utterances of persons having impaired speech conditions. In an embodiment, digitized audio information about a speaker’s utterance is processed to determine a set of candidate words matching the utterance. From these candidate words, a set of concepts is determined using a finite state machine model. A pictogram representing each concept is identified and presented to the speaker so that the speaker may select the pictogram corresponding to the best match of his or her intended meaning associated with the utterance. An action corresponding to speaker’s selection then may be performed. For example, displaying or synthesizing speech from textual information describing the selected concept.Type: GrantFiled: June 27, 2019Date of Patent: September 19, 2023Assignee: Cerner Innovation, Inc.Inventor: Douglas S. McNair