Patents Examined by Neeraj Sharma

Audio decoder, audio encoder, method for providing a decoded audio signal, method for providing an encoded audio signal, audio stream, audio stream provider and computer program using a stream identifier

Patent number: 11837247

Abstract: An audio decoder for providing a decoded audio signal representation on the basis of an encoded audio signal representation is configured to adjust decoding parameters in dependence on a configuration information, to decode one or more audio frames using a current configuration information, to compare a configuration information in a configuration structure associated with one or more frames to be decoded by the current configuration information, and to make a transition to perform decoding using the configuration information in the configuration structure associated with the one or more frames to be decoded as a new configuration information if the configuration information in the configuration structure associated with the one or more frames to be decoded, or a relevant portion thereof, is different from the current configuration information, and to consider a stream identifier information included in the configuration structure when comparing the configuration information.

Type: Grant

Filed: November 30, 2021

Date of Patent: December 5, 2023

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Max Neuendorf, Matthias Felix, Matthias Hildenbrand, Lukas Schuster, Ingo Hofmann, Bernd Herrmann, Nikolaus Rettelbach
Generating and providing inclusivity data insights for evaluating participants in a communication

Patent number: 11830496

Abstract: The present disclosure relates to determining communication inclusivity amongst speakers during a user communication. Communication inclusivity is a targeted analysis that collectively evaluates speaking opportunities (provided and taken by users) during a user communication and thought completion during speech associated with the user communication. To derive communication inclusivity, a user communication is modeled as a probabilistic interaction between speakers, where a sequence of speaking states of the user communication is identified and analyzed. Non-limiting examples of speaking states comprise: active user speech; periods of silence; overlapping speakers; icon indication; questions in corresponding chat windows; combination states; other contextual signals; and any combination thereof. With these observed sequences of speaking states, a probability distribution is modeled over transitions between states to predict inclusivity of a user communication.

Type: Grant

Filed: December 1, 2020

Date of Patent: November 28, 2023

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventor: Gaurav Vinayak Tendolkar
Natural language processing system

Patent number: 11803542

Abstract: Provided is a natural language processing system for determining a merchant based on a natural language query. The system may include a processor to receive a natural language query, convert at least one word of the natural language query to a vector using at least one neural network to form a set of vectors, determine a vector distance from the set of vectors to each profile in a set of profiles, rank the set of profiles based on the vector distance of each profile to the set of vectors, communicate merchant data associated with at least one merchant included in the set of profiles to the user device, receive a selection of a first merchant associated with the merchant data from the user device, and schedule an appointment with the first merchant for a user of the user device. A computer program product and method are also disclosed.

Type: Grant

Filed: September 3, 2021

Date of Patent: October 31, 2023

Assignee: Visa International Service Association

Inventors: Abhishek Parasnath Yadav, Rahul Singhal, Alok Yadav, Rajesh Hanumakonda
Systems and methods for integrating voice controls into applications

Patent number: 11798542

Abstract: The disclosed computer-implemented method may include receiving input voice data synchronous with a visual state of a user interface of the third-party application, generating multiple sentence alternatives for the received input voice data, identifying a best sentence of the multiple sentence alternatives, executing a dialog script for the third-party application using the best sentence, the dialog script generating a response to the received voice data comprising output voice data and a corresponding visual response, and providing the visual response and the output voice data to the third-party application, the third-party application playing the output voice data synchronous with updating the user interface based on the visual response. Various other methods, systems, and computer-readable media are also disclosed.

Type: Grant

Filed: March 26, 2021

Date of Patent: October 24, 2023

Assignee: Alan AI, Inc.

Inventors: Andrey Ryabov, Ramu V. Sunkara
Personal electronic captioning based on a participant user's difficulty in understanding a speaker

Patent number: 11783836

Abstract: Providing, using a computer, personalized captioning in response to a user having difficulty in understanding another participant speaking. Detecting, at a computer, the user having difficulty in understanding another participant speaking, or alternatively, receiving, at the computer, a communication from a user indicating that the user requests assistance to understand speech of a particular participant in an electronic group meeting. The speech of the particular participant is identified, and a speech input is captured from the particular participant in the electronic group meeting. The captured speech of the particular participant is transcribed, and an audio assistance output is generated for communication to the user. The audio assistance output is communicated to a device of the user.

Type: Grant

Filed: September 30, 2020

Date of Patent: October 10, 2023

Assignee: International Business Machines Corporation

Inventors: Heather Saunders, Dana L. Price, Kelly Camus
Systems and methods for reading comprehension for a question answering task

Patent number: 11775775

Abstract: Embodiments described herein provide a pipelined natural language question answering system that improves a BERT-based system. Specifically, the natural language question answering system uses a pipeline of neural networks each trained to perform a particular task. The context selection network identifies premium context from context for the question. The question type network identifies the natural language question as a yes, no, or span question and a yes or no answer to the natural language question when the question is a yes or no question. The span extraction model determines an answer span to the natural language question when the question is a span question.

Type: Grant

Filed: November 26, 2019

Date of Patent: October 3, 2023

Assignee: Salesforce.com, Inc.

Inventors: Akari Asai, Kazuma Hashimoto, Richard Socher, Caiming Xiong
Authentication system, authentication method, and, non-transitory computer-readable information recording medium for recording program

Patent number: 11776543

Abstract: An authentication system prevents leakage of a key-reading speech during user authentication based on the key-reading speech of a user reading an authentication key. For each user ID, a storage stores a voiceprint of a user in association with a recorded sound including speech spoken previously by the user. A specifier specifies the user ID of a user attempting to receive authorization. An outputter outputs a masking sound that includes the recorded sound recorded in association with the specified user ID. An acquirer acquires a key-reading speech of the user reading the authentication key and the output masking sound. A remover acquires a second sound by removing the masking sound from the acquired first sound. A determiner determines whether the user has authority pertaining to the specified user ID based on the acquired second sound.

Type: Grant

Filed: May 3, 2021

Date of Patent: October 3, 2023

Assignee: Passlogy Co., Ltd.

Inventors: Motohiko Mitsuno, Hideharu Ogawa
Natural language processing system for context-specific applier interface

Patent number: 11776537

Abstract: A computer-implemented method is provided to optimize natural language processing of voice interaction data in product/service categorization and product/service application. The computer-implemented method receives, from a voice interaction device through a context discovery interface, user voice data corresponding to a user. Furthermore, the computer-implemented method performs, with an NLP engine, natural language processing of the user voice data to determine a context category. Additionally, the computer-implemented method selects, with an AI engine, one of a plurality of context-specific applier interfaces based on the context category. The computer-implemented method automatically transitions, with the AI engine, to said one of the plurality of context-specific applier interfaces. Finally, the computer-implemented method interacts, via the AI engine, with the user via a voice interaction to initiate the product/service application.

Type: Grant

Filed: December 7, 2022

Date of Patent: October 3, 2023

Assignee: Blue Lakes Technology, Inc.

Inventors: Anand Menon, Satyaprashvitha Nara
Method and device for processing a multi-language text

Patent number: 11763102

Abstract: Embodiments of the present disclosure provide a method and apparatus for processing a multi-language text. According to embodiments of the present disclosure, the multi-language text including contents in a plurality of languages may be encoded with a Unicode. The method further comprises splitting the multi-language text into a plurality of parts based on the Unicode of the multi-language text, contents of the plurality of parts having different languages. In addition, the multi-language text may also be processed based on the plurality of parts.

Type: Grant

Filed: February 26, 2021

Date of Patent: September 19, 2023

Assignee: EMC IP Holding Company, LLC

Inventors: Kun Wu Huang, Winston Lei Zhang, Chao Chen, Jingjing Liu, Duke Hongtao Dai
Voice recognition apparatus, voice recognition method, and program

Patent number: 11756552

Abstract: A voice recognition apparatus includes: a communication circuit acquiring a speech sentence that is a result of voice recognition of a speech; a storage storing digit number information indicative of the maximum number of digits; and a control circuit. When the number of digits of a first numerical value indicated by a first numeral included in the speech sentence is larger than the maximum number of digits, the control circuit replaces the first numeral in the speech sentence with a second numeral indicative of a second numerical value having the number of digits equal to or less than the maximum number of digits. The control circuit divides the first numeral into a plurality of numerals and adds numerical values respectively indicated by the plurality of numerals to calculate the second numerical value.

Type: Grant

Filed: October 29, 2020

Date of Patent: September 12, 2023

Assignee: PANASONIC INTELLECTUAL PROPERTY MANAGEMENT CO., LTD.

Inventor: Natsuki Saeki
Goal-oriented dialog system

Patent number: 11749282

Abstract: A dialog system receives a user request corresponding to a dialog with a user. The dialog system processes the user request to determine multiple service providers capable of responding to the user request. The dialog system selects one service provider based on a request-to-handle score, and selects another service provider based on a satisfaction rating. The dialog system updates the dialog state based on further input provided by the user to determine an output responsive to the user request.

Type: Grant

Filed: May 5, 2020

Date of Patent: September 5, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Arindam Mandal, Devesh Mohan Pandey, Kjel Larsen, Prakash Krishnan, Raefer Christopher Gabriel
Transcription generation technique selection

Patent number: 11741964

Abstract: A method to transcribe communications may include selecting a first transcription generation technique from among multiple transcription generation techniques for generating transcriptions of audio of one or more communication sessions that involve a user device and obtaining performances of the multiple transcription generation techniques with respect to generating the transcriptions of the audio. The method may also include monitoring comparisons between the performances of the multiple transcription generation techniques and obtaining input from the user with respect to the comparisons. The method may further include selecting a second transcription generation technique from among the multiple transcription generation techniques based on the input from the user.

Type: Grant

Filed: May 27, 2020

Date of Patent: August 29, 2023

Assignee: Sorenson IP Holdings, LLC

Inventor: David Thomson
Natural language input processing

Patent number: 11721330

Abstract: Techniques for intelligently selecting a component to execute with respect to a natural language user input are described. A natural language processing (NLP) system may receive first data representing a natural language input. The NLP system may determine first and second scores representing first and second confidences that first and second components are to be invoked to perform actions responsive to the natural language input, respectively. Based on the first and second scores, the NLP system may determine further information is needed to determine which of the first or second component is to be invoked. The NLP system may query a user for the further information. Based on the further information, the NLP system may determine third and fourth scores representing third and fourth confidences that the first and second components are to be invoked to perform actions responsive to the natural language input, respectively.

Type: Grant

Filed: September 4, 2019

Date of Patent: August 8, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Rajesh Kumar Pandey, Julia Kennedy Nemer, David Thomas, Isaac Joseph Madwed, Rashmi Tonge
Intuitive computing methods and systems

Patent number: 11715473

Abstract: A smart phone senses audio, imagery, and/or other stimulus from a user's environment, and acts autonomously to fulfill inferred or anticipated user desires. In one aspect, the detailed technology concerns phone-based cognition of a scene viewed by the phone's camera. The image processing tasks applied to the scene can be selected from among various alternatives by reference to resource costs, resource constraints, other stimulus information (e.g., audio), task substitutability, etc. The phone can apply more or less resources to an image processing task depending on how successfully the task is proceeding, or based on the user's apparent interest in the task. In some arrangements, data may be referred to the cloud for analysis, or for gleaning. Cognition, and identification of appropriate device response(s), can be aided by collateral information, such as context. A great number of other features and arrangements are also detailed.

Type: Grant

Filed: September 1, 2020

Date of Patent: August 1, 2023

Assignee: Digimarc Corporation

Inventors: Tony F. Rodriguez, Geoffrey B. Rhoads, Bruce L. Davis
Link-based audio recording, collection, collaboration, embedding and delivery system

Patent number: 11715455

Abstract: A machine has a processor and a memory connected to the processor. The memory stores instructions executed by the processor to supply a name page in response to a request from an administrator machine. Name page updates are received from the administrator machine. The name page updates include participants and associated network contact information for the participants. A code is utilized to form a link to the name page. Prompts for textual name information and audio name information are supplied to a client machine that activates the link to the name page. Textual name information and audio name information are received from the client machine. The textual name information and audio name information are stored in association with the name page. Navigation tools are supplied to facilitate access to the textual name information and audio name information.

Type: Grant

Filed: October 12, 2020

Date of Patent: August 1, 2023

Assignee: NAMECOACH, INC.

Inventor: Praveen Shanbhag
Decoding apparatus and method, and program

Patent number: 11705140

Abstract: The present technology relates to a decoding apparatus, a decoding method and a program which make it possible to obtain sound with higher quality. A demultiplexing circuit demultiplexes an input code string into a gain code string and a signal code string. A signal decoding circuit decodes the signal code string to output a time series signal. A gain decoding circuit decodes the gain code string. That is, the gain decoding circuit reads out gain values and gain inclination values at predetermined gain sample positions of the time series signal and interpolation mode information. An interpolation processing unit obtains a gain value at each sample position between two gain sample positions through linear interpolation or non-linear interpolation according to the interpolation mode based on the gain values and the gain inclination values. A gain applying circuit adjusts a gain of the time series signal based on the gain values. The present technology can be applied to a decoding apparatus.

Type: Grant

Filed: May 6, 2020

Date of Patent: July 18, 2023

Assignee: Sony Corporation

Inventors: Yuki Yamamoto, Toru Chinen, Hiroyuki Honma, Runyu Shi
Systems and methods for automatic speech translation

Patent number: 11704507

Abstract: A method for providing automatic interpretation may include receiving, by a processor, audible speech from a speech source, generating, by the processor, in real-time, a speech transcript by applying an automatic speech recognition model on the speech, segmenting, by the processor, the speech transcript into speech segments based on a content of the speech by applying a segmenter model on the speech transcript, compressing, by the processor, the speech segments based on the content of the speech by applying a compressor model on the speech segments, generating, by the processor, a translation of the speech by applying a machine translation model on the compressed speech segments, and generating, by the processor, audible translated speech based on the translation of the speech by applying a text to speech model on the translation of the speech.

Type: Grant

Filed: October 31, 2022

Date of Patent: July 18, 2023

Assignee: KUDO, INC.

Inventor: Claudio Fantinuoli
Automated transcript generation from multi-channel audio

Patent number: 11699456

Abstract: Systems and methods are described for generating a transcript of a legal proceeding or other multi-speaker conversation or performance in real time or near-real time using multi-channel audio capture. Different speakers or participants in a conversation may each be assigned a separate microphone that is placed in proximity to the given speaker, where each audio channel includes audio captured by a different microphone. Filters may be applied to isolate each channel to include speech utterances of a different speaker, and these filtered channels of audio data may then be processed in parallel to generate speech-to-text results that are interleaved to form a generated transcript.

Type: Grant

Filed: February 12, 2021

Date of Patent: July 11, 2023

Assignee: Veritext, LLC

Inventors: Anthony Donofrio, David Joseph DaSilva, James Andrew Maraska, Jr., Jonathan Mordecai Kaplan
Natural language processing with missing tokens in a corpus

Patent number: 11687723

Abstract: Text blocks are semantically compared, and a semantic score is provided to a user. The semantic score is based on application of a machine learning model trained on a text corpus. One or both of the two text blocks may have one or more words that do not appear in the training text corpus (skip-words). Skip-words are used, rather than discarded, to adjust the semantic score via, for example, a penalization function. The user provides feedback about the accuracy of the adjusted semantic score, and the feedback is used to perform supervised learning model.

Type: Grant

Filed: March 23, 2020

Date of Patent: June 27, 2023

Assignee: International Business Machines Corporation

Inventors: Raj Nagesh, Charles Christopher Walker, Kriteshwar Kaur Kohli
Methods and systems for querying for parameter retrieval

Patent number: 11676496

Abstract: Systems and methods to identify a query parameter in an incoming flight voice or data communication to respond to a request. A processing system configured to: in response to receipt of a clearance message, decode the clearance message to determine whether the clearance message contains a command instruction or clearance data for a flight, and to present the command instruction to a pilot as notice to execute the command instruction or if available, obtain at least one query parameter from the clearance data to configure in a query operation to present in response to a pilot question about the command instruction. In response to receipt of the voice or data communication, determine further an intent within the voice or data communication of a question or instruction voiced by applying an acoustic model for tagging identified parts about the question or instruction voiced with query parameters in response to the pilot.

Type: Grant

Filed: June 2, 2020

Date of Patent: June 13, 2023

Assignee: HONEYWELL INTERNATIONAL INC.

Inventors: Hariharan Saptharishi, Gobinathan Baladhandapani, Mahesh Kumar Sampath, Sivakumar Kanagarajan

prev 1 2 3 4 5 6 … next